ActiveloopAC

Senior AI Search Engineer

Activeloop provides an open-source data lake, Deep Lake, that simplifies the management of data for deep learning applications, accelerating AI product development.

Activeloop
United States only

At Activeloop we are transforming the way organizations harness their data for AI with our Deep Lake and Multi-modal AI Search. Whether you're answering critical clinical questions or searching across vast repositories of scientific papers, we empower you to index, search and organize billions of documents, images, and videos intuitively using natural language powered by Large Language Models. Join us in making data more accessible and actionable than ever before.

We're looking for an AI Search Engineer who possesses a deep understanding of large-scale information retrieval systems, deep learning, databases, and RAG architectures. The ideal candidate will have expertise in developing and optimizing search algorithms, implementing efficient indexing techniques, and leveraging RAG to enhance AI-powered search and question-answering systems.

What You Will Be Doing

As an AI Search Engineer, you will play a pivotal role in designing, developing, and deploying advanced search and retrieval systems that leverage RAG techniques to solve complex information access challenges. You will collaborate with software engineers, customers, and business stakeholders to develop AI search solutions that deliver significant value to the organization and our clients.

Key Responsibilities

RAG System Research and Implementation: Lead the design and implementation of advanced retrieval systems like Deep Memory by Activeloop, delivering optimized RAG systems across the entire value chain - from embedding or model fine-tuning to retrieval optimization with custom algorithms, to enhance knowledge retrieval accuracy.

Search Algorithm Optimization: Develop and refine search algorithms, including semantic search, hybrid search, and multi-modal search techniques, to improve retrieval performance and relevance ranking.

Vector Database Integration: Implement and optimize vector storage and indexing solutions within Deep Lake, ensuring efficient similarity search capabilities for high-dimensional embeddings used in RAG systems.

Query Understanding and Processing: Design and implement advanced query processing pipelines, including query expansion, intent recognition, and contextual interpretation to enhance search precision.

Information Retrieval Model Development: Create and fine-tune machine learning models specifically for information retrieval tasks, such as document ranking, query-document relevance scoring, and zero-shot retrieval.

Performance Evaluation and Metrics: Establish comprehensive evaluation frameworks for search and RAG systems, including relevance assessments, A/B testing, and user satisfaction metrics to continually improve system performance.

Scalability and Efficiency: Optimize RAG and search systems for high throughput and low latency, ensuring they can handle large-scale datasets and real-time query processing demands.

Data Ingestion and Indexing: Develop efficient data ingestion pipelines and indexing strategies to support rapid updates and real-time search capabilities across diverse data types and sources.



What We Need to See

  • Master's or PhD degree in Computer Science, Machine Learning, Statistics, or a related field.

  • Strong programming skills in one or more programming languages, such as Python, or C++, and extensive experience with machine learning libraries, such as TensorFlow, PyTorch, Llama Index, LangChain etc.

  • Proven experience in developing and deploying complex machine learning models in production environments, including experience with cloud-based platforms, edge devices, or embedded systems.

  • Strong understanding of advanced machine learning algorithms, such as deep learning, reinforcement learning, RAGs, and ensemble methods, and experience with model optimization techniques, such as hyper-parameter tuning, model compression, and quantization.

  • Solid understanding of data pre-processing, feature engineering, and data quality assurance techniques, and experience with large-scale, complex data sets.

  • Excellent problem-solving skills and ability to analyze and interpret complex data sets to extract meaningful insights and drive decision-making.

Ways To Stand Out From The Crowd

  • You have trained deep learning models in a distributed manner.

  • You are a highly motivated, curious, hardworking explorer in the field of AI

  • You have publications in top-tier machine learning and AI conferences such as ICML, NeurIPS, and CVPR (highly desirable).

  • You have a builder attitude and you love building cool things that matter

  • You're excited to work closely with the founding team in developing hyper-scalable software for ML

  • You can proactively identify and anticipate problems and provide tangible solutions

  • You enjoy the startup journey of building an endurable, scalable business

Please clearly indicate in your application where you heard about this job opportunity.


Why Join Activeloop?

Activeloop Deep Lake is at the forefront of transitioning from traditional software to AI, accelerating AI deployment across various industries. Our products empower advanced LLMs, generative models, and computer vision models. Trusted by industry leaders we are expanding our team to further advance AI applications. We pride ourselves on being an inclusive, equal opportunity workplace, committed to diversity and accessibility for all applicants.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

United States +/- 0 hours

About Activeloop

Learn more about Activeloop and their company culture.

View company profile

Activeloop frees deep learning teams from building complex data infrastructure so they can develop AI products faster. Deep Lake open source for researchers & nascent teams enables automatic connection of unstructured data, including audio, video, image, and point cloud data, to machine learning models. Their product, Deep Lake open-source, supports data streaming, scalable machine learning pipelines, and dataset version control for distributed workloads.

Additionally, teams can access over 200 machine learning datasets like MNIST, COCO, CIFAR, ImageNet, or GTZAN formatted for Deep Lake, which is curated by the community. Activeloop serves reputable companies such as Intel, Airbus, and Matterport, promoting operational efficiency and reducing costs. The company emphasizes the importance of good data for great models, aiming to democratize access to optimized data for all, including students, researchers, and startups.

Claim this profileActiveloop logoAC

Activeloop

Chief executive officer

Davit Buniatyan

Employees live in

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

2 remote jobs at Activeloop

Explore the variety of open remote roles at Activeloop, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Activeloop

Remote companies like Activeloop

Find your next opportunity by exploring profiles of companies that are similar to Activeloop. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Activeloop hiring Senior AI Search Engineer • Remote (Work from Home) | Himalayas