Browse 36 exciting jobs hiring in Retrieval now. Check out companies hiring such as Guidehouse, Pear VC, MagicSchool AI in Worcester, Shreveport, Madison.
Lead AI/ML Engineer role to build and operationalize secure LLM, RAG, and inference pipelines in GovCloud for high‑security adjudication workflows.
FlowGen Labs is hiring a hands-on Software Engineering Lead to architect and deliver scalable, secure cloud-native and AI-enabled platform services and enterprise integrations.
MagicSchool is hiring a Staff Context Engineer to design and scale context, retrieval, and memory systems that power reliable, token-efficient AI agents for millions of teachers.
Snap is hiring a seasoned C++ engineer to design and optimize large-scale ML infrastructure, feature stores, and retrieval systems that power Snapchat’s ranking and recommendation products.
Lead a team to design, build, and deploy production-grade generative AI and multimodal systems for a global digital innovation agency while working fully remotely.
Build and own full-stack features at Human Delta to help enterprises reliably adopt and govern AI, working closely with founders and early customers in San Francisco.
Lead the design and implementation of knowledge graphs, RAG retrieval pipelines, and memory systems that power MagicSchool's AI agents for millions of educators.
Work as an Applied ML engineer at Basis to design, build, and operate production-grade agent systems that automate accounting workflows and improve via rigorous evaluation and instrumentation.
Arlo is hiring an Applied AI Engineer to design, build, and productionize LLM-driven features that transform how members navigate and access healthcare.
Experienced AI/ML software engineer needed to build and deploy RAG-based chatbots, data pipelines, and cloud-native AI applications for federal missions.
Lead the design and productionization of scalable transformer-based LLMs and multi-agent AI systems to power Seven AI's next-generation cybersecurity platform.
Intuitive is hiring a Staff Agentic AI Developer to architect and build safe, scalable autonomous agent systems for clinical and complex workflows using C#, Python, and cloud-native services.
Auctor seeks a Senior Backend Software Engineer in New York to architect and build scalable backend systems powering AI-first enterprise services.
Work on Agentforce at Salesforce to build scalable generative AI services, integrate LLMs with citations and guardrails, and deploy ML-driven systems used by millions.
Blackbird.AI is hiring a Staff AI/ML Engineer to turn advanced LLM and ML techniques into dependable, explainable production capabilities that operate at scale.
Work on NeMo Retriever to optimize and containerize LLM/MLLM models and build MLOps pipelines that deliver low-latency, production-grade inference for retrieval-augmented AI systems.
Lawrence Livermore National Laboratory is recruiting a Document Operations Specialist to manage and digitize facility records and support records control and compliance for strategic deterrence facilities.
Lead the AI & Results Quality program at a high-growth AI company to define evaluation processes, build reusable quality tooling, and drive measurable improvement across search and generative product surfaces.
Technical Lead (freelance) to design and implement a Vertex AI-based RAG search platform, lead a small engineering team, and establish cloud and DevOps foundations for a short-term, high-impact build.
Senior Backend Engineer needed to design and scale the backend systems that power Junior’s LLM-driven research platform in a fast-moving, in-person NYC startup environment.
Founding AI Engineer to prototype and productionize LLM-driven legal AI systems at Norm Law/Norm Ai, working closely with lawyers and product teams to transform expert workflows.
Lead the design and productionization of agentic AI systems and an evaluation platform to power Night Shift, Flock Safety’s investigator-facing LLM agent product.
Human Delta is hiring an early Forward Deployed Engineer in San Francisco to design and ship reliable, customer-facing AI infrastructure that bridges product, engineering, and operations.
Design and ship large-scale recommendation and ranking systems at Quizlet to personalize discovery and study experiences for millions of learners.
Quizlet is hiring a Senior Applied AI Engineer to build and scale personalization, ranking, retrieval, and LLM systems that drive measurable improvements in learner engagement and retention.
Sequen AI is hiring a Senior Research Engineer to develop and deploy state-of-the-art ranking, embedding, and retrieval models for its personalized discovery platform.
Lead the design and production of large-scale personalization and recommendation systems at Quizlet to improve learning outcomes for millions of students.
Work with NomadicML founders to train and productionize large-scale vision-language models that reason about motion in real-world video for autonomy and robotics.
Lead the technical design and implementation of A1’s foundational LLM systems—training pipelines, inference stacks, and deployment architecture—for a global consumer AI product.
Lead high-priority AI product initiatives at Harvey, owning strategy through launch and partnering closely with founders, engineering, and customers to scale enterprise-grade solutions.
Lead the architecture and delivery of safe, reliable, large-scale ML and agentic systems as a Staff Machine Learning Engineer on the Stripe Assistant team.
Lead the design and delivery of large-scale personalization and recommendation systems at Quizlet, shaping technical direction and mentoring teams to make learning uniquely tailored for millions of users.
Lead product strategy for context, memory, and retrieval systems at MagicSchool to ensure LLM-driven agents behave reliably and effectively for millions of teachers.
Security Risk Advisors is hiring an Enterprise Full Stack Developer to build and scale their AI platform, integrating RAG, agent orchestration, and enterprise APIs across frontend and backend systems.
Lead the technical direction and implementation of large-scale, high-performance retrieval platforms at Pinterest to power recommendations across the product.
Lead the applied LLM systems effort at Plaud to design reasoning pipelines, productionize RAG and memory features, and optimize model inference for reliable, user-centered AI experiences.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
26
|