Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + meaningful equity (founding tier)
Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.
About the Role
We’re designing the future of enterprise AI infrastructure — grounded in agents, retrieval-augmented generation (RAG), knowledge graphs, and multi-tenant governance.
We’re looking for an ML/AI Research Engineer to join our AI Lab and lead the design, training, evaluation, and optimization of agent-native AI models. You'll work at the intersection of LLMs, vector search, graph reasoning, and reinforcement learning — building the intelligence layer that sits on top of our enterprise data fabric.
This isn’t a prompt engineer role. It’s full-cycle ML: from data curation and fine-tuning to evaluation, interpretability, and deployment — with cost-awareness, alignment, and agent coordination all in scope.
Core Responsibilities
Fine-tune and evaluate open-source LLMs (e.g. LLaMA 3, Mistral, Falcon, Mixtral) for enterprise use cases with both structured and unstructured data
Build and optimize RAG pipelines using LangChain, LangGraph, LlamaIndex, or Dust — integrated with our vector DBs and internal knowledge graph
Train agent architectures (ReAct, AutoGPT, BabyAGI, OpenAgents) using enterprise task data
Develop embedding-based memory and retrieval chains with token-efficient chunking strategies
Create reinforcement learning pipelines to optimize agent behaviors (e.g. RLHF, DPO, PPO)
Establish scalable evaluation harnesses for LLM and agent performance, including synthetic evals, trace capture, and explainability tools
Contribute to model observability, drift detection, error classification, and alignment
Optimize inference latency and GPU resource utilization across cloud and on-prem environments
Desired Experience
Model Training:
Deep experience fine-tuning open-source LLMs using HuggingFace Transformers, DeepSpeed, vLLM, FSDP, LoRA/QLoRA
Worked with both base and instruction-tuned models; familiar with SFT, RLHF, DPO pipelines
Comfortable building and maintaining custom training datasets, filters, and eval splits
Understand tradeoffs in batch size, token window, optimizer, precision (FP16, bfloat16), and quantization
RAG + Knowledge Graphs:
Experience building enterprise-grade RAG pipelines integrated with real-time or contextual data
Familiar with LangChain, LangGraph, LlamaIndex, and open-source vector DBs (Weaviate, Qdrant, FAISS)
Experience grounding models with structured data (SQL, graph, metadata) + unstructured sources
Bonus: Worked with Neo4j, Puppygraph, RDF, OWL, or other semantic modeling systems
Agent Intelligence:
Experience training or customizing agent frameworks with multi-step reasoning and memory
Understand common agent loop patterns (e.g. Plan→Act→Reflect), memory recall, and tools
Familiar with self-correction, multi-agent communication, and agent ops logging
Optimization:
Strong background in token cost optimization, chunking strategies, reranking (e.g. Cohere, Jina), compression, and retrieval latency tuning
Experience running models under quantized (int4/int8) or multi-GPU settings with inference tuning (vLLM, TGI)
Preferred Tech Stack
LLM Training & Inference: HuggingFace Transformers, DeepSpeed, vLLM, FlashAttention, FSDP, LoRA
Agent Orchestration: LangChain, LangGraph, ReAct, OpenAgents, LlamaIndex
Vector DBs: Weaviate, Qdrant, FAISS, Pinecone, Chroma
Graph Knowledge Systems: Neo4j, Puppygraph, RDF, Gremlin, JSON-LD
Storage & Access: Iceberg, DuckDB, Postgres, Parquet, Delta Lake
Evaluation: OpenLLM Evals, Trulens, Ragas, LangSmith, Weight & Biases
Compute: Ray, Kubernetes, TGI, Sagemaker, LambdaLabs, Modal
Languages: Python (core), optionally Rust (for inference layers) or JS (for UX experimentation)
Soft Skills & Mindset
Startup DNA: resourceful, fast-moving, and capable of working in ambiguity
Deep curiosity about agent-based architectures and real-world enterprise complexity
Comfortable owning model performance end-to-end: from dataset to deployment
Strong instincts around explainability, safety, and continuous improvement
Enjoy pair-designing with product and UX to shape capabilities, not just APIs
Why This Role Matters
This role is foundational to our thesis: that agents + enterprise data + knowledge modeling can create intelligent infrastructure for real-world, multi-billion-dollar workflows. Your work won’t be buried in research reports — it will be productionized and activated by hundreds of users and hundreds of thousands of decisions. If this is your dream role - we would love to hear from you.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Fabrion is seeking a founding Product Designer to craft AI-driven, scalable workflows for industrial enterprise collaboration in the San Francisco Bay Area.
Innovative AI startup in the San Francisco Bay Area is looking for a seasoned Data Engineer to create scalable data fabrics and semantic infrastructures fueling next-gen enterprise AI applications.
AbbVie is seeking an experienced Principal Research Scientist I in North Chicago to lead peptide synthesis process development from research through commercialization.
Drive innovation in filmic pressure-sensitive labels as a Product Development Specialist at Avery Dennison’s North America R&D team.
Drive AI innovation as Lead Research Scientist focused on video and audio generative technologies at Canva’s San Francisco hub.
Lead clinical development programs for innovative consumer eye care products at AbbVie, shaping scientific strategies and collaborations.
Analytical Separations Scientist role at Eurofins Scientific in Malvern, PA, focusing on advanced analytical testing and method validation.
Contribute to innovative neurotechnology as a Biocompatibility Intern developing and optimizing in-vitro biocompatibility assays at Neuralink.
Seeking an experienced Open Source Technology Innovation Analyst to enhance intelligence capabilities for critical federal missions at Agile Defense.
Conduct neuroscience research involving electrophysiology and mouse behavioral studies in a dynamic university laboratory setting.
Kimley-Horn is looking for a proactive Land Planning Analyst to contribute to urban and land use planning projects in their Austin office.
Innovate and lead valve technology development as a Senior R&D Engineer within Valmet’s dynamic team in Shrewsbury, MA.
Lead cutting-edge packaging R&D projects at Kraft Heinz, driving innovation and sustainability in packaging technologies with global impact.
Licensed Landscape Architect with project management expertise sought for a hybrid role at Design Workshop’s Austin studio to lead teams and manage key client projects.
Lead clinical development strategy and oversee global clinical trials as Senior Director at Noven Pharmaceuticals.