DevOps Engineer
Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + meaningful equity (founding tier)
Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.
About the Role
We're building an AI-native, multi-tenant enterprise platform for complex domains in industrial verticals. In this architecture, DevOps isn't just about shipping features — it's about operationalizing intelligent agents, ensuring traceability across AI systems, and supporting mission-critical ML infrastructure at scale.
We're looking for a DevOps engineer who can own infrastructure from Day 1 — automating everything from CI/CD and observability to cloud governance and security. You’ll work with a highly technical team building real-time AI pipelines and multi-agent systems. If you want to be the person who makes the platform run — fast, secure, reliable, and explainable — this is your role.
Responsibilities
Build and maintain scalable cloud infrastructure across AWS/GCP/Azure with a focus on secure, tenant-isolated deployments
Own and evolve CI/CD systems (e.g. GitHub Actions, ArgoCD) with progressive rollout, testing, and rollback flows
Establish observability tooling across services, agents, and pipelines (OpenTelemetry, Prometheus, Grafana, Sentry)
Implement policy-as-code (OPA, Rego) for deployment safety, RBAC, audit logging, and approval workflows
Define and enforce SLAs, uptime targets (99.99%+), incident response, and remediation workflows
Secure infrastructure: IAM, VPC, encryption, key management, image scanning, secrets rotation
Automate deployments, infrastructure provisioning (Terraform, Helm), and environment replication
What We’re Looking For
Core Experience:
4–10+ years in DevOps, platform engineering, or SRE in production-grade systems
Strong experience with Docker, Kubernetes (EKS/GKE), Terraform or Pulumi
Hands-on experience deploying and monitoring distributed cloud-native systems
Familiar with GitOps practices, CI/CD design, progressive delivery, and secure SDLC
Clear understanding of how to implement monitoring, alerting, and failure simulation in dynamic environments
Engineering Mindset:
Obsessed with reliability, latency, uptime, and repeatability
Security-aware and compliance-conscious
Proactive — you don’t wait for alerts to fix things
Comfortable collaborating with backend, AI, and data teams
Bonus: Agent-Native / ML Ops Capabilities
We’re building an agentic, AI-native platform from the ground up. Experience here isn’t required, but would be a strong differentiator:
Experience running LLM orchestration frameworks (e.g. LangChain, LangGraph, Dust, ReAct agents)
Building retrieval-augmented generation (RAG) pipelines — and deploying them safely and repeatably
Familiarity with vector DBs (Weaviate, Qdrant, Pinecone) and embedding pipelines
Monitoring and governing long-running or multi-agent chains
Auditability and replay systems for agent decision-making
Serving fine-tuned or open-source LLMs with model versioning and GPU scaling (e.g. vLLM, TGI)
Interest in auto-remediation using agents (e.g. observability + alert → insight → response via LLM)
Why This Role Matters
DevOps is the nervous system of the platform — every agent, every data fabric component, every pipeline flows through what you build. This is a rare opportunity to design that system early, the right way, and future-proof it for scale, compliance, and trust.
If you're excited by intelligent systems, distributed data, and deeply technical infrastructure problems — and you want your work to have immediate real-world impact — we’d love to hear from you.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Fabrion is looking for a systems-minded founding Product Designer to build AI-powered supplier management experiences in the automotive industry.
Innovative AI startup in the San Francisco Bay Area is looking for a seasoned Data Engineer to create scalable data fabrics and semantic infrastructures fueling next-gen enterprise AI applications.
Innovate audio experiences as an ANC Engineer at OpenAI, crafting state-of-the-art noise cancellation technologies for future hardware.
Contribute your expertise in electrical power systems to support Boeing's Vertical Lift program in an onsite full-time engineering role at Mesa, AZ.
A Customer Order Engineer role at Valmet's Shrewsbury, MA location focusing on engineering solutions, customer collaboration, and product development.
Seeking an experienced Infrastructure/DevOps Engineer to enable AI innovation through robust infrastructure design and maintenance in a fully remote capacity across Canada and the US.
Innovate at Anduril Industries as a Flight Test Operator, supporting autonomous aircraft flight tests and advancing defense technology.
PayPal seeks a Machine Learning Engineer to build scalable ML solutions and collaborate with cross-functional teams to innovate in digital payments.
Experienced or entry-level CAD Operator needed for Kimley-Horn in Orlando to produce site plans and construction drawings onsite.
Electrical Controls Engineer opportunity with BW Filling & Closing to develop automation and control system expertise on innovative packaging machinery in a collaborative, global environment.
Lead Wellmark’s architecture team to deliver innovative, scalable solutions that drive business growth and technology transformation in a trusted mutual insurance environment.
An exciting opportunity for a junior to intermediate Civil/Structural Engineer to join J.S. Held’s forensic architecture and engineering team delivering critical investigative and consulting services nationwide.
Lead and mentor engineering teams to build scalable, production-ready AI-native applications embedded in customer environments for ServiceNow's Applied AI Forward Deployed Engineering team.
Lead the advancement of cutting-edge Large Language Models at Palo Alto Networks to enhance AI-powered cybersecurity solutions.
Strategic and innovative Systems Engineer role at Valinor, focusing on embedded software and sensor integration for field-deployed medical trauma systems.