Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU.
If you'd like to build the world's best deep learning cloud, join us.
*Note: This position requires presence in our San Francisco, San Jose or Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday.
The Lambda Core Services team builds and operates release engineering, cloud automation, and workflow systems for our AI cloud product suite. We provide CI/CD tooling and artifact management to support the build/deploy process for our services. We also automate configuration of our AWS and other SaaS resources and manage AWS usage for all of Lambda engineering. Keeping the internal product and engineering teams moving quickly and delivering quality is what makes us tick.
Along with the Platform Engineering organization, we help to build the foundations that unlock product excellence and a highly reliable experience for our customers.
About the Role:
We are seeking a seasoned Engineering Manager with deep experience in both release engineering and the management of large-scale cloud deployments. You will hire and guide a team of platform engineers in building out critical pillars of our stack. You will lead the team in designing, deploying, scaling, and supporting these solutions.
Your role is not just to manage people, but to coordinate the delivery of platform solutions to engineering customers within Lambda. This is a unique opportunity to work at the intersection of platform engineering and the rapidly evolving field of AI infrastructure.
What You’ll Do
Team Leadership & Management:
Grow/Hire, lead, and mentor a team of high-performing platform engineers and SREs.
Foster a culture of technical excellence, collaboration, and customer service.
Conduct regular one-on-one meetings, provide constructive feedback, and support career development for team members.
Drive outcomes by managing project priorities, deadlines, and deliverables.
Technical Strategy & Execution:
Work with the engineering team to drive strategy for internal CI/CD and Cloud services.
Develop self-service abstractions to make our platform tooling easier to adopt and use.
Lead the broader engineering organization in best-practices adoption of CI/CD, Workflow, and Cloud services.
Manage costs of both vendors and internally developed platforms.
Lead team in the continued development of our existing CI/CD solutions based on Buildkite and Github Actions.
Lead team in the expansion of our Terraform / Atlantis infrastructure automation platform.
Guide Lambda engineering in utilization of AWS services in line with our technical standards.
Guide team in problem identification, requirements gathering, solution ideation, and stakeholder alignment on engineering RFCs.
Identify gaps in our platform engineering posture and drive resolution.
Lead the team in supporting our internal customers from across Lambda engineering.
Cross-Functional Collaboration:
Work closely with Lambda product engineering teams on requirements and planning to meet their needs.
Work to understand the needs of engineering teams and drive our Platform solutions towards self-service.
Manage a short list of vendors that provide SaaS solutions used at Lambda.
You
Experience:
7+ years of experience in either Release Engineering or Platform Engineering with at least 3 years in a management or lead role.
Demonstrated experience leading a team of engineers and SREs on complex, cross-functional projects in a fast-paced startup environment.
Experience managing, monitoring, and scaling CI/CD platforms.
Deep experience using and operating AWS services.
Solid background in software engineering and the SDLC.
Strong project management skills, leading planning, project execution, and delivery of team outcomes on schedule.
Experience building a high-performance team through deliberate hiring, upskilling, performance-management, and expectation setting.
Nice to Have
Experience:
Experience driving cross-functional engineering management initiatives (coordinating events, strategic planning, coordinating large projects).
Experience driving organizational improvements (processes, systems, etc.)
Experience managing AWS service usage across a broader engineering organization.
Experience in AWS spend management.
Experience designing solutions using Temporal workflows; ability to act as an internal consultant for Temporal.
Experience with Kubernetes.
Experience designing scalable distributed systems.
Salary Range Information
The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
About Lambda
Founded in 2012, ~400 employees (2025) and growing fast
We offer generous cash & equity compensation
Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
Health, dental, and vision coverage for you and your dependents
Wellness and Commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible Paid Time Off Plan that we all actually use
A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lambda is hiring a hands-on Director of Real Estate & Workplace to lead facilities strategy, vendor relationships, and workplace experience for its San Francisco offices in a fast-growing AI infrastructure company.
Egen is hiring a Senior Application Engineer to build scalable, full-stack, cloud-native applications that connect AI/ML models to engaging user experiences.
Snap Inc is hiring a Level 4 Full Stack Engineer to build performant web and mobile experiences and contribute to core engineering systems for Snapchat and AR products.
Lead a high-impact engineering team to design and deliver a modern, scalable SaaS platform while shaping technical roadmap and team culture.
Lead platform engineering initiatives to design, automate, and operate secure, production-ready cloud infrastructure for large-scale services at Palo Alto Networks.
Work with a mission-driven team to build backend systems and ML-enabled APIs that convert classroom audio into meaningful instructional insights for schools worldwide.
Mid-level Java developer needed to build microservices on a greenfield project for Northstrat, requiring TS/SCI clearance and onsite availability in Sterling, VA or Aurora, CO.
Senior Backend Engineer to design and implement scalable, low-latency cloud services for a remote-friendly, mission-driven platform.
Experienced backend engineer sought to design and deliver scalable, secure services for a fast-moving, remote US-based development team.
Design, build, and operate high-performance Go backend services from design to production as part of a small, high-ownership engineering team in New York City.
DMI is seeking a Software Engineer to design, develop, test, and deploy software solutions supporting government and commercial customers in the Tysons Corner/McLean area.
Lead architecture and implementation of a scalable Python-based pricing platform to optimize revenue and monetization across multiple product surfaces in a remote-first fintech environment.
Senior front-end engineer needed to craft secure, high-performance TypeScript/React UIs for Anduril's Intelligence Systems, integrating with embedded platforms and real-time backend services.
Design and ship production generative AI and LLM systems at Promise to improve access to public benefits and automate complex workflows across government and utilities.
Lambda provides Artificial Intelligence and Machine Learning infrastructure to companies like Apple, Intel, Microsoft, MIT, Harvard, the Federal Government, and the DOD. Were headquartered in the Dogpatch and are a short walk from the 22nd Street ...
12 jobs