The Role
We’re searching for Staff Compiler Engineers to architect and build the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs. You will own integration with PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers. Your mission will be to create seamless support for a broad ecosystem of large AI models, and ensure we are pushing the limits of their performance envelope by closely partnering with hardware and modelling teams to understand device trade-offs.
Responsibilities
Ownership: Define and deliver the technical vision and roadmap for your team that unlocks key strategic technical and business goals that are essential to the success of Flux.
Collaboration: Partner closely with hardware engineers to align compiler, runtime, and debugger requirements with the OTPU design; ensure software and hardware are designed together to deliver maximum performance.
Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware.
Optimize Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximize throughput.
Mentor & Define Standards: Lead code reviews, coach peers, and define best practices in ML backend and performance engineering for your team.
Guide Technology Direction: Stay ahead of GPU, AI accelerator, and optical computing trends; propose and prototype innovations.
Skills & Experience
7+ years of experience in software engineering with a focus on C/C++ programming.
Extensive experience in ML framework internals, compilers, low-level programming, and optimisation techniques.
Extensive experience optimising Tensorflow, PyTorch or JAX deep learning models.
Extensive experience with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM.
Practical experience applying machine learning in high-performance computing contexts.
Strong problem-solving skills and the ability to think critically and creatively.
Experience in high-pace, dynamic work environments.
Excellent teamwork and communication skills, with the ability to collaborate effectively with cross-functional teams.
Bachelor's degree in computer science, electrical engineering, telecoms engineering, mathematics, or a related field.
Personal projects are a key differentiating factor and hold more weight than other requirements.
Compensation & Benefits
Starting salary: $275,000 - $336,000, depending on experience.
Generous stock options in a rapidly growing AI company
Based in our office in central San Francisco
To foster collaboration in our high-growth environment, we require all employees to work from our SF office and live within a 45-minute commute. We offer an extra ($24,000/year) incentive for those living within 20 minutes.
Due to U.S. export control regulations, candidates’ eligibility to work at Flux depends on their most recent citizenship or permanent residency status. We are generally unable to consider applicants whose most recent citizenship or permanent residence is in certain restricted countries (currently including Iran, North Korea, Syria, Cuba, Russia, Belarus, China, Hong Kong, Macau, and Venezuela). Applicants who have subsequently obtained citizenship or permanent residency in another country not subject to these restrictions may still be eligible.
We do not accept unsolicited CVs from recruitment agencies, will not be liable for any fees, and prohibit unauthorised use of our company name in recruitment activities.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Flux seeks a Senior Platform Engineer to build and maintain the full-stack platform services (billing, auth, onboarding, integrations) that enable a scalable AI-hardware product experience.
Frec seeks a Backend Engineer to design and operate scalable, fault-tolerant backend services that power a next-generation financial platform.
Experienced Senior Software Engineer needed to architect and maintain Python/Django backends and APIs for widely used digital media platforms, with hybrid/remote flexibility.
Lead full‑stack enterprise software development and mentor engineering teams while delivering high‑quality Java-based applications and modern SPA front-ends for a US partner represented by Jobgether.
Build and ship production-grade React frontends for a cutting-edge AI product in the Laravel ecosystem as a fully-remote product engineer focused on speed, craft, and user impact.
Lead the design, training, and production deployment of ASR, TTS, and Speech LLM systems at OutcomesAI to power HIPAA-compliant voice agents in clinical settings.
Experienced Staff Engineer needed to own and deliver large-scale web initiatives, mentor engineers, and drive technical strategy for a mission-driven healthcare technology company.
An experienced Java developer with AI/ML familiarity is needed to integrate and productionize machine learning capabilities within enterprise Java applications at a leading digital transformation consultancy.
TENEX seeks a Principal AI Engineer in Sarasota, FL to architect and productionize AI-driven detection, investigation, and remediation systems for a next-generation MDR platform.
Lead DevOps and MLOps engineering for HBS Foundry to build, scale, and operationalize a secure Generative AI application platform within Harvard Business School.
Technical Lead, Applied AI to lead engineering delivery and hands‑on implementation of scalable LLM, RAG and agentic solutions for strategic enterprise customers at Mistral AI.
Lead and grow an engineering team to design, build, and operate scalable AI-powered experiences that drive measurable product and business impact in a remote-first environment.
GameChanger seeks an experienced Senior Backend Software Engineer to lead development and reliability improvements for its subscriptions platform, working remotely across the U.S. or from our Manhattan office.
Experienced Platform Engineer needed to design and optimize scalable backend systems and cloud infrastructure for a leading data orchestration platform (fully remote).