About Etched
Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents.
Job Summary
Sohu pushes the boundaries of what’s possible in AI model performance. But Sohu is just our first product - we are looking for talented software engineers to help us build the software, systems, and infrastructure that let us build specialized ASICs faster than any other company on Earth.
This is a broad role. We are looking for hungry, talented software engineers who can contribute to many parts of the systems that make Etched run. You’ll lead the charge in designing and scaling the distributed systems that power our development and deployment pipelines. This is an opportunity to invent high-performance systems that orchestrate fleets of machines across hybrid superclusters, simulate trillions of operations, and move code from idea to silicon faster than ever before.
While this role will work closely with infrastructure (part of the job will be setting up physical supercomputing systems), candidates for this position do not necessarily need infrastructure engineering experience. If you’re obsessed with building robust software for developers to use, designing systems that harness massive compute, and solving problems most companies never encounter, we’d love to talk.
What you’ll do
Build Reliable, High-Performance Systems for Engineering at Scale: Design the core frameworks and workflows that enable Etched’s engineers to iterate faster than anyone else in hardware. This includes building software that drives hybrid superclusters, with an extremely high compute-to-engineer ratio – your work will orchestrate thousands of machines to simulate, verify, and build chips continuously, with performance and correctness guaranteed.
Engineer World-Class Continuous Integration for Hardware: Invent and implement fault-tolerant, high-throughput CI pipelines for our entire backend – simulation, synthesis, CDC, and more. You’ll help build systems that surface correctness and performance regressions across nightly flows, and make high-assurance silicon development as continuous and automated as software deployment.
Scale Bare-Metal Superclusters and Hybrid Infrastructure: Work hands-on with on-prem systems to design compute infrastructure purpose-built for AI chip development. You’ll be orchestrating physical clusters across racks and clouds, thinking deeply about CPU/memory topology, EDA workloads, and how to saturate every core, all the time.
Design High-Quality Tools That Engineers Love Using: EDA workflows are stuck in the early 2000s – non-reproducible, slow, and hard to use. You’ll build from first principles: new interfaces, new dev workflows, and new execution engines that treat compute as cheap and time as sacred. Your tools will turn hours into minutes for the rest of our team.
Representative projects
Tools for Everything: Build a fully automated, correctness-first CI system that continuously runs performance regressions, CDC checks, simulations, and synthesis flows. Make chip verification as fast and seamless as shipping web code.
Massively Parallel Workload Scheduler: Design and deploy a dynamic scheduler for orchestrating compute-heavy workloads across thousands of on-prem and cloud cores. It should balance throughput, reliability, and fault tolerance while abstracting the underlying hardware complexity from users.
On-Demand, GPU-Backed Dev Environments: Create infrastructure that launches reproducible, zero-downtime interactive environments – backed by GPU nodes and hardened against flaky hardware. Developers should be able to spin up environments instantly, run massive workloads, and never worry about cluster volatility.
Scale-Aware Workload Migration: Prototype systems that intelligently migrate CI and simulation workloads across heterogeneous infrastructure, optimizing for performance and minimizing downtime—especially under variable hardware availability and cost constraints.
High-Fidelity Fault Simulation: Develop fault injection and synthetic testing frameworks that simulate hardware failures, degraded networking, and extreme load conditions – validating that our pipelines remain reliable under real-world stress.
You may be a good fit if you
Have Strong Systems Programming Skills: You’re fluent in languages like Go, Rust, or C++ and have familiarity with scripting as well.
Embrace Deep Technical Challenges: You’re not afraid to dive into kernel internals, debug tricky hardware-software interactions, or design low-latency pipelines from first principles.
Are Obsessed With Developer Productivity: You want to build tools that help great engineers move faster and with greater confidence.
Treat Infrastructure as Code: You believe great infrastructure is software – clean, testable, versioned, and designed for maintainability.
Strong candidates may also have experience with
Designing and operating large-scale CI systems, build farms, or simulation clusters.
Architecting hybrid infrastructure across bare-metal and cloud providers.
Optimizing compute environments for performance-critical workloads, including memory-bound simulations and multi-node orchestration.
Working in environments where correctness, reproducibility, and performance are existential requirements.
Base Compensation Range
$150,000 - $275,000
Benefits
Full medical, dental, and vision packages, with generous premium coverage
Housing subsidy of $2,000/month for those living within walking distance of the office
Daily lunch and dinner in our office
Relocation support for those moving to San Jose (Santana Row)
How we’re different
Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.
We are a fully in-person team in San Jose (Santana Row), and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Contribute your expertise in geometric modeling as a Senior Software Engineer at Higharc, a funded startup redefining home design and construction.
Software Engineer needed at Bank of America to develop, test, and deploy software solutions supporting business objectives within a hybrid work model.
Drive innovation in conversational AI as a Senior Software Developer at Susquehanna, leveraging your expertise in AI/ML and software engineering to optimize internal enterprise systems.
Experienced software engineer needed to lead lifecycle management and cross-team coordination for software releases at CVS Health.
Aptiv seeks a Localization Software Developer to design and implement cutting-edge localization algorithms for automotive safety and connectivity.
Pioneer AI-driven legal tech innovation as a Full Stack Engineer with Thomson Reuters’ Innovation Team, developing end-to-end software solutions in a hybrid work setting.
Contribute to Fable Security's AI-driven cybersecurity platform as a Software Engineering Intern building innovative backend and full-stack features.
Foodsmart seeks a technically skilled Senior Software Engineer - Backend to develop scalable APIs and cloud-based backend solutions in a fully remote environment.
Help transform healthcare by building cutting-edge AI solutions as a Fullstack AI Engineer at Freed in San Francisco with a hybrid work model.
Senior Software Engineer in Test Automation needed at Palo Alto Networks to build and automate testing for cutting-edge cybersecurity cloud services.
Suno seeks a skilled Machine Learning Infrastructure Engineer to develop and optimize cutting-edge ML systems in a music-focused AI platform.
Contribute to Voxel’s innovative platform by developing scalable and secure backend systems powering real-time industrial video analytics for workplace safety.
Experienced software engineer needed at NVIDIA to develop scalable GPU management solutions in Kubernetes environments.
by burning the transformer architecture into our chips, we’re creating the world’s most powerful servers for transformer inference.
14 jobs