Compute is a commodity. We think people should buy it like one.
Startups shouldn’t be forced to buy a year’s worth of compute time in order to get market rate and compute providers shouldn’t go bankrupt because they can’t fully book their clusters.
At SF Compute, our goal is to solve this issue the same way this was solved for every other commodity — by building a venue where compute contracts are traded in real-time and by bringing a new kind of participant into the supply chain, traders.
If we succeed, buyers will be able to get a good price for any order, whether it’s 32 H100s for a month or 8,000 H100s for an hour, and sellers will instantly book out their clusters because traders will speculatively buy them, for a spread. Every FLOP will flow through us somewhere in the supply chain. What Brent is for oil, we will be for compute.
As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase compute for inference work loads. We're working with a cutting edge technology company as our partner. You will play a key role in building out and defining the inference products at SFC. You will work closely with a small competent team driving high impact revenue generating changes. You will have the opportunity to deliver the highest quality, most affordable inference products at scale.
On a day-to-day basis, you might…
Quantify, optimize, and visualize the unit economics of inference
Design solutions to maximize compute utilization
Create automated compute purchasing software to optimally fulfill inference job demand
Scale inference software pipeline systems to hundred trillion token scale
This role may be a good fit for you if…
You enjoy the craftsmanship of software
You’re a thoughtful high-agency engineer
Have a deep appreciation for reliable fault tolerant systems
Are a strong communicator and work well with others
Have SQL experience
Bonus points if you…
Have previously worked at a startup
Have interest in market dynamics
Have previously scaled distributed work scheduling systems
Are comfortable working in Rust
Have some Kubernetes or terraform experience
Team members are offered a competitive salary along with equity in the company
Yes, we sponsor visas and work permits
We match 401(k) plans up to 4%
We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums
We offer unlimited paid time off as well as 10+ observed holidays
We offer biological, adoptive, and foster parents paid time off to spend quality time with family
We cover lunch daily for employees
You can buy as many books for the office as you want
The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment.
We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.
We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco’s Fair Chance Ordinance and California’s ban-the-box laws.
If you require reasonable accommodation for any reason, please reach out to us at [email protected].
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Roblox is hiring a Senior Software Engineer - Release to develop and operate multi-platform build, CI, and release automation for the Game Engine team.
Senior full-stack engineer needed to design and ship accessible, scalable web experiences and serverless backends for a global edtech company, based in Utah.
Build and ship production-grade Solidity smart contracts and supporting backend systems for a leading decentralized finance platform with global offices in NYC, Denver, and the Cayman Islands.
Experienced SailPoint IdentityNow developer needed to implement advanced workflows, integrations, and governance controls for a D.C.-based IT consulting firm.
Kanopi Studios seeks a remote WordPress Engineer to develop custom themes, plugins, and Gutenberg blocks while contributing to performant, accessible, and well-documented client websites.
Help build Jerry.ai’s AllCar™ app as an entry-level Software Engineer, writing production code across frontend and backend services while learning from experienced engineers in a fast-growing startup.
Visa seeks a hybrid Software Engineer in Highlands Ranch to design, build, and maintain scalable payment systems and web services used worldwide.
First Resonance is seeking a Senior Site Reliability Engineer to lead SRE practices, automate CI/CD and infrastructure, and ensure the ION platform is scalable and highly reliable for advanced mobility customers.
Mastercard’s Ethoca team is hiring a Senior Principal Software Engineer to lead architecture and hands-on delivery of large-scale, cloud-native payment systems.
Mapbox seeks a Software Development Engineer II to develop scalable, high-throughput backend APIs and services for Maps, Traffic, and MapGPT using Python, TypeScript/Node.js, and AWS.
Experienced C#/.NET engineer needed to design and build cloud-native SaaS solutions for Geoforce's GPS asset tracking platform, working remotely with up to 25% national travel.
GDIT is hiring a senior Azure-focused Cloud Developer Advisor to architect secure, scalable cloud and data platforms and advise federal programs on DevSecOps, migrations, and operational readiness.
Experienced SRE leader needed to architect secure, automated cloud platforms and champion reliability, observability, and CI/CD best practices for a Washington, D.C. IT consultancy.
A large, low-cost H100 cluster you can rent by the hour
2 jobs