Job details

Founding Engineer (AI / ML)

Our Company

Pax Historia is defining a new category of gameplay using the latest advancements in generative AI. Our platform brings together the depth of grand strategy with the creative freedom of a sandbox experience, all fueled by a passionate community that creates and remixes scenarios on our platform.

Our community publishes hundreds of scenarios per day, plays millions of rounds a week, and is growing quickly. In addition, we’re backed by Y Combinator, Pace Capital, and Z Fellows. Your work will immediately ship to a product used by hundreds of thousands of players.

The Role

We’re hiring a founding-level ML systems engineer to work in-person full-time in San Francisco (in Dogpatch). You will report directly to the cofounders.

Our current position:

The latest closed source models play our game with reasonably good quality, but they’re incredibly expensive.
Open source models are much more affordable but almost never selected by users as their performance on our platform is poor.
Prompts and harnesses are largely identical between models.
A working internal eval system (with vast rooms for improvement)

What you’ll do:

Build and run the infrastructure needed to rigorously tailor harnesses and prompts to each AI model individually to squeeze out maximum performance.
Train domain-specific models to close or even eliminate the gap between open and closed models in their weight class at playing Pax.
Reduce costs associated with closed source models by optimizing caching strategies.
Further improve performance of closed source models by training tuned endpoints.
Evaluate and improve embedding and reranker performance in the places we use them.
Enable entirely new user experiences based on upcoming world models.

TLDR: Your work will directly make the game more affordable and more fun

Resources you’ll have:

Trillions of tokens of prompt and response logs from millions of gameplay trajectories.
Tens of thousands of user preference votes per day (coming soon, pairing algo ideas described here)
Generous access to compute (6 figure budget now, with a pathway to 7 if results are promising)
Points of contacts with many of the teams pushing the envelope of inference at scale (Chutes, OpenRouter, CanopyWave, and more)

How performance will be measured:

While we understand results may take months to start seeing, your north star metric will be to improve user-preference win-rates over off-the-shelf options with the same inference budget.

This is an intensive role and you should expect to work around 50-60 hours per week for the first few months; after that, hours may begin to decrease. There is potential for slight flexibility (ie, 1 day/week hybrid) but we have a strong preference for candidates who can commit to in person work.

Culture

Most of our players have discovered Pax Historia organically (friends, youtube reviews) and have stuck around because we truly care about the game we’re building. That’s why we want every one of our employees to care deeply about our product too. History, fantasy, or sci-fi nerds are especially welcome, but if you can articulate why you’d be excited to work on our game, we’d love to hear from you.

Pax Historia is still an extremely small company, so you’ll be working directly with the cofounders and a few other employees. You should be self-driven, a team player, and willing to advocate for your ideas. The cofounders will not be hand-holding: their leadership strategy is to ‘get out of the way’ of employees to let them do their best work.

Finally, flexibility is also very important. Since we are scaling very rapidly and still working with a small team, you should come to work willing to help solve a variety of problems on the fly.

Your Qualifications

Core Competencies:

You have shipped ML systems to real users and operated them in production.
You have made explicit cost/quality tradeoffs in deployed systems.
You have debugged and fixed unexpected model failures in production (e.g. expert hot-spots, structured output errors, etc).
You have designed, critiqued, or iterated on evaluation frameworks and understand their failure modes.

Product & Ownership Mindset

You bias toward leverage and compounding improvements (better evals, better feedback loops, better infrastructure).
You are willing to work on the “boring” but important problems like instrumentation, data hygiene, debugging, and reliability.
You take ownership of problems and are comfortable advocating for your ideas (while remaining open to evidence).
You know when to say “no” to yourself and us when something isn’t worth the complexity or risk.

Nice to Have

Experience with preference modeling, pairwise ranking, or human-in-the-loop evaluation systems.
Background in games, simulations, storytelling systems, or other domains where qualitative judgment matters.
Experience operating systems at high request volume.
Prior work at an early-stage startup or as a founding engineer.

What We Don’t Require

A specific degree, academic pedigree, or publication record.
Prior game industry experience.
Perfect knowledge of every technique listed above.

Compensation

Salary range is from $150,000 to $240,000 depending on your relevant qualifications and experience. For truly exceptional fits (senior/staff-level), we may be willing to go above the posted range.

Selected candidates can expect to receive 0.25% - 1+% equity. Vesting schedule is a 12 month cliff and 4 year monthly vesting. We will also be offering a non-matching 401k plan.

This job listing is for a W-2 employee opportunity. We are unfortunately unable to sponsor visas (other than O1) at this time. Pax Historia is an equal opportunity employer and does not discriminate on the basis of race, color, religion, sex, gender identity or expression, sexual orientation, national origin, age, disability, or veteran status.

Founding Engineer ML Engineer MLOps LLMs Generative AI Prompting Model Tuning Embeddings Reranker Inference PyTorch Kubernetes Evaluation Cost Optimization Game AI San Francisco

Average salary estimate

$195000 / YEARLY (est.)

min

max

$150000K

$240000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Junior AI/ML Software Developer (req-208)

CATHEXIS Hybrid No location specified

VIEW

Posted 23 hours ago

A short-term role at CATHEXIS for a Junior AI/ML Software Developer to help build and maintain AI-driven solutions alongside an experienced engineering team.

Principal Architect, Infrastructure

Zencore Hybrid US (Remote)

VIEW

Posted 18 hours ago

Zencore seeks a Principal Architect with strong Google Cloud expertise to lead technical delivery and drive cloud modernization initiatives for enterprise customers.

Staff Software Engineer

Atropos Health Hybrid No location specified

VIEW

Posted 4 hours ago

Atropos Health seeks a Staff Software Engineer to lead architecture and delivery of scalable, testable Python/Django backends and React front-ends for a privacy-sensitive healthcare evidence platform.

Principal Software Engineer - Full Stack

Veeva Systems Hybrid Oregon - Portland

VIEW

Posted 17 hours ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Family Medical Leave

Maternity Leave

Paternity Leave

Lactation Facilities

Family Coverage (Insurance)

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Paid Time-Off

Paid Volunteer Time

Lead the design and delivery of scalable enterprise SaaS at Veeva Systems as a Principal Full Stack Engineer, building cloud software that accelerates life sciences innovation.

Senior Software Engineer - NVLINK NOS

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 6 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Senior Software Engineer to develop and lead NVLINK NOS (NVOS) features using Python and C++ for next-generation HPC networking.

Member of Technical Staff, Platform

Anchorage Digital Hybrid United States

VIEW

Posted 2 hours ago

An experienced platform engineer will design and operate CI/CD pipelines, monorepo build tooling, and secure GCP runtime environments to accelerate developer workflows at Anchorage Digital.

Front-End Software Developer (NIN-AE3-27.12125)

Capital Solutions Group Hybrid Linthicum Heights, Maryland

VIEW

Posted 6 hours ago

Work on a TS/SCI-cleared team building Angular 14+ single-page applications that automate and visualize enterprise security and authorization workflows for mission-critical systems.

Cloud Full Stack Python Software Engineer - US Remote

Railroad19, Inc Hybrid U.S. Remote

VIEW

Posted 17 hours ago

Railroad19 is seeking experienced Cloud Full Stack Python engineers to build serverless AWS applications, develop full-stack features with React, and advise clients on enterprise-grade solutions.

Senior Engineer, Automation

Greenlight Financial Technology Hybrid Atlanta (Remote Friendly)

VIEW

Posted 7 hours ago

Greenlight is hiring a Senior Automation Engineer to build LLM-powered automations, enterprise integrations, and Infrastructure-as-Code to strengthen security and operational efficiency.

Product Engineer - Authorization

WorkOS Hybrid United States

VIEW

Posted 19 hours ago

WorkOS is hiring a Product Engineer focused on Authorization to build scalable, high-throughput access control systems and developer-friendly APIs for enterprise-ready developer platforms.

GPU Accelerated Bioinformatics Engineer

Prima Mente Hybrid No location specified

VIEW

Posted 5 hours ago

Lead the design and production of GPU-accelerated multi-omics pipelines at Prima Mente to accelerate AI-driven biological discovery and clinical translation.

Forward Deployed Engineer

Xdof Hybrid San Francisco

VIEW

Posted 6 hours ago

A hands-on Forward Deployed Engineer role working directly with frontier robotics labs to build integrations, data pipelines, and production software that connect xdof's datasets to model training workflows.

Lead Engineer

Pairtu Hybrid San Francisco

VIEW

Posted 20 hours ago

Lead engineering at Pairtu to build and ship the core advocate and patient-facing platform using Python/Django and React/Next.js while driving product and technical decisions in a fast-moving healthcare startup.

P Pax Historia

2 jobs

MATCH

Calculating your matching score...

FUNDING

Seed

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Senior Level

TEAM SIZE

No info