Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Frontend Developer — LLM Evaluation & Experiment Visualization image - Rise Careers
Job details

Frontend Developer — LLM Evaluation & Experiment Visualization

Our Client is a well-funded nonprofit research organization focused on measuring frontier AI capabilities—especially agentic / autonomous capabilities and the ability of models to conduct AI R&D, because those capabilities can create outsized societal and security risk if they scale faster than our ability to evaluate and govern them.


Their work is unusually “real-world” compared to typical benchmarks: they build evaluations with high realism and measure performance against skilled-human baselines (often multi-hour tasks), and publish research on how quickly models are improving at completing long tasks.


You’d be building the UI that turns messy LLM evaluation outputs into clear, explorable artifacts that researchers can trust.


What you’ll do

- Build React + TypeScript interfaces for exploring LLM evaluation results and experiment outputs.

- Design and implement data visualizations that make model behavior, metrics, and results easy to inspect.

- Build workflows that support end-to-end traceability of LLM runs (prompts → intermediate steps → decisions → outputs).

- Partner closely with researchers; iterate quickly while balancing clarity, accuracy, and performance.


Tech stack / must-haves

- React + TypeScript

- Hands-on with at least one major visualization library: D3, Plotly, Vega/Vega-Lite, Visx, Three.js, Highcharts, ECharts


Why this matters

- Their mission is to give society and AI labs grounded answers to: “What can frontier models actually do?” and “When do capabilities become dangerous?”

- The team includes researchers and engineers with backgrounds across top AI orgs and programs (e.g., OpenAI, DeepMind, and alumni of OxfordCaltechMIRI, and ML interpretability programs).


Location

- Onsite in the San Francisco Bay Area (relocation sponsored).


Contact [email protected].


Average salary estimate

$170000 / YEARLY (est.)
min
max
$140000K
$200000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Inclusive & Diverse
Empathetic
Take Risks
Transparent & Candid
Feedback Forward
Mission Driven
Collaboration over Competition
Work/Life Harmony
Maternity Leave
Paternity Leave
Snacks
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
401K Matching
Paid Sick Days
Paid Time-Off
Paid Volunteer Time

Help build the Data Experience for Spotify's Backstage Portal by developing TypeScript/React UIs, Node.js backends, and connectors that make enterprise dataset metadata discoverable and actionable.

Photo of the Rise User
Posted 12 hours ago

Lead Orum's frontend architecture and developer experience by owning the design system, shared component libraries, tooling, and performance standards for a high-scale, real-time sales platform.

Photo of the Rise User
Posted 12 hours ago

Pearly is hiring a hybrid NYC-based Software Engineer to build scalable payments and platform infrastructure using TypeScript, GraphQL, and PostgreSQL.

Photo of the Rise User
Posted 2 hours ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Family Medical Leave
Maternity Leave
Paternity Leave
Lactation Facilities
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Time-Off
Paid Volunteer Time

Lead the design and delivery of scalable enterprise SaaS at Veeva Systems as a Principal Full Stack Engineer, building cloud software that accelerates life sciences innovation.

Posted 2 hours ago

Lead the development of scalable, secure back-end services and streaming data pipelines for a fast-growing, data-driven fintech platform focused on syndicated loans.

Photo of the Rise User

Build and own full-stack features at Human Delta to help enterprises reliably adopt and govern AI, working closely with founders and early customers in San Francisco.

Railroad19 is seeking experienced Cloud Full Stack Python engineers to build serverless AWS applications, develop full-stack features with React, and advise clients on enterprise-grade solutions.

Photo of the Rise User
Aretum Hybrid No location specified
Posted 13 hours ago

Aretum is hiring a Senior Software Engineer to lead development of .NET/GraphQL/Postgres-based applications for federal modernization initiatives in a remote Agile setting.

Photo of the Rise User

Lead the development of scalable RL training systems and procedural scenario generation to accelerate safe robotic deliveries at Serve Robotics.

Senior C++ Full-Stack Engineer needed to build and optimize production C++ systems and full-stack tooling for AI data pipelines and evaluation workflows at a fast-moving AI infrastructure company.

Photo of the Rise User
Posted 3 hours ago

Zencore seeks a Principal Architect with strong Google Cloud expertise to lead technical delivery and drive cloud modernization initiatives for enterprise customers.

Photo of the Rise User
ClickHouse Hybrid United States (remote)
Posted 14 hours ago

Senior Cloud Engineer to design and operate secure, highly available ClickHouse Cloud platforms for regulated and mission-critical environments across cloud, hybrid, and on‑prem deployments.

Photo of the Rise User
Posted 15 hours ago

Lead full‑stack development for Elsevier's CK AI Nursing team, building React and Node.js applications within a Micro Focus front-end to support clinical and educational workflows.

MATCH
Calculating your matching score...
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
December 27, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!