Browse 65 exciting jobs hiring in Data Evaluation now. Check out companies hiring such as Alignerr, People Data Labs, Bioscope AI in Columbus, Dallas, Grand Rapids.
Alignerr seeks a Principal Python Engineer to design and optimize high-performance ML data, annotation, and evaluation infrastructure for leading AI labs on a remote contract basis.
Experienced Solutions Engineer needed to lead technical evaluations and act as a trusted advisor for customers buying people and company data at PDL.
Experienced Rust systems engineer needed to build and optimize high-performance AI data pipelines and full-stack tooling for model evaluation and data-quality workflows.
Lead the design and implementation of evaluation pipelines, uncertainty quantification, and QA systems to ensure clinical AI outputs are reliable and trustworthy.
Lead the measurement and experimentation strategy for Scribd's user-generated content ecosystem, turning ambiguous product questions into rigorous, high-impact analytics and AI evaluation.
Lead data reporting, quantitative research, and CRM management at NYC's Office of Financial Empowerment to measure and advance financial health for New Yorkers.
Acelero Learning is hiring a Reporting & Evaluation Manager to design and deliver KPIs and operational reports that inform program and business decisions in a remote capacity.
Provide technical leadership for design, integration, and sustainment of airborne and ground instrumentation systems to support Air Force test and evaluation at Nellis AFB.
Experienced Rust engineer wanted to build and optimize high-performance ML data and evaluation infrastructure for Alignerr on a remote, part-time to near-full-time contract basis.
Alignerr is hiring a Senior C++ Full-Stack Engineer to build and optimize production C++ systems that power AI data pipelines and evaluation tooling on a remote, contract basis.
Alignerr is hiring a Senior Python Full-Stack Engineer (contract, remote) to build reliable, high-performance backend and tooling for large-scale data annotation and AI evaluation workflows.
Alignerr is hiring a Senior C# Full-Stack Engineer to build and optimize backend services and tooling for AI data pipelines and evaluation workflows on a remote, contract basis.
Senior Rust Backend & Tooling Engineer (contract, remote) to build high-performance services and developer tooling for AI data pipelines and evaluation workflows at Alignerr.
Senior C# Full-Stack Engineer (contract, remote) to build high-performance C# systems and full-stack tooling for AI data, evaluation, and annotation pipelines.
MORSE Corp is hiring a cleared Data Scientist to evaluate, develop, and transition machine‑learning algorithms for national security applications working on challenging, multidisciplinary T&E efforts.
Provide technical leadership to ARPA-H and external performers by designing and validating healthcare datasets and evaluation frameworks for AI-driven rare disease diagnostics.
Boeing is looking for Software Engineers specializing in LLMs to design, implement, and integrate AI/ML capabilities into aerospace and autonomy systems at its Tukwila, WA site.
Lead the integration, qualification, and operational test campaigns for X-BAT aircraft at Shield AI, guiding multi-disciplinary teams from design through first flight and operational readiness.
Lead a team of Impact Managers and school partnerships to drive measurable student impact, ensure AmeriCorps compliance, and strengthen school-community relationships for City Year Philadelphia.
Risant Health is hiring a Washington, D.C.-based Compensation and Benefits Specialist to manage total rewards programs, administer benefits, and support compensation strategy and compliance.
Join TheIncLab as a Machine Learning Engineer to develop, train, and evaluate ML models (PyTorch/TensorFlow) in a hybrid, mission-focused environment supporting defense and aerospace systems.
Work remotely as a contract Senior Python Full-Stack Engineer to build scalable evaluation and data infrastructure powering model training, benchmarking, and quality assurance at Alignerr.
Arcade seeks an experienced technical lead to build and run DataOps, annotation, and evaluation systems that power generative AI-driven product design.
A Senior C++ Full-Stack Engineer is needed to build and optimize high-performance C++ systems and full-stack tooling for large-scale AI data and evaluation infrastructure on a flexible remote contract.
Experienced Python full-stack engineer wanted to build and optimize backend services and tooling for AI data pipelines and evaluation workflows at Alignerr (remote, contract, 20–40 hrs/week).
Work remotely as a Senior C++ Full-Stack Engineer at Alignerr to build and optimize high-performance C++ services and tooling for AI data pipelines and evaluation workflows.
Alignerr is looking for a Senior Rust Full-Stack Engineer to design and optimize high-performance Rust systems and tooling that power AI data pipelines and model evaluation workflows.
Senior Rust Full-Stack Engineer needed to build and optimize production-grade AI data pipelines and tooling for model training and evaluation at a remote-first AI infrastructure firm.
Senior Python full-stack engineer sought to build and optimize scalable AI data pipelines and tooling for Alignerr’s model training and evaluation workflows on a part-time remote contract.
A senior C++ full-stack systems engineer is needed to build reliable, high-performance infrastructure and tooling for AI data pipelines and evaluation workflows at a remote-first AI-focused company.
Work remotely as a senior C++ engineer building and optimizing high-performance systems and full-stack tooling for AI data pipelines and evaluation workflows.
A senior Rust engineer is needed to build and optimize high-performance ML data and evaluation infrastructure for Alignerr’s AI research and production workflows on a part‑time remote contract.
Lead and execute high-impact AI data programs at Handshake, coordinating large distributed teams and partnering with frontier AI labs to drive revenue, quality, and scalable delivery.
Active Minds is hiring a Manager of Equity and Inclusion to design trainings, lead Employee Network Groups, manage equity projects, and analyze DEI data to advance mental-health equity across the organization.
Mercor is hiring a Research Engineer to develop post-training, RLVR, and large-scale evaluation pipelines that materially improve LLM behavior in production settings.
Experienced full-stack engineer needed to build and optimize C# systems and tooling that power large-scale AI data and evaluation workflows at Alignerr.
Handshake is hiring experienced commercial pilots to remotely evaluate and refine AI model outputs using real-world aviation expertise.
Lead production-grade LLM and AI agent development at Everstar to accelerate nuclear deployment through rigorous evals, fine-tuning, and synthetic data pipelines.
Support Entrata’s Brand Impact team as a Marketing Intern focused on research, evaluation, and strategic planning for social impact partnerships nationwide.
Work as a contract Senior C++ Full-Stack Engineer building high-performance C++ systems and full-stack tooling to support large-scale AI data, annotation, and evaluation workflows for leading labs.
Experienced C# backend engineer needed to build and optimize high-performance services and full-stack tooling for AI data pipelines and evaluation workflows at Alignerr.
Alignerr is hiring a Senior C# Full-Stack Engineer to build high-performance backend services and tooling for AI data pipelines and evaluation systems on a remote, contract basis.
Senior Python systems engineer needed to build and optimize distributed ML data pipelines and evaluation tooling for Alignerr on a 20–40 hour/week remote contract.
Alignerr is hiring a Senior Rust engineer to build high-performance backend services and developer tooling for AI data pipelines and evaluation workflows on a part-time contract basis.
Work on production-grade data ingestion, UI components, evaluation systems, and agent tooling at a small engineering company focused on automating pre-construction engineering workflows.
MagicSchool seeks a Senior LLM Quality Analyst to lead prompt engineering experiments, maintain quality reporting, and ensure AI outputs meet educators' classroom needs.
Experienced Python backend engineer needed to build and optimize scalable data and evaluation tooling for cutting-edge AI research workflows at Alignerr.
Senior Rust systems engineer needed to architect and optimize high-performance backend and tooling for AI data pipelines and evaluation workflows at Alignerr.
City Year New York seeks a Talent Pathways Senior Manager to launch and run an 18-month Work Study Pilot that recruits and supports college interns while building college and agency partnerships and measuring program impact.
Alignerr seeks a senior C++ full-stack engineer to build high-performance backend and tooling for AI data pipelines and model evaluation on a flexible remote contract.
Below 50k*
1
|
50k-100k*
0
|
Over 100k*
0
|