Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Staff AI Research Scientist - Evaluation, Handshake AI image - Rise Careers
Job details

Staff AI Research Scientist - Evaluation, Handshake AI

About Handshake AI

Handshake is building the career network for the AI economy. Our three-sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired.

Handshake AI is a human data labeling business that leverages the scale of the largest early career network. We work directly with the world’s leading AI research labs to build a new generation of human data products. From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain-specific data and evaluation at scale.

This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.

Now’s a great time to join Handshake. Here’s why:

  • Leading the AI Career Revolution: Be part of the team redefining work in the AI economy for millions worldwide.

  • Proven Market Demand: Deep employer partnerships across Fortune 500s and the world’s leading AI research labs.

  • World-Class Team: Leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, just to name a few.

  • Capitalized & Scaling: $3.5B valuation from top investors including Kleiner Perkins, True Ventures, Notable Capital, and more.

About the Role

As a Staff Research Scientist, you will drive frontier research on how we define intelligence of frontier models, i.e. develop benchmarks and measurements that help the research community to understand how large language models (LLMs) understand, reason, and interact with human knowledge. You will:

  • Lead teams of researchers to produce original research in LLM evaluation methodologies, interpretability, and human-AI knowledge alignment.

  • Develop novel frameworks and assessment techniques that reveal deep insights into model capabilities, limitations, and emergent behaviors.

  • Collaborate with engineers to translate research breakthroughs into scalable benchmarks, evaluation systems, and standards.

  • Pioneer new approaches to measuring reasoning, alignment, and trustworthiness in frontier AI systems.

  • Author high-quality code to enable large-scale experimentation, reproducible evaluation, and knowledge assessment workflows.

  • Publish in top-tier conferences and journals, establishing new directions in the science of AI evaluation.

  • Work cross-functionally with leadership, engineers, and external partners to set industry standards for responsible AI evaluation and alignment.

Desired Capabilities

  • PhD or equivalent research experience in machine learning, computer science, cognitive science, or related fields with focus on AI evaluation, interpretability, or model understanding.

  • 6+ years of academic or industry experience post-doc in a research-first environment

  • Strong background in LLM research, evaluation methodologies, and/or foundational AI assessment techniques.

  • Proven ability to independently design, lead, and execute evaluation research programs with novel data types end-to-end.

  • Deep proficiency in Python and PyTorch for large-scale model analysis, benchmarking, and evaluation.

  • Experience building or leading novel benchmark development, systematic model assessment, or interpretability studies.

  • Strong publication record in post-training, evaluation, or interpretability that demonstrates field-defining contributions.

  • Ability to clearly communicate complex insights and influence both technical and non-technical stakeholders.

Extra Credit

  • Experience with RLHF, agent modeling, or AI alignment research.

  • Familiarity with data-centric AI approaches, synthetic data generation, or human-in-the-loop systems.

  • Understanding of challenges in scaling foundation models (training stability, safety, inference efficiency).

  • Contributions to open-source libraries or research tooling.

  • Interest in the societal impact, deployment ethics, and governance of frontier AI systems.

Perks

Handshake delivers benefits that help you feel supported—and thrive at work and in life.

The below benefits are for full-time US employees.

🎯 Ownership: Equity in a fast-growing company

💰 Financial Wellness: 401(k) match, competitive compensation, financial coaching

🍼 Family Support: Paid parental leave, fertility benefits, parental coaching

💝 Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend

📚 Growth: $2,000 learning stipend, ongoing development

💻 Remote & Office: Stipends for home office setup, internet, commuting, and free lunch/gym in our SF office

🏝 Time Off: Flexible PTO, 15 holidays + 2 flex days, winter #ShakeBreak where our whole office closes for a week!

🤝 Connection: Team outings & referral bonuses

Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers.

Handshake Glassdoor Company Review
4.1 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Handshake DE&I Review
3.9 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
CEO of Handshake
Handshake CEO photo
Garrett Lord
Approve of CEO

Average salary estimate

$240000 / YEARLY (est.)
min
max
$180000K
$300000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 14 hours ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Handshake AI is hiring a Senior AI Research Engineer to architect and scale post-training systems, evaluation frameworks, and high-quality data pipelines for advanced LLM research and deployment.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Lead research and engineering efforts to ensure high-quality supervision data and scalable post-training pipelines that improve LLM alignment and evaluation.

Photo of the Rise User
Eurofins Hybrid Cedar Falls, IA, USA
Posted 5 hours ago

Eurofins TestAmerica is hiring a Metals Analyst to run ICP/ICP-MS analyses and support environmental testing in a quality-driven laboratory environment.

A paid, hybrid research internship at Brookings Tax Policy Center for undergraduate juniors and seniors to gain policy research, data analysis, and writing experience.

Posted 21 hours ago

Serve as XQ Institute’s Senior Education Policy & Data Analyst, translating national and state data and research into evidence-driven materials that shape policy and support high school transformation.

Riverside Research’s Secure and Resilient Systems group is looking for a Formal Methods Research Intern to help specify and verify systems-level software using proof assistants and modern programming languages in Lexington, MA.

Photo of the Rise User
Posted 20 hours ago

Lead original security research on state-of-the-art machine learning systems and work with top AI organizations to identify and mitigate novel attack vectors.

Photo of the Rise User

A two-year postdoctoral appointment to develop and analyze extreme-weather simulations and assess electricity-grid stability using physics-based and deep-learning Earth system models at Lawrence Livermore National Laboratory.

Photo of the Rise User
Posted 18 hours ago

Ironclad is seeking a Staff UX Researcher to lead mixed-methods research on complex contract workflows and shape product strategy from their San Francisco hybrid office.

Posted 7 hours ago

The Getty Conservation Institute is hiring an Assistant Archivist to organize, preserve, and describe GCI project records across media while supporting access and transfer to Getty’s Institutional Archives.

Posted 10 hours ago

Prairie View A&M University's Physics department seeks a postdoctoral researcher to lead X‑ray radiation experiments, mentor students, and contribute to prototype development and publications on a grant-funded project.

Paid Spring 2026 internship in Brookings' Foreign Policy program offering research, event coordination, and professional development for undergraduate and master's students interested in U.S. foreign policy and MENA issues.

osu Hybrid Medical Center Campus
Posted 14 hours ago

The Wexner Medical Center is recruiting a Researcher 2 to lead MRI reconstruction and image-processing software development for breast and body MRI research and clinical translation.

Posted 24 hours ago

XQ Institute is hiring a seasoned research leader to design and execute a robust evaluation and insights strategy that drives evidence-based decisions for high school transformation.

Posted 10 hours ago

SeaWorld Orlando seeks a registered Medical Technologist to run veterinary laboratory diagnostics and support animal health care, rescue and rehabilitation efforts.

Our mission at Handshake is to give all students the chance to build the career they want, no matter where they’re from or what school they attend.

6 jobs
MATCH
Calculating your matching score...
BENEFITS & PERKS
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
October 7, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!