Lead LMArena’s open-source research program to design, run, and publish rigorous, reproducible evaluations and datasets that shape global AI model benchmarking and community tools.
Lead LMArena’s product security efforts by designing and building scalable defenses against bots, Sybil attacks, and adversarial behavior across product, infra, and data pipelines.