Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Software Engineer (AI Performance) image - Rise Careers
Job details

Software Engineer (AI Performance)

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds. 

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking a Software Engineer focused on AI Performance. You will be researching and implementing techniques to drive performance and quality optimizations across the latest AI models. You will implement techniques such as quantization, KV caching, and FlashAttention to enable inference efficiency. You will design parallelism strategies to distribute data and workloads across compute nodes at production scale. You will dive deep into GPU code and kernel optimizations to accelerate AI workloads.

Responsibilities:

  • Evaluating and implementing cutting-edge AI research for model performance and efficiency 

  • Architecting infrastructure for distributed AI workloads across both the software stack and GPU kernel layers

  • Profiling, benchmarking, and analyzing system performance, identifying bottlenecks and optimization opportunities in execution runtimes targeting various hardware systems

Qualifications:

  • Bachelor’s degree in computer science, engineering, applied mathematics or comparable area of study

  • Experience with performance optimization

Preferred Qualifications:

  • Graduate degree in computer science, engineering, applied mathematics or comparable area of study

  • Familiarity with compilers and compiler frameworks such as MLIR

  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks

  • Software development experience with Python, C++, and CUDA

Average salary estimate

$187500 / YEARLY (est.)
min
max
$150000K
$225000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Gimlet Labs logo

What it's like to work at Gimlet Labs

Read Reviews
Similar Jobs
Photo of the Rise User
Posted 4 hours ago

Workday is hiring a Software Development Engineer Intern to build real-world features, learn from experienced engineers, and contribute to platform engineering during a 12-week in-person summer program in Pleasanton, CA.

Photo of the Rise User
Posted 6 hours ago

Lead architecture and development of Clair's real-time fintech systems, building scalable, secure APIs and transaction processing that deliver instant pay to millions of users.

Photo of the Rise User
Posted 3 hours ago

National General seeks a hands-on full-stack Software Engineer Consultant I to build and maintain cloud-ready, test-driven applications for insurance products in a remote role.

Photo of the Rise User
Posted 8 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead a multidisciplinary DevOps/SRE team to build and operate scalable, GitHub-first CI/CD and multi-cloud GPU inference infrastructure for NVIDIA's AI products.

Photo of the Rise User
Posted 4 hours ago

Senior-level embedded engineer needed to develop and validate bare-metal infrastructure, board bring-up, and secure drivers for ARM-based SoC platforms supporting critical defense systems.

Photo of the Rise User
Posted 4 hours ago

Lead development of photometric display calibration algorithms and on-device tools for Anduril's AR/VR systems, applying deep expertise in C/C++, computer vision, and display metrology to production environments.

Photo of the Rise User

Lead the development and deployment of pickup/dropoff motion-planning algorithms at Zoox to improve robotaxi driving behavior across complex real-world scenarios.

Photo of the Rise User
Posted 9 hours ago

Bjak is looking for an experienced ML Ops Engineer to optimize, serve, and scale open-source LLMs into production for high-impact global AI products in a hybrid remote/New York role.

Photo of the Rise User
Posted 5 hours ago

Proto Labs is hiring a Senior Software Engineer (Contractor) to lead AX 2012 → D365 F&O upgrades, build integrations, and modernize finance/order systems for their Maple Plain, MN operations.

GDIT Hybrid USA DC Washington
Posted 1 hour ago

Lead operational excellence for federal cloud systems as a Senior Azure Engineer at GDIT, focusing on Azure monitoring, IaC automation, incident response, and cost optimization.

Photo of the Rise User
Posted 1 hour ago

Boeing is hiring an Associate Software Systems Engineer to support systems engineering, requirements, and Agile delivery for space-focused software and distributed computing solutions in Herndon, VA.

Photo of the Rise User
Posted 2 hours ago

Lead Gravie's engineering efforts on customer and member web portals as a hands-on technical lead focused on full-stack development, reliability, and team growth.

Photo of the Rise User

ENS Solutions is hiring a senior IDAM-focused Software Engineer to design, integrate, and support enterprise identity and access management systems for DoD/IC environments under an active TS/SCI clearance with CI poly.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
September 29, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!