Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Solutions Architect, Inference Deployments image - Rise Careers
Job details

Solutions Architect, Inference Deployments

We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes. As a Solutions Architect (Inference Focus), you’ll collaborate closely with our engineering, DevOps, and customer success teams to foster enterprise AI adoption. Together, we'll introduce generative AI to production!

What you'll be doing:

  • Help customers craft, deploy, and maintain scalable, GPU-accelerated inference pipelines on Kubernetes for large language models (LLMs) and generative AI workloads.

  • Enhance performance tuning using TensorRT/TensorRT-LLM, NVIDIA NIM, and Triton Inference Server to improve GPU utilization and model efficiency.

  • Collaborate with multi-functional teams (engineering, product) and offer technical mentorship to customers implementing AI at scale.

  • Architect zero-downtime deployments, autoscaling (e.g., HPA or equivalent experience with custom metrics), and integration with cloud-native tools (e.g., OpenTelemetry, Prometheus, Grafana).

What we need to see:

  • 5+ Years in Solutions Architecture with a proven track record of moving AI inference from POC to production on Kubernetes.

  • Experience architecting GPU allocation using NVIDIA GPU Operator and NVIDIA NIM Operator. Troubleshoot sophisticated GPU orchestration, optimize with Multi-Instance GPU (MIG), and ensure efficient utilization in Kubernetes environments.

  • Proficiency with TensorRT-LLM, Triton, and TensorRT for model optimization and serving.

  • Success stories optimizing LLMs for low-latency inference in enterprise environments.

  • BS or equivalent experience in CS/Engineering.

Ways to stand out from the crowd:

  • Prior experience deploying NVIDIA NIM microservices for multi-model inference.

  • Serverless Inference, knowledge of FaaS patterns (e.g., Google Cloud Run, AWS Lambda, NVCF) with NVIDIA GPUs.

  • NVIDIA Certified AI Engineer or similar.

  • Active contributions to Kubernetes SIGs or AI inference projects (e.g., KServe, Dynamo, SGLang or similar).

  • Familiarity with networking concepts which support multi-node inference such as MPI, LWS or similar.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until August 8, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Glassdoor Company Review
4.6 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
NVIDIA DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of NVIDIA
NVIDIA CEO photo
Jensen Huang
Approve of CEO

Average salary estimate

$191875 / YEARLY (est.)
min
max
$148000K
$235750K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 18 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead NVIDIA's hardware product engineering initiatives for MGX platforms and GPU/Tegra products, driving innovation and product excellence.

Photo of the Rise User
Posted 11 hours ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA is hiring a Global Trade Compliance Analyst to manage export control processes and support SAP GTS operations within their innovative technology company.

Photo of the Rise User

Experienced Roadway Engineer needed to lead local projects and expand Kimley-Horn’s roadway practice in Naples, FL.

Photo of the Rise User

An experienced Civil EIT is needed at Kimley-Horn's Boca-Delray office to support renewable energy projects through civil engineering design and client coordination.

Photo of the Rise User
Posted 18 hours ago

Seeking a skilled Stormwater Engineer to join Consor, contributing to innovative stormwater projects and professional growth in a collaborative hybrid work setting.

Posted 21 hours ago

Drive industrial automation innovation as a Senior Algorithms Engineer shaping advanced real-time robotic vision systems at a rapidly scaling startup.

Experienced systems engineer needed to support the design and integration of advanced defense systems at General Dynamics Mission Systems in Manassas, VA.

Photo of the Rise User
Posted 19 hours ago

NYC Parks' Architecture unit is looking for a licensed Architect experienced in managing complex park structure projects from design through construction.

Photo of the Rise User
Intel Hybrid US, Texas, Austin
Posted 23 hours ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Customer-Centric
Snacks
Onsite Gym
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
401K Matching
Maternity Leave
Paternity Leave

Intel seeks a Senior Firmware Engineer to lead firmware development for next-generation networking devices within their RDMA IP team in Austin, TX.

Photo of the Rise User
Posted 43 minutes ago

Seeking a skilled System Architect to lead technical solutions and architecture improvements in financial systems for a government client.

Photo of the Rise User

A Senior Mechanical Engineer is needed at Joby Aviation to design and certify mechanical systems for innovative autonomous electric aircraft.

Posted 15 hours ago

PMA invites an analytical Project Cost Engineer to drive cost control and reporting excellence across engineering and construction projects with a world-class team.

Photo of the Rise User
Posted 5 hours ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Innovate at OpenAI by designing and implementing motor control software solutions for next-generation robotics in a highly collaborative environment.

Photo of the Rise User

Seeking an experienced Hardware Design Engineer to design and commission motor control panels for conveyor systems within the Material Handling Industry.

Photo of the Rise User

Contribute to sustainable mining as a hands-on Production Engineer at Phoenix Tailings, working closely with production teams to enhance clean metals manufacturing.

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

46 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Diversity ChampionBadge Family FriendlyBadge Global CitizenBadge Work&Life Balance
CULTURE VALUES
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
BENEFITS & PERKS
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, unknown
DATE POSTED
August 6, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!