Join our team as an intern to build the future of inference, GPU optimization and AI infrastructure. You'll work directly as a full-time engineer with the team to help define our technical direction and build the core systems that power our GPU optimization platform.
Build scalable infrastructure for AI model training and inference
Lead technical decisions and architecture choices
GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns.
Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads.
LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation).
Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.
Publications or open-source contributions in inference GPU computing or ML/AI for code are a plus.
Hands-on experience with large-scale experiments, benchmarking, and performance tuning.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead the design and implementation of GPU-optimized infrastructure and systems to accelerate large model training and inference for a fast-moving AI infrastructure team.
Work as an integral engineering intern on GPU optimization, AI infrastructure, and inference systems to help design and implement performance-critical GPU tooling and architectures.
Senior engineering leader needed to drive technical strategy and delivery for compliance-focused data pipelines and reporting automation across cross-functional teams in a remote US role.
Senior Growth Engineer needed to design and build full‑stack tools and experimentation platforms that directly move business metrics at Samsara.
OPPO US Research Center is hiring a Senior Backend Engineer (contractor) to build and optimize low-latency, high-availability recommendation and search backends using Java, Spring Boot, Redis, Elasticsearch, and Kafka.
Lead development of high-performance backend services in Rust for a sports-focused super app, owning APIs, databases, and realtime systems at an early-stage startup.
Lead and grow a high-performing engineering team at LILT, driving technical strategy and delivering scalable AI-enabled translation products for enterprise and public sector customers.
Contribute to mission-critical enterprise systems as a remote Full Stack Developer using C#, Angular, Spring Boot, Java, and SQL while applying SAFe Agile practices.
General Motors is hiring a Senior Infotainment Platform Software Engineer to lead design and integration of Android, Linux and QNX software for advanced vehicle infotainment domains.
Experienced backend engineer needed to build and operate scalable API and microservice systems for a fast-growing automotive platform in a US-remote role.
Senior distributed systems engineer to architect and implement a mission-critical, low-latency load balancer/gateway for research inference at OpenAI's San Francisco engineering organization.
Experienced backend engineer with strong Python and systems-level skills to maintain and evolve a compiler-driven backend for quantum controllers at Q-CTRL.
Experienced full-stack Software Engineer II needed to build and support scalable enterprise integrations and microservice/serverless solutions for a US-remote team.
Drive Crusoe's SDN strategy by building high-performance packet processing and kernel-level networking solutions using XDP/eBPF, DPDK, and SmartNIC/DPU technologies.
Contribute as a Frontend Engineer building responsive, high-performance trading interfaces using React and GraphQL for a remote US-based team.