Browse 5 exciting jobs hiring in Rocm now. Check out companies hiring such as Sciforium, TensorWave, FM in Orlando, Virginia Beach, Denver.
Lead the architecture and hands-on development of Sciforium’s high-performance model serving platform, spanning GPU kernels, runtimes, distributed scheduling, and Python APIs to deliver low-latency multimodal inference.
Senior ML Solutions Engineer needed to lead CUDA-to-ROCm portability and kernel-level performance optimization for GPU-based ML workloads at TensorWave.
Build high-performance front-end tooling and optimize GPU-level kernels at Sciforium to accelerate our AI serving platform and bridge the UI with low-level infrastructure.
Develop and validate next-generation low-level GPU compute runtimes and system software at Intel, driving performance and reliability for AI, HPC and data-center workloads.
Lead engineering efforts to design and scale ProRata Attribution systems—covering content understanding, distributed serving, knowledge systems, and agentic backend workflows—at ProRataAI's Bellevue office.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
5
|