Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Infrastructure Engineer (Infiniband) image - Rise Careers
Job details

Infrastructure Engineer (Infiniband)

We are seeking an Infrastructure Engineer with a focus on InfiniBand/NCCL to join our Infrastructure Engineering team. Our engineers design and build automation, tooling, and systems that bridge the gap between physical infrastructure and the platforms that power large-scale AI/ML and HPC workloads.

This role combines the breadth of a core infrastructure engineer with a specialty in high-performance networking and GPU communication. You’ll help ensure our InfiniBand fabric and NCCL stack are tuned, reliable, and efficient at scale — supporting some of the world’s largest GPU clusters.

This is a fully remote position, although candidates must be based in the continental United States. Unfortunately, we are unable to provide sponsorship for this role.

Responsibilities

  • Design, build, and maintain automation, APIs, and frameworks to manage physical infrastructure at scale.

  • Develop and extend systems for server lifecycle management.

  • Implement and tune InfiniBand networking and NCCL configurations for multi-GPU communication.

  • Collaborate with Network, Platform, and Infrastructure Operations teams to support new infrastructure rollouts.

  • Diagnose and improve performance across GPU, NVSwitch, PCIe, and InfiniBand layers.
    Write clear design documents and technical documentation to capture best practices.

Qualifications

  • 8+ years of professional experience in infrastructure engineering, HPC, or related domains.

  • Strong experience with Linux in production environments.

  • Proficiency in Python or similar languages for automation.

  • Deep understanding of InfiniBand networking (CX7 HCAs, fabrics, partitioning, GPUDirect).

  • Familiarity with NCCL, CUDA, and GPU topology optimization.

  • Knowledge of containerization and orchestration concepts.

  • Strong written and verbal communication skills.

Ideal Experiences

  • Experience with Dell PowerEdge XE9680 or other GPU-dense servers.

  • Prior work with NVIDIA H100s, NVSwitch, and large-scale NCCL testing.

  • Familiarity with Mellanox OFED, UCX, and Redfish/iDRAC for management.

  • Broader experience across infrastructure areas (storage, virtualization, networking).

Culture

  • Enjoy collaborating with a motivated, execution-focused team.

  • Comfortable operating with autonomy while aligning to company objectives.

  • Value precision, documentation, and knowledge-sharing.

  • Excited to grow as both a domain specialist (InfiniBand/NCCL) and a generalist infrastructure engineer.

Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter. 

Average salary estimate

$220000 / YEARLY (est.)
min
max
$180000K
$260000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User

An experienced infrastructure engineer with deep observability expertise to build scalable telemetry, monitoring, and alerting systems for large-scale GPU and server fleets at a remote-first company.

Photo of the Rise User
IonQ Hybrid Bothell, Washington, United States
Posted 6 hours ago

IonQ is hiring a Senior Optical Engineer to design and deliver precision optical and laser subsystems for trapped-ion quantum computers in Bothell, WA.

Photo of the Rise User
Posted 14 hours ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Work on OpenAI's Robotics team as a junior-to-mid electrical engineer designing, building, and bringing up electronics for next-generation robotic systems in San Francisco with mentorship from experienced industry engineers.

Photo of the Rise User
Posted 3 hours ago

AbbVie is looking for a Senior Project Engineer to lead complex manufacturing and packaging projects, drive risk-mitigated designs, and support regulatory audit readiness.

Photo of the Rise User
Posted 12 hours ago

Serve as a Systems Engineer on an Agile team to design system architecture, lead integration and testing, and ensure mission-aligned solutions for a cleared, government-focused environment.

Photo of the Rise User
Baxter Hybrid Round Lake, Illinois
Posted 9 hours ago

Baxter is hiring a Principal Automation Engineer to lead automation, equipment design, and Industry 4.0 initiatives at its Round Lake manufacturing site while ensuring regulatory-compliant, reliable production.

Photo of the Rise User
Posted 21 hours ago
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition

SpaceX seeks a Mechanical Engineer (Facilities) to own design and analysis of campus fluid and HVAC systems, ensuring manufacturability and operational reliability across the Hawthorne site.

Photo of the Rise User
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition

SpaceX is hiring a Manufacturing Engineer to develop and scale injection molding processes for Starlink consumer hardware aimed at high-volume production.

Photo of the Rise User
Posted 18 hours ago

Entry-level civil engineer role at Dewberry’s Richmond office focusing on site/civil design, stormwater and drainage systems, permitting, and hydrologic/hydraulic analysis.

Photo of the Rise User
Baxter Hybrid Marion, North Carolina
Posted 10 hours ago

Lead controls engineering and automation initiatives at Baxter’s Marion manufacturing site, applying deep PLC, robotics, and validation expertise to ensure safe, compliant, and efficient production.

Photo of the Rise User
Posted 19 hours ago

Moog is hiring a fall 2025 Manufacturing Engineering intern in Christiansburg, VA to support continuous improvement, data analytics, and testing efforts within its Industrial Group.

Photo of the Rise User
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition

Lead mechanical design and packaging of flight‑critical avionics and sensor systems at SpaceX, owning hardware performance from concept through launch and beyond.

Goken America seeks an experienced Lead Dimensional Engineer to lead flush-and-gap craftsmanship, dimensional variation analysis (DVA) and measurement strategies for automotive development programs.

Photo of the Rise User
Anduril Industries Hybrid Boston, Massachusetts, United States
Posted 21 hours ago

Lead the design and delivery of maritime INS, sensor-fusion, and estimation solutions at Anduril to advance autonomous maritime navigation and system architecture.

voltage park is building a new class of cloud infrastructure from the ground up. join us, we're hiring!

7 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
August 23, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!