Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
AI Infrastructure Engineer image - Rise Careers
Job details

AI Infrastructure Engineer

At TensorWave, we're leading the charge in AI compute, building a versatile cloud platform that's driving the next generation of AI innovation. We're focused on creating a foundation that empowers cutting-edge advancements in intelligent computing, pushing the boundaries of what's possible in the AI landscape.

About the Role:

We are looking for an AI Infrastructure Engineer with a passion for high-performance computing and distributed systems. The ideal candidate will support our vision by developing and managing the compute infrastructure that underpins our innovative AI cloud services. This role involves building and maintaining robust AI clusters, ensuring optimal performance and reliability for our clients' most demanding workloads.

Responsibilities:

  • Collaborate with a dynamic IT team to design, deploy, and maintain high-performance AI compute clusters supporting both AMD and NVIDIA GPU technologies.

  • Lead initiatives to optimize cluster performance, resource utilization, and job scheduling to maximize efficiency across diverse AI workloads.

  • Ensure system reliability, performance, and security for cloud services, implementing monitoring solutions and automated recovery systems.

  • Work closely with the AI development team to align infrastructure capabilities with the evolving needs of TensorWave's cloud platform.

  • Troubleshoot and resolve complex infrastructure issues across Linux systems, networking, and distributed computing environments, providing expert guidance to maintain high service levels.

  • Implement and maintain configuration management, deployment automation, and infrastructure-as-code practices.

Essential Skills & Qualifications:

  • Bachelor's degree in Computer Science, Information Technology, or related field.

  • At least 5 years of relevant experience in infrastructure engineering, with a focus on supporting high-performance computing (HPC) and AI applications.

  • Expert-level Linux system administration skills across multiple distributions.

  • Strong experience with clustered computing environments (GPU, CPU, or hybrid clusters).

  • Solid understanding of networking fundamentals, including TCP/IP, routing protocols, and high-speed interconnects.

  • Experience with container technologies (Docker, Kubernetes), job schedulers (Slurm, PBS), and configuration management tools.

  • Familiarity with AMD and NVIDIA GPU ecosystems, CUDA, ROCm, and their infrastructure requirements.

  • Exceptional debugging and problem-solving abilities with a methodical approach to complex system issues.

  • Demonstrated ability to learn new technologies quickly and adapt to rapidly evolving infrastructure needs.

We're looking for resilient, adaptable people to join our team—folks who enjoy collaborating and tackling tough challenges. We're all about offering real opportunities for growth, letting you dive into complex problems and make a meaningful impact through creative solutions. If you're a driven contributor, we encourage you to explore opportunities to make an impact at TensorWave. Join us as we redefine the possibilities of intelligent computing.

What We Bring:

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

  • Stock Options

  • 100% paid Medical, Dental, and Vision insurance

  • Life and Voluntary Supplemental Insurance

  • Short Term Disability Insurance

  • Flexible Spending Account

  • 401(k)

  • Flexible PTO

  • Paid Holidays

  • Parental Leave

  • Mental Health Benefits through Spring Health

Average salary estimate

$175000 / YEARLY (est.)
min
max
$140000K
$210000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
insightsoftware Hybrid Remote, Remote, United States
Posted 13 hours ago

Experienced Oracle DBA needed to administer Oracle databases (11g/12c/19c) and lead migrations to AWS RDS for a global financial reporting and analytics software provider.

Photo of the Rise User
Global Lending Services Hybrid Greenville, South Carolina
Posted 19 hours ago

Contribute to GLS's enterprise security posture as an entry-level IT Security Analyst responsible for monitoring threats, maintaining security systems, and supporting incident response and risk mitigation.

Wyetech Hybrid Annapolis Junction, Maryland
Posted 22 hours ago

Wyetech is hiring a senior Exploitation Analyst to perform advanced vulnerability and exploitation analysis in support of classified federal missions, requiring active TS/SCI with polygraph and extensive cyber experience.

Photo of the Rise User

Provide on-site PC, Mac, printer, and basic network support as an on-call independent contractor for Geeks on Site in the Greensboro-High Point area.

Acuity seeks an experienced Information Assurance Analyst/Engineer to produce security authorization documentation and support RMF-based cloud security activities for federal clients.

Photo of the Rise User
NBCUniversal Hybrid 100 Universal City Plaza, Universal City, CALIFORNIA
Posted 10 hours ago

NBCUniversal is hiring an Enterprise IT Mobile Manager to lead mobile strategy, operations, cost management, and vendor relationships for corporate mobile services at Universal City.

The Material and License Lead Coordinator will manage IT acquisitions and software license oversight for TOS at Arnold AFB, ensuring compliant tracking, reporting, and process improvements across mission systems.

Photo of the Rise User

Geeks on Site is hiring on-call IT Field Technicians in the Appleton area to provide hands-on PC, Mac, network, and printer support for residential and small-business customers.

Photo of the Rise User

Geeks on Site is hiring on-call, 1099 field IT technicians to provide local onsite PC, Mac, printer, and network support in the Allentown-Bethlehem-Easton area.

Avint Hybrid No location specified
Posted 23 hours ago

Avint is hiring a senior Cybersecurity SME with an active Top Secret clearance to lead RMF/A&A efforts, advise leadership, and manage cybersecurity posture for DoD-affiliated systems.

Posted 2 hours ago

CalAmp seeks an Information Security Analyst to support SOC operations, cloud security, and compliance efforts for its connected intelligence IoT solutions.

Photo of the Rise User
USC Hybrid Los Angeles, CA - University Park Campus
Posted 19 hours ago

USC seeks an Attack Surface Management Analyst to monitor and reduce risk across campus digital assets—combining vulnerability assessment, ASM tooling, and cross-team remediation to protect on-prem, cloud, OT and application environments.

Photo of the Rise User

Experienced field IT technician needed to join a national on-call network providing PC, Mac, network, and printer/scanner support for residential and small business customers.

Supercharge your large-scale PyTorch LLM workloads with our cloud powered by AMD MI300X

16 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
September 3, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!