Our Mission
At Palo Alto Networks® everything starts and ends with our mission:
Being the cybersecurity partner of choice, protecting our digital way of life.
Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.
Who We Are
We believe collaboration thrives in person. That’s why most of our teams work from the office full time, with flexibility when it’s needed. This model supports real-time problem-solving, stronger relationships, and the kind of precision that drives great outcomes.
Your Career
With Prisma AIRS, Palo Alto Networks is building the world's most comprehensive AI security platform. Organizations are increasingly building complex ecosystems of AI models, applications, and agents, creating dynamic new attack surfaces with risks that traditional security approaches cannot address. In response,Prisma AIRS delivers model security, posture management, AI red teaming, and runtime protection. Our customers can confidently deploy AI-driven innovation while ensuring a formidable security posture from development through runtime.
As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas, including AI-Native Security (LLM, AI Agent, Model Supply-Chain, Runtime AI) and the broader LLM ecosystem security. You will leverage this research to identify and bring up new product opportunities. You will collaborate closely with engineering teams to deploy models, ensuring maximum product impact. Furthermore, you will foster cross-functional collaboration and serve as an AI thought leader both within the company and in the security/LLM community. Beyond individual contribution, you will lead complex technical projects, mentor senior engineers, and set the standard for performance, scalability, and engineering excellence across the organization. Your decisions will have a profound and lasting impact on our ability to deliver cutting-edge AI security solutions at a massive scale.
Your Impact
Lead the architectural design of a highly scalable, low-latency, and resilient ML inference platform capable of serving a diverse range of models for real-time security applications.
Define technical approaches to less-defined product requirements, ensuring the best fit between product features and technical implementation. Explore new product opportunities by maintaining a deep understanding of LLM and Generative AI research trends.
Technical Leadership: Provide technical leadership and mentorship to the team, driving best practices in MLOps, software engineering, and system design.
Strategic Optimization: Drive the strategy for model and system performance, guiding research and implementation of advanced optimization techniques like custom kernels, hardware acceleration, and novel serving frameworks.
Set The Standard: Establish and enforce engineering standards for automated model deployment, robust monitoring, and operational excellence for all production ML systems.
Cross-Functional Vision: Act as a key technical liaison to other principal engineers, architects, and product leaders to shape the future of the Prisma AIRS platform and ensure end-to-end system cohesion.
Solve the Hardest Problems: Tackle the most ambiguous and challenging technical problems in large-scale inference, from mitigating novel security threats to achieving unprecedented performance goals.
Your Experience
BS/MS or Ph.D. in Computer Science, a related technical field, or equivalent practical experience.
Extensive professional experience in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models at scale.
Expert-level programming skills in Python are required; experience in a systems language like Go, Java, or C++ is nice to have.
Deep, hands-on experience designing and building large-scale distributed systems on a major cloud platform (GCP, AWS, Azure, or OCI).
Proven track record of leading the architecture of complex ML systems and MLOps pipelines using technologies like Kubernetes and Docker.
Mastery of ML frameworks (TensorFlow, PyTorch) and extensive experience with advanced inference optimization tools (ONNX, TensorRT).
A strong understanding of popular model architectures (e.g., Transformers, CNNs, GNNs) is a must. A deeper understanding of attention mechanisms and related knowledge is a plus.
Demonstrated expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas are a significant plus.
Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton Language, is a plus.
Experience with data infrastructure technologies (e.g., Kafka, Spark, Flink) is great to have.
Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI, Tekton) is a plus.
The Team
Our Prisma AIRS team is a group of highly motivated and innovative engineers and researchers dedicated to solving the most challenging problems in AI security. We thrive in a collaborative environment where we value creativity, ownership, and a commitment to excellence. You will have the opportunity to work with cutting-edge technology and make a significant impact on the future of cybersecurity.
Compensation Disclosure
The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $185,200 - $299,450/YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.
Our Commitment
We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.
We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at [email protected].
Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.
All your information will be kept confidential according to EEO guidelines.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead the design and delivery of a cloud-native Network Management System as a Principal Software Engineer driving frontend and full-stack solutions at Palo Alto Networks.
Lead the design and delivery of AI-enhanced developer platforms at Palo Alto Networks to radically improve developer productivity and software quality across global cloud infrastructure.
LPL Financial seeks an Engineer II Full Stack (.NET) to design and build scalable advisor onboarding solutions using .NET Core, Angular, AWS, and AI technologies.
Experienced frontend engineer needed to build scalable, data-rich AIOps UI features using React and TypeScript while driving engineering best practices and mentoring teammates at Palo Alto Networks.
Help build the product surface for a new liquid market for GPU offtake as a Product Engineer focused on frontend-driven, high-impact features across cloud and market products.
Lead the design and implementation of enterprise generative AI platforms at NVIDIA as a Principal Software Engineer, shaping scalable architecture, reliability, and product delivery.
CCBC is seeking entry- and mid-level programmers to develop, maintain, and support enterprise administrative applications in a remote work model.
Experienced staff-level engineer needed to lead design and implementation of secure, high-performance messaging protocols and mobile SDKs in a remote, open-source driven environment.
High-impact full-stack engineering role at a YC-backed crypto trading startup building real-time trading systems and product features end-to-end in Austin.
CompassX seeks an entry-level Software Engineer to design and build new cloud-facing applications while receiving hands-on mentorship in a remote, CST/EST-friendly role.
Senior Python Software Engineer needed to architect and build scalable, low-latency backend data pipelines and workflow orchestration for high-volume real-time systems.
Blue Origin is hiring a Senior Software Product Engineer to architect, develop, and manage mission-critical embedded and application software for rocket engine test and ground support systems.
Lead a cross-functional engineering team at Narmi to design and ship a greenfield Financial Intelligence product that delivers cash flow analysis and actionable financial insights for business banking customers.
Lead and grow a high-performing engineering team in San Francisco to design and ship OpenAI’s core monetization and ads infrastructure with a strong focus on safety, privacy, and scalability.
Experienced full-stack engineer needed to lead modernization of early childhood education platforms using React, TypeScript, Node.js, AWS and to support legacy ColdFusion systems during the transition.
Being the cybersecurity partner of choice, protecting our digital way of life.
133 jobs