Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 700 of the world's leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io.
Astro Private Cloud (APC) is the self-hosted, enterprise-grade distribution of Astronomer’s data-orchestration platform, purpose-built for enterprises that need full control of infrastructure, security, and data. Fortune 500 companies, top financial institutions, and other highly regulated organizations run APC to bring the power of Apache Airflow into their own secure environments—from private clouds to air-gapped data centers.
We're seeking a Senior Software Engineer, Infrastructure to join the Astro Private Cloud team at Astronomer. In this role, you'll leverage your expertise in Kubernetes, cloud platforms (AWS, Azure, GCP, OpenShift), and observability tools (ELK, Prometheus) to improve the observability, performance, reliability, and scalability of our on-premise offering.
You’ll work at the intersection of distributed systems, cloud-native infrastructure, and data platform tooling. If you enjoy solving complex problems in Kubernetes, care about engineering excellence, and thrive in a fast-moving, collaborative environment—this role is for you.
Take ownership of the health, performance, and scalability of different parts of the platform.
Support the rollout and deployment of new features and installations to drive rapid growth and iteration.
Collaborate with engineering teams to ensure the platform is built with reliability and operational excellence in mind.
Lead automation initiatives to reduce manual effort and increase system efficiency.
Build tools to streamline deployment processes and enhance observability in large-scale environments.
Conduct thorough root cause analyses and document findings through detailed post-mortems.
Create and maintain clear, comprehensive documentation for critical systems and processes.
Participate in an on-call rotation, including direct collaboration with select customers.
Thrive in a dynamic, fast-paced environment with evolving priorities and challenges.
3+ years of hands-on experience deploying, operating, and troubleshooting Kubernetes clusters in production environments.
5+ years of software engineering experience with a strong command of Python and/or Go.
Proficiency in automation and scripting using Shell, Python, or similar languages.
Experience with Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
Proven ability to manage and scale distributed systems across major cloud platforms (AWS, Azure, or GCP).
Solid experience with CI/CD tools, such as CircleCI, Jenkins, or similar systems.
Strong understanding of Linux systems, networking protocols, and core infrastructure components.
Skilled in deploying, maintaining, and monitoring complex application stacks and services.
Excellent troubleshooting and analytical skills, with a proactive approach to problem-solving.
Experience with scale testing, disaster recovery strategies, and capacity planning.
Knowledge of service mesh technologies such as Istio or Envoy.
Familiarity with TypeScript and JavaScript.
Exposure to Apache Airflow and its ecosystem.
Experience working with OpenShift and the Red Hat Marketplace.
Proficiency with observability tools like Prometheus, Grafana, and the ELK stack.
The estimated salary for this role ranges from $180,000 - $210,000 based on leveling and geography, along with an equity component and a comprehensive benefits package. This range is merely an estimate; actual compensation may deviate from this range based on skills, experience, and qualifications.
#LI-Hybrid
At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Experienced iOS engineer needed to build and maintain clinical-grade health features and data visualizations for WHOOP’s Boston-based Healthcare team.
Anyscale is hiring new graduate software engineers to build scalable ML and distributed systems infrastructure powered by Ray.
Senior DevOps Engineer (Azure) to drive Terraform-based IaC, improve cloud reliability, and accelerate containerization at Degreed.
Backend Engineer role at a well-funded AI startup, responsible for building scalable backend services, APIs, and data platforms using Go/Python in a hybrid Silicon Valley team.
Help design and implement high-performance, secure distributed systems and networking protocols in Go to advance StrongDM's Zero Trust PAM platform for enterprise customers.
Build and deploy mission-focused full-stack software for military logistics at Rune Technologies, working closely with users to deliver reliable, field-ready solutions.
Netflix is looking for a Site Reliability Engineer (L4) to enhance resilience, automation, and incident response for its streaming infrastructure in a remote role.
Help build the trust layer for the internet as a full-stack engineer at Eigen Labs, turning R&D concepts into polished, production-ready web services and demos.
Be the technical owner for blockchain integrations and product delivery at Inversion, building infrastructure and bespoke solutions that power portfolio companies and fintech partners.
At Salesforce, this Lead Software Engineer role will drive endpoint security architecture and implementation to strengthen enterprise-wide defenses across diverse platforms.
Help build reusable, high-performance web UI frameworks at Netflix using JavaScript, TypeScript, React, and GraphQL to power server-driven experiences across client platforms.
NBCUniversal is hiring a Sr. Site Reliability Engineer to architect and support enterprise SharePoint and Power Platform solutions that enable scalable, secure Microsoft 365 business applications.
Great Gray seeks a Senior DevOps Engineer to modernize Azure infrastructure, lead the GitHub Actions migration, and elevate CI/CD and observability for a growing platform team.
Astronomer is a platform for data engineering. The company was founded in 2015 and is based in Cincinnati, Ohio.
7 jobs