Twenty is seeking a Site Reliability Engineer for an on-site position at Fort Meade, MD to ensure the reliability, performance, and availability of our mission-critical cyber technologies that protect democracies worldwide. We're looking for someone with 5+ years of experience in site reliability engineering, DevOps, and cloud operations, with deep expertise in AWS, Docker containerization, and secure enclave environments. In this role, you'll be the guardian of our AI-powered graph database applications running in closed AWS environments, ensuring operational readiness for systems that process real-time cyber operation data at machine speed. You'll monitor, troubleshoot, and optimize our containerized microservices architecture, implement robust monitoring and alerting systems, and serve as the critical link between our Arlington engineering teams and the operational requirements at Fort Meade. You'll join a world-class product and engineering team that delivers mission-critical solutions for U.S. national security, working in highly secure environments to maintain systems that operate at the speed of cyber warfare. If you're passionate about ensuring system reliability in high-stakes environments while making a direct impact on national security, we want to talk to you.
At Twenty, we're taking on one of the most critical challenges of our time: defending democracies in the digital age. We develop revolutionary technologies that operate at the intersection of cyber and electromagnetic domains, where the speed of operations exceeds human sensing and complexity transcends conventional boundaries. Our team doesn't just solve problems – we deliver game-changing outcomes that directly impact national security. We're pragmatic optimists who understand that while our mission of protecting America and its allies is challenging, success is possible.
Ensure availability and performance of AI-powered cyber applications running in secure AWS enclaves at Fort Meade
Monitor and maintain Docker containerized microservices architecture across development, staging, and production environments
Implement and manage comprehensive monitoring, logging, and alerting systems to proactively identify and resolve issues before they impact operations
Manage and optimize AWS infrastructure within closed enclave environments, ensuring compliance with government security requirements
Automate deployment pipelines and infrastructure provisioning using Infrastructure as Code (IaC) principles
Perform capacity planning and scaling operations to ensure systems can handle real-time cyber operation data loads
Lead incident response efforts for system outages or performance degradation, coordinating with Arlington engineering teams as needed
Conduct root cause analysis for system failures and implement preventive measures to avoid recurrence
Maintain detailed runbooks and documentation for operational procedures and emergency response protocols
Serve as the primary technical liaison between Fort Meade operations and Twenty's Arlington engineering teams
Work closely with government stakeholders to understand operational requirements and ensure system configurations meet mission needs
Provide technical support and training to end users on system functionality and troubleshooting procedures
5+ years of professional experience in site reliability engineering, DevOps, or cloud operations
Expert-level proficiency with Amazon Web Services (AWS) including EC2, ECS, RDS, CloudWatch, and networking services
Advanced experience with Docker containerization and container orchestration platforms
Strong knowledge of Linux/Unix systems administration and command-line tools
Proficiency with Infrastructure as Code tools (Terraform, CloudFormation, or similar)
Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, or similar)
Knowledge of CI/CD pipelines and automated deployment practices
Understanding of networking concepts, security groups, and VPC configurations in AWS environments
Experience working in secure, air-gapped, or enclave environments
Understanding of government security requirements and compliance frameworks
Knowledge of container security best practices and vulnerability management
Familiarity with logging and auditing requirements for government systems
Strong troubleshooting and problem-solving skills with ability to work under pressure
Experience with incident management and on-call responsibilities
Proven ability to write clear technical documentation and runbooks
Understanding of database administration and performance tuning (particularly graph databases like Neo4j)
Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent practical experience
Must be eligible to obtain and maintain a TS/SCI security clearance
U.S. citizenship required
Ability to work on-site at Fort Meade, MD with occasional travel to Arlington, VA
Previous experience supporting mission-critical systems in government or defense environments
Background in cyber operations or intelligence systems support
Experience with graph databases (Neo4j) and GraphQL APIs
Knowledge of AI/ML system operations and monitoring
Certifications in AWS, Kubernetes, or site reliability engineering
Experience with NATS or other message queue systems
Experience with Agile development methodologies and cross-functional collaboration
Knowledge of performance testing and load testing methodologies
Understanding of disaster recovery and business continuity planning
Scripting experience in Python, Bash, or Go
Experience with configuration management tools (Ansible, Chef, Puppet)
Familiarity with service mesh technologies and microservices patterns
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Battle Creek Games is hiring a Senior Backend Engineer to architect and scale cloud-native backend systems that power high-performance, live-operated Unity mobile games.
Lead enterprise tax compliance and strategic finance initiatives at MSK as Senior Director, Finance & Assistant Controller, overseeing philanthropic fund accounting, pension reporting, and complex cross-functional projects.
Shepherd is hiring a Software Engineer, AI Infra to develop LLM observability, scale AI-driven underwriting features, and take end-to-end ownership of production systems at the intersection of insurance and the built world.
Procore is seeking a Senior Frontend Software Engineer to advance the Drawings product, delivering scalable, high-performance web experiences for construction teams.
Lead and optimize the end-to-end software delivery pipeline for mission-critical OPIR missile warning systems at a small defense-focused engineering firm.
Experienced Oracle PL/SQL and BI Publisher developer needed to provide Tier 3 support and enhance integrations between Eagle ADMS and client systems for a federal-focused technology consultancy in Washington, DC.
Build and productionize AI-driven applications at TrueMeter that parse complex utility data, automate agent workflows, and optimize energy spend using LLMs and modern cloud infrastructure.
TCP is seeking a DevOps Engineer to build and maintain cloud infrastructure, CI/CD pipelines, and production reliability for their workforce management platform.
OpusClip is hiring a Software Engineering Manager to lead teams building scalable AI-driven video indexing, search, and editing systems from our Palo Alto headquarters.
Lead a small, cross-functional engineering team building scalable cloud and edge software for Hudl’s camera product family, guiding technical strategy and team growth.
Domino’s Pizza is hiring a Salesforce Developer to design, build, and support Lightning-based Salesforce solutions while coordinating with offshore teams and product owners.
Experienced WordPress-focused Web Developer wanted to manage FreedomPay’s corporate site and intranet, ensuring security, performance, and seamless integrations across marketing and IT systems.
Help Jerry build its AI-driven AllCar app as an entry-level Software Engineer, shipping production code early while learning on a high-growth team.
An entry-level DevOps role at SynergisticIT offering hands-on experience with Docker, Jenkins, Kubernetes and cloud tooling plus mentorship and growth opportunities.
Work at ServiceNow as a Software Engineer building scalable, reusable web and backend components while helping integrate AI-driven capabilities into enterprise workflows.
SpringRole is the first professional reputation network powered by artificial intelligence and blockchain to eliminate fraud from user profiles. Because SpringRole is built on blockchain and uses smart contracts, it's able to verify work experienc...
482 jobs