Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
SES Corporation

Reliability Engineer

SES Corporation

Reliability Engineer responsible for U.S. Air Force Cloud One Architecture.

Posted 6/30/2026full-timeBoston • Massachusetts • 🇺🇸 United StatesSeniorLeadWebsite

Tech Stack

Tools & technologies
AWSAzureCloudDistributed SystemsDNSDockerGoGoogle Cloud PlatformGrafanaKubernetesLinuxPrometheusPythonTCP/IPTerraform

About the role

Key responsibilities & impact
  • This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract
  • The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of mission‑critical systems
  • This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement
  • The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities

Requirements

What you’ll need
  • Bachelors and eight (8) years or more of experience; Masters and six (6) years or more of experience. Additional experience may be accepted in lieu of degree.
  • Active Secret clearance at a minimum required to start
  • US citizenship required
  • Experience with cloud platforms (AWS, Azure, OCI, or GCP), including managed services
  • Experience with containerized environments (Docker, Kubernetes)
  • Familiarity with CI/CD pipelines and deployment automation
  • SLOs and error budgets
  • Capacity modeling and performance testing
  • Strong understanding of:
  • Distributed systems and high‑availability architectures
  • Linux/Windows system administration
  • Networking fundamentals (DNS, TCP/IP, load balancing)
  • Hands-on experience with:
  • Monitoring and observability tools (e.g., Prometheus, Grafana, ELK/Elastic, Datadog, Azure Monitor)
  • Infrastructure as Code (Terraform, ARM, CloudFormation)
  • Scripting or programming languages (Python, Bash, Go, PowerShell, or similar)
  • Experience supporting incident management and on‑call operations

Benefits

Comp & perks
  • Medical
  • Dental
  • Vision
  • AD&D
  • STD
  • LTD
  • Company paid Life Insurance
  • 401k with employer contribution
  • Paid Time Off
  • Pet Insurance

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Capacity ModelingPerformance TestingLinux System AdministrationWindows System AdministrationNetworking Fundamentals (DNS, TCP/IP, Load Balancing)Scripting Languages (Python, Bash, Go, PowerShell)
Certifications
Active Secret Clearance