
Senior Site Reliability Engineer
MetroStar
full-time
Posted on:
Location Type: Hybrid
Location: Maryland • Virginia • United States
Visit company websiteExplore more
Salary
💰 $170,000 - $248,000 per year
Job Level
Tech Stack
About the role
- Design, deploy, and maintain mission-critical application workloads on virtualized or containerized environments (e.g., VMWare or Kubernetes), ensuring scalability, availability, and compliance with government requirements.
- Develop and sustain automated CI/CD pipelines, monitoring, and configuration management workflows to support reliable software delivery and operational observability across development, integration, staging, and production environments.
- Provision, configure, and maintain developer environments and toolchains to support rapid, secure, and efficient development workflows, enabling mission-aligned software delivery.
- Identify developer friction across the software development lifecycle and implement solutions to reduce that friction and provide developer-first environments.
- Establish and maintain a high level of customer trust and confidence through deep technical expertise, and use creativity to provide innovative solutions that fit the customer’s mission needs.
Requirements
- Active Top Secret with SCI eligibility security clearance.
- Certification meeting DoD 8140 (e.g., Security+, or higher).
- Bachelor’s degree in Computer Science or related engineering field is preferred; relevant experience may substitute.
- 7+ years of experience in software development, systems engineering, or operations roles with responsibility for availability, performance, and reliability of production systems.
- Demonstrated experience blending software engineering and systems administration practices to support highly available, scalable applications.
- Experience designing and managing monitoring, alerting, and observability solutions to meet defined Service Level Objectives.
- Experience leading or participating in incident response, root cause analysis, and continuous improvement activities.
- Experience with Ansible and Desired State Configuration.
- Experience with GitLab CI/CD automation and Bash scripting.
- Experience supporting container-native storage and object storage solutions (e.g., MinIO, S3-compatible services, and PortWorx).
- Experience with enterprise load-balancing solutions (e.g., F5 or similar platforms).
- Ability to contribute immediately with minimal ramp-up in a mission-critical operational environment.
Benefits
- Health, dental, and vision insurance
- 401(k) retirement plan with company match
- Paid time off (PTO) and holidays
- Parental Leave and dependent care
- Flexible work arrangements
- Professional development opportunities
- Employee assistance and wellness programs
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
CI/CD pipelinesBash scriptingAnsibleDesired State Configurationmonitoring solutionsalerting solutionsobservability solutionscontainer-native storageobject storageenterprise load-balancing
Soft Skills
creativitycustomer trusttechnical expertiseproblem-solvingcollaborationcommunicationleadershipincident responseroot cause analysiscontinuous improvement
Certifications
Top Secret with SCI eligibilityDoD 8140Security+