FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Site Reliability Engineer
ArctiqSite Reliability Engineer engaged in maintaining reliability engineering for government systems. Developing automation and CI/CD pipelines while ensuring system health through monitoring.
Tech Stack
Tools & technologiesAnsibleAWSAzureCloudDockerGoGoogle Cloud PlatformGrafanaKubernetesPrometheusPythonTerraform
About the role
Key responsibilities & impact- Implement and maintain dashboards and alerting rules using Prometheus, Grafana, or ELK Stack.
- Support the identification of Service Level Indicators (SLIs).
- Develop and maintain Infrastructure as Code (IaC) scripts using Terraform and Ansible to ensure repeatable, error-free deployments.
- Maintain automated deployment pipelines, ensuring security scans and automated tests are integrated into the workflow.
- Participate in on-call rotations and assist in troubleshooting system outages.
- Contribute to blameless post-mortem reports to drive continuous improvement.
- Identify repetitive manual tasks and develop automation to reduce "toil," allowing the team to focus on high-value engineering.
Requirements
What you’ll need- 3–5 years of experience in SRE, DevOps, or Systems Engineering roles.
- Proficiency in scripting languages (Python, Go, or Bash).
- Hands-on experience with containerization (Docker, Kubernetes) and cloud platforms (AWS, Azure, or GCP).
- Familiarity with NIST SP 800-53 security controls.
- Bachelor’s degree in Computer Science or a related technical field.
Benefits
Comp & perks- Health insurance
- Paid time off
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Infrastructure as CodeTerraformAnsiblePythonGoBashDockerKubernetesAWSAzure
Soft Skills
troubleshootingcontinuous improvementautomationcollaborationproblem-solving