
Site Reliability Engineer, SRE
Mida Technologies
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇺🇸 United States
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AnsibleAWSAzureCloudDistributed SystemsDNSDockerGoGoogle Cloud PlatformGrafanaKubernetesMicroservicesPrometheusPythonTerraform
About the role
- Build and maintain highly available, scalable, and secure cloud infrastructure.
- Develop automation frameworks that streamline deployments, monitoring, and performance optimization.
- Implement and manage observability tools (metrics, logs, tracing) to ensure deep visibility into system behavior.
- Improve reliability through capacity planning, chaos engineering, and failure-mode analysis.
- Own CI/CD pipelines and ensure smooth, automated release processes.
- Collaborate with backend, frontend, data, and product teams to define SLIs, SLOs, and error budgets.
- Manage incident response, root cause analysis, and postmortems to prevent recurrence.
- Optimize system performance and reduce operational costs through proactive engineering.
- Enforce security best practices across infrastructure, deployments, and access management.
- Reduce manual toil by building automation and self-healing systems.
Requirements
- 3+ years experience as an SRE, DevOps Engineer, or Infrastructure Engineer
- Strong experience with cloud platforms (AWS, GCP, or Azure)
- Proficiency with containerization technologies (Docker, Kubernetes)
- Experience managing distributed systems and microservices architecture
- Strong scripting skills in Python, Bash, Go, or similar languages
- Hands-on experience with infrastructure-as-code tools (Terraform, Ansible, Helm)
- Deep understanding of CI/CD pipelines and deployment automation
- Strong grasp of networking, load balancing, DNS, caching, and security concepts
- Experience with monitoring/logging tools like Prometheus, Grafana, Loki, ELK, Datadog, or New Relic
- Ability to run incident response, troubleshoot production issues, and perform root cause analysis.
Benefits
- Competitive compensation + performance bonuses
- Health insurance & wellness benefits
- Flexible, remote-friendly work culture
- Opportunity to build and shape a mission-critical infrastructure powering thousands of users
- High-growth environment with room for leadership and innovation
- A culture dedicated to automation, reliability, and engineering excellence
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
cloud infrastructureautomation frameworksobservability toolscapacity planningchaos engineeringCI/CD pipelinessystem performance optimizationinfrastructure-as-codescripting in Pythoncontainerization technologies
Soft skills
collaborationincident responseroot cause analysisperformance optimizationsecurity best practices