Oowlish

DevOps – Site Reliability Engineer

Oowlish

full-time

Posted on:

Location Type: Remote

Location: Brazil

Visit company website

Explore more

AI Apply
Apply

About the role

  • Deploy and manage web, mobile, and API applications across cloud environments
  • Implement and maintain monitoring and observability tools like NewRelic, Datadog, or Prometheus/Grafana
  • Design and optimize CI/CD pipelines using tools like Azure Pipelines, Jenkins, or CircleCI
  • Manage containerized environments with Docker, Kubernetes, and Helm
  • Build and manage cloud infrastructure on Azure, AWS, or GCP
  • Write automation scripts using Bash and other scripting languages
  • Develop and maintain incident response processes and disaster recovery strategies
  • Collaborate with development, product, and operations teams to improve system reliability and deployment efficiency

Requirements

  • 3+ years of experience in a DevOps, Site Reliability Engineering (SRE), or related role
  • Strong hands-on experience with the deployment of web, mobile, and API applications
  • Expertise in monitoring and observability tools (e.g., NewRelic, Datadog, Prometheus/Grafana)
  • Strong experience with CI/CD pipelines and associated tools (Azure Pipelines, Jenkins, CircleCI)
  • Proficiency with Docker, Kubernetes, and Helm
  • Experience working with cloud platforms like Azure, AWS, or GCP
  • Scripting proficiency in Bash
  • Familiarity with incident response and disaster recovery planning
Benefits
  • Remote work (home office)
  • Competitive compensation based on experience
  • Career development plans with opportunities for significant growth within the company
  • International projects
  • Oowlish English Program (technical and conversational)
  • Oowlish Fitness with TotalPass
  • Games and competitions
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
DevOpsSite Reliability EngineeringCI/CD pipelinesscriptingmonitoring toolsobservability toolsincident responsedisaster recoverycontainerizationcloud infrastructure
Soft Skills
collaborationsystem reliabilitydeployment efficiency