Sinch

Senior Site Reliability Engineer

Sinch

full-time

Posted on:

Origin:  • 🇺🇸 United States • Colorado, Illinois

Visit company website
AI Apply
Manual Apply

Salary

💰 $143,000 - $179,000 per year

Job Level

Senior

Tech Stack

AnsibleAWSCassandraCloudDistributed SystemsElasticSearchGoGoogle Cloud PlatformGrafanaLinuxMicroservicesPrometheusPythonTerraform

About the role

  • Assist in shaping, scaling, and optimizing the infrastructure that underpins Mailgun services.
  • Work closely with product engineering teams to drive improvements, automate workflows, and ensure systems meet high reliability standards.
  • Collaborate with other teams to define and implement system requirements.
  • Design, build, and maintain cloud-based microservices infrastructure.
  • Automate routine operational tasks and remediation processes to improve efficiency and reliability.
  • Proactively fix and resolve issues, collaborating with support teams and using monitoring tools to ensure system health.
  • Ensure datastores operate efficiently and meet performance and availability goals.
  • Contribute to the team's growth by mentoring junior engineers and sharing standard methodologies.
  • Plan and execute strategies for scaling systems and infrastructure as needs grow.
  • Solve complex distributed systems challenges and drive innovation in how email infrastructure is built and operated.

Requirements

  • Strong background in infrastructure, operations, or software engineering with a focus on reliability.
  • Extensive experience working with cloud platforms such as Google Cloud Platform (GCP) or Amazon Web Services (AWS).
  • Proficiency in using configuration management tools like Terraform and Ansible to manage infrastructure.
  • Hands-on experience with modern monitoring and observability tools such as Prometheus, Grafana, and similar technologies.
  • Proven experience with distributed databases (e.g. Cassandra, Elasticsearch) and maintaining their health at scale.
  • Familiarity with distributed event stores and stream-processing platforms.
  • Strong coding skills in at least one modern programming language (Python, Go, etc.).
  • Expertise in running and maintaining production systems in a Linux environment and public cloud infrastructure.
  • Demonstrated expertise in architecting solutions for complex technical challenges and leading initiatives from conception through execution.
  • Strong interpersonal and communication skills with a history of building effective relationships with cross-functional teams.
  • Ability to mentor and guide junior engineers, fostering a collaborative and inclusive team culture.
  • PREFERRED: Experience with container orchestration platforms.
  • PREFERRED: Expertise in CI/CD pipeline automation and infrastructure as code practices.
  • PREFERRED: Knowledge of network architecture and security best practices in cloud environments.
  • PREFERRED: Experience with containerization and microservices architectures.
  • PREFERRED: Advanced problem solving skills, particularly in highly sophisticated and distributed systems.