Thales

Site Reliability Engineer

Thales

full-time

Posted on:

Origin:  • 🇺🇸 United States

Visit company website
AI Apply
Manual Apply

Salary

💰 $114,480 - $184,369 per year

Job Level

Mid-LevelSenior

Tech Stack

AnsibleCloudDistributed SystemsDNSDockerElasticSearchGoGrafanaJenkinsKafkaKubernetesLinuxNoSQLPrometheusPythonRDBMSRedisSaltStackSDLC

About the role

  • Work with other teams across the company to build and deploy infrastructure that supports our platforms.
  • Apply SRE core tenets of measurement (SLI/SLO/SLA), eliminate toil, and reliability modelling
  • Establish metrics for data-driven decisions to increase availability, reliability, and velocity
  • Build and maintain, and evolve SLO and SLI network/system/application baselines
  • Assist with go/no go preplanning, verification/validation, and review of existing and new product/services
  • Proactively analyse data and test the integrity of network/systems to ensure production applications and services are operating optimally
  • Work with internal customers as needed to troubleshoot and resolve business affecting issues
  • Participate in escalations, incident response, RCA’s, and postmortems
  • Participate in 24x7 on-call rotation

Requirements

  • At least 5 years of professional experience within a cloud/web/CDN scale infrastructure
  • Experience with Python and Go. C/C++ a plus
  • Expert knowledge of Linux systems, network programming and protocols TCP, UDP, DNS, TLS/SSL, HTTP
  • Experience with BGP and Anycast routing is a plus
  • Experience with DevOps principles and concepts such as Infrastructure as Code (Ansible/Saltstack), CI/CD (Gitlab, Jenkins, Git), monitoring and visualization (Prometheus, Grafana)
  • Experience with big data technologies such as NoSQL/RDBMS, Redis, ElasticSearch, Kafka
  • Experience with containers and container management (Docker, Kubernetes)
  • Experience analyzing and building data telemetry, modeling, pipelines, UI visualization
  • Experience in developing software, troubleshooting, and monitoring large scale distributed systems
  • Implement software engineering best practices/standards and software development life cycle
  • Working knowledge and experience of Agile software development methodologies
  • Outstanding collaboration and communication, and documentation skills with a proven ability to work cross-functionally
  • BS/MS in computer science, engineering, or a related technical discipline or equivalent experience