Tech Stack
CassandraCloudDNSDockerGoogle Cloud PlatformGrafanaJenkinsKafkaLinuxMicroservicesMySQLPrometheusPythonSMTPSplunkTCP/IPTerraformUnix
About the role
- Work closely with various engineering teams to identify service needs and collaboratively design solutions
- Lead initiatives with service owners to define SLOs and build SLIs to ensure systems are meeting SLAs
- Implement and enforce security best practices
- Reduce Operational Toil and maintain a high degree of automation by adapting IaC first and GitOps principles
- Acquire and maintain significant understanding of Taulia-SAP products and services to ensure timely resolution of production incidents
- Develop and advocate for CI/CD best practices, standards, and processes
- Participate and enhance our 24/7 on call and incident management process
- Mentor junior engineers and cross-functionally work with other engineering teams to enhance productivity
- Scale platform worldwide and work with cutting edge technology to support enterprise SaaS
Requirements
- 10+ years of experience working with an enterprise SaaS solution hosted on a cloud platform
- Deep expertise in managing container based microservices (GKE, Docker, Service Mesh, Observability, and continuous deployment)
- Strong understanding of Unix/Linux operating systems and protocols (HTTPS, TCP/IP, SSL/TLS, PKI, DNS, SMTP, SSH, NTP)
- 7+ years of progressive experience with public cloud
- High degree of proficiency in scripting especially Bash and/or Python
- Experience with code repository management, code merge and quality checks and version control workflows
- Significant hands-on IaC experience managing infrastructure using Terraform, Helm
- 5+ years of hands-on experience implementing and managing Observability tools preferably Prometheus, New Relic, Grafana, ELK, Splunk
- Hands-on experience managing CI/CD pipelines using one or more of the following: Jenkins, ArgoCD, Github Actions, JenkinsX
- Significant experience in participating in 24/7 on-call rotation, managing runbooks and building documentation
- Preferred: 3+ years with Google Cloud Platform
- Preferred: Hands-on experience with database administration especially MySQL and Cassandra
- Preferred: Message Broker systems including ActiveMQ, Kafka
- Preferred: Implementation and management of CDN & WAF Solutions - Cloudflare, Akamai, Amazon CloudFront, Google Cloud CDN