
Site Reliability Engineer II
Veritone
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $130,000 - $140,000 per year
Tech Stack
About the role
- deploy and maintain a resilient, secure, and efficient SaaS application platform to meet established SLAs
- build and maintain robust CI/CD pipelines
- design and deploy scalable infrastructure optimized for AI/ML workloads
- automate monitoring, management and incident response to achieve an auto-remediation system
- participate in on-call rotation to ensure stability and uptime for platforms
- independently design and develop tools to aid in operations and automation
Requirements
- 7+ years of experience in Linux systems and software management
- expertise with Terraform, Ansible, and cloud platforms like AWS, Azure, and GCP
- experience with large-scale distributed systems, monitoring/alerting systems (Prometheus, Grafana), CI/CD pipelines, container orchestration (Docker, Kubernetes), and programming languages (Go, Java, Python)
- background in implementing security controls, automating deployments, and troubleshooting complex systems
Benefits
- incentive compensation
- health benefits
- retirement benefits
- life insurance
- paid time off
- parental leave and benefits
- other employee perks and benefits
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Linux systemsTerraformAnsibleAWSAzureGCPPrometheusGrafanaDockerKubernetes
Soft Skills
independent designoperationsautomationincident responseon-call rotation