
Site Reliability Engineer
VALCE Talent Solutions
full-time
Posted on:
Location Type: Hybrid
Location: Guadalajara • Mexico
Visit company websiteExplore more
About the role
- Support multiple Diligent datacentres and deployments in the Cloud (AWS, Azure, GCP).
- Lead and improve daily operational processes across production environments.
- Contribute to incident management and response efforts, conduct root cause analysis and lead retrospective reviews for all Diligent applications.
- Build self-service tools and automation pipelines to enhance developer productivity and system reliability.
- Drive observability, cost optimization, and security best practices across cloud environments.
- Collaborate with engineering and SRE teams to improve system performance, maintainability, and scalability.
- Ensure high availability and fault tolerance of cloud services through proactive monitoring and automation.
Requirements
- 3+ years of professional experience in Software Engineering, DevOps, or Site Reliability Engineering.
- Provide support to development teams on monitoring, scalability, and reliability.
- Intermediate-level experience with AWS, including services like EC2, Lambda, ECS, Fargate, S3, IAM, VPC, Route 53, RDS, DynamoDB, and CloudWatch.
- Experience with Infrastructure as Code (IaC) using Terraform, Terraform CDK or AWS CDK.
- Proficiency in designing, implementing, and testing scalable software architectures.
- Strong automation and scripting abilities for operational workflows and cloud infrastructure.
- Demonstrated experience using AI tools to accelerate engineering workflows (code generation, test writing, documentation, RCA analysis)
- Familiarity with LLM APIs and prompt engineering for automation tasks
- Experience with building AI-assisted runbooks, alert triage, or auto-remediation workflows
- Ability to critically evaluate AI-generated code and configurations for correctness, security, and operational risk
- Excellent problem-solving skills.
Benefits
- Flexible work arrangements
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSAzureGCPInfrastructure as CodeTerraformTerraform CDKAWS CDKautomationscriptingsoftware architecture
Soft Skills
problem-solvingcollaborationleadershipincident managementroot cause analysisretrospective reviewscritical evaluation