FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAWSCloudDockerDynamoDBGrafanaKubernetesPrometheusRaySplunkTerraform
About the role
Key responsibilities & impact- A cloud observability engineer’s day is about making complex systems understandable, improving signal quality, and enabling faster, smarter debugging across teams.
- Check system health, review alerts/incidents.
- Triage alerts
- Investigate issues
- Improve observability instrumentation
- Build and improve dashboards
- Alert optimization
- Work with development teams and other engineering partners.
- Continuous improvements.
- Release support.
- Design and implement observability frameworks using metrics, logs, and distributed tracing
- Develop dashboards, alerts, and visualizations to monitor system health
- Standardize observability practices across engineering teams (logging, telemetry, tracing)
- Implement and manage native monitoring tools.
- Build alerting systems (avoid alert fatigue)
- Participate in on-call rotations
- Build intelligent alerting using Improve System Reliability/ proactively reduce risk
- Identify reliability risks to help harden systems against failure
- Reduction in alert noise / false positives
- Increased observability coverage (% of services instrumented)
- Improved SLO compliance
- Embed observability into applications
- Add tracing/metrics into code
- Standardize logging formats
- Ensure all services are observable end-to-end
Requirements
What you’ll need- SRE, DevOps, or Cloud Engineering
- Cloud platforms AWS
- Experience in AWS services including CloudWatch, X-Ray, Lambda ECS/EKS, API Gateway RDS, DynamoDB, S3
- Hands-on with experience with an observability tool Dynatrace Splunk Datadog OpenTelemetry Prometheus, Grafana AWS Distro for OpenTelemetry
- Strong understanding of: Containers & orchestration (Docker, Kubernetes)
- CI/CD pipelines
- Infrastructure as Code (Terraform, CloudFormation)
- Nice to have Experience building observability platforms at scale in AWS
- Familiarity with multi-account AWS environments
- Experience with cost optimization for observability (logging/metrics ingestion)
- Experience in high-scale distributed system
Benefits
Comp & perks- Health insurance
- Professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Cloud ObservabilityAlert OptimizationSystem Health MonitoringDistributed TracingMetrics and LoggingCI/CD PipelinesInfrastructure as CodeCost OptimizationHigh-Scale Distributed SystemsStandardized Logging Formats
Soft Skills
CollaborationProblem-SolvingContinuous Improvement