FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior Software Engineer – Grafana Databases, Managed Services
Grafana LabsSenior Engineer managing production-critical streaming infrastructure for Grafana Cloud. Working on multicloud systems ensuring reliability, scaling, and operational excellence.
Tech Stack
Tools & technologiesAWSAzureCloudDistributed SystemsGoGoogle Cloud PlatformKubernetesLinuxTerraform
About the role
Key responsibilities & impact- Operating and evolving multi-cloud streaming clusters and related database infrastructure
- Diagnosing and eliminating cross-layer failure modes
- Designing upgrade and rollout strategies at scale
- Improving observability, automation, and operational ergonomics
- Collaborating with database and platform teams on scaling and performance
Requirements
What you’ll need- 6+ years of engineering experience in SRE, platform engineering, or distributed systems roles
- Experience operating distributed systems in production
- Strong Kubernetes experience in AWS, GCP, or Azure
- Familiarity with infrastructure-as-code tooling (Helm, Terraform, Jsonnet)
- Proficient in at least one programming language (Go preferred)
- Working knowledge of Linux internals, networking, and cloud storage
- Experience in blameless incident response and post-incident reviews
- Clear communicator capable of collaboration across teams
Benefits
Comp & perks- Restricted Stock Units (RSUs)
- Equity
- Bonuses (if applicable)
- 30 days of annual leave
- Company-funded AI tools usage budget
- In-person onboarding
- Global culture of collaboration and shared purpose
- Career growth pathways
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesAWSGCPAzureinfrastructure-as-codeHelmTerraformJsonnetGoLinux
Soft Skills
collaborationcommunicationblameless incident responsepost-incident reviews