FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Principal SRE
ChalkboardPrincipal SRE ensuring platform reliability and scalability for Chalkboard’s immersive sports gaming experience. Collaborating closely with Engineering, Product, and Data teams to enhance user engagement.
Posted 4/12/2026full-timeNew York City • New York • 🇺🇸 United StatesLead💰 $230,000 - $250,000 per yearWebsite
Tech Stack
Tools & technologiesCloudDistributed SystemsGoogle Cloud PlatformKubernetesTerraform
About the role
Key responsibilities & impact- Own platform reliability end-to-end, proactively identifying and mitigating risks before they impact users
- Build and evolve observability (metrics, logs, tracing) to enable fast detection, diagnosis, and resolution of issues
- Scale infrastructure ahead of demand by identifying bottlenecks and implementing durable architecture improvements
- Reduce developer friction by improving CI/CD pipelines, deployment workflows, and internal tooling
- Lead incident response and root cause analysis, driving systemic fixes—not just short-term patches
- Establish and enforce best practices for infrastructure, deployments, and system reliability
- Build reusable, self-service infrastructure that enables teams to ship quickly and safely
- Continuously improve systems through automation and Infrastructure-as-Code
Requirements
What you’ll need- Cloud Infrastructure (GCP preferred): networking, IAM, databases, storage
- Kubernetes: cluster operations and workload management
- Infrastructure as Code: Terraform, Helm
- CI/CD: GitHub Actions or similar
- Observability: metrics, logging, tracing, alerting
- 8+ years of experience in SRE, platform engineering, or infrastructure roles
- Strong experience with distributed systems and backend architectures
- Proven ability to improve system reliability, scalability, and performance
- Experience building and improving CI/CD pipelines and deployment workflows
- Strong debugging skills using data (logs, metrics, traces)
- Experience leading incident response and driving root cause analysis
- Ability to make pragmatic tradeoffs between speed, reliability, and scale
- Experience partnering across engineering teams to improve developer velocity
Benefits
Comp & perks- Comprehensive medical, dental, and vision coverage starting day 1, with the majority of premiums covered by Chalkboard
- 401(k) with company match
- Lunch on us everyday with a corporate DoorDash account
- Refuel in the office with protein shakes, energy drinks, and a snack buffet
- Flexible time off policy, plus 10 company holidays, WFH during the holidays
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Cloud InfrastructureKubernetesInfrastructure as CodeTerraformHelmCI/CDGitHub ActionsObservabilitydistributed systemsbackend architectures
Soft Skills
incident responseroot cause analysissystem reliabilityscalabilityperformance improvementdebuggingpragmatic tradeoffscollaborationleadershipproblem-solving