Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Bitdeer Group

Senior SRE, Platform Software Engineer

Bitdeer Group

Engineer for Bitdeer developing and operating SRE platform services. Responsibilities include taking architect-approved designs to production and managing several vital services.

Posted 6/24/2026full-timeSan Jose • California, Texas • 🇺🇸 United StatesSeniorWebsite

Tech Stack

Tools & technologies
AWSBootstrapCloudElasticSearchFluxGoGoogle Cloud PlatformJavaPrometheusPythonRustSQLVault

About the role

Key responsibilities & impact
  • Take an architect-approved design and turn it into production code that ships through GitOps + the CICD release pipeline.
  • Build and operate one or more bounded contexts of the NeoCloud SRE platform.
  • Own 1-2 of various services including collection, alert, correlation, and SLO frameworks.
  • Implement fault-prediction and remediation workflows.
  • Manage hardware lifecycle and data center operations.
  • Develop services for identity, secrets, tenant configuration, and customer bridging.

Requirements

What you’ll need
  • 7+ years of production software engineering experience, including 2 or more years operating what you built (real on-call experience, not just shipping code).
  • Production-depth mastery of at least one systems-grade language—Go (preferred), Rust, or Java.
  • Proficiency in Python for tooling and SDK work.
  • Strong grasp of at-least-once vs. exactly-once trade-offs, idempotency, back-pressure, leader election, consistent hashing, gossip, and fan-out.
  • Experience at production scale with Prometheus, VictoriaMetrics, Mimir, Thanos, Loki, Elasticsearch, Tempo, Jaeger, or OpenTelemetry.
  • Hands-on experience with Argo, Flux, Helm, Kustomize, Cosign signing, signed-bundle promotion, and blast-radius-aware rollouts.
  • Proven experience writing a controller or CRD handling real production traffic, with a deep understanding of watch-cache mechanics, leader election, and reconcile loops.
  • Experience executing end-to-end mTLS bootstrap with certificate rotation.
  • Hands-on experience with HashiCorp Vault or cloud KMS (AWS KMS / GCP KMS).
  • Ability to read a Prometheus query plan, build a recording-rule strategy, and write SQL that joins per-tenant telemetry against analytics-lake tables.
  • Rigorous approach to unit, integration, contract, chaos, and soak testing.
  • Ability to author clear design docs that align with existing platform architecture, create runbooks optimized for 3 AM on-call responses, and write intent-driven PR descriptions.

Benefits

Comp & perks
  • Bitdeer is committed to providing equal employment opportunities in accordance with country, state, and local laws.

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
GoRustJavaPythonPrometheusVictoriaMetricsMimirThanosElasticsearchOpenTelemetry
Soft Skills
on-call experienceclear design documentationrunbook creationintent-driven PR descriptions