Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
The Leaflet

Senior Database Administrator

The Leaflet

Senior Database Administrator managing CockroachDB and PostgreSQL for Hard Rock Digital. Evolving multi-region database clusters and ensuring production reliability.

Posted 6/23/2026full-timeRemote • Florida • 🇺🇸 United StatesSeniorWebsite

Tech Stack

Tools & technologies
AWSAzureCloudGoGoogle Cloud PlatformGrafanaJavaLinuxPostgresPrometheusPythonRDBMSShell ScriptingSQLTerraformUnix

About the role

Key responsibilities & impact
  • Operate and evolve multi-region CockroachDB clusters and PostgreSQL instances across production, staging, and development.
  • Investigate contention, serialization retries (SQLSTATE 40001), and range hotspots in the transaction paths behind bets, wagers, settlements, and payouts.
  • Diagnose leaseholder placement, monotonic-key write pressure, and cross-region latency that turn one query into several network round trips.
  • Trace a slow query to its execution plan, the plan to optimizer behavior, and the behavior to the schema or application call path that caused it.
  • Catch plan regressions after statistics changes, schema changes, data growth, or releases, then turn the fix into a safer rollout pattern.
  • Assess schema-change and backfill risk before it reaches large tables: lock behavior, retry pressure, capacity impact, and rollback path.
  • Plan capacity, scaling events, and version upgrades that customers don't feel.
  • Build database observability with Prometheus, Grafana, Mimir, Loki, and Snowflake: dashboards backed by real queries, alerting on leading indicators, and SLO tracking.
  • Connect database symptoms to application behavior and business events so alerts point at causes.
  • Join the on-call rota for the data layer, lead incident analysis when the database is part of the failure, and drive blameless postmortems toward durable fixes.
  • Instrument the system to surface degradation before it becomes an incident.
  • Build Go and Python tools that make incidents easier to understand: log collectors, explain-plan analyzers, migration checks, capacity models, and runbook generators.
  • Automate provisioning, configuration, backup, and recovery with Terraform and other infrastructure-as-code tools.
  • Work with platform engineering on CI/CD, deployment safety, and change management for database-touching services.
  • Use Claude Code, Codex, and similar harnesses in daily work to investigate incidents, generate probes, draft runbooks, and write tooling.
  • Keep the harness grounded in logs, traces, metrics, and source code, and verify its output against production facts before you ship it.
  • Implement and audit access controls, authentication, authorization, and encryption in transit and at rest.
  • Support data-protection, audit-logging, and retention requirements for a regulated gaming platform.
  • Build a database center of excellence: documentation, standards, and reusable patterns that other teams build against.
  • Mentor engineers on database fundamentals, distributed-systems behavior, and operational discipline.
  • Read CockroachDB and PostgreSQL internals when a problem needs a code-level answer, and bring what you learn back to the team.

Requirements

What you’ll need
  • 5+ years operating production relational databases as a DBA, Database Reliability Engineer, or Data Platform Engineer.
  • Deep production experience with PostgreSQL, CockroachDB, or another relational database management system, with fluency in the tradeoffs behind isolation, consensus, locality, and query planning.
  • You understand how a SQL engine works under the hood: MVCC, the Volcano iterator execution model, and cost-based optimizer frameworks such as Cascades.
  • You apply that knowledge to indexing strategy and to execution plan analysis.
  • Working knowledge of distributed-systems fundamentals: consensus (Raft and Paxos), distributed transactions, consistency models, and failure modes.
  • You write tools and automation in Python or Go, and you can debug and suggest fixes to line-of-business code in Java or a similar language.
  • Experience with infrastructure-as-code and a monitoring and observability stack such as Grafana or Datadog.
  • UNIX/Linux administration with shell scripting, comfort on a cloud platform (AWS preferred, Azure, or GCP welcome), and willingness to join an on-call rota.
  • Clear technical writing and speech. You'll work across teams.

Benefits

Comp & perks
  • Competitive pay and benefits
  • Flexible vacation allowance
  • A hybrid / remote working environment
  • Startup culture backed by a secure, global brand
  • Opportunity to build products enjoyed by millions as part of a passionate team
  • Team gatherings around the US and Europe
  • Employer-sponsored training and conference attendance
  • Opportunity to work in an AI-first environment with access to the tools you need to excel at your job

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PostgreSQLCockroachDBSQLPythonGoTerraformUNIXLinuxdistributed systemsinfrastructure as code
Soft Skills
technical writingcommunicationmentoringincident analysiscollaborationproblem-solvingleadershiporganizational skillsblameless postmortemscapacity planning