Tech Stack
AWSCloudCyber SecurityDynamoDBEC2KafkaKubernetesNoSQLPostgresRedisTerraformVault
About the role
- Architect, deploy, and maintain highly available, secure, and performant database systems across multiple AWS regions. (PostgreSQL and NoSQL)
- Implement automation for database provisioning, scaling, backups, failover, and patch management using Infrastructure as Code (Terraform, Crossplane).
- Establish database observability: design and maintain telemetry for query performance, replication lag, disk I/O, and capacity forecasting.
- Partner with development teams to review schema changes, query plans, and application database interactions for efficiency and scalability.
- Drive incident response for database-related outages, including root cause analysis, remediation, and prevention.
- Maintain compliance with industry standards and security frameworks (encryption, access controls, audit logging).
- Collaborate on disaster recovery planning and ensure tested, reliable backup and restore processes.
- Maintain detailed and accurate technical documentation and operational runbooks.
- Evaluate emerging database technologies and patterns; recommend and implement improvements.
Requirements
- 5+ years of experience in database administration, site reliability engineering, or related roles.
- Proven track record in managing mission-critical database environments in production.
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration across engineering, product, and security teams.
- Ability to manage multiple priorities and deliver in a fast-paced environment.
- Proficiency with PostgreSQL (Aurora, RDS), Redis, DynamoDB, Kafka; Terraform, Crossplane; Kubernetes Operators for databases; CI/CD pipelines (GitLab CI/CD); Monitoring & Observability: Datadog, OpenSearch; Cloud Technologies: AWS, Networking, Security.