
DevOps Engineer
True Platform
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇺🇸 United States
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AWSAzureCloudDockerGoGoogle Cloud PlatformJenkinsKubernetesPythonRubyTerraform
About the role
- Implement and maintain observability tools and dashboards using [e.g., AWS CloudWatch, Datadog, Sentry, OpenTelemetry].
- Go beyond basic CPU/memory metrics; instrument applications for high-value Application Performance Monitoring (APM) traces, custom business metrics, and real-user monitoring (RUM).
- Enhance security monitoring in our observability stack. Implement automated alerts for anomalous behavior, access pattern violations, and potential security threats.
- Implement logging and retention configurations to meet defined data retention policies and relevant standards (e.g., GDPR, CCPA, SOC2) and ensure PII is appropriately redacted or handled.
- Assist with cloud cost visibility and optimization.
- Analyze infrastructure usage patterns to identify waste, implement aggressive tagging strategies, and recommend rightsizing adjustments to reduce spend.
- Manage Reserved Instances, Savings Plans, and Spot Instance usage to maximize value.
- Manage and enhance our CI/CD pipelines (using [e.g., GitHub Actions, GitLab CI, Jenkins]). Your goal is to optimize for speed, reliability, and ease of use for developers
- Integrate security scanning (SAST/DAST/container scanning) and compliance checks directly into the CI pipeline.
- Manage the tooling and processes for deploying applications to AWS EKS / Kubernetes / ECS / Serverless
- Facilitate modern deployment strategies, such as Blue/Green deployments, Canary releases, and feature-flag rollouts, to minimize blast radius during releases.
- Maintain and evolve our Infrastructure as Code (IaC) base using [Terraform / OpenTofu / CloudFormation / Pulumi].
Requirements
- Experience: 3+ years of hands-on experience in a DevOps, SRE, or Platform Engineering role supporting production environments.
- Cloud Fluency: Strong proficiency with a major cloud provider (AWS preferred, Azure or GCP acceptable)
- Observability Expertise: Proven experience configuring and managing a modern observability stack (logs, metrics, and distributed tracing). You know the difference between a useful alert and noise.
- Infrastructure as Code: Solid experience with Terraform (or equivalent IaC tools) in a collaborative team environment (state management, modules, PR reviews).
- Containers & Orchestration: Strong working knowledge of Docker and container orchestration (Kubernetes experience is highly valued).
- CI/CD: Solid understanding of CI/CD principles and experience building pipelines.
- Scripting: Comfort with scripting languages for automation (Ruby, Bash, Python, or Go).
Benefits
- - Time Off: Unlimited PTO because we trust you to manage your time and recharge when you need it. We encourage our team to truly disconnect and come back refreshed.
- - Comprehensive Benefits: Our generous benefits package includes medical and dental coverage, competitive 401(k) matching to help you plan for the future, plus gym subsidies to support your health and wellness goals.
- - Balanced Growth: We offer intellectually challenging work on meaningful problems while respecting your work/life balance. We believe the best work happens when people have time for life outside of work.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Application Performance Monitoringreal-user monitoringlogging and retention configurationsInfrastructure as CodeCI/CDscriptingobservability stackcloud cost optimizationsecurity scanningcontainer orchestration
Soft skills
collaborative team environmentanalytical skillsproblem-solvingcommunication