FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior Software Engineer – Reliability Engineering
The Home DepotSenior Software Engineer developing software solutions to enhance reliability and performance for Home Depot's platform. Leading automation and collaborative efforts to meet operational objectives.
Tech Stack
Tools & technologiesBigQueryCloudElasticSearchGoogle Cloud PlatformGrafanaNode.jsPrometheusPythonServiceNowTerraform
About the role
Key responsibilities & impact- Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide
- Takes on new opportunities and tough challenges with a sense of urgency, high energy, and enthusiasm
- Consistently achieves results, even under tough circumstances
- Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production
- Takes a broad view when approaching issues; using a global lens
- Learns through successful and failed experiments when tackling new problems
- Actively seeks ways to grow and be challenged using both formal and informal development channels
- Collaborates with other team members in agile processes
- Creates new and better ways for the organization to be successful
- Works with the Product Team to ensure user stories are valuable, developer ready, easy to understand, and testable
- Delivers multi-mode communications that convey a clear understanding of the unique needs of different audiences
- Adapts approach and demeanor in real-time to match the shifting demands of different situations
- Relates openly and comfortably with diverse groups of people
- Helps grow junior engineers by providing guidance on modern software development frameworks and leading technical discussions
Requirements
What you’ll need- Must be eighteen years of age or older.
- Must be legally permitted to work in the United States.
- GCP Cloud Infrastructure — BigQuery analytics, ADC auth, cloud-native services
- Observability — Grafana, Prometheus, Kibana/Elasticsearch (WES logs), OCP Health Dashboards
- Terraform Enterprise — Infrastructure as Code
- GitHub — SCM
- GH Copilot + AI Agents — AI-accelerated incident analysis, automated remediation workflows, prompt-engineered operational tooling
- SRE Practices — Production Readiness Review, Capacity Planning, Change Validation, Prod Support, Post-Mortems, SLO Definition & Tracking
- ServiceNow — Incident, Problem, and Change management; trend analysis; RCA grouping
- BigQuery — Incident analytics, problem candidate identification, operational reporting
- PagerDuty — On-call scheduling, escalation paths, push-button paging
- Rundeck — Self-heal automation, push-button remediation jobs
- Atlassian (Jira/Confluence) — RCA documentation, runbooks, architecture diagrams, onboarding
- CyberArk — Privileged access for WMS/DFC log pulls and node access
- Manhattan WMS — Warehouse Management System operations, RF/UI/LM node support
- Python Automation — Operational scripting, BQ pipelines, alert correlation, report generation.
Benefits
Comp & perks- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Remote work options
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
GCP Cloud InfrastructureBigQueryTerraform EnterpriseGitHubGrafanaPrometheusKibanaSRE PracticesPython AutomationRundeck
Soft Skills
collaborationcommunicationadaptabilityproblem-solvingleadershipcreativityurgencyenthusiasmresults-orientedmentorship