
Systems Monitoring Engineer
The Hashgraph Association
full-time
Posted on:
Location Type: Remote
Location: India
Visit company websiteExplore more
Tech Stack
About the role
- Design, deploy, and maintain monitoring solutions (Prometheus, Grafana) for DLT-specific metrics (consensus finality, node health, on-chain activity)
- Integrate and manage PagerDuty for rapid, automated incident response
- Deploy and manage Mirror Nodes and RPC relays using Terraform/Ansible across AWS/GCP
- Serve as the L3 escalation point for complex incidents (“ghost transactions,” API anomalies)
Requirements
- 4+ years in DevOps, SRE, or NOC roles (with 1–2 years in Web3/Blockchain environments)
- Deep expertise in Prometheus/Grafana, Linux, Docker/Kubernetes, and scripting (Python, Go, Bash)
- Proven experience with cloud platforms (AWS/GCP) and IaC tools (Terraform)
- Strong understanding of Hedera Hashgraph or EVM-based chains, and ability to interpret ledger APIs
- Familiarity with ITIL/ITSM, DORA, SOC2, or ISO 27001 frameworks
Benefits
- Opportunity to be a part of the world’s leading DLT ecosystem
- Significant career growth potential in a fast growing sector
- Working with colleagues and on projects across the globe
- Open and direct communication, flat structures
- Flexible working hours
- Competitive salary package
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PrometheusGrafanaLinuxDockerKubernetesPythonGoBashTerraformAnsible