
Network Operations Engineer
Polygon Labs
full-time
Posted on:
Location Type: Remote
Location: Anywhere in Latin America
Visit company websiteExplore more
About the role
- Monitor the health and performance of Polygon Labs’ infrastructure, including blockchain networks, bridges, RPC services, staking systems, and user-facing products
- Track third-party dependencies and identify degradation that may impact the broader ecosystem
- Validate and triage alerts by distinguishing signal from noise, assessing severity, and determining impact
- Escalate confirmed issues to the appropriate SRE or engineering teams with clear, structured context
- Coordinate incident response by ensuring the right stakeholders are engaged, timelines are maintained, and communication is consistent
- Document incidents in real time, including decisions, actions, and outcomes
- Build and improve dashboards, alerting systems, and monitoring coverage to enhance visibility
- Create and maintain runbooks for common failure modes and triage workflows
- Support ecosystem participants such as validators and infrastructure providers when issues intersect with Polygon Labs systems
- Validate user-facing product functionality during incidents, even in the absence of predefined runbooks
Requirements
- Foundational experience working with Linux systems, including navigating filesystems, reading logs, and understanding system processes
- Understanding of core networking concepts (DNS, HTTP, TCP/IP) and the ability to troubleshoot service connectivity issues
- Basic scripting ability (e.g., Python, Bash) to automate tasks or analyze system data
- Exposure to monitoring and observability tools such as Datadog, Grafana, or Prometheus
- Strong written communication skills, with the ability to clearly document incidents and procedures under pressure
- Willingness to work in a shift-based, follow-the-sun environment, with a structured and methodical approach to troubleshooting
- Familiarity with blockchain infrastructure, including node operation or EVM-based systems (preferred)
- Experience with Datadog or similar observability platforms in production environments (preferred)
- Exposure to infrastructure-as-code tools such as Terraform or configuration management tools like Ansible (preferred)
- Previous experience in a network operations center, incident response team, or on-call rotation (preferred)
- Experience working in a remote, globally distributed team (preferred)
Benefits
- Remote first global workforce
- Industry leading Medical, Dental and Vision health insurance*
- Company matching 401k with 3% match*
- $1,500 Home Office Set Up Allowance (life-time max)
- $200 Annual AI Allowance
- $75 Monthly internet or phone reimbursement
- Flexible Time Off
- Company issued laptop
- Egg freezing, mental health, and employee wellness benefits
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Linux systemsscriptingPythonBashnetworking conceptsDNSHTTPTCP/IPinfrastructure-as-codeTerraform
Soft Skills
written communicationtroubleshootingstructured approachmethodical approachincident response coordination