Polygon Labs

Network Operations Engineer

Polygon Labs

full-time

Posted on:

Location Type: Remote

Location: Anywhere in Latin America

Visit company website

Explore more

AI Apply
Apply

About the role

  • Monitor the health and performance of Polygon Labs’ infrastructure, including blockchain networks, bridges, RPC services, staking systems, and user-facing products
  • Track third-party dependencies and identify degradation that may impact the broader ecosystem
  • Validate and triage alerts by distinguishing signal from noise, assessing severity, and determining impact
  • Escalate confirmed issues to the appropriate SRE or engineering teams with clear, structured context
  • Coordinate incident response by ensuring the right stakeholders are engaged, timelines are maintained, and communication is consistent
  • Document incidents in real time, including decisions, actions, and outcomes
  • Build and improve dashboards, alerting systems, and monitoring coverage to enhance visibility
  • Create and maintain runbooks for common failure modes and triage workflows
  • Support ecosystem participants such as validators and infrastructure providers when issues intersect with Polygon Labs systems
  • Validate user-facing product functionality during incidents, even in the absence of predefined runbooks

Requirements

  • Foundational experience working with Linux systems, including navigating filesystems, reading logs, and understanding system processes
  • Understanding of core networking concepts (DNS, HTTP, TCP/IP) and the ability to troubleshoot service connectivity issues
  • Basic scripting ability (e.g., Python, Bash) to automate tasks or analyze system data
  • Exposure to monitoring and observability tools such as Datadog, Grafana, or Prometheus
  • Strong written communication skills, with the ability to clearly document incidents and procedures under pressure
  • Willingness to work in a shift-based, follow-the-sun environment, with a structured and methodical approach to troubleshooting
  • Familiarity with blockchain infrastructure, including node operation or EVM-based systems (preferred)
  • Experience with Datadog or similar observability platforms in production environments (preferred)
  • Exposure to infrastructure-as-code tools such as Terraform or configuration management tools like Ansible (preferred)
  • Previous experience in a network operations center, incident response team, or on-call rotation (preferred)
  • Experience working in a remote, globally distributed team (preferred)
Benefits
  • Remote first global workforce
  • Industry leading Medical, Dental and Vision health insurance*
  • Company matching 401k with 3% match*
  • $1,500 Home Office Set Up Allowance (life-time max)
  • $200 Annual AI Allowance
  • $75 Monthly internet or phone reimbursement
  • Flexible Time Off
  • Company issued laptop
  • Egg freezing, mental health, and employee wellness benefits
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Linux systemsscriptingPythonBashnetworking conceptsDNSHTTPTCP/IPinfrastructure-as-codeTerraform
Soft Skills
written communicationtroubleshootingstructured approachmethodical approachincident response coordination