Coralogix

Site Reliability Engineer

Coralogix

full-time

Posted on:

Origin:  • 🇺🇸 United States • Massachusetts

Visit company website
AI Apply
Manual Apply

Job Level

Mid-LevelSenior

Tech Stack

ApacheAWSCloudGoGrafanaGRPCKafkaKubernetesPrometheusTerraform

About the role

  • Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day
  • Adopt cutting edge technologies with end-to-end responsibility
  • Build internal tools to expand platform capabilities
  • Collaborate with R&D to improve stability & reliability of the system
  • Lead the product roadmap and influence product direction
  • Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management
  • Operate and monitor Kubernetes, Kafka, Prometheus, Thanos, Istio, Argo CD and related cloud infrastructure

Requirements

  • At least 5 years of experience as a DevOps Engineer/ SRE in production environments
  • In-depth experience with Kubernetes - operating & monitoring are key parts
  • High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus
  • Experience in AWS or other cloud providers
  • Experience with infrastructure as a code (Terraform, Crossplane, etc.)
  • Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl)
  • Some software engineering experience, preferably in Golang
  • Advantage: Experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting
  • Advantage: experience operating data pipelines
  • Advantage: familiarity with Apache Kafka