Salary
💰 $118,000 - $135,000 per year
Tech Stack
AnsibleAWSAzureChefCloudGrafanaJenkinsMicroservicesPrometheusPuppetPythonRayRubySplunkTerraform
About the role
- Founded in 2004, NetBrain is the leader in no-code network automation. Its ground-breaking Next-Gen platform provides IT operations teams with the ability to scale their hybrid multi-cloud connected networks by automating the processes associated with Diagnostic Troubleshooting, Outage Prevention and Protected Change Management.
- Today, over 2,500 of the world’s largest enterprises and managed services providers leverage NetBrain’s platform.
- NetBrain is looking for a Senior Development Operations Engineer to join our Research and Development Team in on a hybrid basis in our Burlington, MA headquarters. This position will play a impactful role in our SaaS solution as you help to enable the leading preventative network automation solution powered by agentic AI in the market.
- Collaborate with product, development and security teams to deliver solutions efficiently and securely.
- Design, deploy and manage scalable AWS cloud infrastructure, using advanced scripts and tools to ensure the implementation of infrastructure as code (using Terraform, CloudFormation or AWS CDK).
- Build and maintain CI/CD pipelines with automated testing and deployment gating (CodePipeline, CodeBuild, Github Actions or Jenkins).
- Automate provisioning and deployments for microservices and serverless workloads.
- Establish monitoring, tracing and alerting with CloudWatch, Prometheus/Grafana, Datadog and X-Ray.
- Implement centralized logging.
- Manage IAM roles, policies and enforce least-privilege access and implement encryption (at rest and in transit), secure APIs and AWS WAF.
- Integrate container security practices while supporting compliance efforts with automated controls and documentation.
- Configure and manage VPCs, subnets, routing, NAT gateways and secure connectivity.
- Design and maintain highly available multi-AZ and multi-region architectures.
- Develop and test backup and disaster recovery strategies (AWS Backup, RDS snapshots, S3 versioning).
- Respond to incidents, perform root cause analysis and implement preventative measures.
- Maintain clear technical documentation, runbooks and onboarding guides.
Requirements
- 5 years of experience with cloud-related technologies such as AWS, Azure, or Google Cloud with a bachelor’s degree; or 3 years and a master’s degree.
- Expertise in programming in Python, Ruby, or Bash scripting languages.
- Solid understanding of CI/CD tools.
- AWS Professional Certification is a plus.
- Familiar with the provision and configuration using management tools (e.g., Puppet, Chef, Ansible).
- Familiar with application monitoring tools like ELK, Splunk, or Datadog.
- Familiar with IaC such as Terraform or CloudFormation.
- Experience with SAAS project implementation.
- Ability to commute to Burlington, MA on a hybrid work schedule.