Tech Stack
AnsibleAWSAzureChefCitrixCloudDistributed SystemsDNSDockerGoogle Cloud PlatformJavaJenkinsKubernetesPostgresPuppetPythonReactShell ScriptingSplunkSQLTerraform
About the role
- Participate in a Hybrid work schedule that includes working from our Raleigh Office 3 days a week.
- Manage highly scalable and available distributed systems on cloud-based platforms.
- Automate everything - write automation and configuration management code to build scalable, reliable and secure systems.
- Identify key system metrics and ensure adequate monitoring coverage for new and existing services.
- Build monitoring, supervision and resiliency mechanisms to observe, react to, and maintain platform health and performance.
- Manage deployments, upgrades and configurations in a controlled, pre-production and production environment with tight operating perimeters.
- Be part of a DevOps team with rotating after hours on-call support (24/7/365).
- Verify the functionality of components and services and ensures deployment meets expectations; Design, create, and document test plans, possibly code a custom test automation framework.
- Work on complex, major or highly visible tasks in support of multiple projects that require multiple areas of expertise.
- Plan and execute ongoing routine application maintenance tasks, such as production support, and troubleshooting existing information systems; Identify errors and deficiencies as well as develop long and short-term solutions.
- Keep up-to-date with security patches and proactively address security vulnerabilities and compliance.
- Analyze a variety of approaches to larger DevOps issues – provide pros and cons of different approaches to the team to arrive at an agreed upon direction.
- Gather and understand business requirements with a focus on business outcomes.
- Prepare and present formal training to IT staff, as needed.
Requirements
- Strong Windows skills with respect to operations, and troubleshooting.
- Experience operating, tuning, and troubleshooting SQL Server and Postgres in Production as well as database design, backup, and restoration.
- Operational experience with Kubernetes, Docker and HashiCorp products in Production.
- Strong demonstrated experience in Python, Java, or Shell Scripting.
- Expert experience with operations and DevOps in AWS.
- Expert experience with automation/configuration management using Ansible, Terraform, Chef, or Puppet; CI/CD tools, such as Jenkins, Artifactory, GIT, etc.
- The ability to project, manage or act as project lead and provide subject matter expertise and technical direction for certain subjects.
- Strong experience with APM tools such as NewRelic or AppDynamics, as well as Log Analysis and Monitoring tools such as Splunk, CloudWatch, Dynatrace, Nagios, SysDig, Site24x7, etc.
- BS or MS in Computer Science, related field, or equivalent professional experience.
- Experience in network security (DNS, VPN/VPC, IDS/IPS, Subnets/Security Groups/Network ACLs) and technologies supporting compliance (HA/DR, Identity Management, Key Management, WAF and others).
- Be part of a DevOps team with rotating after hours on-call support (24/7/365).
- NICE TO HAVE: Experience with Cloud Network Design (AWS, Azure or GCP).
- NICE TO HAVE: Experience with Fortigate, Citrix Netscaler or other firewalling and Load Balancing Products.
- NICE TO HAVE: Experience with Kubernetes Clusters optimization and security management.
- NICE TO HAVE: Experience managing VPN solutions incorporating 2FA.
- NICE TO HAVE: Understanding of security best practices for banking procedures (ideally PCI or SOC knowledge).
- NICE TO HAVE: Experience with JIRA.