
Senior Infrastructure Engineer
Rithum
full-time
Posted on:
Location Type: Hybrid
Location: Raleigh • North Carolina • United States
Visit company websiteExplore more
Salary
💰 $70,000 - $125,000 per year
Job Level
About the role
- Support and operate physical datacenter infrastructure, including servers, storage, networking, and rack-and-stack activities.
- Manage and maintain VMware vSphere / ESXi environments, including host builds, upgrades, patching, and troubleshooting.
- Administer enterprise storage platforms such as Pure Storage and/or NetApp, including provisioning, snapshots, replication, and performance monitoring.
- Perform hardware lifecycle activities: installs, replacements, firmware upgrades, and decommissioning.
- Participate in planned maintenance, change control, and incident response for datacenter systems.
- Build infrastructure-as-code software and other tools providing a foundation for software teams to rapidly build large scale cloud systems
- Establish a golden AMI pipeline and maintain a continuous patching schedule for all cloud resources.
- Operationalize the resiliency and disaster recovery processes for IaaS landscape
- Automation of key operating and security processes in IaaS workloads
- Infrastructure management via Configuration Management platform to ensure security and full stack automation for applications, drive the use of the platform and standards for all applications
- Drive investigation and resolution of security incidents impacting cloud infrastructure
- Work closely with security team to address vulnerabilities, evidence gathering and securing our infrastructure
- Handle the security hardening efforts for major OS distributions to ensure overall compliance and a streamlined service for internal clients
- Participate in the rotating on-call schedule. Ensure that user emergencies, platform alerts, and support requests are addressed.
- Identify and solve problems in cloud and hybrid environments and the ability to create and implement a new solution from scratch.
- Mentor and develop less experienced engineers.
Requirements
- 3+ years’ hands-on experience working in a physical datacenter environment.
- Minimum of 2 days on-site in our data center, additional may be required based on business needs
- Based in the Raleigh-Durham (RDU) area, North Carolina
- Strong experience with VMware (vSphere / ESXi) in a production setting.
- Experience administering enterprise storage platforms, preferably Pure Storage and/or NetApp.
- Familiarity with server hardware (Dell, HPE, or similar), RAID, BIOS/firmware management.
- Experience following change management and operational best practices.
- Knowledge and experience of high availability and scalability
- Experience with AWS foundations, including computer, networking, storage, and security
- Experience architecting containerization solutions in cloud environments like ECS or EKS. A strong background in systems engineering, especially with tools like Docker and Kubernetes in Linux and containerized environments
- Expertise in implementing content delivery solutions at the edge
- A thorough understanding of IAM roles, access management, DNS, load balancing, routing, firewalls, and monitoring tools for cloud and hybrid environments
- Experience deploying and supporting AWS network services, including VPC, Subnets, Route Tables, NACLs, Security Groups, TGW, GWLB, VPC Endpoints, Route53.
- Knowledge of AWS compute, data sources, security technologies, services including EC2, S3, IAM, ECS, EKS, Load Balancers, SCP, RAM, CloudWatch, CloudTrail, WAF
- Experience architecting and securing regulated enterprise-class cloud services with compliance frameworks like SOC2, NIST, and ISO
- Experience with logging and monitoring systems like Datadog and Cloudwatch
- Familiarity and ability to diagnose large systems - how they work and can be operated on a large scale, edge cases, failure modes, behaviors.
- Proficient in writing code/scripting in languages like
- Automation of infrastructure with CDK, Terraform, and Ansible, as well as containerization using EKS or ECS.
- Ability to diagnose complex distributed systems problems whether it be system, network or code
Benefits
- Medical, dental and vision benefits: Affordable health care plans and company HSA contributions, starting on Day 1
- A 6% 401(k) match
- Competitive time off package with 20 days of Paid Time Off, 9 Company-Paid holidays, 2 paid floating holidays, 7 paid sick days, 2 Wellness days, and 1 Paid Volunteer Day; at 3 years of service PTO increases to 22 days, and at 5 years it increases to 25 days
- 12 weeks primary caregiver leave & 4 weeks secondary caregiver leave
- Accident, critical illness, and hospital indemnity insurance
- Pet insurance
- Legal assistance and identity theft insurance plans
- Life insurance 2x salary
- Access to the Calm app and the Employee Assistance Program
- $65/month Remote work stipend for internet
- Culture and team-building activities
- Tuition assistance
- Career development opportunities
- Charitable contribution match up to $250 per year
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
VMware vSphereESXiPure StorageNetAppAWSDockerKubernetesTerraformAnsibleCDK
Soft Skills
problem solvingmentoringcommunicationteam collaborationincident responsechange managementsecurity hardeningoperational best practicesresiliencydisaster recovery
Certifications
SOC2NISTISO