FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Staff Systems Engineer – Cloud Operations, Support
Cadence Design SystemsStaff Systems Engineer focused on cloud operations and support at Cadence. Ensuring high-performance computing cluster management for optimized cloud infrastructure.
Tech Stack
Tools & technologiesCloudDockerLinuxOpenStackPerlPython
About the role
Key responsibilities & impact- Supporting multiple geological locations to serve user communities across North America, Europe, and Asia sites.
- Focusing on improving customer productivity and committing to customer success.
- Driving the overall operational strategy for internal High-Performance Compute (HPC) clusters in Cadence cloud.
- Maintaining, enhancing, monitoring, reporting, and improving its efficiency.
Requirements
What you’ll need- 8+ years of technical experience architecting, managing, and improving a HPC environment running Linux.
- At least 3 years working in a global group, coordinating support, strategies, projects, and operations across multiple geographies in a team-oriented approach
- Solid understanding and proven operational experience with HPC clusters, job submission/management technologies, cloud, and associated management tools.
- Proven experience working directly with engineering teams to collaboratively develop solutions to optimize their working environment (Direct EDA experience desired)
- Proven experience in capacity and performance management, optimizing performance, ensuring adequate capacity, working with customers on optimization of their workloads, and development and maintenance of key performance indicators
- A proven process focus shown through documentation, change management, incident management and problem-resolution activities
- Extensive hands-on experience with Docker: image management, container orchestration, and troubleshooting.
- Deep expertise in Linux system administration (RHEL preferred), including networking, storage, and performance tuning.
- Familiarity with user authentication and integration using systems like LDAP or Active Directory.
- Solid understanding and proven operational experience with HPC clusters, job submission/management technologies, cloud, and associated management tools.
- Hands-on GPU Cluster Management: Experience in configuration, installation, and optimization of GPU server clusters.
- Hands-on technical experience managing GPU VMS, installing, configuring instances and other services over OpenStack
- Automation & Monitoring: Develop and maintain automation scripts using languages like Python, Bash, or Perl to streamline system maintenance, deployment, and reporting.
- Strong problem-solving and communication skills with the ability to work in a multi platform, cross-functional, and geographically distributed team.
Benefits
Comp & perks- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Professional development
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
HPC environmentLinuxDockerGPU Cluster ManagementOpenStackPythonBashPerlperformance managementcapacity management
Soft Skills
problem-solvingcommunicationteam-oriented approachprocess focuscollaboration