
Systems Integration Engineer, SW Focused Issue Triage & RCA
Agility Robotics
full-time
Posted on:
Location Type: Hybrid
Location: Fremont • United States
Visit company websiteExplore more
Salary
💰 $170,000 - $221,000 per year
About the role
- Serve as a lead voice in the triage process, providing the expertise required to classify complex failures specifically as software, firmware, or system-level regressions.
- Effectively disposition identified issues to the software organization, providing clean tickets (logs, video clips, and analysis) that allow developers to act quickly.
- Manage and prioritize escalated SW-related investigations, making informed trade-offs to ensure that critical safety or performance risks are addressed first.
- Lead end-to-end investigations into novel failures using deep-dive log review, telemetry analysis, and video diagnostics to pinpoint bugs at the software/hardware interface or unexpected system behaviors.
- Develop and execute scripts or other data visualization tools to parse massive log sets and identify intermittent failure trends.
- Leverage structured methodologies such as 5-Whys or Fishbone to move from a surface-level symptom to a definitive root cause.
- Author and maintain "Gold Standard" RCA reports and troubleshooting guides that improve the technical autonomy of the broader triage team.
- Promote a culture of rigorous documentation and data-driven problem-solving.
- Create reusable diagnostic frameworks that automate the identification of known software issues, increasing the efficiency of the entire R&D loop.
Requirements
- 4+ years of experience in Systems Integration, Software-Hardware interface, or R&D with a focus on software on complex mechatronic or autonomous systems.
- Proven experience using monitoring and observability platforms (e.g., Datadog, Splunk, or New Relic) to track system health and identify performance anomalies across a fleet.
- Experience interacting with cloud-based storage and databases (e.g., AWS S3, SQL, or NoSQL) to retrieve and manage large-scale telemetry and video datasets.
- Proven track record of navigating highly ambiguous software-hardware intersections to find definitive root causes.
- Experience creating technical documentation or bug reports intended for software engineering audiences.
- Preferred: Experience with HW/SW integration and design on HiL.
- Mastery of log parsing via CLI and proficiency in using Python or similar scripting languages for data visualization and failure trend analysis.
- Familiarity with database environments, specifically regarding data retrieval and log management.
- Experience correlating video and/or HW symptoms with system telemetry to identify physical manifestations of software bugs.
- Strong understanding of software stacks in robotics, including communication protocols (e.g., EtherCAT, CAN) and how they manifest in system logs.
- Preferred: Experience with characterizing or troubleshooting HW/SW interactions such as cameras, encoders, IMUs, or other sensors.
Benefits
- 401(k) Plan: Includes a 6% company match.
- Equity: Company stock options.
- Insurance Coverage: 100% company-paid medical, dental, vision, and short/long-term disability insurance for employees.
- Benefit Start Date: Eligible for benefits on your first day of employment.
- Well-Being Support: Employee Assistance Program (EAP).
- Time Off: Exempt Employees: Flexible, unlimited PTO and 10 company holidays, including a winter shutdown. Non-Exempt Employees: 10 vacation days, paid sick leave, and 10 company holidays, including a winter shutdown, annually.
- On-Site Perks: Catered lunches four times a week and a variety of healthy snacks and refreshments at our Salem and Pittsburgh locations.
- Parental Leave: Generous paid parental leave programs.
- Work Environment: A culture that supports flexible work arrangements.
- Growth Opportunities: Professional development and tuition reimbursement programs.
- Relocation Assistance: Provided for eligible roles.
- Annual Discretionary Bonus: Provided for eligible roles.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
log parsingPythondata visualizationtelemetry analysis5-Whys methodologyFishbone methodologyRCA reportingbug reportingHW/SW integrationmonitoring platforms
Soft Skills
problem-solvingdocumentationcommunicationleadershipprioritizationanalytical thinkingcollaborationcritical thinkingattention to detailadaptability