
Senior Robot Software Triage & RCA Engineer at Agility Robotics
Hybrid- Fremont, CAFull-timeHardwarePosted 22 days ago
Apply with PipelineAbout the Role
<div class="content-intro"><p>Agility’s commercially deployed humanoids operate alongside teams in warehouses, manufacturing facilities, and distribution centers—tackling physically demanding and repetitive tasks while enabling workers to focus on higher-value work. With industry-leading safety standards and years of proven deployment data, we're pioneering a new era of automation that enhances human potential.</p></div><p style="text-align: left;"><span style="font-size: 10pt;"><span style="font-size: 12pt;"><strong>Role Overview:</strong></span><strong><br></strong>We are seeking a Senior Robot Software Triage & RCA Engineer specialized in Software Issue Triage and Root Cause Analysis (RCA). Your main function is to conduct remote triage, utilizing log parsing, telemetry data, and video analysis, to identify failures with software root causes and ensure they are accurately dispositioned to the appropriate SW development teams. You will conduct deep-dive root cause analysis on novel failures occurring at the hardware-software interface, while simultaneously architecting the diagnostic scripts and tools required to streamline these investigations.</span></p>
<p style="text-align: left;"><span style="font-size: 10pt;">In this role you will move beyond basic data review to navigate ambiguous failure modes, develop automated diagnostic scripts, and create the technical documentation that drives software reliability across the fleet.</span></p>
<p style="text-align: left;"><span style="font-size: 10pt;"><strong>Issue Triage</strong></span></p>
<ul style="text-align: left;">
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Serve as a lead voice in the triage process, providing the expertise required to classify complex failures specifically as software, firmware, or system-level regressions.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Effectively disposition identified issues to the software organization, providing clean tickets (logs, video clips, and analysis) that allow developers to act quickly.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Manage and prioritize escalated SW-related investigations, making informed trade-offs to ensure that critical safety or performance risks are addressed first.</span></li>
</ul>
<p style="text-align: left;"><span style="font-size: 10pt;"><strong>Root Cause Analysis</strong></span></p>
<ul style="text-align: left;">
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Lead end-to-end investigations into novel failures using deep-dive log review, telemetry analysis, and video diagnostics to pinpoint bugs at the software/hardware interface or unexpected system behaviors.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Develop and execute scripts or other data visualization tools to parse massive log sets and identify intermittent failure trends.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Leverage structured methodologies such as 5-Whys or Fishbone to move from a surface-level symptom to a definitive root cause. </span></li>
</ul>
<p style="text-align: left;"><span style="font-size: 10pt;"><strong>Continuous Improvement</strong></span></p>
<ul style="text-align: left;">
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Author and maintain "Gold Standard" RCA reports and troubleshooting guides that improve the technical autonomy of the broader triage team.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Promote a culture of rigorous documentation and data-driven problem-solving.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Create reusable diagnostic frameworks that automate the identification of known software issues, increasing the efficiency of the entire R&D loop.</span></li>
</ul>
<p style="text-align: left;"><span style="font-size: 12pt;"><strong>Qualifications:</strong></span></p>
<p><span style="font-size: 10pt;"><strong>Experience:</strong></span></p>
<ul style="text-align: left;">
<li style="font-size: 10pt;"><span style="font-size: 10pt;">5+ years of experience in Systems Integration, Software-Hardware interface, or R&D with a focus on software on complex mechatronic or autonomous systems.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Proven experience using monitoring and observability platforms (e.g., Datadog, Splunk, or New Relic) to track system health and identify performance anomalies across a fleet.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Experience interacting with cloud-based storage and databases (e.g., AWS S3, SQL, or NoSQL) to retrieve and manage large-scale telemetry and video datasets.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Proven track record of navigating highly ambiguous software-hardware intersections to find definitive root causes.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Experience creating technical documentation or bug reports intended for software engineering audiences.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Preferred: Experience with HW/SW integration and design on HiL. </span></li>
</ul>
<p><span style="font-size: 10pt;"><strong>Technical Expertise:</strong></span></p>
<ul>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Mastery of log parsing via CLI and proficiency in using Python or similar scripting languages for data visualization and failure trend analysis.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Familiarity with database environments, specifically regarding data retrieval and log management.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Experience correlating video and/or HW symptoms with system telemetry to identify physical manifestations of software bugs.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Strong understanding of software stacks in robotics, including communication protocols (e.g., EtherCAT, CAN) and how they manifest in system logs.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Preferred: Experience with characterizing or troubleshooting HW/SW interactions such as cameras, encoders, IMUs, or other sensors. </span></li>
</ul>
<p><span style="font-size: 10pt;"><strong>Skills:</strong></span></p>
<ul style="text-align: left;">
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Ability to tackle ambiguous, unprecedented problems and create reusable, scalable solutions.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Capacity to operate independently on initiatives and proactively anticipate the needs for effective and efficient triage and RCA.</span></li>
<li style="font-size: 10pt;"><span style="font-size: 10pt;">Exceptional ability to synthesize complex telemetry and video data into clear, actionable insights for software engineering stakeholders.</span></li>
</ul>
<p><span style="font-size: 10pt;"><strong>Education:</strong></span></p>
<ul>
<li style="text-align: left; font-size: 10pt;"><span style="font-size: 10pt;">Bachelor’s or Master’s degree in Computer Science, Robotics, Electrical Engineering, or a related field.</span></li>
</ul>
<p>This is an hybrid position at our Fremont, CA office.</p>
<p><em>The final salary offered to a successful candidate will be dependent on several factors that may include but are not limited to: job-related knowledge, skills, and experience. Agility Robotics is a multi-state employer and this salary range may not reflect positions who work in other locations. These ranges may be modified in the future.</em></p><div class="content-pay-transparency"><div class="pay-input"><div class="title">Anticipated Base Salary Range</div><div class="pay-range"><span>$170,000</span><span class="divider">—</span><span>$221,000 USD</span></div></div></div><div class="content-conclusion"><p><strong><em>In addition to base pay, our competitive total rewards package consists of the following for full-time employees:</em></strong></p>
<ul>
<li><strong>401(k) Plan:</strong><span class="Apple-converted-space"> </span>Includes a 6% company match.</li>
<li><strong>Equity:</strong><span class="Apple-converted-space"> </span>Company stock options.</li>
<li><strong>Insurance Coverage:</strong><span class="Apple-converted-space"> </span>100% company-paid medical, dental, vision, and short/long-term disability insurance for employees.</li>
<li><strong>Benefit Start Date:</strong><span class="Apple-converted-space"> </span>Eligible for benefits on your first day of employment.</li>
<li><strong>Well-Being Support:</strong><span class="Apple-converted-space"> </span>Employee Assistance Program (EAP).</li>
<li><strong>Time Off:</strong>
<ul>
<li><strong>Exempt Employees:</strong><span class="Apple-converted-space"> </span>Flexible, unlimited PTO and 12 company holidays, including a winter shutdown.</li>
<li><strong>Non-Exempt Employees:</strong><span class="Apple-converted-space"> </span>10 vacation days, paid sick leave, and 12 company holidays, including a winter shutdown, annually.</li>
</ul>
</li>
<li><strong>On-Site Perks:</strong><span class="Apple-converted-space"> </span>Catered lunches four times a week and a variety of healthy snacks and refreshments at our Salem and Pittsburgh locations.</li>
<li><strong>Parental Leave:</strong><span class="Apple-converted-space"> </span>Generous paid parental leave programs.</li>
<li><strong>Work Environment:</strong><span class="Apple-converted-space"> </span>A culture that supports flexible work arrangements.</li>
<li><strong>Growth Opportunities:</strong><span class="Apple-converted-space"> </span>Professional development and tuition reimbursement programs.</li>
<li><strong>Relocation Assistance:</strong><span class="Apple-converted-space"> </span>Provided for eligible roles.</li>
<li><strong>Annual Discretionary Bonus:</strong> Provided for eligible roles.</li>
</ul>
<p><span style="font-weight: 400;">All of our roles are U.S.-based. Applicants must have current authorization to work in the United States.</span></p>
<p><span style="font-weight: 400;">Agility Robotics is committed to a work environment in which all individuals are treated with respect and dignity. Each individual has the right to work in a professional atmosphere that promotes equal employment opportunities and prohibits unlawful discriminatory practices, including harassment. Therefore, it is the policy of Agility Robotics to ensure equal employment opportunity without discrimination or harassment on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, age, disability, marital status, citizenship, national origin, genetic information, or any other characteristic protected by law. Agility Robotics prohibits any such discrimination or harassment.</span></p>
<p> </p>
<p><strong>Agility Robotics does not accept unsolicited referrals from third-party recruiting agencies. We prioritize direct applicants and encourage all qualified candidates to apply directly through our careers page. If you are represented by a third party, your application may not be considered. To ensure full consideration, please apply directly.</strong></p>
<p> </p>
<p>Apply Now: <a href="https://grnh.se/b444bbd04us" target="_blank">https://grnh.se/b444bbd04us</a></p></div>
Related Roles
Senior Systems Integration Engineer
Agility Robotics
Hybrid- Fremont, CAManager, Robot Diagnostics & Analysis
Agility Robotics
Hybrid- Fremont, CAThermal Analysis Engineer
Agility Robotics
Hybrid- Fremont, CAStructural Analysis Engineer
Agility Robotics
Hybrid- Fremont, CAEngineering Support Technician
Agility Robotics
Onsite- Pittsburgh, PAStaff Hardware Test Engineer
Agility Robotics
Hybrid- Fremont, CA