Pipeline
Browse Jobs
Sign inSign up
Pipeline
Browse jobsSign inContactTermsPrivacyCookiesPreferences
Logos provided by Logo.dev

© 2026 Pipeline. All rights reserved.

  1. Home
  2. Jobs
  3. Algorithm
  4. Senior ML/RL Engineer, Behavior Planning
Bot Auto logo

Bot Auto

Senior ML/RL Engineer, Behavior Planning at Bot Auto

Houston, TX or San Francisco Bay AreaFull-timeAlgorithm Posted 14 days ago
Apply with Pipeline→

About the Role

<div class="mt-8 text-xl text-gray-800 leading-8">&nbsp;</div> <div class="mt-8 text-xl text-gray-600 leading-8"> <div data-controller="rich-text"> <div class="rich-text-container" data-rich-text-target="richTextContainer"> <h3><strong>Company Introduction</strong></h3> <p>At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a startup and the wisdom of seasoned experts, our team has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create groundbreaking solutions that propel the future of transportation. Join us and transform your ideas into reality.</p> <h3><strong>Role Overview</strong></h3> <p>We are seeking a <strong>Senior ML/RL Engineer</strong> to join our Algo team and drive the development of our unified behavioral architecture. In this role, you will help bridge the gap between simulation and the real world by developing a scalable policy framework that represents both our L4 ego-policy and a diverse population of simulated agents. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure our autonomous semi-trucks navigate highways with superhuman safety and precision.</p> <h3><strong>Key Responsibilities</strong></h3> <ul> <li><strong>Behavioral Modeling:</strong> Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate our autonomous driving stack.</li> <li><strong>Safety-Constrained Learning:</strong> Lead the research and implementation of advanced RL algorithms to ensure safety metrics are treated as primary constraints in the learning process.</li> <li><strong>Reward &amp; Objective Design:</strong> Collaborate with cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.</li> <li><strong>Scalable Training Pipelines:</strong> Contribute to the optimization of our large-scale, high-throughput training environments to enable rapid iteration on complex multi-agent scenarios.</li> <li><strong>Model Architecture:</strong> Advance our state-of-the-art neural architectures to improve spatial reasoning, long-horizon planning, and interaction modeling.</li> <li><strong>Cross-Team Collaboration:</strong> Work closely with Simulation and Planning teams to integrate research-grade models into production-quality, safety-critical software.</li> </ul> <h3><strong>Required Qualifications</strong></h3> <ul> <li><strong>Professional RL Experience:</strong> Proven track record of training and deploying deep RL algorithms (e.g., PPO, SAC) for complex, real-world robotic or autonomous systems.</li> <li><strong>Technical Mastery:</strong> Expertise in <strong>Python</strong> and <strong>PyTorch</strong>; strong understanding of modern deep learning architectures and optimization techniques.</li> <li><strong>Academic Background:</strong> MS or PhD in Computer Science, Robotics, or a related quantitative field.</li> <li><strong>Scientific Intuition:</strong> Ability to diagnose and solve fundamental challenges in RL training, such as variance management and distribution shift.</li> </ul> <h3><strong>Preferred Qualifications</strong></h3> <ul> <li><strong>Safe RL Specialization:</strong> Experience with constrained optimization or safety-critical learning frameworks.</li> <li><strong>Multi-Agent Systems:</strong> Background in MARL training stability, including self-play and decentralized execution strategies.</li> <li><strong>Autonomous Driving Domain:</strong> Familiarity with vehicle dynamics and behavior planning, particularly for long-haul highway environments.</li> </ul> <h3><strong>Additional Information</strong></h3> <ul> <li><strong>Compensation:</strong> Competitive salary based on experience, with opportunities for performance bonuses and equity.</li> <li><strong>Benefits:</strong> Comprehensive health insurance, paid time off, and the opportunity to work at the forefront of the autonomous trucking industry.</li> </ul> <h3><strong>Why Bot Auto?</strong></h3> <p>We are a small, hyper-focused team on a mission to beat human cost-per-mile through technology. We recently successfully completed the industry’s first fully humanless commercial truckload, proving that our vision is a reality. If you are passionate about AI, safety, and transforming logistics, we want to hear from you.</p> </div> </div> </div>

Related Roles

  • Intern, Software Engineer-Operation Platforms

    Bot Auto

    Houston, TX
  • Autonomy Algorithm Engineer, Planning & Prediction

    Bot Auto

    Houston, TX or San Francisco Bay Area
  • Algorithm Engineer, Autonomy Planning & Prediction

    Bot Auto

    Houston, TX or San Francisco Bay Area
  • Senior Software Engineer, Autonomous Traffic Control

    Bot Auto

    Houston, TX or San Francisco Bay Area
  • Senior Software Engineer, Security Engineering

    Bot Auto

    Houston, TX or San Francisco Bay Area
  • Senior Software Engineer, Workflow Platforms

    Bot Auto

    Houston, TX or San Francisco Bay Area