- Home
- Jobs
- AI & Machine Learning
- World Model Research Scientist- Physical AI

World Model Research Scientist- Physical AI at Kodiak
Mountain View, CAFull-timeAI & Machine LearningPosted about 1 month ago
Apply with PipelineAbout the Role
<div class="content-intro"><p>Kodiak Robotics, Inc. was founded in 2018 and has become a leader in autonomous ground transportation committed to a safer and more efficient future for all. The company has developed an artificial intelligence (AI) powered technology stack purpose-built for commercial trucking and the public sector. The company delivers freight daily for its customers across the southern United States using its autonomous technology. In 2024, Kodiak became the first known company to publicly announce delivering a driverless semi-truck to a customer. Kodiak is also leveraging its commercial self-driving software to develop, test and deploy autonomous capabilities for the U.S. Department of Defense.</p></div><div><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Kodiak is building AI that doesn't just perceive the world, it learns how the physics of the world works. We are developing large-scale generative world models that learn to predict realistic, physically consistent futures from real-world sensor data. This capability serves as the foundation for scalable closed-loop training, validation, and long-tail scenario generation, and is distilled into the onboard models that drive our autonomous trucks. We are looking for a research scientist to lead the design and development of world models capable of generating multi-sensor, multi-view, temporally coherent driving scenarios conditioned on actions, 3D scene context, and text.</span></div>
<div> </div>
<div><strong><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">In this role, you will:</span></strong></div>
<ul>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Design and train generative world models that synthesize realistic multi-camera video and LiDAR conditioned on ego trajectories, 3D scene context, and text</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Research and implement conditional diffusion architectures for driving, including spatiotemporal attention, latent space design, and action-conditioned generation</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Develop techniques for multi-view geometric consistency in generated outputs, drawing on neural rendering, cross-view attention, and 3D-aware generative approaches</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Build methods for joint multimodal generation that maintain cross-sensor consistency between camera, LiDAR, and radar outputs</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Design evaluation frameworks that measure world model quality beyond pixel-level metrics, including scenario fidelity and autoregressive stability</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Scale training pipelines to learn from thousands of hours of real-world driving data across multiple sensor modalities</span></li>
</ul>
<div><strong><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">What you'll bring:</span></strong></div>
<ul>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">PhD in Computer Science, AI, Robotics, or a related field, with a focus on generative modeling, neural rendering, or video synthesis</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Strong publication record or demonstrated research contributions in diffusion models, video generation, neural radiance fields, 3D-aware generative models, or world models</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Experience with neural rendering and view synthesis and an understanding of multi-view geometric consistency</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Proficiency working with multimodal sensor data (camera, LiDAR, radar) and familiarity with 3D representations such as BEV grids, voxel fields, or tri-planes</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Strong implementation skills in Python and PyTorch, with experience training large generative models at scale using distributed training</span></li>
<li style="font-family: helvetica, arial, sans-serif; font-size: 12pt;"><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">Passion for building AI that understands and predicts the physical world to enable safe autonomous driving</span></li>
</ul>
<p><strong><span style="font-family: helvetica, arial, sans-serif; font-size: 12pt;">What We Offer:</span></strong></p>
<ul>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Competitive compensation package including equity and annual bonuses</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Excellent Medical, Dental, and Vision plans through Kaiser Permanente, Cigna, and MetLife (including a medical plan with infertility benefits)</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">MetLife Legal Services, Identity & Fraud Protection, Hospital Indemnity Insurance, Accident Insurance, & Critical Illness Insurance</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Flexible PTO, 10 paid holidays, and generous parental leave policies</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Our office is centrally located in Mountain View, CA</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Office perks: dog-friendly, free catered lunch, a fully stocked kitchen, and free EV charging</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Long Term Disability, Short Term Disability, Life Insurance</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Wellbeing Benefits - Headspace through Cigna, Calm through Kaiser, One Medical, Gympass, Spring Health through Cigna, Rula (mental health navigation) </span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Fidelity 401(k)</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Commuter, FSA, Dependent Care FSA, HSA</span></li>
<li style="font-size: 12pt;"><span style="font-size: 12pt;">Various incentive programs (referral bonuses, patent bonuses, etc.)</span></li>
</ul><div class="content-pay-transparency"><div class="pay-input"><div class="description"><p>The pay range listed below reflects the base salary <strong data-stringify-type="bold">in our SF/Silicon Valley location, </strong>across several internal levels. Actual starting pay will be based on job-related factors including: work location, experience, relevant training, education, skill level and performance during interview. Total compensation at Kodiak includes base pay, equity, bonus and a competitive benefits package</p></div><div class="title">California Pay Range</div><div class="pay-range"><span>$190,000</span><span class="divider">—</span><span>$250,000 USD</span></div></div></div><div class="content-conclusion"><div> </div>
<div>At Kodiak, we strive to build a diverse community working towards our common company goals in a safe and collaborative environment where harassment of any kind is strictly prohibited. Kodiak is committed to equal opportunity employment regardless of race, ethnicity, religion, gender identity, sexual orientation, age, disability, or veteran status, or any other basis protected by applicable law.</div>
<div> </div>
<div>In alignment with its business operations, Kodiak adheres to all relevant statutes, regulations, and administrative prerequisites. Accordingly, roles that carry more sensitive requirements may be limited to candidates that can satisfy additional scrutiny and eligibility for such positions may hinge on verification of a candidate’s residence, U.S. person status, and/or citizenship status. Should the position require, and Kodiak determines that a candidate’s residence, U.S. person status, and/or citizenship status necessitate an export license, bar the candidate from the position, or otherwise fall under national security-related restrictions, Kodiak will consider the candidate for alternative positions unaffected by such restrictions, under terms and conditions set forth at Kodiak’s sole discretion, or, as an alternative, opt not to proceed with the candidate’s application. If applicable, Kodiak may provide visa sponsorship for eligible candidates.</div>
<div> </div>
<div><em>We use a third-party AI tool (Endorsed) to assist in the initial screening of applications. As part of the evaluation process, we provide Endorsed with job requirements and candidate-submitted applications. Final hiring decisions are made by our human recruitment team, and no automated system makes the ultimate decision regarding hiring. Certain features of the platform may qualify it as an Automated Employment Decision Tool (AEDT) under applicable regulations. We began using Endorsed on January 1, 2026. You can review the independent bias audit report covering our use of Endorsed [here](</em><em><a href="https://endorsed.com/local-law-144" target="_blank" data-sk="tooltip_parent">https://endorsed.com/local-law-144</a></em><em>). By submitting your application, you acknowledge that your application may be processed by AI systems as part of the screening and selection process. If you have any questions or would like to request a separate review of your application, please contact </em><em><a href="mailto:[email protected]" target="_blank" data-sk="tooltip_parent">[email protected]</a></em><em> with "Separate Review Request" in the email subject line.</em></div>
<p> </p></div>
Related Roles
Staff Machine Learning Engineer - Deployment
Kodiak
San Francisco Bay AreaApplied AI Engineer - Flywheel Automation & Continuous Learning
Kodiak
Mountain View, CASenior Applied AI Engineer - Multimodal Transformers
Kodiak
San Francisco Bay AreaAutonomous Fleet Specialist
Kodiak
Lancaster, TXSenior Software Engineer, Planning ML Integration
Kodiak
Mountain View, CAOperations Specialist
Kodiak
Lancaster, TX