- Home
- Jobs
- Engineering
- Machine Learning Engineer

Machine Learning Engineer at Abaka AI
Mountain View, CAFull-timeEngineeringPosted about 1 month ago
Apply with PipelineAbout the Role
<div data-page-id="YfRYdcycyo6ib9x3qrFloPeYgdf" data-lark-html-role="root" data-docx-has-block-data="false">
<div class="ace-line ace-line old-record-id-X162dT0rVoIu0yxrJd0lQRhZgSa"><strong>About Abaka AI</strong></div>
<div class="ace-line ace-line old-record-id-X162dT0rVoIu0yxrJd0lQRhZgSa"> </div>
<div class="ace-line ace-line old-record-id-Pq5edBWZLooX9OxnC30l4WfcgFH">Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Generative AI, Embodied AI, and Automotive AI rely on us to power their data pipelines. With our headquarters in Silicon Valley—and teams in Paris, Singapore, and Tokyo—we support global partners with fast, reliable, and scalable data solutions.</div>
<div class="ace-line ace-line old-record-id-U4TldRjkwozN00xmh33lgxX3gih">Our offerings include a diverse catalog of off-the-shelf datasets (image, video, multimodal, reasoning, 3D, and beyond) as well as comprehensive data collection and annotation services. Whether teams need raw data, curated datasets, or full-cycle data engineering, Abaka AI provides the foundation for building high-performance AI systems.</div>
<div class="ace-line ace-line old-record-id-Ea33day3aoYbSwxZc8HlKgRdgde"> </div>
<div class="ace-line ace-line old-record-id-Ea33day3aoYbSwxZc8HlKgRdgde"> </div>
<div class="ace-line ace-line old-record-id-PZdYdqdPVot88DxfNSNloh0SgMc"><strong>About the Role</strong></div>
<div class="ace-line ace-line old-record-id-PZdYdqdPVot88DxfNSNloh0SgMc"> </div>
<div class="ace-line ace-line old-record-id-ItrzdmzQJos0X5xG72xlyyQRgpg">We’re hiring our first Machine Learning Engineer in the United States, a foundational role that will shape how Abaka builds, trains, and optimizes multimodal AI systems. You will own the design and development of scalable training pipelines, work directly with our data engineering and research teams, and help drive the technical roadmap for model development across multiple modalities.</div>
<div class="ace-line ace-line old-record-id-DPJjd78gjoSiryxVPsVlXFPEgrb">As an early member of the engineering team, you will influence core decisions around model training strategy, experimentation frameworks, distributed infrastructure, and internal best practices. Your work will directly impact the performance of frontier models trained on Abaka datasets and will help elevate the technical bar for our clients and partners.</div>
<div class="ace-line ace-line old-record-id-FDoldlK4woxtC4x6VSCldp3agof">If you thrive in high-ownership environments and want to shape the machine learning foundation of a fast-moving AI company, this role offers an opportunity to make an immediate and lasting impact.</div>
<div class="ace-line ace-line old-record-id-IIisduCMholhh3x7uRNlmOJcgwg"> </div>
<div class="ace-line ace-line old-record-id-IIisduCMholhh3x7uRNlmOJcgwg"> </div>
<h4 class="heading-4 ace-line old-record-id-R2xsd4GPcoyWljx1pfsljSciglg"><strong>Responsibilities</strong></h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-Oqoadw4FBollhBxBfqblubOigRb" data-list="bullet">
<div>Design, build, and optimize scalable machine learning pipelines for multimodal model training, fine-tuning, and evaluation across text, image, audio, video, and 3D data.</div>
</li>
<li class="ace-line ace-line old-record-id-MJtqdXK29oKUI1xC2ZKlqvK2gjg" data-list="bullet">
<div>Work closely with data engineering and research teams to develop efficient data workflows, including collection, preprocessing, annotation, versioning, and model integration.</div>
</li>
<li class="ace-line ace-line old-record-id-L0q9dkOGwowtZqxZDtolp8JOg5d" data-list="bullet">
<div>Implement and refine training strategies for large-scale AI systems, including vision, video, and diffusion models, ensuring reproducibility, efficiency, and strong model performance.</div>
</li>
<li class="ace-line ace-line old-record-id-ZAVGdSLpEoq0suxSuChlLtF6gNW" data-list="bullet">
<div>Develop tools and automation frameworks that accelerate model experimentation, hyperparameter tuning, and deployment.</div>
</li>
<li class="ace-line ace-line old-record-id-W6l2d7xuAoO8t3xEnA5l0VzNgNf" data-list="bullet">
<div>Identify and address performance bottlenecks in data or training pipelines to improve throughput, stability, and resource utilization.</div>
</li>
<li class="ace-line ace-line old-record-id-WtJsdGTFHoeNjpxyA8Rl3a6Dg9b" data-list="bullet">
<div>Collaborate with product and infrastructure teams to ensure smooth integration of model outputs into both internal and client-facing applications.</div>
</li>
<li class="ace-line ace-line old-record-id-IfFYd0rT7ozXMNxoEEzl0c5kgyh" data-list="bullet">
<div>Support internal best practices for model governance, experiment tracking, and documentation to maintain high engineering standards and reproducibility.</div>
</li>
</ul>
<div class="ace-line ace-line old-record-id-UVXjducHroiw1txGeaold6QEgGh"> </div>
<h4 class="heading-4 ace-line old-record-id-LP7NdXB3loz3FTxxt0Jl94qRgTf"><strong>Qualifications</strong></h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-FCJJdmfwXo74VbxQx2mlUhZFgbN" data-list="bullet">
<div>Strong academic background in computer science, artificial intelligence, machine learning, or related fields. Master’s degree or Ph.D. is preferred.</div>
</li>
<li class="ace-line ace-line old-record-id-VD6IdtMT5oFkmGxrcENlseEagIh" data-list="bullet">
<div>3+ years of experience in applied machine learning or ML engineering, with a demonstrated ability to deliver production-ready models or pipelines.</div>
</li>
<li class="ace-line ace-line old-record-id-Ijd3dXn8hoNN0Pxona8lSbtCgrg" data-list="bullet">
<div>Proficient in Python and ML frameworks such as PyTorch, TensorFlow, or JAX, with hands-on experience in large-scale distributed training and inference systems.</div>
</li>
<li class="ace-line ace-line old-record-id-RNeCdgcCKoQ8cRxKvWulqmK8gCf" data-list="bullet">
<div>Familiarity with multimodal data processing (e.g., text-image pairing, video understanding, speech-audio modeling) and dataset optimization for model training.</div>
</li>
<li class="ace-line ace-line old-record-id-RWScdqUxyodZIwxwaCulK0digpe" data-list="bullet">
<div>Solid understanding of ML system design, including feature pipelines, data loaders, model serving, and evaluation frameworks.</div>
</li>
<li class="ace-line ace-line old-record-id-P1S5d73ZqoByebxOcrIlZwAugXe" data-list="bullet">
<div>Experience with modern infrastructure tools such as Kubernetes, Ray, Airflow, or MLflow, along with cloud-based training environments (AWS, GCP, Azure).</div>
</li>
<li class="ace-line ace-line old-record-id-WxHpd6AxzoVgM2xhkxjlLh6NgHb" data-list="bullet">
<div>Excellent communication and collaboration skills, capable of working effectively across engineering, research, and product teams to accomplish shared goals.</div>
</li>
<li class="ace-line ace-line old-record-id-O6WsdIPvyolJL2xT70TlKO2IgBh" data-list="bullet">
<div>Self-driven and adaptable, comfortable operating in a fast-paced startup environment, and able to demonstrate strong ownership and urgency in execution.</div>
</li>
</ul>
<div class="ace-line ace-line old-record-id-YGPOd3AHvoJI3lxXZddljWI2gr4"> </div>
<h4 class="heading-4 ace-line old-record-id-UKYSd961qoRTS7x7wCTl75CUgVc"><strong>Compensation & Benefits</strong></h4>
<div class="ace-line ace-line old-record-id-AmJhdzqPlosD5RxvkP1liyYpglc">The base salary range for this position is $175,000 - $275,000 USD annually.</div>
<div class="ace-line ace-line old-record-id-COsbdLJZxovIyzxyQcylwbCHgFb">Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work at Abaka AI. This role is eligible for equity, as well as a comprehensive benefits package (health, dental, vision, PTO, flexible work schedule).</div>
<div class="ace-line ace-line old-record-id-CJ92dQRGToFA1sxmmpolS1xxgve"> </div>
</div>
Related Roles
Data Operations Engineer
Abaka AI
Mountain View, CAResearch Partnerships Manager
Abaka AI
Mountain View, CAQuality Project Associate
Abaka AI
Mountain View, CATechnical Project Associate
Abaka AI
Mountain View, CAResearch Program Associate
Abaka AI
Mountain View, CAData Solutions Engineer
Abaka AI
Mountain View, CA