- Home
- Jobs
- Data Science and Engineering
- Senior Data Scientist, LLM

Senior Data Scientist, LLM at Xometry
BrazilFull-timeData Science and Engineering Posted 3 months ago
Apply with PipelineAbout the Role
<div class="content-intro"><p><span style="font-size: 10pt; font-family: arial, helvetica, sans-serif;">Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry’s digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.</span></p></div><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Xometry is seeking a <strong>Senior Data Scientist</strong> to join our Generative AI team. The candidate will focus on training and fine-tuning Visual Language Models (VLMs) for multimodal document understanding. The ideal candidate will leverage their expertise in machine learning and computer vision to advance Xometry's capabilities in processing and extracting structured data from complex documents and images. This is a 1-year contract. </span></p>
<p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><strong>Responsibilities:</strong></span></p>
<ul>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Develop, fine-tune, and evaluate Visual Language Models (VLMs) to enhance document understanding, focusing on multimodal data such as text, images, and technical drawings.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Design and implement data preparation, cleaning, and augmentation processes tailored to multimodal model training, ensuring high-quality data pipelines for VLMs.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Leverage transfer learning and pre-trained models to accelerate model development and optimize performance on Xometry’s specific data.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Use cloud resources (e.g., Amazon Web Services) to scale training and fine-tuning processes for VLMs efficiently.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Collaborate with data engineering and machine learning operations (MLOps) teams to deploy VLMs into production and monitor their performance.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Interpret model outputs and improve model accuracy and robustness by applying data analysis and visualization tools (such as Python, Jupyter Notebooks, and SQL).</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Experiment with and implement state-of-the-art model architectures, continuously optimizing VLM performance in a fast-paced, iterative environment.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Work within a team-oriented setting, participating in peer reviews, sharing insights, and contributing to an environment of continuous learning and improvement.</span></li>
</ul>
<p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><strong>Qualifications:</strong></span></p>
<ul>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">A bachelor’s degree is required; an advanced degree (M.S. or PhD) in computer science, data science, machine learning, or a related field is highly preferred.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">5+ years of experience in data science and machine learning, with expertise in Visual Language Models or multimodal machine learning.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Strong experience with machine learning libraries and frameworks such as PyTorch, TensorFlow, or Hugging Face.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Proficiency in Python, including libraries like pandas, numpy, and scikit-learn.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Solid understanding of deep learning techniques and experience with transfer learning, fine-tuning, and model evaluation.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Experience with cloud platforms (e.g., AWS SageMaker) for model training and deployment.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Familiarity with data processing and visualization tools (SQL, Jupyter Notebooks, Looker, etc.) and basic database knowledge (e.g., Snowflake, MongoDB).</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Excellent analytical and problem-solving skills, with a strong ability to work in an environment that values teamwork, innovation, and continuous learning.</span></li>
<li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Familiarity with computer vision tasks and frameworks, as well as experience with multimodal data, is a plus.</span></li>
</ul>
<p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">#LI-Remote</span></p><div class="content-conclusion"><p><span style="font-size: 10pt; font-family: arial, helvetica, sans-serif;">Xometry is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.</span></p>
<p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">For US based roles: Xometry participates in E-Verify and after a job offer is accepted, will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.</span></p></div>