Jobs Similar to Pre-training Engineer

Research Scientist - Small Language Models

Fastino 10 days ago

Global

Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap
Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics
Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories

Fastino is building the next generation of LLMs. Their team, boasting alumni from Google Research, Apple, Stanford, and Cambridge, is on a mission to develop specialized, efficient AI and has raised $25M through their seed round.

View details Similar jobs

ML Engineer - Small Language Models

Fastino 11 days ago

Design, build, and deploy the critical small language models that are foundational to Fastino’s product.
As an engineer, you will own the full lifecycle of our state of the art models, from prototyping and data analysis to deployment and monitoring.
Drive the data strategy to continuously improve model performance by analyzing distribution gaps and contributing to synthetic data pipelines.

Fastino is building the next generation of LLMs, with a team of alumni from Google Research, Apple, Stanford, and Cambridge. They have raised $25M through their seed round and are backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.

View details Similar jobs

Data Team Member

Poolside 29 days ago

Europe North America 7w PTO

Improve the quality of pretraining datasets by leveraging your previous experience, intuition and training experiments.
Focus on generating synthetic data at scale and determining the best strategies to leverage such data into training large models.
Closely collaborate with other teams like Pretraining, Postraining, Evals, and Product to define high-quality data needs.

Poolside aims to be the company that builds a world where AI will be the engine behind economically valuable work and scientific progress. They are a remote-first team across Europe and North America that values the quality of their systems.

View details Similar jobs

Staff Research Engineer, Pre-training Data

Reddit 27 days ago

$230,000–$322,000/yr

US

Define technical strategy & architecture for data curriculum pipelines powering next-gen foundation models.
Design & execute dynamic curriculum learning strategies, improving model stability & reasoning.
Engineer logic for serializing Reddit’s complex conversational trees into optimal training contexts.

Reddit is a community-driven platform where users submit, vote, and comment on what interests them. With over 100,000 active communities and 116 million daily active users, they foster open conversations and shared interests.

View details Similar jobs

Senior AI/ML Specialist Solutions Architect

NVIDIA 4 days ago

$225,000–$315,000/yr

US 20w maternity 12w paternity

Architect and optimize distributed training and inference systems for large-scale AI models
Design and deliver customer-focused solutions that maximize performance and business value
Lead the transition of ML pipelines from POC to scalable production systems

The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.

View details Similar jobs

Research Intern – Multimodal Foundation Model for Vision

Sony Corporation of America 7 days ago

$50–$50/hr

US Europe

Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge.
Design or implement state-of-the-art techs on model compression, inference speedup, deployement on harwares, tool automation.
Contribute to library and tool development to support business; or Publish influential research in top-tier conferences and journals.

Sony Corporation of America is the U.S. headquarters of Sony Group Corporation, based in Tokyo, Japan. Sony creates and delivers more entertainment experiences to more people than anyone else on earth.

View details Similar jobs

Computer Vision Platform Engineer

GameChanger 1 day ago

$180,000–$200,000/yr

US Unlimited PTO 20w maternity 12w paternity

Work directly with CV researchers to understand their goals, review their code, and engineer it for reliability and performance at scale.
Profile and optimize performance-sensitive code across both training and real-time inference.
Identify patterns across research efforts and propose standardized, composable abstractions.

GameChanger believes in the life changing impact youth sports have on and off the field. By building the first and best place to experience the youth sports moments important to their community, they are helping families elevate the next generation through youth sports. They are a remote first, dynamic tech company based in New York City, and they are solving some of the biggest challenges in youth sports today.

View details Similar jobs

Principal Engineer

Turing 20 days ago

India

Build the technical roadmap given a business requirement and own the delivery of the same.
Develop and optimize LLM-based solutions : Lead the design, training, fine-tuning, and deployment of large language models, leveraging techniques like prompt engineering, retrieval-augmented generation (RAG), and agent-based architectures.
Codebase ownership : Maintain high-quality, efficient code in Python (using frameworks like LangChain/LangGraph) and SQL, focusing on reusable components, scalability, and performance best practices.

Turing, based in San Francisco, is a research accelerator for frontier AI labs, partnering with global enterprises to deploy advanced AI systems. They accelerate research with data, talent, and training pipelines and build proprietary intelligence systems, recognized among the world's top innovators.

View details Similar jobs

Remote System Software Engineer

Jobgether 10 days ago

Europe

Research, prototype, develop and optimize solutions, tools, and libraries.
Analyse, influence, and improve deep learning libraries and frameworks standards and APIs.
Collaborate with team members and other partners.

Jobgether is a platform that connects job seekers with companies. They leverage AI to match candidates with roles.

View details Similar jobs

Senior Machine Learning Engineer

Loopio 27 days ago

North America Europe Asia

Build and productionize LLM and NLP models across retrieval, summarization, classification, and generative tasks.
Design and implement scalable ML services and inference pipelines in Python using modern ML frameworks.
Translate complex NLP and LLM product requirements into structured engineering plans with clear milestones.

Loopio provides a workplace that recognizes the advantages of working flexibly, operating as a remote-first company. They have established hub regions around the world and foster a supportive culture with opportunities for connection.

View details Similar jobs

AI/ML Engineer

Fastino 11 days ago

Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.

Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.

View details Similar jobs

Senior Machine Learning Scientist

Turnitin 7 days ago

$112,125–$186,875/yr

US

Work with subject matter experts to curate, generate, and annotate data, and create optimal datasets.
Develop and tune Machine Learning models, following best practices to select datasets, architectures, and model parameters.
Write clean, efficient, and modular code, with automated tests and appropriate documentation.

Turnitin partners with educational institutions to promote honesty, consistency, and fairness across all subject areas and assessment types. They are a global organization with team members in over 35 countries that embraces diversity, respects local cultures, and has a remote-centric culture.

View details Similar jobs

Software Engineer, Video Generation

EnCharge AI 3 days ago

Global

Design and build scalable serving infrastructure for video generation models.
Build robust APIs and SDKs that enable customers and partners to integrate video generation into their products.
Develop compelling demo applications that showcase our platform's capabilities.

EnCharge AI is building the next generation AI platform with in-memory-computing architecture that delivers a 10x improvement in compute energy efficiency and performance for AI inference workloads. They are an experienced team of AI researchers, silicon & systems engineers, and architects backed by leading investors.

View details Similar jobs

Principal Machine Learning Scientist

Turnitin 29 days ago

$147,300–$245,000/yr

US

Research and develop Machine Learning models and optimize them for scaled production usage.
Work with colleagues to explore ongoing product issues and recommend innovative ML/AI based solutions.
Work with subject matter experts to curate and generate optimal datasets following responsible data collection and model maintenance practices.

Turnitin is a recognized innovator in the global education space, partnering with educational institutions to promote honesty, consistency, and fairness across all subject areas and assessment types. They are a global organization with team members in over 35 countries, offering a remote-first culture which empowers team members to work with purpose and accountability.

View details Similar jobs

Senior, ML Engineer - Neural Rendering

Torc 8 days ago

$160,800–$212,300/yr

Canada US

Implement the latest research advances in Neural Rendering and generative models.
Translate cutting edge solution in the domain of autonomous driving for high-quality Camera, LiDAR and Radar sensor simulations.
Design, implement, test and deploy shippable production quality software starting from early prototypes using disciplined software development processes.

Torc is dedicated to transforming travel, freight, and business through autonomous vehicle technology. As a part of the Daimler family since 2007, they're focused on creating software for automated trucks, fostering a collaborative, energetic, and team-focused culture.

View details Similar jobs

Principal Software Engineer, AI/ML

Jobgether 7 days ago

Canada

Design and build advanced machine learning models for generative tasks.
Optimize models for performance enhancements and scalability.
Preprocess and manage large datasets for model training.

Jobgether is a platform that connects job seekers with companies. They use an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements.

View details Similar jobs

AI Development Engineer - DevX Platform

Google Chrome 23 hours ago

Global

Design and build robust backend services and microservices that power the DevX platform ecosystem.
Integrate Large Language Models (LLMs) and custom AI models to enable features like semantic code search, automated refactoring, and natural language infrastructure provisioning.
Act as a technical liaison and co-developer with our India-based engineering team, participating in daily stand-ups and code reviews to ensure architectural alignment.

They are developing the DevX platform, a next-generation engineering platform designed to accelerate time-to-market and improve code quality through intelligence. The company seems to be focused on developer tools and AI-driven solutions to enhance the software development lifecycle.

View details Similar jobs

Senior Software Engineer, Machine Learning (EST or EMEA)

AssemblyAI 6 days ago

$195,000–$225,000/yr

EMEA

Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability

AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.

View details Similar jobs

Senior Data Scientist

Exadel 24 days ago

Europe Asia

Collaborate with engineers, data scientists, and business analysts to understand requirements, refine models, and integrate LLMs into AI solutions
Development and implementation of Deep learning algorithms for AI solutions
Preprocess raw data, including text normalization, tokenization, and other techniques, to make it suitable for use with NLP models

Exadel is an AI-first global tech company with 25+ years of engineering leadership. They have 2,000+ team members, and 500+ active projects powering Fortune 500 clients valuing open dialogue, creative freedom, and mentorship.

View details Similar jobs

Applied AI Engineer

Stealth Co. 14 days ago

US

Build, maintain, and scale document ingestion + processing pipelines.
Integrate and productionize LLM-powered workflows.
Improve accuracy, reliability, and cost/performance of models and pipelines.

They are building the AI-native operating system for litigation. Their platform turns chaos into knowledge graphs to provide a lasting edge in high-stakes litigation.

View details Similar jobs

Source Job