Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap
Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics
Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
Fastino is building the next generation of LLMs. Their team, boasting alumni from Google Research, Apple, Stanford, and Cambridge, is on a mission to develop specialized, efficient AI and has raised $25M through their seed round.
Design, build, and deploy the critical small language models that are foundational to Fastino’s product.
As an engineer, you will own the full lifecycle of our state of the art models, from prototyping and data analysis to deployment and monitoring.
Drive the data strategy to continuously improve model performance by analyzing distribution gaps and contributing to synthetic data pipelines.
Fastino is building the next generation of LLMs, with a team of alumni from Google Research, Apple, Stanford, and Cambridge. They have raised $25M through their seed round and are backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.
Improve the quality of pretraining datasets by leveraging your previous experience, intuition and training experiments.
Focus on generating synthetic data at scale and determining the best strategies to leverage such data into training large models.
Closely collaborate with other teams like Pretraining, Postraining, Evals, and Product to define high-quality data needs.
Poolside aims to be the company that builds a world where AI will be the engine behind economically valuable work and scientific progress. They are a remote-first team across Europe and North America that values the quality of their systems.
Engineer logic for serializing Reddit’s complex conversational trees into optimal training contexts.
Reddit is a community-driven platform where users submit, vote, and comment on what interests them. With over 100,000 active communities and 116 million daily active users, they foster open conversations and shared interests.
Architect and optimize distributed training and inference systems for large-scale AI models
Design and deliver customer-focused solutions that maximize performance and business value
Lead the transition of ML pipelines from POC to scalable production systems
The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.
Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge.
Design or implement state-of-the-art techs on model compression, inference speedup, deployement on harwares, tool automation.
Contribute to library and tool development to support business; or Publish influential research in top-tier conferences and journals.
Sony Corporation of America is the U.S. headquarters of Sony Group Corporation, based in Tokyo, Japan. Sony creates and delivers more entertainment experiences to more people than anyone else on earth.
Work directly with CV researchers to understand their goals, review their code, and engineer it for reliability and performance at scale.
Profile and optimize performance-sensitive code across both training and real-time inference.
Identify patterns across research efforts and propose standardized, composable abstractions.
GameChanger believes in the life changing impact youth sports have on and off the field. By building the first and best place to experience the youth sports moments important to their community, they are helping families elevate the next generation through youth sports. They are a remote first, dynamic tech company based in New York City, and they are solving some of the biggest challenges in youth sports today.
Build the technical roadmap given a business requirement and own the delivery of the same.
Develop and optimize LLM-based solutions : Lead the design, training, fine-tuning, and deployment of large language models, leveraging techniques like prompt engineering, retrieval-augmented generation (RAG), and agent-based architectures.
Codebase ownership : Maintain high-quality, efficient code in Python (using frameworks like LangChain/LangGraph) and SQL, focusing on reusable components, scalability, and performance best practices.
Turing, based in San Francisco, is a research accelerator for frontier AI labs, partnering with global enterprises to deploy advanced AI systems. They accelerate research with data, talent, and training pipelines and build proprietary intelligence systems, recognized among the world's top innovators.
Build and productionize LLM and NLP models across retrieval, summarization, classification, and generative tasks.
Design and implement scalable ML services and inference pipelines in Python using modern ML frameworks.
Translate complex NLP and LLM product requirements into structured engineering plans with clear milestones.
Loopio provides a workplace that recognizes the advantages of working flexibly, operating as a remote-first company. They have established hub regions around the world and foster a supportive culture with opportunities for connection.
Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.
Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.
Work with subject matter experts to curate, generate, and annotate data, and create optimal datasets.
Develop and tune Machine Learning models, following best practices to select datasets, architectures, and model parameters.
Write clean, efficient, and modular code, with automated tests and appropriate documentation.
Turnitin partners with educational institutions to promote honesty, consistency, and fairness across all subject areas and assessment types. They are a global organization with team members in over 35 countries that embraces diversity, respects local cultures, and has a remote-centric culture.
Design and build scalable serving infrastructure for video generation models.
Build robust APIs and SDKs that enable customers and partners to integrate video generation into their products.
Develop compelling demo applications that showcase our platform's capabilities.
EnCharge AI is building the next generation AI platform with in-memory-computing architecture that delivers a 10x improvement in compute energy efficiency and performance for AI inference workloads. They are an experienced team of AI researchers, silicon & systems engineers, and architects backed by leading investors.
Research and develop Machine Learning models and optimize them for scaled production usage.
Work with colleagues to explore ongoing product issues and recommend innovative ML/AI based solutions.
Work with subject matter experts to curate and generate optimal datasets following responsible data collection and model maintenance practices.
Turnitin is a recognized innovator in the global education space, partnering with educational institutions to promote honesty, consistency, and fairness across all subject areas and assessment types. They are a global organization with team members in over 35 countries, offering a remote-first culture which empowers team members to work with purpose and accountability.
Implement the latest research advances in Neural Rendering and generative models.
Translate cutting edge solution in the domain of autonomous driving for high-quality Camera, LiDAR and Radar sensor simulations.
Design, implement, test and deploy shippable production quality software starting from early prototypes using disciplined software development processes.
Torc is dedicated to transforming travel, freight, and business through autonomous vehicle technology. As a part of the Daimler family since 2007, they're focused on creating software for automated trucks, fostering a collaborative, energetic, and team-focused culture.
Design and build advanced machine learning models for generative tasks.
Optimize models for performance enhancements and scalability.
Preprocess and manage large datasets for model training.
Jobgether is a platform that connects job seekers with companies. They use an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements.
Design and build robust backend services and microservices that power the DevX platform ecosystem.
Integrate Large Language Models (LLMs) and custom AI models to enable features like semantic code search, automated refactoring, and natural language infrastructure provisioning.
Act as a technical liaison and co-developer with our India-based engineering team, participating in daily stand-ups and code reviews to ensure architectural alignment.
They are developing the DevX platform, a next-generation engineering platform designed to accelerate time-to-market and improve code quality through intelligence. The company seems to be focused on developer tools and AI-driven solutions to enhance the software development lifecycle.
Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability
AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.
Collaborate with engineers, data scientists, and business analysts to understand requirements, refine models, and integrate LLMs into AI solutions
Development and implementation of Deep learning algorithms for AI solutions
Preprocess raw data, including text normalization, tokenization, and other techniques, to make it suitable for use with NLP models
Exadel is an AI-first global tech company with 25+ years of engineering leadership. They have 2,000+ team members, and 500+ active projects powering Fortune 500 clients valuing open dialogue, creative freedom, and mentorship.
Build, maintain, and scale document ingestion + processing pipelines.
Integrate and productionize LLM-powered workflows.
Improve accuracy, reliability, and cost/performance of models and pipelines.
They are building the AI-native operating system for litigation. Their platform turns chaos into knowledge graphs to provide a lasting edge in high-stakes litigation.