Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners.
Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices.
Optimize at scale: Profile and tune throughput, memory, and cost; introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off.
Support the full operational lifecycle of both traditional machine learning systems and emerging generative AI driven applications.
Enable scalable training, evaluation, deployment, and monitoring for a wide range of ML and GenAI workloads.
Manage model upgrades, framework versions, regression testing, maintenance tasks and maintaining performance across systems and solutions.
Achievers' employee recognition and rewards platform empowers organizations to build cultures where people feel seen and valued, everyday. They're a team of passionate, thoughtful builders with more than 4.3 million users across 190 countries, who care deeply about their product, their customers, and each other.
Define and evolve the technical vision for AI and agentic systems across products.
Design orchestration, data, and serving patterns that handle global scale with reliability.
Collaborate with AI Research to turn prototypes into extensible, governed production frameworks.
KnowBe4 is a cybersecurity company that puts security first, empowering over 70,000 organizations worldwide to strengthen their security culture. They value radical transparency, extreme ownership, and continuous professional development in a welcoming workplace that encourages all employees to be themselves.
Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
Optimize serverless container workflows for AI workloads, ensuring performance, scalability, and seamless autoscaling.
Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals globally and collaborate with technology partners such as Intel, NVIDIA, Dell, and Equinix.
Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability
AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.
Design and build production-grade AI systems, including RAG pipelines, multi-step agents, and LLM-powered features.
Build comprehensive evaluation and observability frameworks to measure model accuracy, grounding, and quality drift.
Create production-quality Python services to wrap AI logic into secure microservices.
League, founded in 2014, is the leading healthcare consumer experience (CX) platform powered by AI, reaching over 63 million people globally. Payers, providers, and consumer health partners use League’s platform to deliver high-engagement healthcare solutions and improve health outcomes.
Scale the decisionmaking process for tools for the tvScientific AI team, from our workflows to our training infrastructure to our Kubernetes deployments
Improve the developer experience for the data science team
Upgrade our observability tooling
tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. They leverage massive data and cutting-edge science to automate and optimize TV advertising to drive business outcomes. An Idealab company, tvScientific was co-founded by executives with deep roots in programmatic advertising and digital media.
Architect and ship new backend capabilities that integrate AI-adjacent functionality into Kraken’s core systems.
Design distributed services that meet high standards for reliability, performance, and correctness.
Own end-to-end technical design, from protocol and service boundaries through production deployment.
Kraken is a mission-focused company rooted in crypto values. It aims to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has Krakenites in 70+ countries who speak over 50 languages.
You will design, build, and operate core systems that enable autonomous agents to function reliably in production.
You’ll build production-grade agentic workflows, retrieval and memory systems, multi-model execution, and tool-calling integrations that interact safely with enterprise systems.
You’ll explore new approaches, prototype quickly, and turn what works into durable production systems.
Kindo is an agent automation platform for DevOps and SecOps teams. They help organizations automate high-friction operational work using autonomous agents. Kindo is a small, highly technical team with strong customer traction and real enterprise revenue.
Lead the integration of modern, AI-generated applications with backend enterprise layers.
Design, document, and execute a robust CI/CD and release strategy.
Act as a high-EQ consultant to bridge the gap between our high-velocity delivery model and the client’s internal teams.
CompassX is a boutique business and technology consulting firm helping Fortune 500 and high-growth clients deliver their most strategic initiatives. With over 15 years of proven results, they've expanded across industries; they are a three-time winner of Consulting Magazine’s Best Boutique Firms to Work For.
Design and build scalable and high-performance data software solutions using Golang and Python.
Build and deploy Kubernetes-based systems to manage containerized applications in cloud-native environments.
Collaborate with cross-functional teams to understand and address customer needs, ensuring our systems evolve to meet future requirements.
Machinify is a healthcare intelligence company delivering value, transparency, and efficiency to health plan clients. They deploy a configurable, AI-powered platform and have best-in-class expertise in the payment continuum, serving over 85 health plans and 270 million lives.
Design and build the infrastructure layer powering AI agent systems in production
Develop high-performance Rust services that handle model inference, orchestration, and execution
Architect scalable systems capable of supporting millions of users and high request throughput
Kraken is a mission-focused company rooted in crypto values, aiming to accelerate the global adoption of crypto so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has employees in 70+ countries and is committed to industry-leading security, crypto education, and client support.
Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.
Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.
Lead the architecture and evolution of Cresta’s AI Agent integration framework across CCaaS platforms.
Design scalable, extensible backend systems that manage real-time conversation state, session lifecycle, and context propagation.
Establish architectural patterns for AI-to-human handoff that ensure durability, reliability, and seamless customer experience.
Cresta aims to transform customer conversations into a competitive edge. They achieve this by leveraging AI and human intelligence to enhance contact centers, automate processes, and empower team members.
Help build, implement, and deploy agentic platforms within client environments.
Focus on building and deploying scalable AI-driven solutions.
Work closely with client stakeholders, technical leaders, and delivery teams to build and roll out agentic capabilities.
Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. With a team of 500 experts in 20+ countries, they are trusted by leading enterprises including Lululemon, Puma, and Starbucks.
Design, develop, and maintain high-performance, scalable, and secure backend services, primarily using Python and frameworks like FastAPI
Translate ambiguous business and technical requirements into concrete software designs and actionable tasks for cross-functional teams
Operate and maintain production applications at scale, ensuring high availability, performance, and reliability
SmartAsset is an online destination for consumer-focused financial information and advice, whose mission is helping people make smart financial decisions, reaching over an estimated 59 million people each month. Valued at over $1 billion, SmartAsset has earned recognition on the Inc. 5000 and Deloitte Technology Fast 500 lists.
You will define, build, and evolve foundational systems that enable autonomous agents to operate reliably in production.
You’ll explore new approaches, prototype quickly, and turn what works into durable platform foundations.
You’ll identify high-leverage architectural improvements, abstractions, and guardrails that expand what the platform can do while keeping it reliable, secure, observable, and maintainable under real-world conditions.
Kindo is an agent automation platform for DevOps and SecOps teams, helping organizations automate high-friction operational work using autonomous agents. They are a small, highly technical team with strong customer traction and real enterprise revenue, where engineers have direct ownership over critical systems.
Designing and developing the core platform that enables the efficient deployment, scaling, and management of LLMs and multi-agent systems.
Building specialized infrastructure to support long-running agentic workflows, including state management, tool-calling interfaces, and complex reasoning loops.
Scaling inference for LLMs to handle global demand while optimizing for latency, throughput, and cost.
Clarity AI is a global tech company founded in 2017 with a unique mission: bringing societal impact to markets. They leverage AI and machine learning technologies to provide top international investors, governments, companies, and consumers with the right data, methodologies, and tools to make more informed decisions. They are now a team of more than 300 highly passionate and curious individuals from all over the world.
Design, develop, and maintain scalable and robust backend architectures for Cresta’s AI Agent solutions and proprietary models.
Collaborate with cross-functional teams including frontend engineers, machine learning engineers to ensure seamless integration of AI Agents into Cresta’s customer solutions.
Lead initiatives to enhance system scalability and reliability in production environments, focusing on backend services that support AI functionalities.
Cresta aims to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. They are combining AI and human intelligence to help contact centers discover customer insights and empower every team member to work smarter and faster.
Architect and optimize distributed training and inference systems for large-scale AI models
Design and deliver customer-focused solutions that maximize performance and business value
Lead the transition of ML pipelines from POC to scalable production systems
The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.
You will work to build, maintain and improve our Torc ML frameworks.
You have built ML solutions that have reached production.
You want to build, maintain, grow, and improve our ML platform.
Torc has been a leader in autonomous driving since 2007. Now a part of the Daimler family, they are focused solely on developing software for automated trucks to transform how the world moves freight.