Build and operate scalable backend services and internal APIs for the AI platform.
Integrate LLMs and AI tool execution into reliable, production-ready workflows.
Own production reliability for AI platform infrastructure through observability, alerting, and incident response.
MaintainX is the world's leading Asset and Work Intelligence platform for industrial and frontline environments. They are a modern IoT-enabled cloud-based tool for reliability, safety, and operations on physical equipment and facilities, powering operational excellence for 13,000+ businesses. MaintainX recently completed a $150 million Series D round, at a valuation of $2.5 billion.
Implement AI governance policies in collaboration with NBCU Legal, Privacy, and Cyber teams.
Build monitoring and reporting frameworks for AI models and tools, emphasizing cost, tagging, and AI FinOps principles.
Develop and manage ML/AI Ops pipelines including CI/CD for models using GitHub Actions or Jenkins.
NBCUniversal is a media and entertainment company that creates world-class content distributed across film, television, and streaming. They also have global theme park destinations, consumer products, and experiences. NBCUniversal is a subsidiary of Comcast Corporation, and it strives to attract and develop a talented workforce that reflects our world by championing an inclusive culture.
Design and manage AWS infrastructure for AI services.
Implement Infrastructure as Code using Terraform.
Collaborate with cross-functional teams to enhance performance.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements. Their system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Support and evolve the reliability of platforms used by the AI Research team.
Ensure production services meet expectations for availability, latency, and operational readiness.
Build and maintain Kubernetes-based services on GCP using infrastructure-as-code and GitOps.
Algolia is a pioneer and market leader in AI Search, empowering 17,000+ businesses to deliver blazing-fast, predictive search and browse experiences. They have raised $150 million in Series D funding, quadrupling their valuation to $2.25 billion, investing in their market-leading platform.
Design, build and maintain agentic workflows using low-code/no-code orchestration platforms and enterprise LLMs.
Translate business needs to AI logic by dissecting requests and re-engineering processes to be compatible with how LLMs and agents work.
Integrate agents with enterprise platforms via APIs and webhooks and determine how to extract, clean, and contextually feed data into AI models.
Lookout, Inc. is an endpoint to cloud security company purpose-built for the intersection of enterprise and personal data. They safeguard data across devices, apps, networks and clouds through their unified, cloud-native security platform and are trusted by enterprises of all sizes, government agencies and millions of consumers.
Design, implement, and maintain high-performance ML training and inference platforms.
Ship tools that allow any ML engineer to deploy a model in minutes, not days.
Improve scalability, reliability, and cost efficiency of model training and serving systems.
Speechify's mission is to make sure that reading is never a barrier to learning. With nearly 200 people around the globe working in a 100% distributed setting, Speechify's team includes frontend and backend engineers, AI research scientists, and others.
Architect and deploy secure, scalable infrastructure using Terraform, CloudFormation, or similar tools.
Ensure the platform meets strict SLA requirements for enterprise clients, minimizing downtime.
Implement comprehensive monitoring, logging, and alerting to provide deep visibility into system health.
Filevine provides cloud-based workflow tools for legal professionals, helping them manage organizations and serve clients. They are recognized as a fast-growing and innovative technology company with a team of passionate professionals.
Architect and operate large-scale, mission-critical cloud infrastructures.
Build and maintain Copilot Studio / Microsoft Virtual Agents solutions.
Develop using AI-assisted development tools to accelerate development and code quality.
Coderio designs and delivers scalable digital solutions for global companies. With a strong technical foundation and a product-oriented mindset, our teams lead complex software projects from architecture to execution. They value autonomy, clear communication, and technical excellence.
Partner closely with data engineering and data science teams to enable reliable data pipelines, analytics, and ML workflows
Support, operate, and optimize Databricks and Snowflake environments in production
Monitor, troubleshoot, and optimize systems for performance, reliability, and cost efficiency
Life360's mission is to keep people close to the ones they love with their mobile app and Tile tracking devices, empowering members to protect what they care about most with services like location sharing and crash detection. Life360 has more than 750 remote-first employees and enhances everyday family life with seamless coordination.
Design, build, and scale enterprise-grade AI/ML systems that power internal workflows and external-facing AI/ML platforms.
Develop a production-ready Generative AI and MLOps platform with reusable components used to deploy multiple AI solutions across Natera’s business units.
Implement cloud-native infrastructure for large-scale model training and serving using Kubernetes, MLflow, Terraform, and AWS-native services
Natera is a global leader in cell-free DNA (cfDNA) testing. They are dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.
Work hands-on with the infrastructure that supports our distributed & highly scalable services.
Gather requirements from customers and adapt manifests and software to support new environments.
Automate and optimize the release pipeline to make it as frictionless as possible.
Arize AI is transforming the world by providing a leading AI observability and evaluation platform. They empower AI engineers to ship high-performing, reliable agents and applications, unifying build, test, and run in a single workspace, with over 150 leading enterprises as customers.
Partner directly with customers to deploy and configure AI agents for clinical documentation, scheduling, billing, communications, and workflow automation
Maintain AI governance frameworks that ensure safe, reliable agent operations in production healthcare environments
Work hands-on with foundation model APIs (OpenAI, Claude, Gemini) and the Canvas SDK to troubleshoot and optimize agent behavior
We're accelerating everyday medicine with an EMR platform built for healthcare automation. We create modern, elegant front- and back-end tooling to enable new ways for developers and clinicians to integrate data, automate workflows, and collaborate to solve healthcare's toughest challenges. We are institutionally backed and funded by notable health tech companies.
Design cloud-native architectures for agentic AI workloads using Kubernetes/EKS, Terraform, Docker, serverless APIs, AWS Batch, and async orchestration frameworks.
Define agentic system patterns using LangChain, LangGraph, Autogen, LlamaIndex, Pinecone, and other multi-agent frameworks; ensure consistency of prompt/tool design.
Architect vector database, RAG, embeddings pipelines, and model-serving endpoints (LLM/SLM) with strong emphasis on scalability and latency management.
AHEAD builds platforms for digital business by weaving together advances in cloud infrastructure, automation and analytics, and software delivery, helping enterprises deliver on the promise of digital transformation. They prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard.
Automate infrastructure provisioning, configuration management, monitoring, and operational workflows using IaC and scripting languages.
Own the deployment, maintenance, and lifecycle management of systems supporting engineering, leveraging deep expertise in Kubernetes.
Troubleshoot complex infrastructure and application issues, driving root-cause analysis and developing long-term remediation solutions
SingleStore delivers the cloud-native database with the speed and scale to power the world’s data-intensive applications. They are venture-backed and headquartered in San Francisco with offices in Sunnyvale, Raleigh, Seattle, Boston, London, Lisbon, Bangalore, Dublin and Kyiv.
Design and deliver scalable AI systems that connect models, data, and products.
Turn research prototypes into secure, reliable, production-ready services.
Build pipelines and serving layers that power adaptive, real-time features.
KnowBe4 is a cybersecurity company that puts security first, offering an AI-driven Human Risk Management platform. They empower over 70,000 organizations worldwide to strengthen their security culture and transform their workforce into their strongest security asset.
Build Containerized Agent Systems: Design and implement systems that leverage Docker containers as the ideal runtime for AI agents, ensuring isolation, scalability, and portability
Expand cagent: Maintain and evolve the open-source cagent project, adding new capabilities for containerized agent deployment and orchestration
Agent Runtime Development: Build robust infrastructure for packaging, deploying, and managing agents in containers
Docker makes app development easier so developers can focus on what matters. They are a remote-first team that spans the globe, united by a passion for innovation and great developer experiences, with over 20 million monthly users and 20 billion image pulls.
Ensure the smooth operation and high availability of Clarifai's core services
Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
Design and implement scalable, secure, and cost-effective infrastructure solutions
Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.
Design and implement MLOps pipelines to automate model training, deployment, monitoring, and management
Lead/mentor a team of MLOps Engineers, fostering an inclusive and collaborative environment that encourages innovation and continuous learning
Collaborate with Data Scientists and ML Engineers to ensure models are production-ready, scalable, and maintainable
Egen is a fast-growing and entrepreneurial company with a data-first mindset. They bring together the best engineering talent working with the most advanced technology platforms, including Google Cloud and Salesforce, to help clients drive action and impact through data and insights.
Design, implement, and maintain robust, containerized, and reproducible pipelines for model training, evaluation, and deployment—across both batch and real-time settings.
Build and manage ML services, APIs, and model serving infrastructure using tools like MLflow, Amazon SageMaker, and Feature Store.
Set up and maintain monitoring, observability, and alerting systems to ensure high availability and performance (including model/data drift, feature logging, and inference latency).
AUTO1 Group Technology drives innovation in the used car market across Europe. They operate at the intersection of software engineering, data science, and DevOps, helping bring state-of-the-art ML models—such as large-scale recommendation systems and transformer-based neural networks—safely into production.
Define the technical vision and lead architecture decisions.
Design and implement systems for AI agents using Docker containers.
Maintain and evolve the open-source cagent project.
Docker makes app development easier, allowing developers to focus on their priorities. They are a remote-first team with over 20 million monthly users and 20 billion image pulls, and its tools are trusted by startups and Fortune 100 companies.