You own uptime, observability, incident response, and root cause analysis.
Own the AWS architecture.
Make ML pipelines reliable.
Ferra is building AI infrastructure for structural steel estimation. They process large-scale construction drawing PDFs, run computer vision + LLM pipelines, and generate structured steel graphs, takeoffs, and export-ready models. The team is small and technical, which means high ownership, fast decisions, and work has a direct impact on the core product.
Advanced knowledge of AWS services including ML services (AWS SageMaker and AWS Step Functions).
Experience with ML monitoring and automation tools (MLflow, SagaMaker Pipelines).
Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion.
Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability
AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.
Act as the overall technical authority for the programme, owning architectural decisions, execution patterns, and technical quality across all workstreams.
Define and enforce standard migration patterns for moving ML workloads from Databricks into AWS SageMaker, while managing exceptions for complex or legacy cases.
Lead and contribute across areas such as AWS SageMaker-based ML execution, Databricks to SageMaker migration, and Python-based ML workloads.
CreateFuture is a digital consultancy that builds digital products and services. They have over 500 people and a safe, supportive, and friendly culture.
Architect, implement, and maintain production-grade, low-latency ML services.
Collaborate with data scientists, product managers, and engineers.
Design and support experimentation frameworks to test hypotheses and measure improvements.
Smart Working connects skilled professionals with global teams and products for full-time, long-term roles. They are one of the highest-rated workplaces on Glassdoor and value integrity, excellence, and ambition for their employees' personal and professional growth.
Manage cloud infrastructure and optimize costs, particularly in AWS environments using Terraform and Python.
Design, develop, and maintain CI/CD pipelines and infrastructure for AI model training and deployment.
Ensure platform scalability and efficient resource utilization.
NEORIS, now part of EPAM Systems, is a Digital Accelerator that helps companies step into the future. With more than 20 years of experience as Digital Partners to some of the world’s leading organizations, they are over 4,000 professionals across 11 countries and foster a multicultural, startup-minded culture that promotes innovation, continuous learning, and the delivery of high-impact solutions for their clients.
Build and manage the full ML lifecycle—from experiment tracking to model deployment and retraining.
Implement ML-specific CI/CD (e.g., CML, Kubeflow Pipelines) to automate the promotion of models to production.
Architect distributed systems for large-scale model inference.
Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group, recognized as Hungary’s most attractive employer in 2025. They provide IT and telecommunications services with more than 5300 employees, serving hundreds of large customers in Germany and other European countries.
Own SentiLink’s real-time ML model monitoring domain.
Own our ML experimentation, model tracking, and versioning infrastructure.
Drive improvements to the model development process.
SentiLink provides identity and risk solutions for secure transactions. They are backed by investors like Craft Ventures and Andreessen Horowitz, recognized by Forbes Fintech 50, and have offices across the U.S. and India.
Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners.
Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices.
Optimize at scale: Profile and tune throughput, memory, and cost; introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off.
Cresta aims to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines AI and human intelligence to help contact centers discover customer insights and automate conversations.
Design, develop, and maintain high-performance, scalable, and secure backend services, primarily using Python and frameworks like FastAPI
Translate ambiguous business and technical requirements into concrete software designs and actionable tasks for cross-functional teams
Operate and maintain production applications at scale, ensuring high availability, performance, and reliability
SmartAsset is an online destination for consumer-focused financial information and advice, whose mission is helping people make smart financial decisions, reaching over an estimated 59 million people each month. Valued at over $1 billion, SmartAsset has earned recognition on the Inc. 5000 and Deloitte Technology Fast 500 lists.
Partner with stakeholders to tackle technical problems at scale, building framework agnostic services.
Establish roadmap and architecture for Wealthsimple’s Machine Learning platform.
Build highly performant scalable systems, contributing to our ML platform on Kubernetes, Bedrock and Sagemaker.
Wealthsimple aims to provide financial freedom by making financial services transparent and low-cost. As the largest fintech company in Canada, with over 1,500 employees, they manage over $100 billion in assets and foster a collaborative and quality-focused culture.
Support the full operational lifecycle of both traditional machine learning systems and emerging generative AI driven applications.
Enable scalable training, evaluation, deployment, and monitoring for a wide range of ML and GenAI workloads.
Manage model upgrades, framework versions, regression testing, maintenance tasks and maintaining performance across systems and solutions.
Achievers' employee recognition and rewards platform empowers organizations to build cultures where people feel seen and valued, everyday. They're a team of passionate, thoughtful builders with more than 4.3 million users across 190 countries, who care deeply about their product, their customers, and each other.
You will work to build, maintain and improve our Torc ML frameworks.
You have built ML solutions that have reached production.
You want to build, maintain, grow, and improve our ML platform.
Torc has been a leader in autonomous driving since 2007. Now a part of the Daimler family, they are focused solely on developing software for automated trucks to transform how the world moves freight.
Build and maintain CI/CD pipelines and infrastructure-as-code.
Lead observability and monitoring initiatives.
Truelogic is a nearshore staff augmentation services provider headquartered in New York. They deliver technology solutions to companies of all sizes, helping them achieve their digital transformation goals with a team of 600+ highly skilled tech professionals based in Latin America.
Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
Focus on automation so we can spend energy where it matters.
Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.
Design, build, and maintain machine learning model productionization infrastructure.
Streamline model training, validation, and deployment in collaboration with the data science team.
Implement robust monitoring and alerting for model performance, drift, and data quality.
The Athletic delivers in-depth coverage of sports, teams, and athletes. Their newsroom of 500+ full-time staff covers hundreds of professional and college teams across North American markets and football clubs.
Build and productionize reusable MLOps components supporting scalable and reliable ML workflows.
Establish strong ML lifecycle practices including experiment tracking, evaluation, and reproducibility.
Enable robust and monitored ML systems aligned with healthcare-grade reliability and compliance requirements.
Neko Health aims to shift healthcare from treating illness to preventing it, using advanced, non‑invasive technology and clinical expertise to deliver early, actionable health insights. The company has nearly 100 full-time engineers and supports a flexible workplace that prioritizes work-life balance.
Partner with engineering leadership, EMs, and Product Managers to define and deliver AI products.
Architect scalable, high-performance systems that support a growing number of AI-powered products.
Drive technical strategy and make architectural decisions that compound - enabling the team to ship more AI experiences faster.
Webflow is building the world’s leading AI-native Digital Experience Platform as a remote-first company built on trust, transparency, and a whole lot of creativity. They empower teams to design, launch, and optimize for the web without barriers, from entrepreneurs launching their first idea to global enterprises scaling their digital presence.
Experience with Infrastructure as Code (Terraform, CloudFormation, CDK)
Experience with container orchestration (ECS, Docker)
Kunai builds full-stack technology solutions for banks, credit and payment networks, infrastructure providers, and their customers. They help clients modernize, capitalize on emerging trends, and evolve their business for the coming decades by remaining tech-agnostic and human-centered.
Design, build, maintain, and operate scalable streaming and batch data pipelines.
Work with AWS services, including Redshift, EMR, and ECS, to support data processing and analytics workloads.
Develop and maintain data workflows using Python and SQL.
Southworks helps companies with software development and digital transformation. They focus on solving complex problems and delivering innovative solutions.