Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
Optimize serverless container workflows for AI workloads, ensuring performance, scalability, and seamless autoscaling.
Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals globally and collaborate with technology partners such as Intel, NVIDIA, Dell, and Equinix.
Architect and optimize distributed training and inference systems for large-scale AI models
Design and deliver customer-focused solutions that maximize performance and business value
Lead the transition of ML pipelines from POC to scalable production systems
The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.
Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners.
Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices.
Optimize at scale: Profile and tune throughput, memory, and cost; introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off.
Cresta aims to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines AI and human intelligence to help contact centers discover customer insights and automate conversations.
Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability
AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.
Develop and maintain backend systems and services for generative AI and agentic workflows.
Integrate AI-driven capabilities across the Seismic platform, working with data scientists and AI engineers.
Monitor and optimize agentic workflows’ performance, ensuring low-latency query responses.
Seismic provides sales enablement solutions, leveraging AI to enhance sales and marketing organizations. They focus on improving productivity and sales outcomes through their AI engine, Seismic Aura, integrated into their enablement cloud.
Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
Focus on packaging and integrating new ML models into the platform, using Python and common ML frameworks.
Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They power everything from real-time communication and streaming to enterprise AI and secure web applications, with over 550 professionals globally and partnerships with technology leaders.
Work with customers to develop requirements and scope for new AI/ML projects.
Develop computer vision and machine learning based solutions for inspection platforms.
Analyze large datasets to extract meaningful insights and drive business decisions.
Loram provides advanced insights into inspection data collected for customers worldwide. The company has a small, collaborative team managing the entire project lifecycle, offering employees an outsized impact on inspections and maintenance recommendations.
Collaborate closely with business stakeholders and other engineers to deliver impactful solutions.
Integrate services and product features with databases and messaging queues.
Contribute to the development of our MLOps tools for ML models.
Trellis is rewriting the insurance experience from the inside out. They are a profitable, fast-growing Series A startup backed by General Catalyst, QED, NYCA, and Amex Ventures that brings clarity and ease to insurance shopping.
Advanced knowledge of AWS services including ML services (AWS SageMaker and AWS Step Functions).
Experience with ML monitoring and automation tools (MLflow, SagaMaker Pipelines).
Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion.
Partner with Sales to qualify opportunities and lead technical discovery.
Design and deliver compelling demos adapting to both technical and executive audiences.
Own the technical POC process ensuring clean handoffs into production.
Andromeda Cluster gives early-stage startups access to scaled AI infrastructure. They work with AI labs, data centers, and cloud providers to deliver compute when and where it’s needed most to build the liquidity layer for global AI compute.
Own SentiLink’s real-time ML model monitoring domain.
Own our ML experimentation, model tracking, and versioning infrastructure.
Drive improvements to the model development process.
SentiLink provides identity and risk solutions for secure transactions. They are backed by investors like Craft Ventures and Andreessen Horowitz, recognized by Forbes Fintech 50, and have offices across the U.S. and India.
Work directly with CV researchers to understand their goals, review their code, and engineer it for reliability and performance at scale.
Profile and optimize performance-sensitive code across both training and real-time inference.
Identify patterns across research efforts and propose standardized, composable abstractions.
GameChanger believes in the life changing impact youth sports have on and off the field. By building the first and best place to experience the youth sports moments important to their community, they are helping families elevate the next generation through youth sports. They are a remote first, dynamic tech company based in New York City, and they are solving some of the biggest challenges in youth sports today.
Partner with stakeholders to tackle technical problems at scale, building framework agnostic services.
Establish roadmap and architecture for Wealthsimple’s Machine Learning platform.
Build highly performant scalable systems, contributing to our ML platform on Kubernetes, Bedrock and Sagemaker.
Wealthsimple aims to provide financial freedom by making financial services transparent and low-cost. As the largest fintech company in Canada, with over 1,500 employees, they manage over $100 billion in assets and foster a collaborative and quality-focused culture.
You will work to build, maintain and improve our Torc ML frameworks.
You have built ML solutions that have reached production.
You want to build, maintain, grow, and improve our ML platform.
Torc has been a leader in autonomous driving since 2007. Now a part of the Daimler family, they are focused solely on developing software for automated trucks to transform how the world moves freight.
TrueML is a mission-driven financial software company that aims to create better customer experiences for distressed borrowers. The TrueML team includes inspired data scientists, financial services industry experts and customer experience fanatics building technology to serve people.
Design, develop, and maintain high-quality software solutions using Python.
Contribute to the design and evolution of scalable and maintainable software architectures.
Deploy, operate, and monitor applications in cloud environments (AWS, Azure, or GCP).
Lynx Analytics works on real-world AI and advanced analytics solutions with measurable business impact. They have a collaborative culture that values real outcomes, offering high ownership and rapid learning opportunities.
Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.
Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.
Design, develop, and maintain high-performance, scalable, and secure backend services, primarily using Python and frameworks like FastAPI
Translate ambiguous business and technical requirements into concrete software designs and actionable tasks for cross-functional teams
Operate and maintain production applications at scale, ensuring high availability, performance, and reliability
SmartAsset is an online destination for consumer-focused financial information and advice, whose mission is helping people make smart financial decisions, reaching over an estimated 59 million people each month. Valued at over $1 billion, SmartAsset has earned recognition on the Inc. 5000 and Deloitte Technology Fast 500 lists.
Design, build, and maintain machine learning model productionization infrastructure.
Streamline model training, validation, and deployment in collaboration with the data science team.
Implement robust monitoring and alerting for model performance, drift, and data quality.
The Athletic delivers in-depth coverage of sports, teams, and athletes. Their newsroom of 500+ full-time staff covers hundreds of professional and college teams across North American markets and football clubs.
Lead projects end to end and contribute to impactful platform initiatives.
Partner with engineers, scientists, product managers and business teams to identify opportunities.
Design and ship components of a new platform architecture to enable multi-tenancy and scaling.
Freenome is working to detect cancer in its earliest, most treatable stages using a routine blood draw. Freenome is an equal-opportunity employer who values diversity and does not discriminate.