Lead the implementation, monitoring, and continuous improvement of security, governance, and trust controls for AI systems.
Define trustworthy and untrustworthy AI behavior and ensure it is measurable in production for security event analysis.
Translate governance principles into technical and operational requirements that product and platform teams can adopt.
YipitData is a market research and analytics firm for the disruptive economy. They analyze billions of alternative data points daily, providing insights on various markets and are backed by The Carlyle Group and Norwest Venture Partners.
Build and maintain CI/CD pipelines and deployment infrastructure.
Leverage AI to automate analysis and resolution of production issues.
Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production.
Own and evolve CI/CD pipelines using GitHub Actions and OIDC-based authentication for microservices and agentic workloads.
Automate infrastructure provisioning using Infrastructure as Code tools such as Terraform and CloudFormation.
Operate and scale our Kubernetes platform, including autoscaling, ingress, and multi-tenant isolation for enterprise customers.
Zingtree is a next-generation intelligent process automation platform reimagining customer experience operations for enterprise support leaders. It is a small team with high ownership, emphasizing automation, collaboration, and transparency.
Own and operate GPU and accelerator clusters for AI training, inference, and experimentation, ensuring reliability and cost-efficiency.
Build and optimize scheduling, orchestration, and serving systems using frameworks like vLLM and Triton to improve latency, throughput, and memory efficiency.
Partner with ML engineers to remove workflow bottlenecks and build observability for GPU utilization, capacity, and incident response.
Kraken is a crypto exchange platform building premium financial products for traders and institutions, accelerating global crypto adoption. It is a mission-driven, fully remote company with a world-class team of crypto experts spread across more than 70 countries.
Implement and manage AI-powered tools, copilots, and workflow automations from POC to production, owning the full technical lifecycle.
Design, deploy, and maintain cloud infrastructure on AWS and Azure, including IAM, VPCs, security groups, multi-account strategies, and cost optimization.
Own reliability, observability, and security controls across all AI and cloud services, including incident response, debugging complex multi-service environments, and driving continuous improvement.
Dragos is dedicated to arming customers with best-in-class technology, threat intelligence, and services to protect their systems. They're a remote-first culture with operations in North America, Europe, the Middle East, and APAC, looking for mission-oriented teammates who embody their core values of authenticity, transparency, and trust.
Own the security strategy for frontier model access and MCP governance.
Architect the identity and trust model for non-human agents and set the adversarial defense posture for AI systems in production.
Secure the shared knowledge layer and build AI supply chain integrity into the platform.
Life360's mission is to keep people close to the ones they love through its mobile app and tracking devices, providing services like location sharing and crash detection. It is a remote-first company with over 500 employees, serving nearly 96 million monthly users across more than 180 countries.
Define and evolve the architecture and roadmap for enterprise‑scale Data and AI platforms.
Design and build multi‑tenant, multi‑region, highly available AI platforms with governance.
Lead capacity planning and cost optimization strategies for GPU and CPU workloads.
NEORIS accelerates growth in Ibero‑America, combining global engineering with regional expertise. With over 60,000 professionals across 55+ countries, they offer technical specialization career paths and value responsibility, collaboration, creativity, and commitment.
Architect, implement, and maintain AI agent orchestration platforms for multi-agent delegation, session management, and streaming interfaces.
Design and build Model Context Protocol (MCP) servers to expose domain-specific capabilities as composable tools with rigorous security and isolation.
Deploy and operate containerized services on AWS using Terraform, optimizing for cost, reliability, and observability while partnering with ML and data engineers.
Natera is a global leader in cell-free DNA testing dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing part of standard care. The company consists of a dedicated team of professionals from world-class institutions who are stretched and challenged while working to change genetic disease management.
Own and evolve a scalable observability platform spanning metrics, logs, traces, and events.
Design telemetry pipelines ingesting data from GPUs, CPUs, networking, containers, APIs, and BMC/Redfish.
Design and implement noise-resistant alerting systems to improve signal quality and reduce operational load.
Lightning AI builds an end-to-end platform for developing, training, and deploying AI systems, designed to take ideas from research to production with less friction. They combine developer-first software with cost-efficient, large-scale compute, serving solo researchers, startups, and large enterprises.
Lead implementation engagements end-to-end, running the implementation support program.
Architect agent workflows with customers, designing agent architectures for specific use cases.
Build alongside customers, writing and iterating on agent prompts, skills, and configurations.
Warp is building the platform for agentic development, evolving from a terminal to a full agentic development environment. With over 750k active developers, it's one of the fastest-growing startups in the AI development space, backed by venture capital firms and passionate angels.
Responsible for the foundational security posture of our organization.
Architect and build preventative guardrails and mitigate new risks introduced by first and third-party AI agents in our Enterprise.
Develop and set the long term roadmap for agentic AI identity and posture management, ensuring cohesive strategies for reducing risk from agentic AI use.
Twilio is shaping the future of communications, delivering innovative solutions to hundreds of thousands of businesses and empowering millions of developers worldwide to craft personalized customer experiences. Our dedication to remote-first work, and strong culture of connection and global inclusion means that no matter your location, you’re part of a vibrant team with diverse experiences making a global impact each day.
Own the technical direction of Remote's SRE/Platform domain.
Define and drive the reliability strategy across the platform.
Identify and lead AI enablement initiatives across the engineering organisation.
Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.
Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
Optimize serverless container workflows for AI workloads, ensuring performance, scalability, and seamless autoscaling.
Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals globally and power everything from real-time communication and streaming to enterprise AI and secure web applications.
Own and evolve Quansight's cloud infrastructure across AWS, Azure, and GCP.
Build, deploy, and maintain internal dashboards and reporting for operations and project management.
Lead infrastructure engagements for clients from scoping and architecture through delivery, upskilling client teams.
Quansight is rooted in the Python and PyData ecosystems. They provide services ranging from open-source software development to training and consulting, believing in a culture of do-ers, learners, and collaborators.
Architecting core platform primitives while setting the standard for technical excellence and safety.
Designing durable, production-grade systems that power patient engagement and provider workflows at scale.
Owning the AI foundation that determines how generative AI shows up across mental healthcare.
Rula is dedicated to treating the whole person, not just the symptoms, and aims to create a world where mental health is embraced as an integral part of one's overall well-being. They are a remote-first company that hires in most U.S. states and are dedicated to having a culture of inclusion.
Define, implement, and maintain the AI security strategy across Deel's infrastructure and product ecosystem.
Lead security assessments and threat modeling for AI/ML models, LLM integrations, and agentic AI systems.
Evaluate and deploy AI Security Posture Management (AISPM) and AI Detection & Response (AIDR) solutions.
Deel is the all-in-one payroll and HR platform for global teams with a vision to unlock global opportunity. They are among the largest globally distributed companies with a team of 7,000 spanning more than 100 countries with a connected and dynamic culture.
Design, implement, and maintain reliable, scalable, and secure infrastructure, applications, and tooling, with a focus on our ML/AI pipelines and workloads
Write clean, maintainable code, and perform peer code-reviews
Write clear and concise documentation and engage in cross-team communication and knowledge sharing
Bright Machines is a next-generation, AI-enabled manufacturer focused on data center infrastructure assembly operations. The company utilizes AI-based robotics and software to assemble AI infrastructure hardware products for hyperscalers and leading OEMs, employing under 500 employees, with a culture rooted in innovation and expertise.
Provide technical leadership for infrastructure, reliability, and observability.
Own the observability stack using Datadog and CloudWatch.
Design and evolve AWS infrastructure for reliability, security, scalability, and cost efficiency.
Topstep is an engaging working environment that ranges from fully remote to hybrid. They foster a culture of collaboration by keeping cameras on during meetings and maintaining a robust Slack environment for communication.
Design end-to-end AI integration architectures connecting LLM APIs, vector databases, and inference systems to existing backend infrastructure.
Build reusable ML infrastructure components like feature pipelines, model serving layers, and evaluation frameworks that multiple portfolio companies standardize on.
Establish AI system integration best practices and governance patterns that become repeatable playbooks across the holding company.
Emergence is a thematic holding company backed by the Pritzker Organization focused exclusively on acquiring and scaling category-defining software businesses. They invest in focused portfolios, specialized operating groups with deep domain expertise and proven playbooks.
Design, develop, test, and deploy secure production systems across cloud-native and edge appliance deployments.
Embed with cross-functional teams to advise on infrastructure and security best practices that perform in production.
Own end-to-end infrastructure outcomes for critical programs and harden the artifact pipeline for consistent builds across all deployments.
Onebrief develops collaboration and AI-powered workflow software specifically designed for military staffs to enhance their efficiency and effectiveness. The company is fully remote, employs a team of veterans and technologists, and is valued at $2.15B with significant funding from top-tier investors, fostering a culture of ownership, excellence, and serious teamwork.