Design, build, and operate scheduled and event-driven data pipelines for simulation outputs, telemetry, logs, dashboards, and scenario metadata
Build and operate data storage systems (structured and semi-structured) optimized for scale, versioning, and replay
Support analytics, reporting, and ML workflows by exposing clean, well-documented datasets and APIs
Onebrief provides collaboration and AI-powered workflow software designed specifically for military staffs, valued at $2.15B. They operate as an all-remote company with a team spanning veterans from all forces and global organizations, and technologists from leading-edge software companies.
Design, build, and maintain distributed data pipelines that power Spotify Wrapped data stories and personalized experiences for more than 300M users globally.
Partner with Data Scientists to evaluate and operationalize new Wrapped story concepts, balancing personalization, scalability, and eligibility requirements.
Build scalable systems that process large-scale listening data and generate insights that celebrate users’ unique listening journeys.
The Personalization team makes deciding what to play next easier and more enjoyable for every listener. They are behind some of Spotify’s most-loved features. Join them and you’ll keep millions of users listening by making great recommendations to each and every one of them.
Build and improve AI-native products and data-driven systems.
Rapidly prototype and iterate on AI-powered features.
Analyze, evaluate, and improve model outputs and system reliability.
FutureProofing is a talent platform focused on embedding high-caliber technical talent into startups building real AI-driven products. They work at the intersection of startup execution and real AI product development, helping companies build and improve production AI systems.
Design and build ETL processes in collaboration with software and model development teams.
Create and maintain scalable data infrastructure.
Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.
OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.
Deliver high-trust analysis on clear timelines with explicit assumptions and limits.
Translate ambiguous business problems into crisp analysis scopes, metrics definitions, and reproducible systems.
Partner with GTM teams to move insights to actions and proactively surface anomalies or opportunities.
Rundoo empowers independent supply stores like local hardware stores and mom-and-pop nurseries with best-in-class technology to modernize their outdated systems in a $1T building materials industry. The team of builders, sellers, and industry veterans is backed by leading investors, has raised $18M, and is growing quickly to bring modern tech to an overlooked industry.
Develop Analysis Frameworks: Design and maintain scalable frameworks for client resolutions during POCs, onboarding, and production.
Build Monitoring & Alerting Systems: Create dashboards and real-time alert systems to monitor model performance and identify issues.
Drive Closed-Loop Model Improvement: Systematically monitor customer data to integrate real-world feedback into model development.
Socure builds identity trust infrastructure for the digital economy, verifying identities and preventing fraud in real time. The company hires critical thinkers who act like owners and cares deeply about solving customer problems with a high-performing team.
Lead architecture and hands-on development of distributed systems supporting healthcare data workflows.
Design and implement scalable data pipelines for large-scale datasets.
Partner with Product and Data teams to translate healthcare requirements into scalable architectures.
Zeta Global is an AI-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and consumer signals, helping marketers acquire, grow, and retain customers efficiently. Founded in 2007, Zeta is headquartered in New York City with offices around the world, fostering a culture of trust and belonging.
Own and maintain data pipeline architectures, ensuring reliability and monitoring.
Manage and evolve data modeling environments for analysts and engineers.
Implement observability for data systems, detecting issues early and continuously monitoring data quality.
Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.
Design, build, and maintain scalable data pipelines
Develop and optimize ETL processes to support data products
Work with structured and unstructured data across SQL and NoSQL systems
They are seeking a Data Engineer to support the development of data products that power critical business functions. They seem to have a collaborative, cross-functional Agile environment where you'll partner closely with technical and business teams to deliver high-quality data solutions.
Design & build data observability platforms and metrics.
Build metadata driven pipeline solutions.
Fuze Health puts patients first and tirelessly addresses the most pressing needs in healthcare. They empower millions to digitally connect with care providers, essential health resources and needed treatments. The company is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors.
Building pipelines that augment documents with metadata.
Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
Optimizing and evaluating our core utils, which do things like extracting and resolving citations.
We are hiring a senior software/data engineer to help build the largest case law dataset. Our data coverage includes US laws and court decisions and powers our lawyer-facing AI platform and B2B data services.
Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
Speechify's mission is to make sure that reading is never a barrier to learning by offering text-to-speech products. They are a fully distributed company with nearly 200 employees around the globe, including engineers and scientists from top companies and programs.
Analyze behavioral patterns to identify trends and opportunities to improve adoption, engagement, and retention.
Design experiments and measurement approaches to evaluate the impact of product and marketing changes.
Build models and datasets that improve how teams segment customers, forecast outcomes, and identify trends.
Customer.io's platform is used by over 8,000 companies to send billions of emails, push notifications, in-app messages, and SMS every day. They power automated communication that people actually want to receive, helping teams send smarter messages using real-time behavioral data.
Design and implement batch and real time ingestion pipelines from internal and external sources.
Implement automated data quality checks, observability, and SLA monitoring.
Optimise datasets and pipelines for analytics, ML training, and API consumption.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.
Quickly get up-to-speed on Zscaler’s SecOps platform, utilizing Python and APIs to configure, customize, and automate data transformations and workflows.
Partner with cybersecurity subject matter experts (SMEs) to onboard new data pipelines and map diverse IT and security sources to fulfill specific customer use cases.
Proactively troubleshoot pipeline health and audit customer data across environments to identify quality issues, flag security gaps, and define clear remediation steps.
Zscaler accelerates digital transformation to ensure customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise, they leverage the world’s largest security data lake to power their cloud-native Zero Trust Exchange platform. They build high-performing teams that can make an impact quickly and with high quality.
Instrument fal's core infrastructure to capture CPU, GPU, and request-level signals.
Build ingestion pipelines from partner APIs, compute vendors, and internal services into BigQuery.
Design and operate the ETL backbone that powers cost, margin, and usage analytics.
Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production at scale.
Design, build, and own scalable data pipelines and systems that power analytics, machine learning, and business operations.
Drive system design for data architecture, owning data models and storage solutions to create scalable foundations for the team.
Collaborate with engineering, product, and data teams to translate business needs into technical solutions, ensuring data quality and performance standards.
Goodway Group is a remote-first, data-driven, and technology-enabled digital media and marketing services firm with a 90+ year history, offering the security of an established company with a start-up feel. It is a diverse team of strategists, practitioners, technologists, and data scientists that is recognized as a top workplace and a certified partner to The Trade Desk.
Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.
Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.
Build streaming and batch pipelines that ingest, normalise, and distribute market, trading, and portfolio data.
Build the self-serve tooling so other teams publish, consume, and build on data products without waiting.
Own data contracts and schema evolution; keep schema changes from turning into multi-team coordination events.
Keyrock is a change-maker in the digital asset space, renowned for its partnerships and innovation. They have over 250 team members around the world with diverse backgrounds and hubs in London, Brussels, and Singapore, hosting regular online and offline hangouts.