Design, build, and operate high-scale data ingestion and replication systems from production data stores into the data lakehouse.
Build and maintain reliable, scalable data platform infrastructure capable of handling petabytes of data across analytics, AI, and operational use cases.
Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams move and access data safely.
Samsara is the pioneer of the Connected Operations Cloud, enabling organizations that depend on physical operations to harness IoT data for actionable insights. As a publicly traded company, Samsara fosters a growth-oriented culture and serves industries that represent over 40% of global GDP.
Build, maintain, and run CI/CD pipelines and infrastructure-as-code for the Smile Digital Health platform.
Provision, configure, and operate cloud-based Spark clusters and distributed data processing environments.
Design and maintain scalable, secure infrastructure templates and deployment automation across cloud environments.
Smile Digital Health makes it easy for healthcare stakeholders to collect and exchange data with our leading FHIR-based data liberation platform. At its heart, the Smile platform enables people and organizations to better manage healthcare data; the company was #19 on Deloitte's Technology Fast 50 Ranking for 2024!
Design and deliver robust, high-scale routing experiences for Data Pipelines for Twilio Segment.
Operate always-available, complex distributed systems in cloud environments.
Collaborate cross-functionally with design, product, and other engineers to define solutions.
Twilio is shaping the future of communications, delivering innovative solutions to hundreds of thousands of businesses and empowering millions of developers worldwide. The company is remote-first with a strong culture of connection and global inclusion, and employs a diverse team of Twilions.
Lead, manage, and mentor a group of data engineers.
Own the strategic design, development, and evolution of the Common Event Bus (CEB) architecture.
Actively manage technical debt and guide the team through migrations from legacy processing frameworks and data services into a modernized data stack.
TrueML is a mission-driven financial software company that aims to create better customer experiences for distressed borrowers. The TrueML team includes inspired data scientists, financial services industry experts and customer experience fanatics building technology to serve people.
Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.
Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.
Design and implement robust data infrastructure in AWS, using Spark with Scala.
Evolve our core data pipelines to efficiently scale for our massive growth.
Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.
tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.
Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
Collaborate cross-functionally with AI/ML and Product teams to implement new features.
Proactively identify and resolve bottlenecks in our complex ETL processes.
Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Be the Analytics Engineering lead within the Sales and Marketing organization.
Be the data steward for Sales and Marketing: architect and improve the collection of underlying data.
Develop and maintain robust data pipelines and workflows for data ingestion, processing, and transformation.
Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and millions of daily active unique visitors, Reddit is one of the internet’s largest sources of information.
Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
Contribute to automated testing suites and optimize data operations for integrity and security.
Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.
Contribute to the design and implementation of scalable data solutions.
Build and optimize batch and streaming ingestion pipelines.
Ensure data quality, reliability, and performance across pipelines and datasets.
Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.
Design, build, and maintain scalable data infrastructure using modern cloud technologies.
Develop robust batch and streaming data pipelines to ingest, process, and serve data.
Contribute to the implementation of a modern data lakehouse architecture.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. The system identifies the top-fitting candidates and shares this shortlist with the hiring company.
Design the technical architecture of the Databricks Data Warehouse and act as the pattern reviewer for the team.
Build and optimize secure self-service frameworks for batch and streaming data so the same request is never solved by hand twice.
Treat the platform like production software by defining SLOs, owning observability, and leading incident triage.
Tilt uses machine learning and mobile-first products to provide credit based on over 250 real-time financial signals, not just credit scores. With millions of customers worldwide, the company is building a new credit system for working people.
Owns organizational-wide data architecture, defining standards, patterns, and designs that our teams will implement.
Reviews data-related designs and implementations across teams for architectural consistency, performance, and scalability.
Designs and develops data pipelines, integrations, and platform features with performance and scalability in mind.
Tenna provides a platform that revolutionizes construction equipment fleet operations. They provide innovative solutions to customers looking for competitive ways to better manage and track their assets, such as heavy and light equipment, large fleets, tools, and materials. They value quality-obsessed, gritty, continuous learners, and collaborative problem solvers.
Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.
Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.
Design, implement, and maintain a streaming-first data platform.
Build and operate a hybrid Databricks and Snowflake architecture.
Champion platform standards, tooling, and best practices across the data organization.
1Password is building a foundation for a safe, productive digital future to unleash employee productivity without compromising security. They have over 180,000 businesses trusting their teams; its culture prioritizes collaboration, clear communication, and alignment with core values.
Lead the design and evolution of scalable financial data systems supporting commissions, incentives, and payments.
Build and maintain robust data pipelines using Python, SQL, Spark, and Terraform for accuracy and performance.
Define technical strategy and roadmap for financial data operations in collaboration with stakeholders.
Our partner is a fast-growing technology company building financial data infrastructure for insurance operations. They have a remote-friendly work environment and emphasize engineering excellence and cross-functional collaboration.
Build streaming and batch pipelines that ingest, normalise, and distribute market, trading, and portfolio data.
Build the self-serve tooling so other teams publish, consume, and build on data products without waiting.
Own data contracts and schema evolution; keep schema changes from turning into multi-team coordination events.
Keyrock is a change-maker in the digital asset space, renowned for its partnerships and innovation. They have over 250 team members around the world with diverse backgrounds and hubs in London, Brussels, and Singapore, hosting regular online and offline hangouts.
Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.
YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.
Design, build, and maintain scalable backend services and APIs that power Chattermill’s core analytics platform.
Architect reliable, maintainable distributed systems and contribute to the evolution of backend service design and infrastructure.
Own end-to-end delivery of backend engineering workstreams, from technical scoping and architecture through to implementation, testing, observability, and production support.
Chattermill helps large successful brands like Uber, Amazon, and Wise put their customers at the centre of everything they do. Using best-in-class tech in a fast-evolving AI space, their Customer Experience Intelligence platform continuously analyses feedback to help clients identify what to do next.