Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Design & build data observability platforms and metrics.
Build metadata driven pipeline solutions.
Fuze Health puts patients first and tirelessly addresses the most pressing needs in healthcare. They empower millions to digitally connect with care providers, essential health resources and needed treatments. The company is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors.
Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.
Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.
Owns organizational-wide data architecture, defining standards, patterns, and designs that our teams will implement.
Reviews data-related designs and implementations across teams for architectural consistency, performance, and scalability.
Designs and develops data pipelines, integrations, and platform features with performance and scalability in mind.
Tenna provides a platform that revolutionizes construction equipment fleet operations. They provide innovative solutions to customers looking for competitive ways to better manage and track their assets, such as heavy and light equipment, large fleets, tools, and materials. They value quality-obsessed, gritty, continuous learners, and collaborative problem solvers.
Build, maintain, and operate data pipelines and curated data products across Snowflake, Airflow (MWAA), AWS, Python, and SQL.
Implement observability and data quality controls and build monitoring for freshness, volume, schema, distribution, and lineage.
Define and enforce data platform standards, establish orchestration patterns, DAG anti-patterns, deployment practices, observability standards, data quality patterns, and operational runbooks used across the organization.
Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.
Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.
Build infrastructure and data automation pipelines to ingest, process, and load data from various sources.
Collaborate with stakeholders and data science teams to develop data products aligned with organizational goals.
Develop data analysis tools to provide insights and capture key metrics.
Columbia General is seeking a Senior Data Engineer to help transform data into actionable insights that drive decision-making. The company fosters a dynamic, collaborative environment that supports growth and innovation.
Lead architecture, system design and engineering efforts for high-scale, data-intensive B2B systems.
Design and implement batch and real-time processing architectures that are reliable, observable, and performant.
Mentor and coach engineers at all levels, and actively contribute to Omada’s engineering community.
Omada Health is a digital care provider that empowers people to achieve their health goals through sustainable behavioral change. They have served more than two million members and strive to build an inclusive culture where differences are celebrated.
Lead workspace architecture, Unity Catalog governance, and cluster policy design for client tenant organizations.
Perform tenant discovery, requirements gathering, source profiling, and security classification for new data intake requests.
Develop end-to-end technical designs for tenant onboarding, including Data Sharing Agreements and SLA documentation.
M9 Solutions provides IT services and solutions to the Federal Government, mobilizing skilled people and technologies for improved performance and sustainable change. With 15+ years of proven delivery and growth, the company has been recognized as an Inc. 5000 Fastest-Growing Private Company multiple times and values diverse perspectives.
Design, develop, and maintain robust and scalable data pipelines using Apache Spark and cloud-native data services.
Build, optimize, and support ETL/ELT workflows to enable analytics, reporting, and downstream applications.
Implement and manage data solutions using Databricks, Delta Lake, and Unity Catalog.
Onebridge, a Marlabs Company, is a global AI and Data Analytics Consulting Firm that empowers organizations worldwide to drive better outcomes through data and technology. Since 2005, they have partnered with some of the largest healthcare, life sciences, financial services, and government entities across the globe.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Design and implement batch and real time ingestion pipelines from internal and external sources.
Implement automated data quality checks, observability, and SLA monitoring.
Optimise datasets and pipelines for analytics, ML training, and API consumption.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.
Design, build, and maintain scalable data infrastructure using modern cloud technologies.
Develop robust batch and streaming data pipelines to ingest, process, and serve data.
Contribute to the implementation of a modern data lakehouse architecture.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. The system identifies the top-fitting candidates and shares this shortlist with the hiring company.
Contribute to the design and implementation of scalable data solutions.
Build and optimize batch and streaming ingestion pipelines.
Ensure data quality, reliability, and performance across pipelines and datasets.
Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.
Build streaming and batch pipelines that ingest, normalise, and distribute market, trading, and portfolio data.
Build the self-serve tooling so other teams publish, consume, and build on data products without waiting.
Own data contracts and schema evolution; keep schema changes from turning into multi-team coordination events.
Keyrock is a change-maker in the digital asset space, renowned for its partnerships and innovation. They have over 250 team members around the world with diverse backgrounds and hubs in London, Brussels, and Singapore, hosting regular online and offline hangouts.
Build, maintain, and run CI/CD pipelines and infrastructure-as-code for the Smile Digital Health platform.
Provision, configure, and operate cloud-based Spark clusters and distributed data processing environments.
Design and maintain scalable, secure infrastructure templates and deployment automation across cloud environments.
Smile Digital Health makes it easy for healthcare stakeholders to collect and exchange data with our leading FHIR-based data liberation platform. At its heart, the Smile platform enables people and organizations to better manage healthcare data; the company was #19 on Deloitte's Technology Fast 50 Ranking for 2024!
Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.
Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.
Design and implement robust data infrastructure in AWS, using Spark with Scala.
Evolve our core data pipelines to efficiently scale for our massive growth.
Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.
tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.
Architect and evolve Affirm's lakehouse analytics platform, driving strategy around Snowflake, Apache Iceberg, and Spark.
Design and implement RBAC and dynamic data masking policies in Snowflake, ensuring data access is secure, compliant, and auditable.
Lead the technical direction of analytics engineering practices, including data modeling, transformation pipelines (dbt), and data quality frameworks.
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They pride themselves on their culture across engineering of engaging in thorough technical design review, operational excellence, and capable incident response and analysis.