Work cross-functionally with Product and subject matter experts to conceptualize, prototype, and build data solutions
Connect disparate datasets (e.g. claims, contract rates, demographics data) to empower internal and external stakeholders
Build and maintain data engineering systems that support AI use cases, including scalable ingestion pipelines, feature generation, and downstream products
Turquoise Health aims to make healthcare pricing simpler, more transparent, and lower cost. They are a Series B startup backed by top VCs with an accomplished group of folks with a passion for improving healthcare.
Build and manage business data pipelines and transform Firefox telemetry data into structured datasets.
Partner with data scientists, product, and marketing teams to turn datasets into models and metrics.
Ensure data accuracy and performance using observability tools and resolve data issues.
Mozilla Corporation is a technology company backed by a non-profit that has shaped the internet, creating brands like Firefox. With millions of users globally, they focus on areas including AI and social media while remaining focused on making the internet better for people.
Design, implement, and maintain robust, scalable data pipelines to support AI, analytics, and operational reporting
Own and evolve the data warehouse architecture, ensuring it meets performance, flexibility, and governance needs
Ensure data integrity, availability, lineage, and observability across complex pipelines
Remote People is building the infrastructure to power borderless teams. Their technology handles global payroll, benefits, taxes, and compliance, enabling businesses to compliantly hire anyone anywhere at the push of a button. They are a growing, international family.
Configure a chat client (VSCode) to interact with the configured MCP Server(s)
Create new MCP Tools to capture needed metrics from the running server
Implement the RAG workflow using the HCL Informix database, Actian MCP for HCL Informix, and the Actian HCL Vector Blade for HCL Informix.
Actian believes data should be a competitive advantage. They are a trusted leader in data management, integration, and analytics with a global team of experts and a culture of innovation.
Responsible for building core infrastructure software (pipelines, APIs, data modelling) as part of our client's data platform team.
Coach & mentor other engineers to support the growth of their technical expertise.
Implementing the appropriate technologies for scaling data access patterns, batch processing, and data streaming for soft real-time consumption.
YLD is a software engineering and design consultancy that creates digital capabilities for their clients. The company has offices in London, Lisbon, and Porto and aims to attract, inspire, develop, and retain extraordinary people.
Architect, design, implement, and operate end-to-end data engineering solutions using Agile methodology.
Develop and manage robust data integrations with external vendors and organizations (including complex API integrations).
Collaborate closely with Data Analysts, Data Scientists, DBAs, and cross-functional teams to understand requirements and deliver high-impact data solutions.
SmartAsset is an online destination for consumer-focused financial information and advice, whose mission is helping people make smart financial decisions, reaching over an estimated 59 million people each month. A successful $110 million Series D funding round in 2021 valued the company at over $1 billion.
Design and develop high‑performance data converters for multi‑sensor autonomous‑driving data.
Design, build, and optimize large‑scale ingestion and transformation pipelines capable of processing petabyte‑scale autonomous‑driving sensor data.
Implement automated data validation, quality checks, and lineage tracking to ensure reliability of production datasets.
Torc has been a leader in autonomous driving since 2007 and is now part of the Daimler family. They are focused solely on developing software for automated trucks to transform how the world moves freight and have a collaborative, energetic, and team-focused culture.
Create innovative solutions for handling peta-bytes of data with billions of rows & joins.
Create real time and offline features generation pipelines to managing our data infrastructure to be reliable and fast!
Develop and productionize data pipelines for our ML models in both bare-metal and the cloud environment.
Kayzen is a mobile demand-side platform (DSP) dedicated to democratizing programmatic advertising. They enable leading apps, agencies, media buyers, and brands to run programmatic customer acquisition, retargeting, and brand performance campaigns through their self-serve and managed service options.
Build and maintain robust data pipelines processing large volumes of data
Update and optimise our data platform for speed, scalability and cost
Develop processes and tools to monitor and analyse model performance and data accuracy
Moniepoint is Africa's all-in-one financial ecosystem, empowering businesses and their customers with seamless payment, banking, credit, and management tools. They processed $182 billion in 2023 and are Nigeria’s largest merchant acquirer, cultivating a culture of innovation, teamwork, and growth.
Lead the architecture and evolution of scalable, distributed data pipelines, ensuring high availability and performance at scale
Build and maintain distributed web scraping systems using tools such as Playwright, Selenium, and BeautifulSoup
Integrate AI and LLMs into engineering workflows for code generation, automation, and optimization
MercatorAI is building scalable data infrastructure to power high-quality, data-driven decision making at scale. As an early-stage company, the team is focused on creating robust, future-ready systems that can handle complex data ingestion, transformation, and delivery across a growing national footprint.
Build pipelines to load data from various systems into Dataiku via S3 or Snowflake.
Increase the robustness of existing production pipelines, identify bottlenecks, and set up a robust monitoring, testing processes, and documentation templates.
Build custom applications and integrations to automate manual tasks related to customer operations to help Product Operations / Support / SRE in their day-to-day activities
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.
Collaborate closely with business stakeholders and other engineers to deliver impactful solutions.
Integrate services and product features with databases and messaging queues.
Contribute to the development of our MLOps tools for ML models.
Trellis is rewriting the insurance experience from the inside out. They are a profitable, fast-growing Series A startup backed by General Catalyst, QED, NYCA, and Amex Ventures that brings clarity and ease to insurance shopping.
Extend, optimize, and maintain core data models for reports, machine learning, and generative AI.
Implement automation and operationalize ML models to streamline operational processes and improve efficiency.
Partner with engineering, product, and analytics teams to deliver seamless integrations and customer-facing data products.
Boulevard provides a client experience platform for appointment-based, self-care businesses, helping customers enhance client experiences. They value diversity and inclusivity, offering equal opportunities and aiming to create a supportive work environment.
Lead and grow a team of data engineers, providing mentorship and technical guidance.
Own execution of customer integrations across multiple product lines, ensuring on-time delivery.
Improve data quality and pipeline reliability by investing in better alerting and resilience.
Afresh is the leading AI company in fresh food, partnering with grocers to order billions of dollars of fresh food. They are on a mission to eliminate food waste and make fresh food accessible to all and has saved 200M lbs of food waste in 2025 alone.
Own end-to-end development of agentic and AI-integrated workflows: design, implementation, testing, deployment, and maintenance
Build MCP servers, CLIs and APIs and microservices that connect AI models to business systems: Salesforce, BigQuery, Slack, HubSpot, email, calendars, analytics tools
Scope high-impact automation problems autonomously by shadowing Sales, Customer Success, and Marketing teams to identify efficiency gaps
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana, the open source visualization tool, around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack.
Design, build, and maintain data products that support R&D, analytics, Lab, and scientific workflows.
Build and maintain data pipelines for large and complex datasets ensuring high data quality.
Partner with scientists and engineers to translate research needs into reusable data assets.
Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health. They aim to make personalized genetic testing and diagnostics part of the standard of care to protect health and enable earlier and more targeted interventions that lead to longer, healthier lives.
Design, develop, and maintain scalable data pipelines using cloud data services.
Serve as a technical leader, defining data engineering standards and best practices.
Lead the design and implementation of optimized data models in our cloud data warehouse.
Constant Contact empowers people by giving them the help and tools they need to grow online. They are energized by new challenges and possibilities, and they celebrate diversity and inclusion with programs in place to bring people together.
QAD Inc. is a leading provider of adaptive, cloud-based enterprise software and services for global manufacturing companies. They help customers in various industries rapidly adapt to change and innovate for competitive advantage.
Design, build, and maintain reliable ETL pipelines, integrating data from multiple sources into the Google Cloud Data Warehouse.
Own the product data structure, mapping product features and behaviors to analytics-ready data models, and define meaningful KPIs.
Act as the primary bridge between Backend Engineering and BI, owning the flow from data production to analytics consumption.
TuoTempo, part of the Docplanner group since 2019, develops the market-leading CRM solution dedicated to hospitals, medical centers, and health insurance providers. The platform manages and automates the entire patient journey, centralizing contacts, communications, and processes in a single modular system integrated with the software already used by organizations.
Design, build, and maintain scalable batch and real-time data pipelines that power analytics, experimentation, and machine learning
Partner cross-functionally with analytics, product, engineering and operations to deliver high-quality data solutions that drive measurable business impact
Champion data quality, reliability, and observability by implementing best practices in testing, monitoring, lineage, and incident response
Gopuff is reimagining how people purchase everyday essentials, from snacks to household goods to alcohol, all delivered in minutes. They are assembling a team of thinkers, dreamers and risk-takers who know the value of peace of mind in an unpredictable world.