Source Job

20 jobs similar to Manager Data Extraction - OpenData EMEA (Remote)

Jobs ranked by similarity.

North America

  • Lead, train, and manage our in-house data labeling team.
  • Define, execute, and continuously improve data annotation processes with a very high attention to detail.
  • Ensure high-quality data outputs and meet rigorous accuracy and consistency standards.

Reducto provides a complete toolkit for handling any workflow by understanding documents the way a human would. They have raised over $100M and partner with hundreds of companies, from leading AI teams to enterprise costumers across FAANG and top trading firms.

Global 6w PTO

  • Development of various services in Python: integration with marketing partners, obtaining data from various sources.
  • Creation and support of processes on Airflow.
  • Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.

Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world and we are proud to be a two-time “Great Place to Work” winner.

Global

  • Building pipelines that augment documents with metadata.
  • Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
  • Optimizing and evaluating our core utils, which do things like extracting and resolving citations.

We are hiring a senior software/data engineer to help build the largest case law dataset. Our data coverage includes US laws and court decisions and powers our lawyer-facing AI platform and B2B data services.

$180,000–$290,000/yr
Americas Unlimited PTO 12w maternity 12w paternity

  • Own the scrape product end-to-end.
  • Make 'just works' actually true, pushing the 'just works' rate from great to unbeatable, one long-tail failure mode at a time.
  • Obsess over the output, not just the fetch.

Firecrawl is the easiest way to extract data from the web. Developers use them to reliably convert URLs into LLM-ready markdown or structured data with a single API call. They're a small, fast-moving, technical team building essential infrastructure superintelligence will use to gather data on the web.

LATAM

  • Design, build, and maintain scalable data pipelines
  • Develop and optimize ETL processes to support data products
  • Work with structured and unstructured data across SQL and NoSQL systems

They are seeking a Data Engineer to support the development of data products that power critical business functions. They seem to have a collaborative, cross-functional Agile environment where you'll partner closely with technical and business teams to deliver high-quality data solutions.

Europe

  • Lead technical strategy for AddSearch's mature search platform and evolving RAG solution.
  • Mentor a team of 6 experienced engineers and foster a culture of technical excellence.
  • Collaborate with CEO, product, sales, and marketing leadership on product roadmap.

Saas.group turbocharges promising B2B SaaS ventures, unlocking their full potential. As a Software-as-a-Service portfolio powerhouse, they specialize in acquiring small software treasures and polishing them into industry stars, with a dynamic, fully remote team of nearly 380+ colleagues spanning 50+ countries.

  • Continue building and maturing our data platform.
  • Lead a team of data engineers, own the technical roadmap for our cloud data stack (Snowflake, Azure, dbt), and partner closely with stakeholders.
  • Establish and enforce organization-wide ETL/ELT best practices, naming conventions, and code review standards.

Magna Legal Services provides end-to-end legal support services to law firms, corporations, and governmental agencies throughout the nation. As an end-to-end service provider, they offer strategic advantages to their clients by offering legal support services at every stage of their legal proceedings.

$190,000–$210,000/yr
US Unlimited PTO

  • Lead, coach, and develop a team of analytics engineers and/or data engineers.
  • Ensure on-time delivery of client data integrations by owning enterprise data model standards and maintaining consistent, governed data definitions.
  • Oversee client data pipelines using modern tooling (dbt, Airflow, Snowflake, AWS, Python) to ensure reliable operation and uptime.

SmarterDx builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.

Global

  • Guide and develop a remote team of researchers responsible for identifying and maintaining global healthcare reference data while ensuring compliance and quality standards.
  • Design data production processes and drive improvements in both process and technology, fully owning team performance metrics for efficiency and quality.
  • Collaborate with engineering teams on automation, escalate risks early, and thrive in a fast-paced, multi-cultural remote environment requiring strong leadership and operational skills.

Veeva Systems is a mission-driven pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As a fast-growing public benefit corporation with over $3B in revenue, it fosters a culture of speed, customer success, and employee success in a flexible, Work Anywhere environment.

South America

  • Lead and mentor a team of data engineers, ensuring best practices and high-quality deliverables.
  • Own and drive the end-to-end migration strategy from Snowflake on AWS to Azure Databricks and DBT.
  • Architect and oversee scalable, secure, and reliable data pipelines and infrastructure in Azure.

Coforge is a global digital services and solutions provider. They offer services in areas like cloud, data, and engineering. The company seems to have a culture of non-discrimination and hires based on skills.

$160,000–$190,000/yr
US Canada Unlimited PTO

  • Own and maintain data pipeline architectures, ensuring reliability and monitoring.
  • Manage and evolve data modeling environments for analysts and engineers.
  • Implement observability for data systems, detecting issues early and continuously monitoring data quality.

Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.

LATAM

  • Build and optimize scalable data pipelines using Python and dbt.
  • Design and maintain Snowflake warehouse structures, database tables, and performant data models.
  • Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.

We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.

Mexico

  • Contribute to the design and implementation of scalable data solutions.
  • Build and optimize batch and streaming ingestion pipelines.
  • Ensure data quality, reliability, and performance across pipelines and datasets.

Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.

$193,600–$253,000/yr
US

  • Lead architecture, system design and engineering efforts for high-scale, data-intensive B2B systems.
  • Design and implement batch and real-time processing architectures that are reliable, observable, and performant.
  • Mentor and coach engineers at all levels, and actively contribute to Omada’s engineering community.

Omada Health is a digital care provider that empowers people to achieve their health goals through sustainable behavioral change. They have served more than two million members and strive to build an inclusive culture where differences are celebrated.

South America

  • Design, build, and own scalable data pipelines and systems that power analytics, machine learning, and business operations.
  • Drive system design for data architecture, owning data models and storage solutions to create scalable foundations for the team.
  • Collaborate with engineering, product, and data teams to translate business needs into technical solutions, ensuring data quality and performance standards.

Goodway Group is a remote-first, data-driven, and technology-enabled digital media and marketing services firm with a 90+ year history, offering the security of an established company with a start-up feel. It is a diverse team of strategists, practitioners, technologists, and data scientists that is recognized as a top workplace and a certified partner to The Trade Desk.

$140,000–$175,000/yr
US

  • Deploy new data pipelines.
  • Design & build data observability platforms and metrics.
  • Build metadata driven pipeline solutions.

Fuze Health puts patients first and tirelessly addresses the most pressing needs in healthcare. They empower millions to digitally connect with care providers, essential health resources and needed treatments. The company is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors.

Global

  • Build streaming and batch pipelines that ingest, normalise, and distribute market, trading, and portfolio data.
  • Build the self-serve tooling so other teams publish, consume, and build on data products without waiting.
  • Own data contracts and schema evolution; keep schema changes from turning into multi-team coordination events.

Keyrock is a change-maker in the digital asset space, renowned for its partnerships and innovation. They have over 250 team members around the world with diverse backgrounds and hubs in London, Brussels, and Singapore, hosting regular online and offline hangouts.

Global

  • Co-design, prepare and deliver KPIs to measure commercial and sales performance.
  • Design and develop requested KPIs from different areas and countries.
  • Define and implement initiatives to improve information homologation and standardization.

Encora is a global company that offers Software and Digital Engineering solutions. They hire professionals based solely on skills and do not discriminate, focusing on practices like Cloud Services, Product Engineering, Data & Analytics, and AI.

Argentina Brazil Mexico Colombia Costa Rica

  • Transform raw data into business insights, working closely with stakeholders.
  • Apply software engineering principles like version control to the analytics codebase.
  • Implement validation checks and automated testing procedures to manage data quality.

Newsela is a leading education technology company dedicated to meaningful classroom learning for every student. They deliver integrated, AI-powered solutions designed to unlock student engagement, empower teachers, and drive meaningful learning outcomes.

$99,500–$136,800/yr
US 14w maternity

  • Craft fault-tolerant data pipelines and distributed systems to support millions of students.
  • Effective communicator who collaborates well with distributed engineering, product, and design teams.
  • Ensure that timely, accurate data and metrics are delivered consistently.

Renaissance is a global leader in pre-K–12 education technology. Their solutions help educators analyze, customize, and plan personalized learning paths for students. They are used in over one-third of US schools and in more than 100 countries worldwide.