Source Job

Europe

  • Build and scale data infrastructure powering targeting, identity, and measurement capabilities.
  • Optimize core ETL/ELT pipelines and ensure operational reliability with documented SLAs.
  • Implement privacy-compliant data methodologies meeting GDPR/CCPA standards.

Python Spark Airflow AWS Kubernetes

20 jobs similar to Senior Data Engineer

Jobs ranked by similarity.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

$4,200–$5,200/mo
Global

  • Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
  • Process and integrate data from multiple formats and sources (JSON, CSV, XML).
  • Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.

I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.

Canada Unlimited PTO 12w maternity 12w paternity

  • Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
  • Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
  • Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.

Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.

Mexico

  • Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
  • Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
  • Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.

Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.

  • Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
  • Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
  • Implement data contracts between back-office and the platform.

Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.

LATAM

  • Build and optimize scalable data pipelines using Python and dbt.
  • Design and maintain Snowflake warehouse structures, database tables, and performant data models.
  • Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.

We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.

US

  • Build scalable Python-based data pipelines and backend services for analytics workflows.
  • Design software systems using object-oriented programming and sound engineering practices.
  • Create and support platforms for analytics development, model training, and model deployment.

Experian is a global data and technology company that powers opportunities for people and businesses worldwide across markets like financial services, healthcare, and automotive. With a team of 25,200 people in 32 countries, Experian invests in advanced technologies and its people to unlock the power of data.

Canada

  • Design, build, and operate high-scale data ingestion and replication systems from production data stores into the data lakehouse.
  • Build and maintain reliable, scalable data platform infrastructure capable of handling petabytes of data across analytics, AI, and operational use cases.
  • Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams move and access data safely.

Samsara is the pioneer of the Connected Operations Cloud, enabling organizations that depend on physical operations to harness IoT data for actionable insights. As a publicly traded company, Samsara fosters a growth-oriented culture and serves industries that represent over 40% of global GDP.

Germany Unlimited PTO

  • Translate business requirements into technical specifications for AWS-based data architecture.
  • Manage and optimize Data Warehouses & Data Lakes.
  • Design and administer data architecture in compliance with PCI, GDPR, and CCPA.

Aevi is a payments technology company providing a cloud-based platform for payments and transaction data. The company operates globally with offices in Europe, Australia, and the US and fosters a culture of innovation, diversity, and flexibility.

Canada

  • Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
  • Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
  • Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.

Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.

US

  • Design and maintain production-grade ETL/ELT pipelines in a multi-hundred terabyte Snowflake environment.
  • Translate client loyalty program requirements into dimensional models and platform tables.
  • Build reliable, event-driven data architecture to support AI-powered loyalty products.

Kobie is a leader in loyalty solutions, helping brands build lasting emotional connections with consumers. Named a Top Workplace in the USA, the company fosters a collaborative, growth-focused culture with a diverse suite of benefits and flexible work arrangements.

Global

  • Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
  • Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
  • Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.

Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.

US

  • Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
  • Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
  • Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.

Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.

US Unlimited PTO

  • Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
  • Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
  • Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.

YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.

Brazil

  • Design, develop, and maintain scalable ETL/ELT data pipelines to ingest, transform, and load data into data warehouses.
  • Implement and monitor data quality frameworks, ensuring accuracy, consistency, and reliability across datasets.
  • Collaborate with data scientists, analysts, and business stakeholders to deliver effective data solutions.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They focus on efficient and fair candidate evaluation through technology.

US Canada

  • Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
  • Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
  • Participate in architectural decisions and evangelize data engineering best practices.

OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.

Global

  • Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
  • Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
  • Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.

Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.

Brazil

  • Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
  • Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
  • Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.

This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.

Canada

  • Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
  • Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
  • Define data governance policies and leverage AI tools to streamline data pipeline development.

For over four decades, PAR Technology Corporation has been a leader in restaurant technology, empowering brands worldwide to create lasting connections with their guests. With over 100,000 restaurants in more than 110 countries, we embrace a 'Better Together' ethos and offer comprehensive software and hardware solutions.

Latin America

  • Design, develop, and maintain ETL data engineering processes using Python (PySpark) and Azure Synapse Analytics.
  • Apply expertise in data warehousing to create effective data storage structures in a Massively Parallel Processing SQL Pool.
  • Collaborate with cross-functional teams to understand data requirements and provide support for data-related initiatives.

Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.