Source Job

Canada

  • Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
  • Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
  • Define data governance policies and leverage AI tools to streamline data pipeline development.

Python SQL AWS PySpark Databricks

20 jobs similar to Data Engineer

Jobs ranked by similarity.

US Canada

  • Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
  • Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
  • Participate in architectural decisions and evangelize data engineering best practices.

OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.

Canada

  • Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
  • Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
  • Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.

Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.

Global

  • Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
  • Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
  • Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.

Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.

Mexico

  • Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
  • Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
  • Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.

Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.

US Unlimited PTO

  • Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
  • Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
  • Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.

YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.

$4,200–$5,200/mo
Global

  • Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
  • Process and integrate data from multiple formats and sources (JSON, CSV, XML).
  • Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.

I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

Latin America

  • Develop and maintain data models for core package application and reporting databases.
  • Monitor execution and performance of daily pipelines and escalate issues.
  • Collaborate with analytics and business teams to improve data models and pipelines.

Bluelight Consulting is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.

  • Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
  • Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
  • Implement data contracts between back-office and the platform.

Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.

Canada

  • Architect and lead the implementation of an enterprise lakehouse on Databricks across major clouds.
  • Design scalable batch and streaming data pipelines using PySpark, Spark SQL, and Delta Live Tables.
  • Define and enforce platform standards for data modeling, CI/CD, governance, and cost optimization.

Bounteous is a premier end-to-end digital transformation consultancy partnering with ambitious brands to create digital solutions. With over 4,000 expert team members across the Americas, APAC, and EMEA, we deliver innovative solutions in Strategy, Analytics, Digital Engineering, Cloud, Data & AI, Experience Design, and Marketing.

Brazil

  • Design, develop, and maintain scalable ETL/ELT data pipelines to ingest, transform, and load data into data warehouses.
  • Implement and monitor data quality frameworks, ensuring accuracy, consistency, and reliability across datasets.
  • Collaborate with data scientists, analysts, and business stakeholders to deliver effective data solutions.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They focus on efficient and fair candidate evaluation through technology.

Brazil

  • Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
  • Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
  • Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.

This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.

US

  • Design and maintain production-grade ETL/ELT pipelines in a multi-hundred terabyte Snowflake environment.
  • Translate client loyalty program requirements into dimensional models and platform tables.
  • Build reliable, event-driven data architecture to support AI-powered loyalty products.

Kobie is a leader in loyalty solutions, helping brands build lasting emotional connections with consumers. Named a Top Workplace in the USA, the company fosters a collaborative, growth-focused culture with a diverse suite of benefits and flexible work arrangements.

Canada Unlimited PTO 12w maternity 12w paternity

  • Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
  • Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
  • Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.

Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.

US

  • Play a crucial role in helping client organizations transform raw data into reliable, well-modeled assets that drive business decisions.
  • Design, build, and maintain scalable data pipelines and ELT workflows, with Databricks as the primary platform.
  • Collaborate with data engineers, analysts, and clients on end-to-end data requirements and project delivery.

Velir is an established mid-sized agency with a top-tier portfolio of clients, ranging from the world’s largest non-profits to Fortune 500 brands. Our culture is built on a foundation of trust, collaboration, and continued improvement, and we are a remote first company that offers competitive pay and excellent benefits.

US

  • Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
  • Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
  • Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.

Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.

US

  • Own data pipeline development. Build and maintain reliable pipelines that ingest, transform, and deliver healthcare data across the organization.
  • Design warehouse data models. Create scalable schemas and data structures that support analytics, reporting, and evidence generation.
  • Lead data transformation strategy. Establish frameworks and standards that improve consistency, maintainability, and performance.

Pivotal Health builds a technology platform to help healthcare providers get paid fairly in complex reimbursement landscapes. The company is a collaborative, low-ego team on a mission to make healthcare reimbursement fairer.

LATAM

  • Build and optimize scalable data pipelines using Python and dbt.
  • Design and maintain Snowflake warehouse structures, database tables, and performant data models.
  • Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.

We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.

Mexico 3w PTO

  • Collaborate with U.S.-based clients to architect data solutions and translate requirements into technical specifications.
  • Design and build batch and real-time data pipelines, automate ETL processes, and ensure data accuracy and security.
  • Leverage advanced SQL, Python, and cloud data warehouse technologies to drive data-driven decision making for clients.

3Pillar Global is an AI transformation partner that helps enterprises build AI-native products and intelligent agents. With teams across North America, Europe, Latin America, and Asia, they foster a global, collaborative culture focused on modernizing and competing in the digital era.

Latin America

  • Design, develop, and maintain ETL data engineering processes using Python (PySpark) and Azure Synapse Analytics.
  • Apply expertise in data warehousing to create effective data storage structures in a Massively Parallel Processing SQL Pool.
  • Collaborate with cross-functional teams to understand data requirements and provide support for data-related initiatives.

Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.