Source Job

Global

  • Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
  • Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
  • Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.

SQL Python PySpark Microsoft Fabric Databricks

20 jobs similar to Data Engineer

Jobs ranked by similarity.

Global

  • Design, build, and maintain scalable data pipelines in Microsoft Fabric using pipelines, Dataflows Gen2, and notebooks.
  • Integrate and consolidate data from multiple enterprise sources (ERP, CRM, APIs) into a centralized Lakehouse platform.
  • Develop and manage Bronze, Silver, and Gold layers, ensuring data is structured, clean, and business-ready.

Anord Mardix, a Flex company, is a global leader in critical power solutions supporting industries from financial institutions to data centers. The Flex family has ~160,000 members in 30 countries with a values-driven, high-performance culture focused on doing the right thing, collaboration, and resilience.

Brazil

  • Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
  • Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
  • Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.

This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.

US Unlimited PTO

  • Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
  • Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
  • Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.

YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.

$145,000–$200,000/yr
US Unlimited PTO

  • Design and build ETL processes in collaboration with software and model development teams.
  • Create and maintain scalable data infrastructure.
  • Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.

OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.

Global

  • Design and build end-to-end data pipelines across the RAW, Silver, and Gold layers of the Medallion Architecture.
  • Architect data ingestion, transformation, standardization, and serving processes, that structure data flows from diverse and heterogeneous sources into a coherent analytical foundation.
  • Model data for analytical consumption following Data Warehouse best practices, including Star Schema design and dimensional modeling suited for business intelligence and AI-readiness.

CI&T is a tech transformation specialist, uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters around the world, they’ve built partnerships with more than 1,000 clients during their 30 years of history, valuing diverse identities and life experiences.

$4,200–$5,200/mo
Global

  • Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
  • Process and integrate data from multiple formats and sources (JSON, CSV, XML).
  • Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.

I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.

EU

  • Design and optimize scalable data pipelines and architectures for Data & AI initiatives.
  • Build cloud-native solutions using Azure, Databricks, and big data technologies.
  • Collaborate with business stakeholders to deliver data-driven solutions and contribute to a strong data culture.

Redcare Pharmacy is Europe's leading e-pharmacy, driven by innovation and a mission to ensure every human has access to health. The company fosters a collaborative and healthy work environment, with a team passionate about cutting-edge technology and data-driven solutions.

LATAM

  • Build and optimize scalable data pipelines using Python and dbt.
  • Design and maintain Snowflake warehouse structures, database tables, and performant data models.
  • Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.

We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.

$110,000–$125,000/yr
US Unlimited PTO 12w paternity

  • Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
  • Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
  • Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.

InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.

Canada Unlimited PTO 12w maternity 12w paternity

  • Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
  • Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
  • Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.

Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.

Canada

  • Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
  • Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
  • Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.

Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.

US

  • Design, build, and operate data pipelines for analytics and AI/ML capabilities.
  • Architect ingestion, transformation, and storage pipelines across diverse data sources.
  • Implement data models suitable for analytics and BI consumption.

Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They identify the top-fitting candidates and share the shortlist directly with the hiring company.

US

  • Owns organizational-wide data architecture, defining standards, patterns, and designs that our teams will implement.
  • Reviews data-related designs and implementations across teams for architectural consistency, performance, and scalability.
  • Designs and develops data pipelines, integrations, and platform features with performance and scalability in mind.

Tenna provides a platform that revolutionizes construction equipment fleet operations. They provide innovative solutions to customers looking for competitive ways to better manage and track their assets, such as heavy and light equipment, large fleets, tools, and materials. They value quality-obsessed, gritty, continuous learners, and collaborative problem solvers.

US

  • Build and maintain ELT pipelines using Fivetran and custom API integrations via Dagster, including setting up new source connectors and troubleshooting.
  • Manage and improve our Dagster orchestration — scheduling jobs, debugging failures, and keeping pipeline runtime tight.
  • Maintain and improve our GitHub Actions CI/CD workflows to ensure safe, reliable deployment of dbt models into production.

Wisp is on a mission to put healthcare back in patients’ hands by connecting them with hassle-free sexual + reproductive care online. They are a growing, fully-remote team in the United States looking for collaborators who are committed to their mission.

$86,400–$138,600/yr
US

  • Design, develop, and maintain scalable data pipelines and infrastructure.
  • Build and optimize data warehouses, databases, and data models.
  • Implement and maintain data governance and security practices.

Jobgether is a company that uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They connect candidates with companies; their culture is collaborative and inclusive, focused on innovation and growth.

US

  • Design and implement medallion architecture using Delta Lake.
  • Build and optimize scalable data pipelines using Apache Spark.
  • Implement Unity Catalog for data lineage and access control.

V2 Strategic Advisors transforms media and advertising sales organizations with management and technology consulting. They are a lean, elite team of consultants, technical architects, and data experts who operate with the rigor of global management consulting and the energy and agility of a startup.

  • Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
  • Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
  • Implement data contracts between back-office and the platform.

Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.

$75,000–$110,000/yr
US 5w PTO

  • Support the architecture, design, and development of scalable analytics and reporting solutions across enterprise data platforms.
  • Partner with business stakeholders to define analytical strategies, frame problems, and deliver insights that drive decision-making.
  • Design and implement end-to-end data pipelines and workflows using modern big data and cloud technologies.

Cotiviti provides payment accuracy and analytics-driven solutions, focusing on healthcare and retail sectors. They are committed to fostering a diverse and inclusive environment where team members can grow and thrive.

$90,000–$120,000/yr
US 4w PTO

  • Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
  • Collaborate cross-functionally with AI/ML and Product teams to implement new features.
  • Proactively identify and resolve bottlenecks in our complex ETL processes.

Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.

US

  • Design and build scalable cloud data pipelines for high-volume manufacturing and IoT data using Spark, Kafka, Airflow, and Delta Lake.
  • Implement medallion/lakehouse architectures on Databricks, Snowflake, AWS, or Azure with strong SQL and Python proficiency.
  • Apply manufacturing domain expertise in MES, SCADA, ERP, and industrial protocols to bridge OT/IT systems for real-time data extraction.

We are a Digital Product Engineering company that builds products, services, and experiences that inspire, excite, and delight. We have 17000+ experts across 39 countries and our culture is dynamic and non-hierarchical.