Source Job

Germany Unlimited PTO

  • Translate business requirements into technical specifications for AWS-based data architecture.
  • Manage and optimize Data Warehouses & Data Lakes.
  • Design and administer data architecture in compliance with PCI, GDPR, and CCPA.

SQL AWS Kafka Python Java

20 jobs similar to Data Engineer

Jobs ranked by similarity.

Latin America

  • Design and evolve scalable data platforms, ensuring reliability and governance.
  • Define data architecture standards, models, and integration patterns for business needs.
  • Collaborate with stakeholders to translate requirements into cloud-based data solutions.

Nortal shapes digital transformation with complex solutions for global enterprises and the public sector. With over 25 years of experience and 160+ new hires yearly, the company fosters a culture of autonomy, open communication, and diversity.

India

  • Design scalable data pipelines and backend systems from the ground up.
  • Leverage AWS and GCP for real-time and batch processing.
  • Manage databases and Data Warehouses, optimizing ETL workflows.

Delivery Solutions, a UPS company, is looking for a Senior Data Engineer to join their team. They are a growing company.

Mexico

  • Contribute to the design and implementation of scalable data solutions.
  • Build and optimize batch and streaming ingestion pipelines.
  • Ensure data quality, reliability, and performance across pipelines and datasets.

Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.

  • Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
  • Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
  • Implement data contracts between back-office and the platform.

Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.

$110,000–$125,000/yr
US Unlimited PTO 12w paternity

  • Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
  • Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
  • Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.

InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.

Global

  • Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
  • Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
  • Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.

Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.

Latin America

  • Develop and maintain data models for core package application and reporting databases.
  • Monitor execution and performance of daily pipelines and escalate issues.
  • Collaborate with analytics and business teams to improve data models and pipelines.

Bluelight Consulting is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

US

  • Design, implement, and maintain ingestion paths for multiple data stream types.
  • Support relational database selection, schema design, and configuration.
  • Coordinate with the platform security team on S3 encryption and access policy requirements.

LMI is a digital solutions provider dedicated to accelerating government impact with innovation and speed. With a focus on agility and collaboration, LMI serves the defense, space, healthcare, and energy sectors helping agencies navigate complexity and outpace change.

Global

  • Lead data architecture, pipeline development, and data integrations on a generative AI platform to automate enterprise workflows.
  • Design and implement multi-zone enterprise data lakes on AWS S3 with batch and streaming ingestion pipelines.
  • Develop and deploy ML models on AWS SageMaker for use cases like lead scoring and predictive maintenance.

Capnexus is a comprehensive services provider specializing in designing, building, and supporting retail software. The company follows a build-as-a-service model with a culture built on outcomes and delivery, employing outstanding professionals across various platforms and verticals.

Bulgaria Georgia Poland Romania

  • Assess current source systems, data flows, integrations, and architectural constraints to identify gaps and risks.
  • Define the target-state data platform architecture, recommending tools like Azure, Databricks, or AWS based on business value and scalability.
  • Support AI readiness assessment and provide input into data governance, security, and compliance principles.

Exadel is a global tech company with over 25 years of engineering leadership, serving Fortune 500 clients like HBO, Microsoft, Google, and Starbucks. With 2,000+ team members and 500+ active projects, we foster an ambitious, collaborative culture.

$4,200–$5,200/mo
Global

  • Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
  • Process and integrate data from multiple formats and sources (JSON, CSV, XML).
  • Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.

I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.

US Unlimited PTO

  • Support clients to take control of their data and get value out of it by defining a reference architecture.
  • Define and implement a roadmap on data architecture, data management, business intelligence, or analytics solutions.
  • Design large data platforms to enable Data Engineers, Analysts & scientists.

3Pillar Global is a company that powers the brands you know. They are a fast-growing global team of thousands of professionals who deliver exceptional products and services for their clients.

$190,000–$280,500/yr
US Canada

  • Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
  • Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
  • Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.

Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.

Canada

  • Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
  • Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
  • Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.

Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.

$145,000–$200,000/yr
US Unlimited PTO

  • Design and build ETL processes in collaboration with software and model development teams.
  • Create and maintain scalable data infrastructure.
  • Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.

OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.

US

  • Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
  • Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
  • Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.

Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.

Global

  • Build streaming and batch pipelines that ingest, normalise, and distribute market, trading, and portfolio data.
  • Build the self-serve tooling so other teams publish, consume, and build on data products without waiting.
  • Own data contracts and schema evolution; keep schema changes from turning into multi-team coordination events.

Keyrock is a change-maker in the digital asset space, renowned for its partnerships and innovation. They have over 250 team members around the world with diverse backgrounds and hubs in London, Brussels, and Singapore, hosting regular online and offline hangouts.

Global

  • Be the Snowflake and dbt expert, guiding best practices and making key design decisions for a new multi-dimensional data model and data warehouse.
  • Build and maintain pipelines to ingest data from Salesforce and other sources, and transform data using dbt with Medallion architecture.
  • Work closely with the Business Intelligence team to monitor and implement enhancement requests for the data warehouse to meet stakeholder reporting demands.

Lyra Health is the leading provider of mental health solutions for employers, supporting over 20 million people globally. The company has delivered 13 million sessions of mental health care and is transforming access through its AI-powered platform Lyra Empower.

Latin America

  • Design, develop, and maintain ETL data engineering processes using Python (PySpark) and Azure Synapse Analytics.
  • Apply expertise in data warehousing to create effective data storage structures in a Massively Parallel Processing SQL Pool.
  • Collaborate with cross-functional teams to understand data requirements and provide support for data-related initiatives.

Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.