Source Job

US Unlimited PTO

  • Design and improve data pipelines that process large, multi-modal datasets from internal and external sources into AI model training datasets.
  • Evolve our data storage layer to support analytics, schema evolution, reproducibility, and efficient data access.
  • Collaborate with ML engineers to improve the performance and reliability of Python-based data processing workflows.

ETL Python AWS Kubernetes

20 jobs similar to Data Engineer

Jobs ranked by similarity.

North America Asia Unlimited PTO

  • Design, implement, and maintain distributed ingestion pipelines for structured and unstructured data.
  • Build scalable ETL/ELT workflows to transform, validate, and enrich datasets for AI/ML model training and analytics.
  • Support preprocessing of unstructured assets for training pipelines, including format conversion, normalization, augmentation, and metadata extraction.

Meshy is a leading 3D generative AI company transforming content creation by enabling the creation of 3D models from text and images. They have a global team distributed across North America, Asia, and Oceania and are backed by venture capital firms like Sequoia and GGV, with $52 Million in funding.

Global

  • Design, build, and operate scheduled and event-driven data pipelines for simulation outputs, telemetry, logs, dashboards, and scenario metadata
  • Build and operate data storage systems (structured and semi-structured) optimized for scale, versioning, and replay
  • Support analytics, reporting, and ML workflows by exposing clean, well-documented datasets and APIs

Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. They transform this work, making the staff faster, smarter, and more efficient. The company is all-remote with employees working alongside customers; it was founded in 2019 and has raised $320m+.

US

  • Architect and lead the evolution of our modern data platform.
  • Design and build production LLM pipelines and infrastructure that power intelligent operations.
  • Own end-to-end data acquisition and integration architecture across diverse sources.

Brightwheel is the largest, fastest growing, and most loved platform in early ed. They are trusted by millions of educators and families every day. The team is passionate, talented, and customer-focused and embodies their Leadership Principles in their work and culture.

US

  • Build and own pipelines for the creation, curation, and processing of large-scale multimodal datasets.
  • Build and own ETL and CDC streams from Postgres and ClickHouse to analytics warehouses.
  • Manage production databases (Postgres, ClickHouse) and optimize for performance and reliability

Runway is building AI to simulate the world through merging art and science. They believe that world models are at the frontier of progress in artificial intelligence. The Runway team consists of creative, open minded, caring and ambitious people who are determined to change the world.

$135,500–$200,000/yr
US

  • Architect, design, implement, and operate end-to-end data engineering solutions.
  • Develop and manage robust data integrations with external vendors.
  • Collaborate closely with Data Analysts, Data Scientists, DBAs, and cross-functional teams.

SmartAsset is an online destination for consumer-focused financial information and advice, helping people make smart financial decisions. With over 59 million people reached each month, they operate SmartAsset Advisor Marketing Platform (AMP) to connect consumers with fiduciary financial advisors.

US Unlimited PTO

  • Design, build, and maintain robust data pipelines.
  • Own and scale ETL/ELT processes using tools like dbt, BigQuery, and Python.
  • Build modular data models that power analytics, product features, and LLM agents.

Jobgether is a platform that uses AI to match candidates with jobs. They aim to review applications quickly and fairly, ensuring the top-fitting candidates are identified and shared with hiring companies.

Data Engineer

Egen
$124,800–$145,600/hr

  • Migrate data and analytics workloads from BigQuery to Snowflake
  • Develop and optimize ETL/ELT pipelines using Python and SQL
  • Build analytics-ready datasets for reporting and dashboards

Egen is a fast-growing and entrepreneurial company with a data-first mindset. They bring together the best engineering talent working with the most advanced technology platforms to help clients drive action and impact through data and insights.

Brazil Canada US Latin America

  • Work alongside Caylent’s Architects, Engineering Managers, and Engineers to deliver AWS solutions.
  • Build solutions defined in project backlogs, writing production-ready, well-tested, and documented code across cloud environments.
  • Participate in Agile ceremonies such as daily standups, sprint planning, retrospectives, and demos.

Caylent is a cloud native services company that helps organizations bring the best out of their people and technology using Amazon Web Services (AWS). They are a global company and operate fully remote with employees in Canada, the United States, and Latin America fostering a community of technological curiosity.

$210,746–$240,000/yr
US

  • Design, build, and operate ETL pipelines at scale.
  • Design data structure for data products.
  • Develop and operate API/tools related to data products and machine learning products.

Mercari is a company that provides a marketplace platform. They value teamwork and provide career growth opportunities as the company continues to expand.

Global

  • Architect and maintain robust, scalable, and secure data infrastructure on AWS leveraging Databricks.
  • Design, develop, and maintain data pipelines, primarily using tools like Airbyte and custom-built services in Go, to automate data ingestion and ETL processes.
  • Oversee the creation and maintenance of the data lake, ensuring efficient storage, high data quality, and effective partitioning, organization, performance, monitoring and alerting.

Trust Wallet is the leading non-custodial cryptocurrency wallet, trusted by over 200 million people worldwide to securely manage and grow their digital assets. They aim to be a trusted personal companion — helping users safely navigate Web3, the on-chain economy, and the emerging AI-powered future.

US

  • Architect and maintain scalable, secure, and high-performing data pipelines to support analytics, reporting, and operational needs.
  • Develop and deploy production-grade data engineering code, ensuring reliability and performance across environments.
  • Manage end-to-end data workflows, including ingestion, transformation, modeling, and validation for multiple business systems.

Onebridge, a Marlabs Company, is a global AI and Data Analytics Consulting Firm that empowers organizations worldwide to drive better outcomes through data and technology. Since 2005, they have partnered with some of the largest healthcare, life sciences, financial services, and government entities across the globe.

$110,572–$145,000/yr
US Unlimited PTO

  • Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and design data models and schemas that facilitate data analysis and reporting
  • Design, develop, and maintain scalable and efficient data pipelines and ETL processes to ingest, process, and transform large volumes of data from various sources into usable formats
  • Build and optimize data storage and processing systems, including data warehouses, data lakes, and big data platforms, using AWS services such as Amazon Redshift, AWS Glue, AWS EMR, AWS S3, and AWS Lambda, to enable efficient data retrieval and analysis

ATPCO is the world's primary source for air fare content. They hold over 200 million fares across 160 countries and the travel industry relies on their technology and data solutions. ATPCO believes in flexibility, trust, and a culture where your wellbeing comes first.

$96,050–$113,000/yr
US

  • Creating and maintaining optimal data pipeline architecture.
  • Assembling large, complex data sets that meet functional & non-functional business requirements.
  • Building the infrastructure required for optimal extraction, transformation and loading of data from a wide variety of data sources using relevant technologies.

Mercer Advisors works with families to help them amplify and simplify their financial lives through integrated financial planning, investment management, tax, estate, and insurance services. They serve over 31,300 families in more than 90 cities across the U.S. and are ranked the #1 RIA Firm in the nation by Barron’s.

Latin America Canada US

  • Work alongside engineers, engineering managers, and project managers to deliver AWS solutions.
  • Guide Cayliens and Customers alike through Agile ceremonies like stand-ups and retrospectives.
  • Translate customer requirements into a workable backlog of tickets for engineers.

Caylent is a cloud native services company that helps organizations bring the best out of their people and technology using Amazon Web Services (AWS). They operate fully remote with employees in Canada, the United States, and Latin America and foster a community of technological curiosity.

Europe

  • Build ETL/ELT pipelines for extracting data from sources and placing it in target destinations.
  • Transform data into formats usable by AI-based solutions.
  • Manage datasets for AI model training and fine-tuning.

Jobgether is an AI-powered platform that connects job seekers with employers. They use AI to match candidates with roles and ensure applications are reviewed quickly and fairly.

$130,000–$130,000/yr
Americas Unlimited PTO

  • Build and evolve our semantic layer, design, document, and optimize dbt models.
  • Develop and maintain ETL/orchestration pipelines to ensure reliable and scalable data flow.
  • Partner with data analysts, scientists, and stakeholders to enable high-quality data access and experimentation.

Customer.io's platform is used by over 7,500 companies to send billions of emails, push notifications, in-app messages, and SMS every day. They power automated communication and help teams send smarter, more relevant messages using real-time behavioral data; their culture values empathy, transparency, and responsibility.

US

  • Design and maintain ETL pipelines that ingest, process, and load data into AWS Neptune.
  • Develop and evolve graph data models representing relationships across users, sessions, devices, and security events.
  • Integrate diverse data sources including S3, relational databases, streaming services, and APIs into a cohesive graph architecture

Keeper Security is transforming cybersecurity for organizations around the world with next-generation privileged access management. Keeper’s zero-trust and zero-knowledge cybersecurity solutions are FedRAMP and StateRAMP Authorized, FIPS 140-2 validated, as well as SOC 2 and ISO 27001 certified.

Data Engineer

540
US Unlimited PTO

  • Develop and maintain data pipelines and ETL processes using Python in AWS environments
  • Write and deploy AWS Lambda functions for data processing tasks
  • Collaborate with team members using Git for version control and code collaboration

540 is a forward-thinking company that the government turns to in order to #getshitdone. They break down barriers, build impactful technology, and solve mission-critical problems.

Latin America Canada US

  • Work alongside Caylent’s Engineers, Engineering Managers, and Project Managers to deliver AWS solutions.
  • Translate customer requirements into a workable backlog of tickets for engineers and delegate tickets to a team.
  • Troubleshoot and resolve issues in customer dev, test, and production environments, and automate software testing.

Caylent is a cloud native services company that helps organizations bring the best out of their people and technology using Amazon Web Services (AWS). At Caylent, their people always come first, and they operate fully remote with employees in Canada, the United States, and Latin America.

US Unlimited PTO

  • Partner with clients and implementation teams to understand data distribution requirements.
  • Design and develop data pipelines integrating with Databricks and Snowflake, ensuring accuracy and integrity.
  • Lead architecture and implementation of solutions for health plan clients, optimizing cloud-based technologies.

Abacus Insights is changing the way healthcare works by unlocking the power of data to enable the right care at the right time. Backed by $100M from top VCs, they're tackling big challenges in an industry that’s ready for change with a bold, curious, and collaborative team.