Source Job

Kenya

  • Support the design, build, and optimization of scalable data pipelines using cloud-based proprietary or open-source data platforms.
  • Implement best practices for data transformation and cleansing to ensure data integrity and reliability.
  • Ensure the stability, performance, and reliability of data platforms through proactive monitoring, CI/CD practices, and production issue resolution.

PySpark Python SQL Databricks Power BI

20 jobs similar to Associate, Data Engineering - DISC Project

Jobs ranked by similarity.

Global Unlimited PTO

  • Design, build, and maintain production data pipelines using Python, Prefect, Airflow, Jenkins or any other orchestration framework multi-phase algorithmic workflows.
  • Build and optimize advanced SQL transformations in Snowflake, including window functions, CTEs, stored procedures, UDFs, and semi-structured data processing.
  • Build and maintain dbt models for data transformation, identity resolution, and slowly changing dimension (SCD Type 2) tracking across 80+ models and multiple pipeline stages.

Kalibri helps to redefine and rebuild the hotel industry. They are looking for passionate, energetic, and hardworking people with an entrepreneurial spirit, who dream big and challenge the status quo; their team is working on cutting-edge solutions for the industry.

Europe

  • Build pipelines to load data from various systems into Dataiku via S3 or Snowflake.
  • Increase the robustness of existing production pipelines, identify bottlenecks, and set up a robust monitoring, testing processes, and documentation templates.
  • Build custom applications and integrations to automate manual tasks related to customer operations to help Product Operations / Support / SRE in their day-to-day activities

Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.

$106,000–$120,000/yr
US

  • Lead the technical onboarding of partner institutions onto UDTS.
  • Design, build, and maintain scalable data pipelines and architectures.
  • Collaborate with team members to set engineering standards and guide data infrastructure strategy.

DataKind is a non-profit organization that uses data science and AI to address global challenges. They work with various sectors like health, humanitarian action, climate, economic opportunity, and education to create data-driven tools.

$122,400–$195,500/yr
US

  • Contribute to architecture and implement robust data pipelines.
  • Drive the creation of a secure, compliant, and privacy-focused data warehousing solution.
  • Partner with the data analytics team to deliver a data platform that supports accurate, actionable reporting.

Headspace provides access to lifelong mental health support. They combine evidence-based content, clinical care, and innovative technology to help millions of members around the world get support that’s effective and personalized. The company values making the mission matter, iterating to great, owning the outcome, and connecting with courage.

$118,000–$148,000/yr
US

  • Design, build, and maintain scalable batch and real-time data pipelines that power analytics, experimentation, and machine learning
  • Partner cross-functionally with analytics, product, engineering and operations to deliver high-quality data solutions that drive measurable business impact
  • Champion data quality, reliability, and observability by implementing best practices in testing, monitoring, lineage, and incident response

Gopuff is reimagining how people purchase everyday essentials, from snacks to household goods to alcohol, all delivered in minutes. They are assembling a team of thinkers, dreamers and risk-takers who know the value of peace of mind in an unpredictable world.

Brazil

  • Design and implement data ingestion and transformation pipelines using PySpark/SparkSQL on Databricks.
  • Own data pipelines end-to-end in production: freshness, correctness, availability, and SLA adherence.
  • Build and maintain Delta Lake tables following medallion architecture patterns.

Pismo, founded in 2016, provides a comprehensive processing platform for banking, card issuing, and financial market infrastructure. With over 500 employees across more than 10 countries and now part of Visa, they empower firms to build and launch financial products rapidly with high security and availability standards.

Europe Asia

  • Create innovative solutions for handling peta-bytes of data with billions of rows & joins.
  • Create real time and offline features generation pipelines to managing our data infrastructure to be reliable and fast!
  • Develop and productionize data pipelines for our ML models in both bare-metal and the cloud environment.

Kayzen is a mobile demand-side platform (DSP) dedicated to democratizing programmatic advertising. They enable leading apps, agencies, media buyers, and brands to run programmatic customer acquisition, retargeting, and brand performance campaigns through their self-serve and managed service options.

$110,000–$135,000/yr
Canada

  • Design and implement scalable data architectures to support business needs.
  • Build and optimize data pipelines, ensuring data accessibility and security.
  • Develop and maintain data models, databases, and data lakes, with robust data governance.

Terawatt Infrastructure delivers large scale, turnkey charging solutions for companies rapidly deploying AV and EV fleets. With a growing portfolio of sites across the US, Terawatt is building the permanent transportation and logistics infrastructure of tomorrow through capital, real estate, development, and site operations solutions.

US North America

  • Enable self-service analytics for all team members by designing clean, intuitive data models and metrics through dbt, empowering employees to make informed, data-driven decisions.
  • Develop and refine custom data pipelines that ingest data from operational systems to our analytics platform, handling both streaming and batch data using third-party tooling and home-grown solutions
  • Maintain and optimize the data platform infrastructure, focusing on data quality, ELT efficiency, and platform hygiene.

Auto Integrate makes leased vehicle maintenance frictionless for millions of customers in the USA and Canada. The business is managed by a small, global team within Fleetio, combining the resources of a scaled SaaS company with the agility of a niche market leader.

$179,469–$242,811/yr
US

  • Lead and grow a team of data engineers, providing mentorship and technical guidance.
  • Own execution of customer integrations across multiple product lines, ensuring on-time delivery.
  • Improve data quality and pipeline reliability by investing in better alerting and resilience.

Afresh is the leading AI company in fresh food, partnering with grocers to order billions of dollars of fresh food. They are on a mission to eliminate food waste and make fresh food accessible to all and has saved 200M lbs of food waste in 2025 alone.

$140,000–$160,000/yr
US 4w PTO

  • Own and evolve the data infrastructure that powers Clever's core data products.
  • Maintain and improve data pipeline reliability, monitoring and resolving pipeline failures.
  • Design and implement ingestion for new operational data sources that support Clever's speed-to-match initiative.

Clever Real Estate is a venture-backed technology company aiming to revolutionize real estate transactions. They have built a leading online education platform helping consumers save money and have earned a 4.9 TrustPilot rating with over 3,800 reviews.

$219,625–$235,675/yr
US Unlimited PTO

  • Define and work within our data governance practices, including a catalog/dictionary and management of data quality.
  • Manage lights-out data operations of our ETL/ELT pipelines ranging from streaming inputs to batch file loads, to support customer reporting, development, and operations.
  • Untangle, normalize, synthesize as needed to permit joining and comparisons from disparate sources, and further analysis including ML processing.

Evermore is a technology company that administers Smart Benefits to connect people to products and services. They are backed by leading investors including General Catalyst, Define Ventures, Lightspeed Venture Partners, Pinegrove Capital Partners, and Qiming Venture Partners.

$180,000–$220,000/yr
US Unlimited PTO 14w maternity

  • Design, build, and maintain databases that power Hologram's operations.
  • Build and maintain ETL pipelines that move and transform data reliably.
  • Audit existing pipelines and data models, identify complexity, and refactor bad decisions.

Hologram is building the future of IoT connectivity, delivering internet access to millions of connected devices worldwide. They process over 5 billion transactions per month across their global infrastructure and values a fun, upbeat, and remote-first team united by their mission.

Europe Asia

  • Design, implement, and maintain robust, scalable data pipelines to support AI, analytics, and operational reporting
  • Own and evolve the data warehouse architecture, ensuring it meets performance, flexibility, and governance needs
  • Ensure data integrity, availability, lineage, and observability across complex pipelines

Remote People is building the infrastructure to power borderless teams. Their technology handles global payroll, benefits, taxes, and compliance, enabling businesses to compliantly hire anyone anywhere at the push of a button. They are a growing, international family.

US 5w PTO

  • Design, develop, and optimize ETL pipelines using Epic Caboodle Console, SQL Server SSIS, Microsoft Fabric, Azure Databricks , Azure Data Lake Storage , and Azure Data Factory.
  • Build and maintain star-schema data structures including facts, dimensions, SCDs, aggregates, and bridge tables.
  • Develop user-facing BI assets, including Power BI data models, dashboards, and semantic layers.

OHSU's Business Intelligence & Advanced Analytics (BIAA) team supports population health, value-based care, financial analytics, and data-driven clinical operations across the OHSU Health System. They are Portland's largest employer, offering opportunities to learn and advance in a system of hospitals and clinics across Oregon and Southwest Washington.

US

  • Independently lead and deliver billable client engagements.
  • Design and implement scalable, secure, and performant data solutions in Microsoft Fabric.
  • Facilitate workshops and guide clients through ambiguous problem spaces.

Atmosera empowers businesses to redefine what's possible with modern technology and human expertise. As a Microsoft Partner with seven specializations, GitHub AI Partner of the Year, a member of the GitHub Advisory Board, and a member of the prestigious Microsoft Intelligent Security Association (MISA), Atmosera expertly delivers cutting-edge, integrated solutions that deliver business value.

Global

  • Design, develop, and maintain data pipelines using Azure Databricks
  • Build and optimize data transformations using PySpark and SQL in Databricks
  • Implement and maintain Lakehouse architectures using Delta Lake

Miratech is a global IT services and consulting company that brings together enterprise and start-up innovation, supporting digital transformation for some of the world's largest enterprises. They retain nearly 1000 full-time professionals, and their annual growth rate exceeds 25%.

$205,000–$220,000/yr
US

  • Partner with Sales and Field Engineering to design and architect complex, enterprise-grade solutions tailored to customer needs.
  • Lead the implementation of custom solutions within customer environments across multi-cloud and hybrid architectures.
  • Optimize solutions for performance, scalability, and reliability in production environments.

Striim is a unified data integration and streaming platform that connects clouds, data, and applications. We believe and expect all of our employees to operate as one with unlimited potential and dignity.

Latin America

  • Strengthen the real estate MLS data platform squad.
  • Build robust data pipelines and backend services.
  • Evaluate and monitor infrastructure to improve systems over time.

Luxury Presence is building the AI growth platform for real estate. Backed by Bessemer Venture Partners and other top investors, we're a Series C company on track to hit $100M in annual recurring revenue in the next six months. Founded in 2016, Luxury Presence has grown to a global team and has raised $89 million to date.

$140,400–$224,250/yr
US

  • Lead the implementation of a resilient, privacy-first data platform architecture.
  • Lead the design, infrastructure, and tooling decisions for platform optimization.
  • Develop AI-ready architecture by creating semantic layers that define and standardize business logic.

Headspace provides access to lifelong mental health support. They combine evidence-based content, clinical care, and innovative technology to help millions of members around the world get support that’s effective and personalized. They value connecting with courage, ownership, and iterating to great.