Remote Data Jobs · Spark

Job listings

  • Enable efficient consumption of domain data as a product by delivering and promoting strategically designed actionable datasets and data models
  • Build, maintain, and improve rock-solid data pipelines using a broad range of technologies like AWS Redshift, Trino, Spark, Airflow, and Kafka streaming for real-time processing
  • Support teams without data engineers in building decentralised data solutions and product integrations, for example, around DynamoDB Act as a data ambassador, promoting the value of data and our data platform among engineering teams and enabling cooperation

OLX operates consumer brands that facilitate trade to build a more sustainable world. They have colleagues around the world who serve millions of people every month.

US 4w PTO

  • You will lead a high-impact team responsible for maintaining and evolving Voleon’s core analytics pipelines and data models.
  • You’ll translate the needs of researchers and data scientists into robust, scalable, and reproducible analytics systems.
  • You’ll collaborate with leaders across Research, Engineering, ProdOps, and Data Infrastructure to define a clear roadmap for analytics enablement.

Voleon is a technology company applying state-of-the-art machine learning to real-world financial problems. With over a decade of leadership in the industry, they've built a multibillion-dollar asset management firm and continue to drive ambitious innovations.

  • Design, build, and evolve scalable data pipelines and systems that ensure financial accuracy and integrity at scale.
  • Explore and apply Spotify’s Data and AI ecosystem to solve engineering problems and improve developer efficiency.
  • Partner closely with Finance and Audiobooks stakeholders to understand their needs and deliver financially impactful features.

Spotify transformed music listening forever when we launched in 2008. Today, they are the world’s most popular audio streaming subscription service.

$0–$200,000/yr

  • Architect and maintain robust data pipelines to transform diverse data inputs.
  • Integrate data from various sources into a unified platform.
  • Build APIs with AI assistance to enable secure access to consolidated insights.

Abusix is committed to making the internet a safer place. They are a globally distributed team that spans multiple countries and thrives in a culture rooted in trust, ownership, and collaboration.

  • Build, manage, and operationalize data pipelines for marketing use cases.
  • Develop a comprehensive understanding of customer and marketing data requirements.
  • Transform large data sets into targeted customer audiences for personalized experiences.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

  • Design, develop, and maintain scalable and robust data pipelines.
  • Create solutions for data ingestion, transformation, and modeling using Databricks, Spark/PySpark, Cloudera, and Azure Data Factory (ADF).
  • Ensure the quality, integrity, and usability of data throughout the entire pipeline.

CI&T specializes in technological transformation, uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters worldwide, they have partnered with over 1,000 clients during their 30-year history, with a focus on Artificial Intelligence.

  • Design and develop scalable data pipelines and infrastructure to process large volumes of data efficiently
  • Collaborate with cross-functional teams to ensure data integrity, accessibility, and usability
  • Implement and maintain data quality measures throughout the data lifecycle

CI&T is a tech transformation specialist, uniting human expertise with AI to create scalable tech solutions. With over 8,000 employees around the world, they've built partnerships with more than 1,000 clients during their 30 years of history.

  • Design and develop scalable data pipelines and infrastructure to process large volumes of data efficiently
  • Collaborate with cross-functional teams to ensure data integrity, accessibility, and usability
  • Implement and maintain data quality measures throughout the data lifecycle

CI&T is a tech transformation specialist, uniting human expertise with AI to create scalable tech solutions. With over 8,000 employees around the world, they have a culture that values diverse identities and life experiences, fostering a diverse, inclusive, and safe work environment.

  • Prototype, iterate, and ship algorithms to production in close collaboration with Product, Data Engineering, and Software teams.

Mirakl provides eCommerce software solutions that enable enterprises to drive growth and efficiency in their online business. With over 350 employees in France and offices in 7 countries, Mirakl is considered a Great Place to Work company that is pioneering the platform economy.

  • Design, build, and maintain robust data pipelines and data models to support program outcomes, grant-funded activities, and long-term impact measurement
  • Develop and maintain self-service dashboards and reports that enable teams and leadership to make informed, data-driven decisions
  • Analyze trends and patterns across multiple datasets to surface actionable insights that guide strategy, partnerships, and program interventions

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies top-fitting candidates, and this shortlist is then shared directly with the hiring company.