Source Job

Spain

  • Design, develop, and maintain ETL and data transformation processes.
  • Implement and support Spark-based data pipelines and contribute to data integration initiatives.
  • Collaborate in Agile teams and participate in DevOps practices and CI/CD processes.

SQL Spark Scala Python Java

20 jobs similar to Big Data Developer

Jobs ranked by similarity.

Spain

  • Design, develop, and maintain backend data processing solutions using Apache Spark.
  • Write and optimize SQL queries for data extraction, transformation, and analysis.
  • Develop scalable data pipelines and ETL processes, collaborating with cross-functional teams.

Talan is an international advisory group specializing in innovation and transformation through technology. The company has 5,000 employees and an annual turnover of 600M€, and has been recognized as a Great Place to Work in Spain and Poland.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

Europe

  • Design and deliver end-to-end data platforms for analytics, BI, machine learning and AI-ready data products
  • Build and optimise scalable ETL/ELT pipelines with Databricks, Spark/PySpark, SQL and Python
  • Apply data quality, governance and security standards across the platform and mentor engineers

Tieto Tech Consulting provides design-led, data-centric, and AI-powered digital engineering & consulting services to enterprises worldwide. They focus on diversity, equity, and inclusion, fostering an inspiring workplace with a global team.

Global

  • Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
  • Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
  • Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.

Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.

Latin America

  • Design, develop, and maintain ETL data engineering processes using Python (PySpark) and Azure Synapse Analytics.
  • Apply expertise in data warehousing to create effective data storage structures in a Massively Parallel Processing SQL Pool.
  • Collaborate with cross-functional teams to understand data requirements and provide support for data-related initiatives.

Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.

$123,696–$254,667/yr
US

  • Design and implement robust data infrastructure in AWS, using Spark with Scala.
  • Evolve our core data pipelines to efficiently scale for our massive growth.
  • Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.

tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.

US Canada

  • Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
  • Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
  • Participate in architectural decisions and evangelize data engineering best practices.

OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.

Brazil

  • Support the development of scalable data pipelines and platform components, following established frameworks and guidance from senior engineers.
  • Apply software engineering best practices, including coding standards, version control, testing, and documentation, to deliver reliable and maintainable code.
  • Collaborate with engineers, product owners, and cross-functional teams in an agile environment to support feature development and delivery commitments.

Experian is a global data and technology company that powers opportunities for people and businesses around the world. With 25,200 employees across 32 countries, it has a people-centric, inclusive, and purpose-driven culture recognized as a World's Best Workplace.

  • Translate business requirements into highly available data solutions using PySpark, SQL, and Python.
  • Collaborate with cross-functional teams including machine learning engineers and software developers.
  • Implement data pipelines and manage ETL processes with data warehousing fundamentals.

We are a multinational team that believes technology is the answer to today's business challenges. Since 2016, we have helped clients translate technology into success, combining the talent of Latin American professionals with Swiss organizational skills.

US Unlimited PTO

  • Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
  • Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
  • Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.

YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.

US

  • Design and build scalable cloud data pipelines for high-volume manufacturing and IoT data using Spark, Kafka, Airflow, and Delta Lake.
  • Implement medallion/lakehouse architectures on Databricks, Snowflake, AWS, or Azure with strong SQL and Python proficiency.
  • Apply manufacturing domain expertise in MES, SCADA, ERP, and industrial protocols to bridge OT/IT systems for real-time data extraction.

We are a Digital Product Engineering company that builds products, services, and experiences that inspire, excite, and delight. We have 17000+ experts across 39 countries and our culture is dynamic and non-hierarchical.

Brazil

  • Design, develop, and maintain scalable ETL/ELT data pipelines to ingest, transform, and load data into data warehouses.
  • Implement and monitor data quality frameworks, ensuring accuracy, consistency, and reliability across datasets.
  • Collaborate with data scientists, analysts, and business stakeholders to deliver effective data solutions.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They focus on efficient and fair candidate evaluation through technology.

US EMEA

  • Design, build, and maintain distributed data pipelines that power Spotify Wrapped data stories and personalized experiences for more than 300M users globally.
  • Partner with Data Scientists to evaluate and operationalize new Wrapped story concepts, balancing personalization, scalability, and eligibility requirements.
  • Build scalable systems that process large-scale listening data and generate insights that celebrate users’ unique listening journeys.

The Personalization team makes deciding what to play next easier and more enjoyable for every listener. They are behind some of Spotify’s most-loved features. Join them and you’ll keep millions of users listening by making great recommendations to each and every one of them.

United States

  • Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
  • Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
  • Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.

Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.

Brazil

  • Design, build, and evolve large-scale, cloud-based data platforms supporting analytics, machine learning, and business intelligence.
  • Lead the development and optimization of scalable ETL/ELT pipelines for batch and near real-time processing.
  • Define data architecture standards, modeling approaches, and governance frameworks across projects.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They process applications using AI to ensure fair review and share shortlisted candidates directly with employers.

US 4w PTO

  • Solving unique data-lake challenges by transforming and normalizing highly varied partner datasets.
  • Designing robust batch-processing pipelines for massive datasets from internal, external, and public sources.
  • Collaborating with cross-functional teams to support their data infrastructure needs.

MissionWired helps progressive nonprofits and political campaigns create revolutionary fundraising strategies. They have raised over $4.5 billion in donations and value innovation, inclusion, and social change.

US 4w PTO

  • Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
  • Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
  • Contribute to automated testing suites and optimize data operations for integrity and security.

Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.

India 5w PTO 26w maternity 2w paternity

  • Design, build, and launch sophisticated data models and visualizations supporting multiple products.
  • Optimize pipelines, frameworks, and systems for easier development of data artifacts.
  • Collaborate with cross-functional teams and embody core values such as ownership and customer focus.

Outreach provides the only complete agentic AI platform for revenue teams. The company is used by world leading enterprises like Databricks, SAP, Siemens, and Verizon and promotes a culture of diversity and inclusion.

Costa Rica LATAM

  • Lead design and ownership of scalable data pipelines using SQL, Python, dbt, and cloud-native tools on Snowflake.
  • Architect and optimize Snowflake environments, including cost governance, performance tuning, and cloud integration.
  • Enforce data governance frameworks, security protocols, and data quality standards across the platform.

BlueCloud is a Snowflake Elite Partner and 2026 CoCo Catalyst Partner of the Year, helping enterprises migrate to AI-ready Snowflake platforms. With 450+ consultants and 200+ transformations, they deliver data solutions 40–50% faster across multiple industries.

Europe

  • Build and scale data infrastructure powering targeting, identity, and measurement capabilities.
  • Optimize core ETL/ELT pipelines and ensure operational reliability with documented SLAs.
  • Implement privacy-compliant data methodologies meeting GDPR/CCPA standards.

Kargo creates powerful moments of connection between brands and consumers to build businesses. With 600+ employees and offices across the US, UK, Australia, and Ireland, they take a creative science approach to deliver unique ad experiences across premium platforms.