Source Job

Spain

  • Design, develop, and maintain backend data processing solutions using Apache Spark.
  • Write and optimize SQL queries for data extraction, transformation, and analysis.
  • Develop scalable data pipelines and ETL processes, collaborating with cross-functional teams.

Apache Spark SQL Java Python Git

7 jobs similar to Mid-Level Spark Developer

Jobs ranked by similarity.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

Global

  • Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
  • Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
  • Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.

Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.

$123,696–$254,667/yr
US

  • Design and implement robust data infrastructure in AWS, using Spark with Scala.
  • Evolve our core data pipelines to efficiently scale for our massive growth.
  • Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.

tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.

  • Design, develop and maintain backend components using Java and OOP principles
  • Build and optimize SQL queries over large datasets (Oracle / PostgreSQL)
  • Contribute to microservices architectures (Spring Boot)

Talan is an international consulting group specializing in innovation and business transformation through technology. With over 7,200 consultants in 21 countries and a turnover of €850M, they are committed to delivering impactful, future-ready solutions.

Latin America

  • Design and maintain ETL processes using Python (PySpark) and Azure Synapse Analytics.
  • Collaborate with data scientists and analysts to deliver high-quality analytical data for decision-making.
  • Optimize data pipelines for scalability and performance while ensuring data accuracy and compliance.

Bluelight is a leading software consultancy focused on designing and developing innovative technology to improve users' lives. With a presence across the United States and Central/South America, the company values quality, customer satisfaction, and a collaborative culture where each team member can grow.

US 4w PTO

  • Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
  • Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
  • Contribute to automated testing suites and optimize data operations for integrity and security.

Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.

US

  • Design and build scalable cloud data pipelines for high-volume manufacturing and IoT data using Spark, Kafka, Airflow, and Delta Lake.
  • Implement medallion/lakehouse architectures on Databricks, Snowflake, AWS, or Azure with strong SQL and Python proficiency.
  • Apply manufacturing domain expertise in MES, SCADA, ERP, and industrial protocols to bridge OT/IT systems for real-time data extraction.

We are a Digital Product Engineering company that builds products, services, and experiences that inspire, excite, and delight. We have 17000+ experts across 39 countries and our culture is dynamic and non-hierarchical.