Support the development of scalable data pipelines and platform components, following established frameworks and guidance from senior engineers.
Apply software engineering best practices, including coding standards, version control, testing, and documentation, to deliver reliable and maintainable code.
Collaborate with engineers, product owners, and cross-functional teams in an agile environment to support feature development and delivery commitments.
Design, develop, and maintain ETL and data transformation processes.
Implement and support Spark-based data pipelines and contribute to data integration initiatives.
Collaborate in Agile teams and participate in DevOps practices and CI/CD processes.
Talan is an international advisory group specializing in innovation and transformation through technology, with 5,000 employees and an annual turnover of 600M€. They offer services in consulting, data & technology, cloud & application services, and service centers of excellence.
Design, develop, and maintain scalable ETL/ELT data pipelines to ingest, transform, and load data into data warehouses.
Implement and monitor data quality frameworks, ensuring accuracy, consistency, and reliability across datasets.
Collaborate with data scientists, analysts, and business stakeholders to deliver effective data solutions.
Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They focus on efficient and fair candidate evaluation through technology.
Design, develop, and maintain ETL data engineering processes using Python (PySpark) and Azure Synapse Analytics.
Apply expertise in data warehousing to create effective data storage structures in a Massively Parallel Processing SQL Pool.
Collaborate with cross-functional teams to understand data requirements and provide support for data-related initiatives.
Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.
Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.
This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.
Design, build, and evolve large-scale, cloud-based data platforms supporting analytics, machine learning, and business intelligence.
Lead the development and optimization of scalable ETL/ELT pipelines for batch and near real-time processing.
Define data architecture standards, modeling approaches, and governance frameworks across projects.
Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They process applications using AI to ensure fair review and share shortlisted candidates directly with employers.
Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.
InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.
Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.
Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.
Design and deliver end-to-end data platforms for analytics, BI, machine learning and AI-ready data products
Build and optimise scalable ETL/ELT pipelines with Databricks, Spark/PySpark, SQL and Python
Apply data quality, governance and security standards across the platform and mentor engineers
Tieto Tech Consulting provides design-led, data-centric, and AI-powered digital engineering & consulting services to enterprises worldwide. They focus on diversity, equity, and inclusion, fostering an inspiring workplace with a global team.
Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
Participate in architectural decisions and evangelize data engineering best practices.
OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.
Participate in all initiatives to satisfy compliance, security, and regulatory needs while designing and building data platform capabilities.
Implement processes and tools enabling product engineering and BI teams to consume an extensible, governed data platform.
Provide technical guidance and mentorship on data platform adoption and maintain existing software services code.
SRS Acquiom delivers a platform and services to manage complex M&A and loan agency transactions efficiently. Based in Denver with offices across the US and in London and Amsterdam, the company has supported over 11,500 transactions globally since 2007, fostering a culture of entrepreneurial energy, growth, and innovation.
Design, implement, and maintain microservices using Python, Go, and Java.
Build and scale cloud-based software products, online services, and data pipelines for millions of transactions.
Collaborate on security, reliability, and automation while supporting testing and troubleshooting.
Ocrolus helps lenders automate workflows with an AI platform that processes nearly one million credit applications monthly with over 99% accuracy. Trusted by 400+ customers including Better Mortgage and SoFi, it is a fast-growing, remote-first team of builders and problem solvers.
Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.
Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.
Design, develop, and maintain backend data processing solutions using Apache Spark.
Write and optimize SQL queries for data extraction, transformation, and analysis.
Develop scalable data pipelines and ETL processes, collaborating with cross-functional teams.
Talan is an international advisory group specializing in innovation and transformation through technology. The company has 5,000 employees and an annual turnover of 600M€, and has been recognized as a Great Place to Work in Spain and Poland.
Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.
Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.
Build scalable Python-based data pipelines and backend services for analytics workflows.
Design software systems using object-oriented programming and sound engineering practices.
Create and support platforms for analytics development, model training, and model deployment.
Experian is a global data and technology company that powers opportunities for people and businesses worldwide across markets like financial services, healthcare, and automotive. With a team of 25,200 people in 32 countries, Experian invests in advanced technologies and its people to unlock the power of data.
Design and implement robust data infrastructure in AWS, using Spark with Scala.
Evolve our core data pipelines to efficiently scale for our massive growth.
Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.
tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.
Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
Contribute to automated testing suites and optimize data operations for integrity and security.
Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.
Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
Define data governance policies and leverage AI tools to streamline data pipeline development.
For over four decades, PAR Technology Corporation has been a leader in restaurant technology, empowering brands worldwide to create lasting connections with their guests. With over 100,000 restaurants in more than 110 countries, we embrace a 'Better Together' ethos and offer comprehensive software and hardware solutions.
Collaborate with U.S.-based clients to architect data solutions and translate requirements into technical specifications.
Design and build batch and real-time data pipelines, automate ETL processes, and ensure data accuracy and security.
Leverage advanced SQL, Python, and cloud data warehouse technologies to drive data-driven decision making for clients.
3Pillar Global is an AI transformation partner that helps enterprises build AI-native products and intelligent agents. With teams across North America, Europe, Latin America, and Asia, they foster a global, collaborative culture focused on modernizing and competing in the digital era.
Design and build scalable cloud data pipelines for high-volume manufacturing and IoT data using Spark, Kafka, Airflow, and Delta Lake.
Implement medallion/lakehouse architectures on Databricks, Snowflake, AWS, or Azure with strong SQL and Python proficiency.
Apply manufacturing domain expertise in MES, SCADA, ERP, and industrial protocols to bridge OT/IT systems for real-time data extraction.
We are a Digital Product Engineering company that builds products, services, and experiences that inspire, excite, and delight. We have 17000+ experts across 39 countries and our culture is dynamic and non-hierarchical.