Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.
InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.
Architect Spark-driven workflows at scale and design data platforms as products for internal teams.
Develop and maintain end-to-end data pipelines and backend ingestion workflows across multiple sources.
Champion Samsara's cultural principles and mentor junior team members to drive data-driven decisions.
Samsara is the pioneer of the Connected Operations Cloud, enabling organizations to harness IoT data for actionable insights to improve safety, efficiency, and sustainability. As a recently public company, it fosters a culture of rapid career development, ownership, and high performance.
Build and scale data infrastructure powering targeting, identity, and measurement capabilities.
Optimize core ETL/ELT pipelines and ensure operational reliability with documented SLAs.
Implement privacy-compliant data methodologies meeting GDPR/CCPA standards.
Kargo creates powerful moments of connection between brands and consumers to build businesses. With 600+ employees and offices across the US, UK, Australia, and Ireland, they take a creative science approach to deliver unique ad experiences across premium platforms.
Design, build, and operate high-scale data ingestion and replication systems from production data stores into the data lakehouse.
Build and maintain reliable, scalable data platform infrastructure capable of handling petabytes of data across analytics, AI, and operational use cases.
Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams move and access data safely.
Samsara is the pioneer of the Connected Operations Cloud, enabling organizations that depend on physical operations to harness IoT data for actionable insights. As a publicly traded company, Samsara fosters a growth-oriented culture and serves industries that represent over 40% of global GDP.
Build and maintain data pipelines for analytics, ML, and product applications.
Design scalable data infrastructure with a focus on quality and observability.
Collaborate with cross-functional teams to understand data needs and implement solutions.
Prolific builds human data infrastructure to power the next wave of AI innovation. They are a remote-first company focused on ethical data collection and mission-driven culture.
Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.
Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.
Design, build, and scale cloud-based data platforms that support critical business insights and operations.
Collaborate with cross-functional teams to extract, load, and transform data using cloud-native principles.
Mentor engineers and drive architectural direction through code reviews, testing, and documentation.
Credit Acceptance is a leading used car finance company in the United States. It has a world-class culture shaped by dedicated Team Members and a shared drive to succeed.
Architect and implement scalable ETL and data pipelines for real-time risk management and advanced analytics.
Design, develop, and optimize distributed data storage solutions for high performance and reliability at scale.
Drive schema evolution, data modeling, and pipeline orchestration with ownership of end-to-end data flow.
Oscilar builds the most advanced AI Risk Decisioning™ Platform for banks, fintechs, and digitally native organizations to manage fraud, credit, and compliance risk. The company is mission-driven with a remote-first culture and team members from Meta, Uber, Citi, and Confluent.
Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.
Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.
Design and deliver end-to-end data platforms for analytics, BI, machine learning and AI-ready data products
Build and optimise scalable ETL/ELT pipelines with Databricks, Spark/PySpark, SQL and Python
Apply data quality, governance and security standards across the platform and mentor engineers
Tieto Tech Consulting provides design-led, data-centric, and AI-powered digital engineering & consulting services to enterprises worldwide. They focus on diversity, equity, and inclusion, fostering an inspiring workplace with a global team.
Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.
YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.
Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
Participate in architectural decisions and evangelize data engineering best practices.
OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.
Build scalable Python-based data pipelines and backend services for analytics workflows.
Design software systems using object-oriented programming and sound engineering practices.
Create and support platforms for analytics development, model training, and model deployment.
Experian is a global data and technology company that powers opportunities for people and businesses worldwide across markets like financial services, healthcare, and automotive. With a team of 25,200 people in 32 countries, Experian invests in advanced technologies and its people to unlock the power of data.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.
Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.
Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.
Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.
Design, develop, and maintain backend data processing solutions using Apache Spark.
Write and optimize SQL queries for data extraction, transformation, and analysis.
Develop scalable data pipelines and ETL processes, collaborating with cross-functional teams.
Talan is an international advisory group specializing in innovation and transformation through technology. The company has 5,000 employees and an annual turnover of 600M€, and has been recognized as a Great Place to Work in Spain and Poland.
Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
Define data governance policies and leverage AI tools to streamline data pipeline development.
For over four decades, PAR Technology Corporation has been a leader in restaurant technology, empowering brands worldwide to create lasting connections with their guests. With over 100,000 restaurants in more than 110 countries, we embrace a 'Better Together' ethos and offer comprehensive software and hardware solutions.
Play a crucial role in helping client organizations transform raw data into reliable, well-modeled assets that drive business decisions.
Design, build, and maintain scalable data pipelines and ELT workflows, with Databricks as the primary platform.
Collaborate with data engineers, analysts, and clients on end-to-end data requirements and project delivery.
Velir is an established mid-sized agency with a top-tier portfolio of clients, ranging from the world’s largest non-profits to Fortune 500 brands. Our culture is built on a foundation of trust, collaboration, and continued improvement, and we are a remote first company that offers competitive pay and excellent benefits.
Design, develop, and maintain ETL and data transformation processes.
Implement and support Spark-based data pipelines and contribute to data integration initiatives.
Collaborate in Agile teams and participate in DevOps practices and CI/CD processes.
Talan is an international advisory group specializing in innovation and transformation through technology, with 5,000 employees and an annual turnover of 600M€. They offer services in consulting, data & technology, cloud & application services, and service centers of excellence.