Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
Contribute to automated testing suites and optimize data operations for integrity and security.
Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.
Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.
InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.
Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
Collaborate cross-functionally with AI/ML and Product teams to implement new features.
Proactively identify and resolve bottlenecks in our complex ETL processes.
Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
Participate in architectural decisions and evangelize data engineering best practices.
OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.
Own day-to-day administration, configuration, and health of Oura's global Databricks environment.
Contribute to data pipeline development and Spark workload optimization across cross-functional growth areas.
Manage workspace governance including access controls, cluster policies, cost monitoring, and security configurations.
Oura empowers people to own their inner potential through award-winning products that help gain deeper knowledge of readiness, activity, and sleep quality. They are a quickly growing company focused on helping people live healthier and happier lives, ensuring team members have what they need to do their best work.
Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.
Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.
Design, build, and launch sophisticated data models and visualizations supporting multiple products.
Optimize pipelines, frameworks, and systems for easier development of data artifacts.
Collaborate with cross-functional teams and embody core values such as ownership and customer focus.
Outreach provides the only complete agentic AI platform for revenue teams. The company is used by world leading enterprises like Databricks, SAP, Siemens, and Verizon and promotes a culture of diversity and inclusion.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Lead the design and evolution of scalable financial data systems supporting commissions, incentives, and payments.
Build and maintain robust data pipelines using Python, SQL, Spark, and Terraform for accuracy and performance.
Define technical strategy and roadmap for financial data operations in collaboration with stakeholders.
Our partner is a fast-growing technology company building financial data infrastructure for insurance operations. They have a remote-friendly work environment and emphasize engineering excellence and cross-functional collaboration.
Design and deliver end-to-end data platforms for analytics, BI, machine learning and AI-ready data products
Build and optimise scalable ETL/ELT pipelines with Databricks, Spark/PySpark, SQL and Python
Apply data quality, governance and security standards across the platform and mentor engineers
Tieto Tech Consulting provides design-led, data-centric, and AI-powered digital engineering & consulting services to enterprises worldwide. They focus on diversity, equity, and inclusion, fostering an inspiring workplace with a global team.
Design and build scalable components for high-throughput data ingestion and processing.
Develop systems for storing and serving batch data, and contribute to API services and event-driven applications.
Optimize data storage and retrieval for high throughput, security, and ease of access, and mentor peers.
Phaidra builds AI-powered control systems for industrial facilities, using reinforcement learning to optimize automation. The company is fully remote with a global team of around 100+ employees, emphasizing a culture of transparency, collaboration, ownership, and empathy.
Design, develop, and maintain scalable ETL/ELT data pipelines to ingest, transform, and load data into data warehouses.
Implement and monitor data quality frameworks, ensuring accuracy, consistency, and reliability across datasets.
Collaborate with data scientists, analysts, and business stakeholders to deliver effective data solutions.
Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They focus on efficient and fair candidate evaluation through technology.
Design, build, and maintain distributed data pipelines that power Spotify Wrapped data stories and personalized experiences for more than 300M users globally.
Partner with Data Scientists to evaluate and operationalize new Wrapped story concepts, balancing personalization, scalability, and eligibility requirements.
Build scalable systems that process large-scale listening data and generate insights that celebrate users’ unique listening journeys.
The Personalization team makes deciding what to play next easier and more enjoyable for every listener. They are behind some of Spotify’s most-loved features. Join them and you’ll keep millions of users listening by making great recommendations to each and every one of them.
Design, develop, and maintain ETL and data transformation processes.
Implement and support Spark-based data pipelines and contribute to data integration initiatives.
Collaborate in Agile teams and participate in DevOps practices and CI/CD processes.
Talan is an international advisory group specializing in innovation and transformation through technology, with 5,000 employees and an annual turnover of 600M€. They offer services in consulting, data & technology, cloud & application services, and service centers of excellence.
Architect and maintain cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases.
Design and automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI.
Manage end-to-end data lifecycles including ingestion, storage, processing, and delivery of structured and unstructured data.
Trustonic makes smartphones affordable for the many, enabling global access to devices and digital finance through secure smartphone locking technology. They partner with mobile carriers, retailers, and financiers across 30+ countries, and pride themselves on a diverse, inclusive culture that values doing the right thing for each other, the community, and the planet.
Design and implement robust data infrastructure in AWS, using Spark with Scala.
Evolve our core data pipelines to efficiently scale for our massive growth.
Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.
tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.
Design, build, and operate high-scale data ingestion and replication systems from production data stores into the data lakehouse.
Build and maintain reliable, scalable data platform infrastructure capable of handling petabytes of data across analytics, AI, and operational use cases.
Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams move and access data safely.
Samsara is the pioneer of the Connected Operations Cloud, enabling organizations that depend on physical operations to harness IoT data for actionable insights. As a publicly traded company, Samsara fosters a growth-oriented culture and serves industries that represent over 40% of global GDP.
Design the technical architecture of the Databricks Data Warehouse and act as the pattern reviewer for the team.
Build and optimize secure self-service frameworks for batch and streaming data so the same request is never solved by hand twice.
Treat the platform like production software by defining SLOs, owning observability, and leading incident triage.
Tilt uses machine learning and mobile-first products to provide credit based on over 250 real-time financial signals, not just credit scores. With millions of customers worldwide, the company is building a new credit system for working people.
Architect and optimize large-scale data platforms on Google Cloud using BigQuery.
Design and build unified batch and streaming pipelines for high-volume workloads.
Mentor engineers and set technical direction for data architecture and governance.
Egen is a fast-growing and entrepreneurial company with a data-first mindset, using advanced technology platforms like Google Cloud and Salesforce to help clients drive action through data. They are committed to being a place where the best people choose to work, dedicated to learning, and thrive on solving tough problems.
Design, develop, and maintain backend data processing solutions using Apache Spark.
Write and optimize SQL queries for data extraction, transformation, and analysis.
Develop scalable data pipelines and ETL processes, collaborating with cross-functional teams.
Talan is an international advisory group specializing in innovation and transformation through technology. The company has 5,000 employees and an annual turnover of 600M€, and has been recognized as a Great Place to Work in Spain and Poland.