Development of various services in Python: integration with marketing partners, obtaining data from various sources.
Creation and support of processes on Airflow.
Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.
Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world and we are proud to be a two-time “Great Place to Work” winner.
Assess current pipelines and data architecture to produce a prioritized plan for change.
Design durable data and ML systems grounded in customer needs with documented tradeoffs.
Harden pipelines, upgrade data architecture, and raise standards for observability and reliability.
FutureFit AI's core mission is to help more people get to better jobs faster and cheaper, with a focus on those facing barriers to opportunity. Their team of 30-50 across the US and Canada fosters a high trust, high intensity culture with a will to win.
Own day-to-day administration, configuration, and health of Oura's global Databricks environment.
Contribute to data pipeline development and Spark workload optimization across cross-functional growth areas.
Manage workspace governance including access controls, cluster policies, cost monitoring, and security configurations.
Oura empowers people to own their inner potential through award-winning products that help gain deeper knowledge of readiness, activity, and sleep quality. They are a quickly growing company focused on helping people live healthier and happier lives, ensuring team members have what they need to do their best work.
Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.
YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.
Lead industrialization and automation initiatives across development and deployment processes.
Design, maintain, and evolve internal development and deployment tooling around dbt, Airflow, and Snowflake.
Implement monitoring, alerting, and observability capabilities to maximize platform stability and operational efficiency.
Talan is an international advisory group focused on innovation and transformation through technology. They have 5000 employees and a turnover of 600M€, offering services to support organizations' transformation through consulting, data & technology, cloud & application services, and service centers of excellence.
Architect and optimize Snowflake architecture for enterprise scalability and future data product commercialization.
Drive agentic engineering using tools like Snowflake Cortex, Cursor, and UiPath to automate workflows and deploy AI agents.
Establish data observability frameworks and operationalize MLOps pipelines for robust ML model lifecycle management.
Versapay automates accounts receivable, removing barriers to collecting and reconciling B2B payments. With over 10,000 customers and 5M+ companies transacting on the platform, they process over 110M transactions and $257B annually.
Build and scale data infrastructure powering targeting, identity, and measurement capabilities.
Optimize core ETL/ELT pipelines and ensure operational reliability with documented SLAs.
Implement privacy-compliant data methodologies meeting GDPR/CCPA standards.
Kargo creates powerful moments of connection between brands and consumers to build businesses. With 600+ employees and offices across the US, UK, Australia, and Ireland, they take a creative science approach to deliver unique ad experiences across premium platforms.
Own the data ingestion layer end-to-end, including migration to open-source tooling (dlt) and maintaining reliability as the stack evolves.
Manage dbt models, tests, documentation, and the semantic layer that defines metrics across the business.
Build monitoring, failure alerting, and anomaly detection so issues surface proactively and trace root causes from dashboard to source.
Lola Blankets is a fast-growing comfort and lifestyle brand on a mission to make the world a cozier place. They are a lean, open-source-leaning, fast-moving, and opinionated team that values strong judgment and execution.
Design and maintain data pipelines using Azure Databricks, PySpark, and SQL.
Build and optimize Lakehouse architectures with Delta Lake and Azure Data Factory.
Collaborate with BI teams and support Power BI while evolving cloud-based data platforms.
Miratech is a global IT services and consulting company that helps visionaries change the world through digital transformation for some of the world's largest enterprises. With nearly 1,000 full-time professionals and a culture of Relentless Performance, they have achieved over 99% project success since 1989.
Design and maintain production-grade ETL/ELT pipelines in a multi-hundred terabyte Snowflake environment.
Translate client loyalty program requirements into dimensional models and platform tables.
Build reliable, event-driven data architecture to support AI-powered loyalty products.
Kobie is a leader in loyalty solutions, helping brands build lasting emotional connections with consumers. Named a Top Workplace in the USA, the company fosters a collaborative, growth-focused culture with a diverse suite of benefits and flexible work arrangements.
Design, build, and maintain robust data infrastructure to support business intelligence, analytics, and machine learning initiatives.
Optimize workflows and implement secure, scalable storage solutions using modern data engineering practices.
Collaborate with cross-functional teams to translate business needs into effective technical solutions.
Truelogic is a leading provider of nearshore staff augmentation services headquartered in New York. With a team of over 600 skilled tech professionals based in Latin America, they deliver top-tier technology solutions to companies of all sizes, fostering a culture of innovation and growth.
Architect and maintain cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases.
Design and automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI.
Manage end-to-end data lifecycles including ingestion, storage, processing, and delivery of structured and unstructured data.
Trustonic makes smartphones affordable for the many, enabling global access to devices and digital finance through secure smartphone locking technology. They partner with mobile carriers, retailers, and financiers across 30+ countries, and pride themselves on a diverse, inclusive culture that values doing the right thing for each other, the community, and the planet.
Build and optimize scalable data pipelines using Python and dbt.
Design and maintain Snowflake warehouse structures, database tables, and performant data models.
Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.
We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Design the technical architecture of the Databricks Data Warehouse and act as the pattern reviewer for the team.
Build and optimize secure self-service frameworks for batch and streaming data so the same request is never solved by hand twice.
Treat the platform like production software by defining SLOs, owning observability, and leading incident triage.
Tilt uses machine learning and mobile-first products to provide credit based on over 250 real-time financial signals, not just credit scores. With millions of customers worldwide, the company is building a new credit system for working people.
Design, develop, and maintain ETL data engineering processes using Python (PySpark) and Azure Synapse Analytics.
Apply expertise in data warehousing to create effective data storage structures in a Massively Parallel Processing SQL Pool.
Collaborate with cross-functional teams to understand data requirements and provide support for data-related initiatives.
Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.
Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.
Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.
Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.
Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.
Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.
InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.
Build and lead a high-performance product engineering team focused on innovation, accountability, and reliability.
Develop scalable reliability, risk management, and operational governance capabilities for production systems.
Drive alignment across Platform Engineering, SRE, Infrastructure, and product teams to deliver long-term technical roadmap outcomes.
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without hidden fees or compounding interest. It is a publicly traded, remote-first company with competitive benefits and a culture focused on innovation and people.