Drive data science initiatives with ML models for predictive maintenance, anomaly detection, and root cause analysis in manufacturing.
Build scalable cloud data pipelines for high-volume IoT data using Spark, Kafka, Airflow, and Delta Lake.
Apply statistical methods (time series, regression, clustering) and design experiments to validate process changes and quantify business impact.
Nagarro is a Digital Product Engineering company that builds products, services, and experiences. With over 17,000 experts across 39 countries, they foster a dynamic, non-hierarchical work culture and are scaling rapidly.
Design and implement batch and real time ingestion pipelines from internal and external sources.
Implement automated data quality checks, observability, and SLA monitoring.
Optimise datasets and pipelines for analytics, ML training, and API consumption.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.
Build infrastructure and data automation pipelines to ingest, process, and load data from various sources.
Collaborate with stakeholders and data science teams to develop data products aligned with organizational goals.
Develop data analysis tools to provide insights and capture key metrics.
Columbia General is seeking a Senior Data Engineer to help transform data into actionable insights that drive decision-making. The company fosters a dynamic, collaborative environment that supports growth and innovation.
Build and Maintain Bronze/Silver Layer Pipelines: You will ensure core data sources lands accurately, on time, and with full lineage.
Lead Data Ingestion, Transformation, and Enrichment: You will own the end-to-end pipeline from raw file landing through cleansed, conformed staging tables, including deduplication, standardization, code mapping, and entity resolution.
Develop Automated Ingestion Pipelines: You will use Snowpipe, Matillion, or custom solutions with reliability, observability, and minimal manual intervention in mind.
Precision AQ is building a centralized Data Hub to consolidate fragmented data infrastructure, establish enterprise-wide data governance, and enable AI-ready analytics across our life sciences portfolio. This is a foundational initiative, not a maintenance role.
Enable efficient data access by creating and maintaining data pipelines.
Collaborate with ML engineers to design and maintain automation for machine learning training, quality assessment, and model release process.
Build data infrastructure from the vast amount of data for analytics, hypothesis testing and company metrics.
Eneba is building an open, safe, and sustainable marketplace for gamers. Their marketplace supports close to 20m+ active users and provides trust and safety.
Design, develop, and maintain robust and scalable data pipelines using Apache Spark and cloud-native data services.
Build, optimize, and support ETL/ELT workflows to enable analytics, reporting, and downstream applications.
Implement and manage data solutions using Databricks, Delta Lake, and Unity Catalog.
Onebridge, a Marlabs Company, is a global AI and Data Analytics Consulting Firm that empowers organizations worldwide to drive better outcomes through data and technology. Since 2005, they have partnered with some of the largest healthcare, life sciences, financial services, and government entities across the globe.
Owns organizational-wide data architecture, defining standards, patterns, and designs that our teams will implement.
Reviews data-related designs and implementations across teams for architectural consistency, performance, and scalability.
Designs and develops data pipelines, integrations, and platform features with performance and scalability in mind.
Tenna provides a platform that revolutionizes construction equipment fleet operations. They provide innovative solutions to customers looking for competitive ways to better manage and track their assets, such as heavy and light equipment, large fleets, tools, and materials. They value quality-obsessed, gritty, continuous learners, and collaborative problem solvers.
Build, maintain, and run CI/CD pipelines and infrastructure-as-code for the Smile Digital Health platform.
Provision, configure, and operate cloud-based Spark clusters and distributed data processing environments.
Design and maintain scalable, secure infrastructure templates and deployment automation across cloud environments.
Smile Digital Health makes it easy for healthcare stakeholders to collect and exchange data with our leading FHIR-based data liberation platform. At its heart, the Smile platform enables people and organizations to better manage healthcare data; the company was #19 on Deloitte's Technology Fast 50 Ranking for 2024!
Design and architect RAG pipelines using Hugging Face, LangChain, and Open AI API for enterprise cloud environments.
Apply OEE, Six Sigma, SPC, and lean methodologies to drive measurable gains in yield, uptime, and efficiency.
Bridge OT/IT systems for real-time data extraction using industrial protocols like OPC-UA, MQTT, and Modbus.
Nagarro is a digital product engineering company that builds products, services, and experiences across all devices and digital mediums. We have over 17,000 experts across 39 countries and a dynamic, non-hierarchical work culture.
Design, build, and operate data pipelines for analytics and AI/ML capabilities.
Architect ingestion, transformation, and storage pipelines across diverse data sources.
Implement data models suitable for analytics and BI consumption.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They identify the top-fitting candidates and share the shortlist directly with the hiring company.
Design and build ETL processes in collaboration with software and model development teams.
Create and maintain scalable data infrastructure.
Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.
OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.
Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.
Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.
Design and build end-to-end data pipelines across the RAW, Silver, and Gold layers of the Medallion Architecture.
Architect data ingestion, transformation, standardization, and serving processes, that structure data flows from diverse and heterogeneous sources into a coherent analytical foundation.
Model data for analytical consumption following Data Warehouse best practices, including Star Schema design and dimensional modeling suited for business intelligence and AI-readiness.
CI&T is a tech transformation specialist, uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters around the world, they’ve built partnerships with more than 1,000 clients during their 30 years of history, valuing diverse identities and life experiences.
Design, develop, and maintain data pipelines using Azure Databricks.
Build and optimize data transformations using PySpark and SQL in Databricks.
Implement and maintain Lakehouse architectures using Delta Lake.
Miratech helps visionaries change the world with enterprise and start-up innovation, supporting digital transformation for some of the world's largest enterprises. They are a values-driven organization with nearly 1000 full-time professionals and an annual growth rate exceeding 25%.
Design & build data observability platforms and metrics.
Build metadata driven pipeline solutions.
Fuze Health puts patients first and tirelessly addresses the most pressing needs in healthcare. They empower millions to digitally connect with care providers, essential health resources and needed treatments. The company is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors.
Lead workspace architecture, Unity Catalog governance, and cluster policy design for client tenant organizations.
Perform tenant discovery, requirements gathering, source profiling, and security classification for new data intake requests.
Develop end-to-end technical designs for tenant onboarding, including Data Sharing Agreements and SLA documentation.
M9 Solutions provides IT services and solutions to the Federal Government, mobilizing skilled people and technologies for improved performance and sustainable change. With 15+ years of proven delivery and growth, the company has been recognized as an Inc. 5000 Fastest-Growing Private Company multiple times and values diverse perspectives.
Design, build, and operate data services and APIs used by application teams.
Own technical design for major features, including API shape, data contracts, versioning, and backward compatibility.
Improve observability, correctness, and operational maturity of data products.
PlayOn Sports operates one of the largest real-time data ecosystems in high school sports. Backed by KKR, their family of brands empowers schools with innovative solutions and exceptional service and focuses on solving real challenges, learning quickly, and creating impactful solutions.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.
Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.