Design and build ETL processes in collaboration with software and model development teams.
Create and maintain scalable data infrastructure.
Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.
OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.
Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.
Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.
Development of various services in Python: integration with marketing partners, obtaining data from various sources.
Creation and support of processes on Airflow.
Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.
Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world and we are proud to be a two-time “Great Place to Work” winner.
Design, build, and operate data pipelines for analytics and AI/ML capabilities.
Architect ingestion, transformation, and storage pipelines across diverse data sources.
Implement data models suitable for analytics and BI consumption.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They identify the top-fitting candidates and share the shortlist directly with the hiring company.
Own and maintain data pipeline architectures, ensuring reliability and monitoring.
Manage and evolve data modeling environments for analysts and engineers.
Implement observability for data systems, detecting issues early and continuously monitoring data quality.
Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.
Builds and modernizes data pipelines and integrations to improve processing efficiency.
Engineers data and analytics components and improves reliability/performance.
Supports testing, documentation, and O&M transition materials.
DMI is a leading provider of digital services and technology solutions, headquartered in Tysons Corner, VA. With a focus on end-to-end managed IT services, the company supports public sector agencies and commercial enterprises around the globe.
Building pipelines that augment documents with metadata.
Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
Optimizing and evaluating our core utils, which do things like extracting and resolving citations.
We are hiring a senior software/data engineer to help build the largest case law dataset. Our data coverage includes US laws and court decisions and powers our lawyer-facing AI platform and B2B data services.
Design, implement, and maintain data pipelines and ETL processes supporting ingestion, transformation, and validation of mission data
Develop and optimize data models and schemas across relational and non-relational databases to support system integrations and analytics
Collaborate with system architects, integration developers, and data analysts to ensure data consistency, security, and integrity across cloud environments
INflow Federal, founded in 2013, delivers cutting-edge solutions to the Department of War (DoW) and Joint Force operations. It is a mission-driven small business with over 50% of its workforce being Veterans, investing deeply in professional growth, well-being, and innovation.
Build and Maintain Bronze/Silver Layer Pipelines: You will ensure core data sources lands accurately, on time, and with full lineage.
Lead Data Ingestion, Transformation, and Enrichment: You will own the end-to-end pipeline from raw file landing through cleansed, conformed staging tables, including deduplication, standardization, code mapping, and entity resolution.
Develop Automated Ingestion Pipelines: You will use Snowpipe, Matillion, or custom solutions with reliability, observability, and minimal manual intervention in mind.
Precision AQ is building a centralized Data Hub to consolidate fragmented data infrastructure, establish enterprise-wide data governance, and enable AI-ready analytics across our life sciences portfolio. This is a foundational initiative, not a maintenance role.
Design and optimize scalable data pipelines and architectures for Data & AI initiatives.
Build cloud-native solutions using Azure, Databricks, and big data technologies.
Collaborate with business stakeholders to deliver data-driven solutions and contribute to a strong data culture.
Redcare Pharmacy is Europe's leading e-pharmacy, driven by innovation and a mission to ensure every human has access to health. The company fosters a collaborative and healthy work environment, with a team passionate about cutting-edge technology and data-driven solutions.
Quickly get up-to-speed on Zscaler’s SecOps platform, utilizing Python and APIs to configure, customize, and automate data transformations and workflows.
Partner with cybersecurity subject matter experts (SMEs) to onboard new data pipelines and map diverse IT and security sources to fulfill specific customer use cases.
Proactively troubleshoot pipeline health and audit customer data across environments to identify quality issues, flag security gaps, and define clear remediation steps.
Zscaler accelerates digital transformation to ensure customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise, they leverage the world’s largest security data lake to power their cloud-native Zero Trust Exchange platform. They build high-performing teams that can make an impact quickly and with high quality.
Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.
YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.
Lead workspace architecture, Unity Catalog governance, and cluster policy design for client tenant organizations.
Perform tenant discovery, requirements gathering, source profiling, and security classification for new data intake requests.
Develop end-to-end technical designs for tenant onboarding, including Data Sharing Agreements and SLA documentation.
M9 Solutions provides IT services and solutions to the Federal Government, mobilizing skilled people and technologies for improved performance and sustainable change. With 15+ years of proven delivery and growth, the company has been recognized as an Inc. 5000 Fastest-Growing Private Company multiple times and values diverse perspectives.
Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.
Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.
Instrument fal's core infrastructure to capture CPU, GPU, and request-level signals.
Build ingestion pipelines from partner APIs, compute vendors, and internal services into BigQuery.
Design and operate the ETL backbone that powers cost, margin, and usage analytics.
Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production at scale.
Develop and maintain data models for core package application and reporting databases.
Monitor execution and performance of daily pipelines and escalate issues.
Collaborate with analytics and business teams to improve data models and pipelines.
Bluelight Consulting is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.
Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.
Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.
Take full ownership of INFOnline's central data platform from raw event ingress through reporting delivery, defining the roadmap and driving execution.
Establish engineering standards, mentor others, and drive AI-native engineering practices to increase speed and quality.
Collaborate with Product, Customer Success, and Leadership to translate business requirements into scalable technical solutions.
INFOnline powers digital audience measurement for the German and Austrian media industry, processing billions of events to deliver trusted reach and engagement metrics. Part of saas.group, they are a small, friendly team focused on modernizing infrastructure and moving to a cloud-native architecture on GCP.
Design, develop, and maintain robust and scalable data pipelines using Apache Spark and cloud-native data services.
Build, optimize, and support ETL/ELT workflows to enable analytics, reporting, and downstream applications.
Implement and manage data solutions using Databricks, Delta Lake, and Unity Catalog.
Onebridge, a Marlabs Company, is a global AI and Data Analytics Consulting Firm that empowers organizations worldwide to drive better outcomes through data and technology. Since 2005, they have partnered with some of the largest healthcare, life sciences, financial services, and government entities across the globe.