Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.
Design and build ETL processes in collaboration with software and model development teams.
Create and maintain scalable data infrastructure.
Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.
OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.
Own and maintain data pipeline architectures, ensuring reliability and monitoring.
Manage and evolve data modeling environments for analysts and engineers.
Implement observability for data systems, detecting issues early and continuously monitoring data quality.
Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.
Instrument fal's core infrastructure to capture CPU, GPU, and request-level signals.
Build ingestion pipelines from partner APIs, compute vendors, and internal services into BigQuery.
Design and operate the ETL backbone that powers cost, margin, and usage analytics.
Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production at scale.
Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.
Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Build and maintain an internal Data warehouse that aggregates and organizes data from various source systems.
Build dynamic reports to facilitate complex business analyses and enable business users to answer their most important data-driven questions.
Help communicate across different business groups in order to bring a source of truth to data.
Revinate is a direct booking platform that leads the hospitality industry in driving direct revenue and increased profitability. With their products and people, they give hoteliers the superpowers they need to crush their goals, shifting share away from OTAs and driving tangible results. They power 1.1 billion Rich Guest Profiles across 12,500+ hotels to drive over $24 billion in direct revenue.
Building pipelines that augment documents with metadata.
Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
Optimizing and evaluating our core utils, which do things like extracting and resolving citations.
We are hiring a senior software/data engineer to help build the largest case law dataset. Our data coverage includes US laws and court decisions and powers our lawyer-facing AI platform and B2B data services.
Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
Process and integrate data from multiple formats and sources (JSON, CSV, XML).
Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.
I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.
Build and optimize scalable data pipelines using Python and dbt.
Design and maintain Snowflake warehouse structures, database tables, and performant data models.
Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.
We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.
Collaborate with stakeholders to gather reporting and data infrastructure requirements.
Design, build, and maintain automated dashboards and scalable analytics infrastructure.
Develop, optimize, and maintain large-scale ETL pipelines for campaign reporting and analytics.
ItD blends diversity, innovation, and integrity with real business results as a woman- and minority-led firm. They reject any strong hierarchy, empowering them to deliver excellent results and thrive in a dynamic environment with empowerment and recognition.
Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.
This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.
Design, build, and maintain complex data processing systems ensuring data integrity and optimizing pipelines for efficiency and scalability.
Provide mentorship for junior team members and lead by demonstrating best delivery practices.
Proactively identify improvement opportunities for dashboards and support clients in solving critical business challenges.
Incubeta is a marketing intelligence company that helps businesses leverage data and technology to drive digital growth. They are an equal opportunity employer with a focus on diversity and inclusion, and they foster a culture of collaboration and continuous learning.
Builds and modernizes data pipelines and integrations to improve processing efficiency.
Engineers data and analytics components and improves reliability/performance.
Supports testing, documentation, and O&M transition materials.
DMI is a leading provider of digital services and technology solutions, headquartered in Tysons Corner, VA. With a focus on end-to-end managed IT services, the company supports public sector agencies and commercial enterprises around the globe.
Design, develop, and maintain scalable data pipelines and infrastructure.
Build and optimize data warehouses, databases, and data models.
Implement and maintain data governance and security practices.
Jobgether is a company that uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They connect candidates with companies; their culture is collaborative and inclusive, focused on innovation and growth.
Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
Collaborate cross-functionally with AI/ML and Product teams to implement new features.
Proactively identify and resolve bottlenecks in our complex ETL processes.
Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
Design, configure, and maintain AI agent workflows using Cursor and Claude Code for automated data system architecture.
Build and maintain a RAW → Base → Data Marts pipeline using dbt Core and implement business logic at the transformation layer.
Build comprehensive test suites using Great Expectations and ensure data quality through manual inspection and automation.
BiOptimizers helps people go from baseline health to peak biological performance with science-backed supplements and wellness tools. As a remote-first company, their globally distributed team focuses on clarity, autonomy, and operational excellence.
Architect and lead the design of data systems serving both operational business stakeholders and product/engineering teams.
Extend internal AI platform and bring software engineering rigor to data work including testing, CI/CD, and code review.
Build and own data models, partner with product engineering, and mentor teammates to raise the technical bar.
ZipRecruiter is a leading online employment marketplace powered by AI-driven intelligent matching technology. The company has the #1 rated job search app on iOS & Android and connects job seekers with millions of businesses.
Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.
Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.
Design, build, and operate data pipelines for analytics and AI/ML capabilities.
Architect ingestion, transformation, and storage pipelines across diverse data sources.
Implement data models suitable for analytics and BI consumption.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They identify the top-fitting candidates and share the shortlist directly with the hiring company.