Development of various services in Python: integration with marketing partners, obtaining data from various sources.
Creation and support of processes on Airflow.
Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.
Design & build data observability platforms and metrics.
Build metadata driven pipeline solutions.
Fuze Health puts patients first and tirelessly addresses the most pressing needs in healthcare. They empower millions to digitally connect with care providers, essential health resources and needed treatments. The company is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors.
Design, build, and maintain scalable data pipelines
Develop and optimize ETL processes to support data products
Work with structured and unstructured data across SQL and NoSQL systems
They are seeking a Data Engineer to support the development of data products that power critical business functions. They seem to have a collaborative, cross-functional Agile environment where you'll partner closely with technical and business teams to deliver high-quality data solutions.
Design, build, and maintain scalable data infrastructure using modern cloud technologies.
Develop robust batch and streaming data pipelines to ingest, process, and serve data.
Contribute to the implementation of a modern data lakehouse architecture.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. The system identifies the top-fitting candidates and shares this shortlist with the hiring company.
Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
Collaborate cross-functionally with AI/ML and Product teams to implement new features.
Proactively identify and resolve bottlenecks in our complex ETL processes.
Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
Lead, coach, and develop a team of analytics engineers and/or data engineers.
Ensure on-time delivery of client data integrations by owning enterprise data model standards and maintaining consistent, governed data definitions.
Oversee client data pipelines using modern tooling (dbt, Airflow, Snowflake, AWS, Python) to ensure reliable operation and uptime.
SmarterDx builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Be the Analytics Engineering lead within the Sales and Marketing organization.
Be the data steward for Sales and Marketing: architect and improve the collection of underlying data.
Develop and maintain robust data pipelines and workflows for data ingestion, processing, and transformation.
Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and millions of daily active unique visitors, Reddit is one of the internet’s largest sources of information.
Lead a team of 6-8 analysts, including hiring and performance management
Manage the team’s workload together with product managers
Ensure the quality of the extracted data
Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, they surpassed $3B in revenue in their last fiscal year with extensive growth potential ahead.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Design and build ETL processes in collaboration with software and model development teams.
Create and maintain scalable data infrastructure.
Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.
OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.
Build and maintain data transformation pipelines with robust testing.
Design, implement, and maintain models with complex domain and business logic.
Optimize data storage and retrieval processes for improved performance and scalability.
Accorded is seeking experienced professionals to join their team. They are located in the San Francisco Bay Area, committed to creating a diverse and inclusive work environment and do not discriminate.
Build and own end-to-end data pipelines in Snowflake — from raw ingestion through transformation to serving layers for AI products.
Partner with ML engineers and data scientists to build and maintain AI-specific data infrastructure.
Consolidate fragmented data sources across the organization into reliable, automated pipelines.
Power Digital is a tech-enabled growth firm at the intersection of marketing, consulting, and data intelligence. They ignite revenue and brand recognition for leading and emerging companies. They are a people-first firm with a focus on diversity and have a dynamic team of consultative marketers, creatives, analysts and technologists.
Build and optimize scalable data pipelines using Python and dbt.
Design and maintain Snowflake warehouse structures, database tables, and performant data models.
Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.
We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.
Design, build, and operate data pipelines for analytics and AI/ML capabilities.
Architect ingestion, transformation, and storage pipelines across diverse data sources.
Implement data models suitable for analytics and BI consumption.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They identify the top-fitting candidates and share the shortlist directly with the hiring company.
Design and administer cloud-native data systems using AWS services like Glue, Lambda, Redshift, and S3 to build scalable data architectures.
Develop and maintain reliable ETL processes using Python and SQL to ingest, clean, and transform complex healthcare data, optimizing pipeline performance.
Implement data security, governance, and compliance measures while collaborating with product and analytics teams to translate business needs into technical solutions.
Evio is a pharmacy solutions company founded by and working with health plans to implement transformative specialty medication initiatives. It is a lean, independent entity with six owner health plans serving over 20 million members, investing heavily in a strong, intentional team culture and values.
Design and implement batch and real time ingestion pipelines from internal and external sources.
Implement automated data quality checks, observability, and SLA monitoring.
Optimise datasets and pipelines for analytics, ML training, and API consumption.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.
Design, develop, and maintain scalable data pipelines and infrastructure.
Build and optimize data warehouses, databases, and data models.
Implement and maintain data governance and security practices.
Jobgether is a company that uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They connect candidates with companies; their culture is collaborative and inclusive, focused on innovation and growth.
Develop and deliver advanced data analytics and reporting solutions to support operational decision-making across business teams.
Build and maintain Python-based data pipelines, automation scripts, and API integrations for scalable analytics workflows.
Design and optimize complex SQL Server queries, stored procedures, and functions with a focus on performance and reliability.
Jobgether is a company that uses AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements. They identify top-fitting candidates.
Build and Maintain Bronze/Silver Layer Pipelines: You will ensure core data sources lands accurately, on time, and with full lineage.
Lead Data Ingestion, Transformation, and Enrichment: You will own the end-to-end pipeline from raw file landing through cleansed, conformed staging tables, including deduplication, standardization, code mapping, and entity resolution.
Develop Automated Ingestion Pipelines: You will use Snowpipe, Matillion, or custom solutions with reliability, observability, and minimal manual intervention in mind.
Precision AQ is building a centralized Data Hub to consolidate fragmented data infrastructure, establish enterprise-wide data governance, and enable AI-ready analytics across our life sciences portfolio. This is a foundational initiative, not a maintenance role.