Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
Collaborate cross-functionally with AI/ML and Product teams to implement new features.
Proactively identify and resolve bottlenecks in our complex ETL processes.
Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
Design, build, and maintain scalable data pipelines
Develop and optimize ETL processes to support data products
Work with structured and unstructured data across SQL and NoSQL systems
They are seeking a Data Engineer to support the development of data products that power critical business functions. They seem to have a collaborative, cross-functional Agile environment where you'll partner closely with technical and business teams to deliver high-quality data solutions.
Support the architecture, design, and development of scalable analytics and reporting solutions across enterprise data platforms.
Partner with business stakeholders to define analytical strategies, frame problems, and deliver insights that drive decision-making.
Design and implement end-to-end data pipelines and workflows using modern big data and cloud technologies.
Cotiviti provides payment accuracy and analytics-driven solutions, focusing on healthcare and retail sectors. They are committed to fostering a diverse and inclusive environment where team members can grow and thrive.
Collaborate with stakeholders to gather reporting and data infrastructure requirements.
Design, build, and maintain automated dashboards and scalable analytics infrastructure.
Develop, optimize, and maintain large-scale ETL pipelines for campaign reporting and analytics.
ItD blends diversity, innovation, and integrity with real business results as a woman- and minority-led firm. They reject any strong hierarchy, empowering them to deliver excellent results and thrive in a dynamic environment with empowerment and recognition.
Contribute to the design and implementation of scalable data solutions.
Build and optimize batch and streaming ingestion pipelines.
Ensure data quality, reliability, and performance across pipelines and datasets.
Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.
Build infrastructure and data automation pipelines to ingest, process, and load data from various sources.
Collaborate with stakeholders and data science teams to develop data products aligned with organizational goals.
Develop data analysis tools to provide insights and capture key metrics.
Columbia General is seeking a Senior Data Engineer to help transform data into actionable insights that drive decision-making. The company fosters a dynamic, collaborative environment that supports growth and innovation.
Development of various services in Python: integration with marketing partners, obtaining data from various sources.
Creation and support of processes on Airflow.
Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.
Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world and we are proud to be a two-time “Great Place to Work” winner.
Design, build, and maintain scalable data infrastructure using modern cloud technologies.
Develop robust batch and streaming data pipelines to ingest, process, and serve data.
Contribute to the implementation of a modern data lakehouse architecture.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. The system identifies the top-fitting candidates and shares this shortlist with the hiring company.
Independently deliver analytical projects across the consumer credit lifecycle, including acquisition, account management and collections
Build statistical and machine learning models through all phases of development, from design through training, evaluation, validation and implementation
Use a broad set of technologies: SQL, PySpark, Python, AWS and more to obtain insights from large volumes of data
Experian is a global data and technology company, powering opportunities for people and businesses around the world. We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more. They have an amazing team of 25,200 people in 32 countries.
Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.
Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Design & build data observability platforms and metrics.
Build metadata driven pipeline solutions.
Fuze Health puts patients first and tirelessly addresses the most pressing needs in healthcare. They empower millions to digitally connect with care providers, essential health resources and needed treatments. The company is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors.
Own and maintain data pipeline architectures, ensuring reliability and monitoring.
Manage and evolve data modeling environments for analysts and engineers.
Implement observability for data systems, detecting issues early and continuously monitoring data quality.
Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.
Build and own end-to-end data pipelines in Snowflake — from raw ingestion through transformation to serving layers for AI products.
Partner with ML engineers and data scientists to build and maintain AI-specific data infrastructure.
Consolidate fragmented data sources across the organization into reliable, automated pipelines.
Power Digital is a tech-enabled growth firm at the intersection of marketing, consulting, and data intelligence. They ignite revenue and brand recognition for leading and emerging companies. They are a people-first firm with a focus on diversity and have a dynamic team of consultative marketers, creatives, analysts and technologists.
Design, build, and operate data pipelines for analytics and AI/ML capabilities.
Architect ingestion, transformation, and storage pipelines across diverse data sources.
Implement data models suitable for analytics and BI consumption.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They identify the top-fitting candidates and share the shortlist directly with the hiring company.
Design and implement batch and real time ingestion pipelines from internal and external sources.
Implement automated data quality checks, observability, and SLA monitoring.
Optimise datasets and pipelines for analytics, ML training, and API consumption.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.
Identify opportunities to attract new users, increase engagement, and drive retention.
Influence strategic roadmaps through data-driven insights into user behaviors and needs.
Drive experimentation from design through execution and analysis to maximize learnings.
Reddit is a community-driven platform fostering open and authentic conversations. With over 100,000 active communities and millions of daily active users, it stands as a prominent source of online information, poised for rapid innovation and growth.
Enable efficient data access by creating and maintaining data pipelines.
Collaborate with ML engineers to design and maintain automation for machine learning training, quality assessment, and model release process.
Build data infrastructure from the vast amount of data for analytics, hypothesis testing and company metrics.
Eneba is building an open, safe, and sustainable marketplace for gamers. Their marketplace supports close to 20m+ active users and provides trust and safety.
Builds and modernizes data pipelines and integrations to improve processing efficiency.
Engineers data and analytics components and improves reliability/performance.
Supports testing, documentation, and O&M transition materials.
DMI is a leading provider of digital services and technology solutions, headquartered in Tysons Corner, VA. With a focus on end-to-end managed IT services, the company supports public sector agencies and commercial enterprises around the globe.
Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
Process and integrate data from multiple formats and sources (JSON, CSV, XML).
Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.
I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.