Source Job

US Unlimited PTO

  • Own the gold data layer by transforming silver tables into curated, semantically rich datasets for AI model development.
  • Reverse-engineer data semantics by collaborating with product engineers, clinical experts, and analyzing SQL queries and stored procedures.
  • Build pipelines for reuse, automate quality filtering and synthesis, and maintain reproducible dataset snapshots.

Python SQL PySpark Databricks Data Engineering

20 jobs similar to Senior Research Data Engineer

Jobs ranked by similarity.

Canada Unlimited PTO

  • Build and own the gold data layer between the silver Lakehouse data and AI model development teams.
  • Reverse-engineer data semantics by collaborating with product engineers, clinical experts, and reading SQL queries and stored procedures.
  • Curate datasets across modalities, build reusable pipelines, and automate quality, filtering, and synthesis for AI research needs.

PointClickCare is a leading health tech company that helps providers deliver exceptional care through a platform serving over 30,000 provider organizations. Founder-led and privately held, the company reinvests in R&D and has been recognized by Forbes as a top private cloud company and one of Canada's Most Admired Corporate Cultures.

US Unlimited PTO

  • Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
  • Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
  • Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.

YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.

US

  • Design and implement medallion architecture using Delta Lake.
  • Build and optimize scalable data pipelines using Apache Spark.
  • Implement Unity Catalog for data lineage and access control.

V2 Strategic Advisors transforms media and advertising sales organizations with management and technology consulting. They are a lean, elite team of consultants, technical architects, and data experts who operate with the rigor of global management consulting and the energy and agility of a startup.

US

  • Architect and lead the design of data systems serving both operational business stakeholders and product/engineering teams.
  • Extend internal AI platform and bring software engineering rigor to data work including testing, CI/CD, and code review.
  • Build and own data models, partner with product engineering, and mentor teammates to raise the technical bar.

ZipRecruiter is a leading online employment marketplace powered by AI-driven intelligent matching technology. The company has the #1 rated job search app on iOS & Android and connects job seekers with millions of businesses.

US Unlimited PTO

  • Build and operate production-grade ingestion pipelines from clinical, operational, and third-party systems into a Databricks lakehouse.
  • Develop and maintain dbt models to transform raw data into clean, documented, analytics-ready datasets.
  • Establish data quality, testing, and monitoring practices to ensure pipeline reliability and performance.

Zócalo Health is a tech-enabled, community-oriented primary care organization serving underserved populations with culturally competent care. Founded in 2021, the company is backed by leading healthcare investors and is scaling rapidly with a focus on value-based care.

US

  • Play a crucial role in helping client organizations transform raw data into reliable, well-modeled assets that drive business decisions.
  • Design, build, and maintain scalable data pipelines and ELT workflows, with Databricks as the primary platform.
  • Collaborate with data engineers, analysts, and clients on end-to-end data requirements and project delivery.

Velir is an established mid-sized agency with a top-tier portfolio of clients, ranging from the world’s largest non-profits to Fortune 500 brands. Our culture is built on a foundation of trust, collaboration, and continued improvement, and we are a remote first company that offers competitive pay and excellent benefits.

US

  • Works as a positive team member to deliver quality data applications within scope, on time, and within budget.
  • Develops strategies for managing complex data sets and integrates disparate data sources for improved reporting.
  • Mentors other analysts on regulation adherence and follows HIPAA standards.

Emory University is a leading research university that fosters excellence and attracts world-class talent to innovate today and prepare leaders for the future. The organization offers a diverse and inclusive environment, with a focus on academic excellence and equal opportunity.

US

  • Lead Databricks lakehouse execution across bronze, silver, and gold with maintainable pipelines and curated outputs.
  • Own common data model implementation, conformed dimensions, fact design, and reusable gold datasets for reporting and AI/BI.
  • Design scalable data models for SAP, manufacturing, finance, sales, supply chain, and operational domains.

Sonny's Enterprises is the world's largest manufacturer of conveyorized car wash equipment, parts, and supplies. The company has a culture focused on innovation and is proudly designed and built in the USA.

Mexico

  • Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
  • Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
  • Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.

Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.

Brazil

  • Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
  • Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
  • Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.

This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.

US 16w maternity 12w paternity

  • Orchestrate High-Velocity Workflows: Leverage advanced agentic coding tools (e.g., Cursor, multi-agent environments) to dramatically accelerate feature prototyping, code generation, and test coverage.
  • Own the Guardrails & Quality: Act as the ultimate reviewer and architect; define the specifications, establish repo-context guardrails, and review AI-accelerated output for hidden security risks, scale bottlenecks, and architectural alignment.
  • Build Scalable Application and Data Layers: Design, build, and maintain our data pipelines and application to service our hundreds of users.

EvolutionIQ provides technology to improve insurance claims handling. The company is experiencing massive growth and has been named a top workplace, prioritizing its team.

US

  • Design, build, and optimize large-scale data and analytics platforms on the Databricks Lakehouse.
  • Architect and maintain scalable ETL/ELT pipelines using PySpark, Spark SQL, and Delta Lake.
  • Implement medallion data architectures, enforce data quality, and manage Unity Catalog for governance.

Bounteous is a premier end-to-end digital transformation consultancy that partners with ambitious brands to create digital solutions. With over 4,000 expert team members across the Americas, APAC, and EMEA, they deliver innovative strategies and technical expertise.

Canada

  • Architect Spark-driven workflows at scale and design data platforms as products for internal teams.
  • Develop and maintain end-to-end data pipelines and backend ingestion workflows across multiple sources.
  • Champion Samsara's cultural principles and mentor junior team members to drive data-driven decisions.

Samsara is the pioneer of the Connected Operations Cloud, enabling organizations to harness IoT data for actionable insights to improve safety, efficiency, and sustainability. As a recently public company, it fosters a culture of rapid career development, ownership, and high performance.

US

  • Design and deploy ML-based anomaly detection pipelines to flag data discrepancies early in the ETL process.
  • Build AI-assisted field mapping and classification tooling to accelerate schema mapping across data conversion cycles.
  • Develop automated data quality scoring pipelines and maintain lightweight AI tooling for staff to operate and extend.

Derex Technologies Inc specializes in providing IT consulting, staffing solutions and software services. Since 1996, the company has delivered customized IT talent solutions to global clients across North America, serving industries like finance, healthcare, and government.

US

  • Own data pipeline development. Build and maintain reliable pipelines that ingest, transform, and deliver healthcare data across the organization.
  • Design warehouse data models. Create scalable schemas and data structures that support analytics, reporting, and evidence generation.
  • Lead data transformation strategy. Establish frameworks and standards that improve consistency, maintainability, and performance.

Pivotal Health builds a technology platform to help healthcare providers get paid fairly in complex reimbursement landscapes. The company is a collaborative, low-ego team on a mission to make healthcare reimbursement fairer.

Canada

  • Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
  • Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
  • Define data governance policies and leverage AI tools to streamline data pipeline development.

For over four decades, PAR Technology Corporation has been a leader in restaurant technology, empowering brands worldwide to create lasting connections with their guests. With over 100,000 restaurants in more than 110 countries, we embrace a 'Better Together' ethos and offer comprehensive software and hardware solutions.

Global

  • Architect and scale data systems for analytics, ML/AI products, reporting, and APIs.
  • Own the full data lifecycle including ingestion, transformation, modeling, validation, and serving.
  • Partner with Data Science to productionize models and build reliable data foundations for AI-driven products.

Vidmob is the creative data company that provides scoring software and analytics to help marketers and agencies drive business results through improved creative effectiveness. They partner with top marketers and agencies worldwide and operate the industry's most robustly instrumented human-reinforcement learning model for creativity.

AI Engineer

LMI
$111,426–$192,890/yr
US

  • Design and develop data pipelines, scoring algorithms, and API infrastructure to power AI-driven matching and recommendation capabilities.
  • Build and maintain integrations between the matching engine and an existing program management platform.
  • Collaborate with SMEs to build, test, and refine user-configurable matching logic.

LMI is dedicated to accelerating government impact with innovation and speed, bringing commercial-grade platforms and mission-ready AI to federal agencies. Headquartered in Tysons, Virginia, they are committed to delivering impactful results that strengthen missions and drive lasting value.

$103,500–$192,000/yr
US

  • Design and build dbt models for Finance & Accounting reporting and analysis.
  • Design the semantic layer and metrics definitions for both humans and AI agents.
  • Maintain documentation and auditability of data pipelines in a SOX-controlled CI/CD environment.

Life360's mission is to keep people close to the ones they love through their mobile app, Tile tracking devices, and Pet GPS tracker. They empower members to protect what they care about most with services like location sharing and safe driver reports. Life360 has more than 500 remote-first employees and is growing.

US

  • Develop and implement scalable AI/ML solutions for generative AI models including large language models and multimodal architectures.
  • Design multi-year vision and shape the direction of crucial generative AI areas such as text generation, image synthesis, and personalized content.
  • Partner with product management and stakeholders to identify use cases, analyze patterns, and maintain compliance in healthcare AI.

Aledade is a healthcare technology company that builds web applications and data pipelines to support primary care. They are a large organization with a culture focused on engineering excellence, observability, and incremental delivery.