Source Job

US Unlimited PTO

  • Design, build, and maintain robust, scalable batch and streaming data processing, storage, and integration pipelines.
  • Take full ownership of building features from the ground up, mentoring and leading your team to deliver the right solutions.
  • Partner with data scientists and engineers to create semantic data models and integrate applications across Shift5 componentry.

Python GoLang Java Apache Airflow AWS

20 jobs similar to Staff Data Engineer

Jobs ranked by similarity.

United States

  • Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
  • Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
  • Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.

Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.

Brazil

  • Design, develop, and maintain scalable ETL/ELT data pipelines to ingest, transform, and load data into data warehouses.
  • Implement and monitor data quality frameworks, ensuring accuracy, consistency, and reliability across datasets.
  • Collaborate with data scientists, analysts, and business stakeholders to deliver effective data solutions.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They focus on efficient and fair candidate evaluation through technology.

Global Unlimited PTO

  • Architect and maintain cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases.
  • Design and automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI.
  • Manage end-to-end data lifecycles including ingestion, storage, processing, and delivery of structured and unstructured data.

Trustonic makes smartphones affordable for the many, enabling global access to devices and digital finance through secure smartphone locking technology. They partner with mobile carriers, retailers, and financiers across 30+ countries, and pride themselves on a diverse, inclusive culture that values doing the right thing for each other, the community, and the planet.

US Unlimited PTO

  • Build and maintain data pipelines with DBT, Airflow, DynamoDB, and AWS Lambda, and develop APIs that serve data to the product.
  • Own the full stack of client-facing analytics and admin tools, ensuring robust testing and monitoring.
  • Contribute front-end code to integrate data into product experiences and build telemetry for analytics and machine learning models.

Vivian Health is the largest healthcare jobs marketplace, using an AI-driven platform to connect healthcare professionals with job opportunities and enhance hiring efficiency for employers. Backed by over $75 million in capital from investors like Thoma Bravo, the company has won awards for being a top remote workplace and has a culture focused on innovation and collaboration.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

US Canada

  • Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
  • Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
  • Participate in architectural decisions and evangelize data engineering best practices.

OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.

US Unlimited PTO 14w maternity 14w paternity

  • Design and build scalable data pipelines using Python and SQL to ingest, transform, and curate data from internal and external sources.
  • Implement schema validation, data quality checks, and job monitoring to ensure trustworthy data outputs.
  • Collaborate with analytics engineers, architects, and product partners to define technical requirements and deliver data products.

Cohere Health provides a clinical intelligence platform that uses AI to connect health plans and providers, improving care quality and reducing costs. We are a growing company recognized as a top LinkedIn startup and backed by leading investors, fostering a supportive, growth-oriented culture.

US

  • Lead workspace architecture, Unity Catalog governance, and cluster policy design for client tenant organizations.
  • Perform tenant discovery, requirements gathering, source profiling, and security classification for new data intake requests.
  • Develop end-to-end technical designs for tenant onboarding, including Data Sharing Agreements and SLA documentation.

M9 Solutions provides IT services and solutions to the Federal Government, mobilizing skilled people and technologies for improved performance and sustainable change. With 15+ years of proven delivery and growth, the company has been recognized as an Inc. 5000 Fastest-Growing Private Company multiple times and values diverse perspectives.

UK

  • Build and maintain data pipelines for analytics, ML, and product applications.
  • Design scalable data infrastructure with a focus on quality and observability.
  • Collaborate with cross-functional teams to understand data needs and implement solutions.

Prolific builds human data infrastructure to power the next wave of AI innovation. They are a remote-first company focused on ethical data collection and mission-driven culture.

US Unlimited PTO 15w maternity 15w paternity

  • Partner with the Director of Data to translate platform vision into engineering execution.
  • Lead the design and implementation of Kimball-style dimensional models and star schema architectures.
  • Design, build, and maintain production-grade data pipelines with a focus on reliability and scalability.

BlueLabs is an analytics services and technology company that helps organizations use data for social and commercial impact. With over 400 clients and a diverse team of analysts, scientists, engineers, and strategists, they foster a supportive, collaborative, and mission-driven culture.

Canada

  • Design, build, and operate high-scale data ingestion and replication systems from production data stores into the data lakehouse.
  • Build and maintain reliable, scalable data platform infrastructure capable of handling petabytes of data across analytics, AI, and operational use cases.
  • Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams move and access data safely.

Samsara is the pioneer of the Connected Operations Cloud, enabling organizations that depend on physical operations to harness IoT data for actionable insights. As a publicly traded company, Samsara fosters a growth-oriented culture and serves industries that represent over 40% of global GDP.

US 4w PTO

  • Solving unique data-lake challenges by transforming and normalizing highly varied partner datasets.
  • Designing robust batch-processing pipelines for massive datasets from internal, external, and public sources.
  • Collaborating with cross-functional teams to support their data infrastructure needs.

MissionWired helps progressive nonprofits and political campaigns create revolutionary fundraising strategies. They have raised over $4.5 billion in donations and value innovation, inclusion, and social change.

US 4w PTO

  • Design and build scalable, well-tested DBT models and develop maintainable ELT pipelines with observability and reliability.
  • Partner with product managers, engineers, and stakeholders to understand data needs and provide actionable paths forward.
  • Mentor other engineers, document logic and methodology, and promote transparency across the organization.

Imagine Pediatrics is a tech-enabled, pediatrician-led medical group reimagining care for children with special health care needs, delivering 24/7 virtual and in-home medical, behavioral, and social care. The company is a fast-paced startup environment focused on compassion and innovation, with an unwavering commitment to children with medical complexity.

US

  • Design and build scalable data pipelines, clean room environments, and privacy-safe integrations for NBCUniversal’s data collaboration ecosystem.
  • Implement identity resolution logic and configure secure, role-based access controls across data platforms.
  • Optimize query performance and operational reliability, including monitoring, cost tracking, and incident response.

NBCUniversal is one of the world's leading media and entertainment companies, creating and distributing content across film, television, and streaming. A subsidiary of Comcast Corporation, it champions an inclusive culture and has a rich tradition of community service.

US 4w PTO

  • Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
  • Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
  • Contribute to automated testing suites and optimize data operations for integrity and security.

Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.

US Canada India England Unlimited PTO

  • Own end-to-end delivery of core data platform components including schema mapping, normalization, and validation pipelines.
  • Drive technical architecture for an AI-native data warehouse serving institutional financial clients.
  • Build AI evaluation infrastructure to ensure trustworthy outputs in high-stakes financial data contexts.

Juniper Square is a private market operations platform that unifies technology, data, and fund administration services for over 2,300 GPs. With 1,000+ employees and $350M+ in funding, the company has a founder-led culture focused on ambitious, meaningful work and diverse perspectives.

US

  • Own data pipeline development. Build and maintain reliable pipelines that ingest, transform, and deliver healthcare data across the organization.
  • Design warehouse data models. Create scalable schemas and data structures that support analytics, reporting, and evidence generation.
  • Lead data transformation strategy. Establish frameworks and standards that improve consistency, maintainability, and performance.

Pivotal Health builds a technology platform to help healthcare providers get paid fairly in complex reimbursement landscapes. The company is a collaborative, low-ego team on a mission to make healthcare reimbursement fairer.

United States

  • Lead the design and evolution of scalable financial data systems supporting commissions, incentives, and payments.
  • Build and maintain robust data pipelines using Python, SQL, Spark, and Terraform for accuracy and performance.
  • Define technical strategy and roadmap for financial data operations in collaboration with stakeholders.

Our partner is a fast-growing technology company building financial data infrastructure for insurance operations. They have a remote-friendly work environment and emphasize engineering excellence and cross-functional collaboration.

Canada

  • Architect Spark-driven workflows at scale and design data platforms as products for internal teams.
  • Develop and maintain end-to-end data pipelines and backend ingestion workflows across multiple sources.
  • Champion Samsara's cultural principles and mentor junior team members to drive data-driven decisions.

Samsara is the pioneer of the Connected Operations Cloud, enabling organizations to harness IoT data for actionable insights to improve safety, efficiency, and sustainability. As a recently public company, it fosters a culture of rapid career development, ownership, and high performance.

Global

  • Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
  • Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
  • Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.

Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.