Remote Data Jobs · Spark

Job listings

$190,800–$267,100/yr

  • Be the Analytics Engineering lead within the Sales and Marketing organization.
  • Be the data steward for Sales and Marketing: architect and improve the data collection.
  • Develop and maintain robust data pipelines and workflows for data ingestion and transformation.

Reddit is a community-driven platform built on shared interests and trust, fostering open and authentic conversations. With over 100,000 active communities and approximately 116 million daily active unique visitors, it serves as a major source of information on the internet.

US Unlimited PTO 20w maternity 14w paternity

  • Build & Operate Data Pipelines, using AWS-native data tools and distributed processing frameworks.
  • Operate and improve core data platform services, addressing incidents, performance issues, and operational toil.
  • Partner with data producers and consumers to onboard pipelines, troubleshoot issues, and improve platform usability.

Fetch is a platform where millions of people use Fetch earning rewards for buying brands they love, and a whole lot more. With investments from SoftBank, Univision, and Hamilton Lane, and partnerships with Fortune 500 companies, it is reshaping how brands and consumers connect in the marketplace. Ranked as one of America’s Best Startup Employers by Forbes, Fetch fosters a people-first culture rooted in trust, accountability, and innovation.

  • Develop big data applications for Synchrony in Hadoop ecosystem.
  • Participate in the agile development process including backlog grooming, coding, code reviews, testing and deployment.
  • Work independently to develop analytic applications leveraging technologies such as: Hadoop, NoSQL, In-memory Data Grids, Kafka, Spark, Ab Initio.

Synchrony is a premier consumer financial services company delivering customized financing programs across key industries including retail, health, auto, travel and home, along with award-winning consumer banking products. With more than $139 billion in sales financed and 68.5 million active accounts, they bring deep industry expertise, actionable data insights, innovative solutions and differentiated digital experiences to improve the success of every business and the quality of each life.

$125,000–$150,000/yr

  • Design, implement, and optimize robust and scalable data pipelines using SQL, Python, and cloud-based ETL tools such as Databricks.
  • Enhance our overarching data architecture strategy, assisting in decisions related to data storage, consumption, integration, and management within cloud environments.
  • Partner with data scientists, BI teams, and other engineering teams to understand and translate complex data requirements into actionable engineering solutions.

The New York Blood Center appears to be a medical organization. They are looking for a Senior Data Engineer to join their team.

US Unlimited PTO

  • Define strategic roadmaps and outcomes for clients.
  • Act as a data engineering SME for upcoming engagements.
  • Oversee the development and implementation of data standards.

Jobgether is a company that uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring employer; the final decision and next steps are managed by their internal team.

  • Design, build, and maintain infrastructure for data ingestion, processing, and analysis.
  • Translate business requirements into technical solutions in collaboration with stakeholders.
  • Ensure data quality, integrity, and security throughout the data lifecycle.

Jobgether is a platform that connects job seekers with companies. They use AI-powered matching to ensure applications are reviewed quickly and fairly.

  • Define, plan, and execute analyses regarding innovation initiatives, methodology development and improvement as well as process automation.
  • Prototype solutions and support pilot programs for R&D purposes, including trend analyses, representation/sampling, bias reduction, indirect estimation, data integration, automation, and generalization.
  • Test-driven development of scalable data processing applications.

NIQ is the world’s leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. NIQ is an Advent International portfolio company with operations in 100+ markets, covering more than 90% of the world’s population.

  • Design, implement, and maintain distributed ingestion pipelines for structured and unstructured data.
  • Build scalable ETL/ELT workflows to transform, validate, and enrich datasets for AI/ML model training and analytics.
  • Support preprocessing of unstructured assets for training pipelines, including format conversion, normalization, augmentation, and metadata extraction.

Meshy is a leading 3D generative AI company transforming content creation by enabling the creation of 3D models from text and images. They have a global team distributed across North America, Asia, and Oceania and are backed by venture capital firms like Sequoia and GGV, with $52 Million in funding.

$133,109–$239,596/yr

  • Take ownership of end-to-end client analytical projects, from initial solution design and data integrity evaluation through to final documentation and implementation.
  • Serve as an internal subject matter expert for complex analytics; be hands-on, using advanced Python and Spark to build scalable, production-ready solutions.
  • Act as a key consultant for clients and internal partners; perform advanced analytics at explaining complex methodologies and results to both technical and non-technical audiences.

Experian is a global data and technology company, powering opportunities for people and businesses around the world. As a FTSE 100 Index company with corporate headquarters in Dublin, Ireland, they have a team of 22,500 people across 32 countries.

$67,000–$157,000/yr
US 4w PTO

  • Design, develop, and optimize data pipelines and ETL processes to ensure high-quality data is available for analysis.
  • Analyze complex datasets to identify trends, patterns, and actionable insights that drive business performance.
  • Implement data quality checks and governance best practices to ensure data accuracy and reliability.

Modeling Data Solutions is seeking an experienced data analytics engineer to join its personal lines property team. This is an exciting opportunity to join the US Data Science Infrastructure department helping to support creating cutting edge pricing programs.