Remote Data Jobs · Scala

Job listings

Mactores is seeking a highly skilled and innovative Spark Engineer to design, develop, optimize, and operationalize high-performance data pipelines and applications using Apache Spark. This role requires hands-on expertise in distributed data processing, ETL engineering, performance tuning, and cluster management.

$150,000–$180,000/yr

Staff Data Engineers are responsible for being interpersonal force multipliers across the organization, augmenting our internal capabilities to effectively manage and leverage client data effectively. Staff Data Engineers work closely with functional leadership and data solutions team members to understand common client business needs and requirements to identify and develop data engineering solutions to meet those needs.

You’ll play a key role in designing and scaling the infrastructure and pipelines that power the product features, analytics, and machine learning across Sonatype. You’ll work closely with stakeholders across product, engineering, and business teams to ensure data is reliable, accessible, and actionable. This role is ideal for someone who thrives on solving complex data challenges at scale and enjoys building high-quality, maintainable systems.

You'll play a crucial role in developing and maintaining scalable data pipelines and infrastructure to drive data analytics and machine learning solutions for our clients. Design and create robust, scalable data pipelines using Databricks, Apache Spark, and SQL to transform and process large datasets efficiently. You will collaborate with data architects to design data models and architecture that support data analytics and machine learning applications.

Europe Unlimited PTO

Seeking a highly skilled and experienced Senior Data Engineer to lead the development and management of our data platform. This pivotal role will focus on supporting critical data needs and developing foundational data models that are essential for advancing our cross-border remittances business. Managing users, scaling compute resources, and ensuring the platform operates optimally and efficiently will be key components of this role.

Develop and maintain scalable data pipelines using Scala and Apache Spark, focusing on performance and reuse. Structure and evolve the medallion architecture (bronze, silver, gold), ensuring governance, traceability, and data quality. Implement solutions for data ingestion, transformation, and delivery in cloud and lakehouse environments. Define and apply data engineering best practices, including versioning, automated testing, and CI/CD.

$14,640–$16,092/yr

As a Data Engineer Intern, you will work with an experienced data team to design, develop, and maintain our data systems and infrastructure. This internship provides hands-on experience with large datasets, building data pipelines, and using tools for data processing and analysis. You'll be contributing to projects that impact business decisions.

Unlimited PTO

As the foundational Data Engineer for HTS Media, you will own the data infrastructure that powers entire advertising business. Mission is to build a robust, scalable data foundation from the ground up that transforms this raw data into the trusted, high-quality datasets that power advertiser-facing reporting and enable future ML models. You will be responsible for the full lifecycle of data, from building real-time data pipelines to modeling data in our warehouse for analytics and reporting.

We’re seeking a seasoned Staff Data Engineer to lead the design, development, and scaling of our modern data platform. This role is ideal for someone who thrives in building and designing robust data systems. You’ll be instrumental in shaping our data infrastructure, driving governance, and building scalable APIs that power real-time and batch analytics.

$125,000–$155,000/yr
US 5w PTO 10w maternity

We are looking for innovative and creative individuals who seize opportunities to uncover hidden drivers, impacts, and key influences to support our product, leadership and clinical teams by applying optimization and statistical methods on a variety of data. You will work closely with the clinical program and product teams to support decision-making and will dig into a wide range of strategic and clinical problems.