Source Job

Global

  • Query and process large datasets using Trino (SQL).
  • Work with data in AWS environment using PySpark.
  • Build audience segments based on website activity, call data, behavioral patterns and segment rules.

SQL PySpark Excel Power BI Python

20 jobs similar to Big Data Analyst

Jobs ranked by similarity.

  • Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
  • Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
  • Implement data contracts between back-office and the platform.

Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.

Global

  • Design and implement batch and real time ingestion pipelines from internal and external sources.
  • Implement automated data quality checks, observability, and SLA monitoring.
  • Optimise datasets and pipelines for analytics, ML training, and API consumption.

Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.

US Unlimited PTO

  • Build infrastructure and data automation pipelines to ingest, process, and load data from various sources.
  • Collaborate with stakeholders and data science teams to develop data products aligned with organizational goals.
  • Develop data analysis tools to provide insights and capture key metrics.

Columbia General is seeking a Senior Data Engineer to help transform data into actionable insights that drive decision-making. The company fosters a dynamic, collaborative environment that supports growth and innovation.

Mexico

  • Contribute to the design and implementation of scalable data solutions.
  • Build and optimize batch and streaming ingestion pipelines.
  • Ensure data quality, reliability, and performance across pipelines and datasets.

Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.

$123,696–$254,667/yr
US

  • Design and implement robust data infrastructure in AWS, using Spark with Scala.
  • Evolve our core data pipelines to efficiently scale for our massive growth.
  • Store data in optimal engines and formats, matching your designs to our performance needs and cost factors.

tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification.

Costa Rica

  • Analyze large, complex datasets to generate actionable business insights.
  • Develop and optimize advanced SQL queries, joins, and data transformations.
  • Create dashboards and reports using data visualization tools like Tableau, Power BI, or Looker.

Coforge is a company that provides business solutions. They aim to hire professionals based solely on their skills and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.

$75,000–$110,000/yr
US 5w PTO

  • Support the architecture, design, and development of scalable analytics and reporting solutions across enterprise data platforms.
  • Partner with business stakeholders to define analytical strategies, frame problems, and deliver insights that drive decision-making.
  • Design and implement end-to-end data pipelines and workflows using modern big data and cloud technologies.

Cotiviti provides payment accuracy and analytics-driven solutions, focusing on healthcare and retail sectors. They are committed to fostering a diverse and inclusive environment where team members can grow and thrive.

$70,560–$81,120/yr
Global

  • Enable efficient data access by creating and maintaining data pipelines.
  • Collaborate with ML engineers to design and maintain automation for machine learning training, quality assessment, and model release process.
  • Build data infrastructure from the vast amount of data for analytics, hypothesis testing and company metrics.

Eneba is building an open, safe, and sustainable marketplace for gamers. Their marketplace supports close to 20m+ active users and provides trust and safety.

$100,649–$174,459/yr
US 4w PTO

  • Independently deliver analytical projects across the consumer credit lifecycle, including acquisition, account management and collections
  • Build statistical and machine learning models through all phases of development, from design through training, evaluation, validation and implementation
  • Use a broad set of technologies: SQL, PySpark, Python, AWS and more to obtain insights from large volumes of data

Experian is a global data and technology company, powering opportunities for people and businesses around the world. We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more. They have an amazing team of 25,200 people in 32 countries.

$190,000–$280,500/yr
US Canada

  • Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
  • Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
  • Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.

Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.

Global

  • Design and build end-to-end data pipelines across the RAW, Silver, and Gold layers of the Medallion Architecture.
  • Architect data ingestion, transformation, standardization, and serving processes, that structure data flows from diverse and heterogeneous sources into a coherent analytical foundation.
  • Model data for analytical consumption following Data Warehouse best practices, including Star Schema design and dimensional modeling suited for business intelligence and AI-readiness.

CI&T is a tech transformation specialist, uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters around the world, they’ve built partnerships with more than 1,000 clients during their 30 years of history, valuing diverse identities and life experiences.

Global 6w PTO

  • Development of various services in Python: integration with marketing partners, obtaining data from various sources.
  • Creation and support of processes on Airflow.
  • Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.

Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world and we are proud to be a two-time “Great Place to Work” winner.

$65,705–$87,606/yr
Canada

  • Design, build, and maintain scalable data infrastructure using modern cloud technologies.
  • Develop robust batch and streaming data pipelines to ingest, process, and serve data.
  • Contribute to the implementation of a modern data lakehouse architecture.

Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. The system identifies the top-fitting candidates and shares this shortlist with the hiring company.

Global

  • Design, develop, and maintain data pipelines using Azure Databricks.
  • Build and optimize data transformations using PySpark and SQL in Databricks.
  • Implement and maintain Lakehouse architectures using Delta Lake.

Miratech helps visionaries change the world with enterprise and start-up innovation, supporting digital transformation for some of the world's largest enterprises. They are a values-driven organization with nearly 1000 full-time professionals and an annual growth rate exceeding 25%.

US EMEA

  • Design, build, and maintain distributed data pipelines that power Spotify Wrapped data stories and personalized experiences for more than 300M users globally.
  • Partner with Data Scientists to evaluate and operationalize new Wrapped story concepts, balancing personalization, scalability, and eligibility requirements.
  • Build scalable systems that process large-scale listening data and generate insights that celebrate users’ unique listening journeys.

The Personalization team makes deciding what to play next easier and more enjoyable for every listener. They are behind some of Spotify’s most-loved features. Join them and you’ll keep millions of users listening by making great recommendations to each and every one of them.

Latin America

  • Develop and maintain ETL processes using Python (PySpark) within Azure Synapse Analytics to ensure efficient data handling.
  • Design data storage structures using expertise in data warehousing and extract data from sources like REST APIs, SQL tables, and CSV files.
  • Collaborate with teams to understand data requirements, implement quality checks, and ensure data security and compliance.

Bluelight is a leading software consultancy that designs and develops innovative technology to enhance users' lives. The company fosters a collaborative environment, has a presence across the United States and Central/South America, and is in an exciting phase of expansion.

US

  • Collaborate with stakeholders to gather reporting and data infrastructure requirements.
  • Design, build, and maintain automated dashboards and scalable analytics infrastructure.
  • Develop, optimize, and maintain large-scale ETL pipelines for campaign reporting and analytics.

ItD blends diversity, innovation, and integrity with real business results as a woman- and minority-led firm. They reject any strong hierarchy, empowering them to deliver excellent results and thrive in a dynamic environment with empowerment and recognition.

Canada

  • Be the Analytics Engineering lead within the Sales and Marketing organization.
  • Be the data steward for Sales and Marketing: architect and improve the collection of underlying data.
  • Develop and maintain robust data pipelines and workflows for data ingestion, processing, and transformation.

Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and millions of daily active unique visitors, Reddit is one of the internet’s largest sources of information.

India

  • Design scalable data pipelines and backend systems from the ground up.
  • Leverage AWS and GCP for real-time and batch processing.
  • Manage databases and Data Warehouses, optimizing ETL workflows.

Delivery Solutions, a UPS company, is looking for a Senior Data Engineer to join their team. They are a growing company.

Mexico

  • Architect and develop Analytics and Reporting solutions.
  • Implement and configure reporting technologies like Tableau and MicroStrategy.
  • Drive automation capabilities into reporting solutions.

Cotiviti helps healthcare payers identify and recover overpayments. They offer a competitive benefits package and value innovation, collaboration, and making a difference in healthcare.