Remote Data Jobs · Python

Job listings

$75,000–$90,000/yr

  • Develop and maintain capacity models for compute, storage, and network infrastructure across global environments.
  • Build and productionize advanced time‑series forecasts (e.g., ARIMA/ETS, Prophet, XGBoost/LightGBM) to predict demand, saturation points, and runway.
  • Conduct scenario modeling (“what‑if”) on deployment plans, workload changes, demand spikes, and hardware refresh strategies.

Vultr is on a mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible for enterprises and AI innovators around the world. With 32 global cloud data center locations, Vultr is trusted by hundreds of thousands of active customers across 185 countries.

$147,900–$203,000/yr
US 4w PTO

  • Drive the development of next‑generation foundation models for wearable biosignals that power Oura’s health sensing features.
  • Design and validate deep learning approaches for large‑scale wearable time‑series and running rigorous evaluations on research and clinical datasets.
  • Partner with cross‑functional teams to translate evidence into shipped features and scientific publications.

Oura's mission is to empower every person to own their inner potential. Their award-winning products help their global community gain a deeper knowledge of their readiness, activity, and sleep quality by using their Oura Ring and its connected app. Oura is a quickly growing company focused on helping people live healthier and happier lives, and they ensure that their team members have what they need to do their best work — both in and out of the office.

  • Lead, mentor, and develop a high-performing data engineering squads delivering production-grade pipelines and services.
  • Set technical and operational standards for quality, documentation, and reliability.
  • Partner with Program Management to plan, prioritise, and track delivery against sprint goals.

Forbes Digital Marketing Inc. is a high-growth digital media and technology company dedicated to helping consumers make confident, informed decisions about their money, health, and everyday life. We combine data-driven content, rigorous experimentation, and modern engineering to power a portfolio of global products and partnerships.

  • Design the Future of Ads Identity: Develop/employ probabilistic models for identity resolution.
  • Advance Lift Methodologies & Experimentation: Own the statistical rigor behind Reddit’s Brand and Conversion Lift products.
  • Maximize Signal for Predictive Performance: Define the strategy for new signal sources.

Reddit is a community of communities built on shared interests and trust, fostering open conversations. With over 100,000 active communities and approximately 116 million daily active unique visitors, Reddit is a significant source of information.

  • Design, develop, and apply DS solutions to inform improvements in advertiser experience and Reddit's ad platform
  • Analyze large-scale datasets to identify trends, patterns, and insights that can be used to improve the effectiveness of our advertising platform
  • Collaborate with product managers and engineers to define product requirements and translate them into data science solutions

Reddit is a community of communities built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and approximately 116 million daily active unique visitors, Reddit is one of the internet’s largest sources of information.

  • Analyze large-scale datasets to identify trends, patterns, and insights that can be used to improve the effectiveness of our advertising platform
  • Develop ML models & DS methods to for improved anomaly detection, prediction, pattern recognition
  • Communicate findings and recommendations to stakeholders across the organization

Reddit is a community of communities built on shared interests, passion, and trust, and is home to open and authentic conversations. With 100,000+ active communities and approximately 116 million daily active unique visitors, Reddit is one of the internet’s largest sources of information.

$57–$72/hr

  • Lead development of advanced models and experimentation frameworks.
  • Design, build, and maintain end‑to‑end machine learning pipelines in a cloud based framework.
  • Develop and refine NLP pipelines to extract structured information from unstructured clinical documentation.

Emory Healthcare is an academic medical center located in Atlanta, Georgia. They are committed to providing reasonable accommodations to qualified individuals with disabilities and is an equal opportunity employer.

US Unlimited PTO

  • Developing and enhancing the Redbox integrated data layer across Measurement, Audience, Optimization, and Insights.
  • Partnering with clients, account leadership, strategy, planning, and Redbox team leads to develop best-in-class Artificial Intelligence strategies and solutions.
  • Ensuring key Redbox data infrastructure is resilient and future-proofed.

Crossmedia is a global media independent committed to doing media and business the right way via TRUST, REASON and the pursuit of HAPPINESS. Crossmedia US was founded in NY in 2000 and is one of the largest minority-owned full-service media planning & buying agencies in the nation with 500+ Crossmedians worldwide.

  • Apply data engineering best practices to analytics code.
  • Build and maintain composable data models and optimize SQL query performance.
  • Transform raw data into business insights and create data visualizations.

Newsela is an instructional content platform that supercharges reading engagement and learning in every subject. I don't have enough information to infer company size/culture.