Remote Data Jobs · North America

Job listings

  • Design, implement, and maintain distributed ingestion pipelines for structured and unstructured data.
  • Build scalable ETL/ELT workflows to transform, validate, and enrich datasets for AI/ML model training and analytics.
  • Support preprocessing of unstructured assets for training pipelines, including format conversion, normalization, augmentation, and metadata extraction.

Meshy is a leading 3D generative AI company transforming content creation by enabling the creation of 3D models from text and images. They have a global team distributed across North America, Asia, and Oceania and are backed by venture capital firms like Sequoia and GGV, with $52 Million in funding.

  • Devising and reporting on integration development plans and strategies.
  • Implement robust and innovative architectures that leverage the full potential of ServiceNow’s Workflow Data Fabric to support data ingestion.
  • Act as an SME to solve complex user issues related to Integrations solutions.

ServiceNow, founded in 2004, provides AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. They offer an intelligent cloud-based platform that connects people, systems, and processes to empower organizations.

$0–$200,000/yr

  • Architect and maintain robust data pipelines to transform diverse data inputs.
  • Integrate data from various sources into a unified platform.
  • Build APIs with AI assistance to enable secure access to consolidated insights.

Abusix is committed to making the internet a safer place. They are a globally distributed team that spans multiple countries and thrives in a culture rooted in trust, ownership, and collaboration.

$113,174–$171,720/yr

  • Contributing groundbreaking research with a particular focus on Wikipedia’s core content policies, the integrity of its knowledge and communities, and the resilience of its model.
  • Discovering, reading, and evaluating existing research and press coverage on Wikipedia content and its core content policies
  • Designing and executing thorough experiments to collect, clean, and analyze data, and/or to develop and evaluate machine learning models

The Wikimedia Foundation operates Wikipedia and other Wikimedia free knowledge projects with the vision of a world where everyone can freely share knowledge. It is a charitable, not-for-profit organization that relies on donations and has offices in San Francisco, California.

  • Design complex LLM prompts that accurately represent real customer journeys and service interactions.
  • Partner with Field Engineers to transform raw data into structured, high-quality tasks for model training.
  • Annotate and review tasks to ensure strict quality standards and alignment with expected customer outcomes.

Welo Data works with technology companies to provide datasets that are high-quality, ethically sourced, relevant, diverse, and scalable to supercharge their AI models.

$140,000–$220,000/yr

  • Partner closely with AI Decisioning customers and internal engineering teams.
  • Diagnose model behavior, tune ML levers, and analyze incrementality.
  • Explain insights to marketers and executives.

Hightouch is the modern AI platform for marketing and growth teams, partnering with industry leaders like Domino’s, Chime, Spotify, Ramp, Whoop, Grammarly, and over 1000 others.