Source Job

United States

  • Own end-to-end design and reliability of large-scale data acquisition systems using AI and LLMs for self-healing pipelines.
  • Build and maintain data serving layers, ETL/ELT pipelines, and reporting systems for real-time insights.
  • Collaborate with engineering and product leadership to shape AI-native data infrastructure.

Python SQL GCP LLM Data Engineering

20 jobs similar to Senior AI Data Engineer

Jobs ranked by similarity.

US

  • Build full-stack web applications (FastAPI/Flask + React/TypeScript) that put data and AI into stakeholders' hands.
  • Integrate LLMs and ML for features like classification, extraction, summarization, copilots, and agentic workflows.
  • Deploy and operate applications on GCP, manage CI/CD, and own app security and product definition.

DDN is a global leader in AI and multi-cloud data management at scale, powering many of the world's most demanding AI data centers. The company has over two decades of innovation, a customer-centric culture, and a team of passionate professionals.

US Unlimited PTO

  • Own the gold data layer by transforming silver tables into curated, semantically rich datasets for AI model development.
  • Reverse-engineer data semantics by collaborating with product engineers, clinical experts, and analyzing SQL queries and stored procedures.
  • Build pipelines for reuse, automate quality filtering and synthesis, and maintain reproducible dataset snapshots.

PointClickCare is a leading health tech company that helps providers deliver exceptional care through AI and data. Privately held and founder-led, they serve over 30,000 organizations, were recognized as a top private cloud company by Forbes, and have one of Canada's Most Admired Corporate Cultures.

Canada

  • Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
  • Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
  • Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.

Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.

US 24w maternity 24w paternity

  • Design and build production-grade LLM-powered agents and workflows for enterprise-scale AI solutions.
  • Develop and optimize RAG pipelines, agent reasoning patterns, and evaluation frameworks to measure model quality.
  • Collaborate with Engineering, Product, and cross-functional teams to translate business requirements into impactful AI systems.

Smartsheet builds AI-powered strategic planning and work execution agents through SmartAssist, an intelligent agent platform. The company is a publicly traded, large enterprise with a collaborative, inclusive culture that values diverse perspectives and engineering rigor.

US 6w PTO

  • Build multi-agent AI systems and automation platforms for Marketing Operations at scale.
  • Design and implement LLM integrations, backend services, and agentic workflows.
  • Partner with cross-functional teams to identify high-impact automation problems and ship measurable solutions.

Grafana Labs is the company behind the open observability cloud, offering a fully managed platform with AI capabilities to help organizations monitor and optimize their systems. With over 1,600 team members across 40+ countries and backed by top investors, we maintain a 100% remote, collaborative culture rooted in open source and innovation.

India

  • Lead AI innovation by researching and prototyping solutions using LLMs and Computer Vision for complex data extraction.
  • Architect scalable, cost-effective AI services and data processing pipelines for processing millions of documents daily.
  • Act as a force multiplier by mentoring engineering teams and driving mission-critical initiatives to production.

AlphaSense provides AI-driven market intelligence and search to help companies make informed decisions. Founded in 2011, it employs over 2,000 people globally and is trusted by over 6,000 enterprise customers, including a majority of the S&P 500.

Canada

  • Design, build, and operate data pipelines processing terabytes of transactional data daily using Airflow, BigQuery, and GCP services.
  • Own end-to-end data models and transformations powering merchant analytics, operational reporting, and ML features.
  • Improve data quality, lineage, and observability through testing, alerting, and validation frameworks.

Narvar is building the data infrastructure behind the post-purchase experiences of hundreds of millions of consumers, powering analytics, ML, and merchant-facing products for over 1,500 brand partners. The company serves 125+ million consumers worldwide across 38 countries and 55 languages, fostering a culture of innovation, collaboration, and inclusivity.

Latin America Unlimited PTO

  • Develop long-term technical vision and design scalable data systems.
  • Build and maintain production data pipelines using Python and integrate external APIs.
  • Mentor engineers and uphold standards for engineering excellence.

Correlation One is the largest provider of AI and data workforce development programs globally, having trained over 500,000 professionals across 11 countries. They work with Fortune 500 enterprises and government agencies to close skills gaps, and foster a culture of empowerment and diversity.

UK

  • Build and maintain data pipelines for analytics, ML, and product applications.
  • Design scalable data infrastructure with a focus on quality and observability.
  • Collaborate with cross-functional teams to understand data needs and implement solutions.

Prolific builds human data infrastructure to power the next wave of AI innovation. They are a remote-first company focused on ethical data collection and mission-driven culture.

US 16w maternity 12w paternity

  • Orchestrate High-Velocity Workflows: Leverage advanced agentic coding tools (e.g., Cursor, multi-agent environments) to dramatically accelerate feature prototyping, code generation, and test coverage.
  • Own the Guardrails & Quality: Act as the ultimate reviewer and architect; define the specifications, establish repo-context guardrails, and review AI-accelerated output for hidden security risks, scale bottlenecks, and architectural alignment.
  • Build Scalable Application and Data Layers: Design, build, and maintain our data pipelines and application to service our hundreds of users.

EvolutionIQ provides technology to improve insurance claims handling. The company is experiencing massive growth and has been named a top workplace, prioritizing its team.

United States

  • Own the reliability of event-driven messaging with backpressure, idempotency, and dead-letter handling.
  • Build and operate infrastructure for LLM orchestration workloads at scale.
  • Maintain production support for CI infrastructure including on-call responsibilities and incident response.

Scorpion is a leading provider of technology and services for local businesses, helping them understand market dynamics and improve marketing. The company fosters a culture of constant improvement and unbeatable teamwork, valuing winning mindsets and genuine care.

Global

  • Design and architect AI capabilities on a cutting-edge iPaaS platform, working with technologies like LLM, RAG, Azure AI, and AWS Bedrock.
  • Build robust, scalable AI systems that run 24/7/365, collaborating with engineers, product management, and operations.
  • Mentor team members, use data-driven decision-making, and stay current with emerging AI and cloud computing trends.

Jitterbit is a leading data, application, and process workflow automation solution, rooted in iPaaS and fueled by an ambitious vision to integrate critical business processes. The company empowers enterprises of all sizes to accelerate their digital journey and is recognized in Gartner MQ for seven straight years, with a distributed, fun, fast-paced, and performance-oriented culture.

United States

  • Design and deliver production AI and agentic systems across document intelligence, workflow automation, and copilots.
  • Own architecture decisions for LLM-based systems, including retrieval, tool use, orchestration, memory, and evaluation.
  • Manage evals and observability for production AI, ensuring system accuracy and detecting regressions.

Maxwell is a mortgage technology and fulfillment company on a mission to make lending simpler, faster, and more accessible. It is a remote-first team that takes craft seriously and moves with intention, building a cutting-edge AI company in mortgage technology.

Global

  • Build and maintain agentic systems collecting and processing signals from social platforms, on-chain data, and ads ecosystems.
  • Design pipelines to ingest from external APIs, enrich data, and build semantic layers for both humans and AI agents.
  • Build full-stack applications and dashboards making intelligence accessible across client accounts.

Serotonin is a go-to-market firm for transformative technologies, specializing in marketing, strategy, recruiting, and legal services. With a global team of 90 across 15 countries, they have supported over 300 clients since 2020, offering end-to-end solutions across all major marketing channels.

Mexico

  • Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
  • Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
  • Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.

Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.

Global

  • Design and build LLM-powered AI components for internal tools and user-facing applications.
  • Develop systems for data retrieval, embeddings, and intelligent automation workflows.
  • Collaborate with cross-functional teams on AI use cases and contribute to technical vision.

Jobgether powers job matching using AI to connect candidates with opportunities. They emphasize a remote-first, collaborative culture with high autonomy and flat structure.

Asia

  • Design and scale backend services integrating Generative AI and RAG for production use.
  • Develop AI agents to automate workflows and build pipelines transforming data into actionable insights.
  • Optimize retrieval systems and collaborate with product and engineering teams to deliver AI features.

Beyondsoft is a leading mid-sized IT and consulting company that combines modern technologies and proven methodologies to tailor solutions. With a global presence spanning four continents, our diverse team thrives on innovation and collaboration.

Canada Unlimited PTO 12w maternity 12w paternity

  • Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
  • Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
  • Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.

Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.

US Canada

  • Ship AI to production by building tool-using LLM agents for grant discovery and drafting, with robust evaluation and observability.
  • Build trustworthy backends with high-quality, tested code, data pipelines, and reliability practices like alerts and incident response.
  • Collaborate with Product, Design, and GTM to scope features, run experiments, and raise engineering standards through clear code and reviews.

Instrumentl is a profitable, hypergrowth, YC-backed SaaS platform building the operating system for grant-funded organizations. More than 5,500 nonprofits use Instrumentl to discover, track, and win grant funding, and the company has grown over 40% year over year.

Brazil

  • Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
  • Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
  • Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.

This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.