Source Job

Global

  • Building pipelines that augment documents with metadata.
  • Building systems to ensure the reliability and accuracy of hundreds of web scrapers.
  • Optimizing and evaluating our core utils, which do things like extracting and resolving citations.

Python SQL PostgreSQL GCP

20 jobs similar to Senior Software/Data Engineer

Jobs ranked by similarity.

$145,000–$200,000/yr
US Unlimited PTO

  • Design and build ETL processes in collaboration with software and model development teams.
  • Create and maintain scalable data infrastructure.
  • Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.

OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.

Europe

  • Design, build, and own backend systems that transform raw enterprise data into structured representations.
  • Develop and maintain scalable data processing pipelines, semantic storage layers, and versioning systems.
  • Own features end-to-end, from data modeling and backend logic to API design and integration layers.

Pragmatike is recruiting on behalf of a fast-growing, AI-first enterprise software company building a next-generation semantic intelligence layer for large-scale business data. They value transparency, ownership, and long-term impact.

$160,000–$190,000/yr
US Canada Unlimited PTO

  • Own and maintain data pipeline architectures, ensuring reliability and monitoring.
  • Manage and evolve data modeling environments for analysts and engineers.
  • Implement observability for data systems, detecting issues early and continuously monitoring data quality.

Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.

$180,000–$290,000/yr
Americas Unlimited PTO 12w maternity 12w paternity

  • Own the scrape product end-to-end.
  • Make 'just works' actually true, pushing the 'just works' rate from great to unbeatable, one long-tail failure mode at a time.
  • Obsess over the output, not just the fetch.

Firecrawl is the easiest way to extract data from the web. Developers use them to reliably convert URLs into LLM-ready markdown or structured data with a single API call. They're a small, fast-moving, technical team building essential infrastructure superintelligence will use to gather data on the web.

Europe

  • Lead a team of 6-8 analysts, including hiring and performance management
  • Manage the team’s workload together with product managers
  • Ensure the quality of the extracted data

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, they surpassed $3B in revenue in their last fiscal year with extensive growth potential ahead.

$90,000–$120,000/yr
US 4w PTO

  • Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
  • Collaborate cross-functionally with AI/ML and Product teams to implement new features.
  • Proactively identify and resolve bottlenecks in our complex ETL processes.

Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.

India

  • Design scalable data pipelines and backend systems from the ground up.
  • Leverage AWS and GCP for real-time and batch processing.
  • Manage databases and Data Warehouses, optimizing ETL workflows.

Delivery Solutions, a UPS company, is looking for a Senior Data Engineer to join their team. They are a growing company.

Global

  • Design and implement batch and real time ingestion pipelines from internal and external sources.
  • Implement automated data quality checks, observability, and SLA monitoring.
  • Optimise datasets and pipelines for analytics, ML training, and API consumption.

Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.

$180,000–$225,000/yr
US

  • Instrument fal's core infrastructure to capture CPU, GPU, and request-level signals.
  • Build ingestion pipelines from partner APIs, compute vendors, and internal services into BigQuery.
  • Design and operate the ETL backbone that powers cost, margin, and usage analytics.

Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production at scale.

LATAM

  • Design, build, and maintain scalable data pipelines
  • Develop and optimize ETL processes to support data products
  • Work with structured and unstructured data across SQL and NoSQL systems

They are seeking a Data Engineer to support the development of data products that power critical business functions. They seem to have a collaborative, cross-functional Agile environment where you'll partner closely with technical and business teams to deliver high-quality data solutions.

Canada Unlimited PTO

  • Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
  • Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
  • Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.

Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.

$4,200–$5,200/mo
Global

  • Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
  • Process and integrate data from multiple formats and sources (JSON, CSV, XML).
  • Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.

I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.

$120,000–$210,000/yr
Europe

  • Enable systematic exploration and materially improve exploration success rates.
  • Build data pipelines and tooling for deriving advanced human and machine insights from exploration data.
  • Develop expertise in KoBold’s Data Systems and deeply understanding how they impact exploration.

KoBold builds AI models for mineral exploration and deploys those models to guide decisions in exploration programs. In the six years since founding, KoBold has become the largest independent mineral exploration company and the largest exploration technology developer.

US Canada

  • Design, build, and maintain backend services that power core business workflows
  • Own projects end-to-end, from early design through production and iteration
  • Design APIs and data models that are clear, stable, and easy to work with

Short Story is an award-winning, technology-powered retailer dedicated to petite women 5'4" and under. As a fast-growing startup, they're revolutionizing retail with a data-driven learning system that leverages customer feedback to create tailored products.

Canada

  • Be the Analytics Engineering lead within the Sales and Marketing organization.
  • Be the data steward for Sales and Marketing: architect and improve the collection of underlying data.
  • Develop and maintain robust data pipelines and workflows for data ingestion, processing, and transformation.

Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and millions of daily active unique visitors, Reddit is one of the internet’s largest sources of information.

Global

  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.

Speechify's mission is to make sure that reading is never a barrier to learning by offering text-to-speech products. They are a fully distributed company with nearly 200 employees around the globe, including engineers and scientists from top companies and programs.

India

  • Quickly get up-to-speed on Zscaler’s SecOps platform, utilizing Python and APIs to configure, customize, and automate data transformations and workflows.
  • Partner with cybersecurity subject matter experts (SMEs) to onboard new data pipelines and map diverse IT and security sources to fulfill specific customer use cases.
  • Proactively troubleshoot pipeline health and audit customer data across environments to identify quality issues, flag security gaps, and define clear remediation steps.

Zscaler accelerates digital transformation to ensure customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise, they leverage the world’s largest security data lake to power their cloud-native Zero Trust Exchange platform. They build high-performing teams that can make an impact quickly and with high quality.

$120,000–$140,000/yr
US

  • Build and maintain data transformation pipelines with robust testing.
  • Design, implement, and maintain models with complex domain and business logic.
  • Optimize data storage and retrieval processes for improved performance and scalability.

Accorded is seeking experienced professionals to join their team. They are located in the San Francisco Bay Area, committed to creating a diverse and inclusive work environment and do not discriminate.

  • Shaping the Python language ecosystem with a strong product and platform mindset.
  • Architecting, building and delivering high-impact solutions that uplift the Python developer experience.
  • Advocating for Python engineering best practices across the organization.

Canva is a design platform that empowers users to create professional-quality graphics. They offer an inclusive culture with employees across multiple locations.