Source Job

India

  • Design, build, and scale cloud-based data platforms that support critical business insights and operations.
  • Collaborate with cross-functional teams to extract, load, and transform data using cloud-native principles.
  • Mentor engineers and drive architectural direction through code reviews, testing, and documentation.

Python Apache Spark Databricks AWS Data Modeling

20 jobs similar to Staff Software Engineer, Data

Jobs ranked by similarity.

Europe

  • Design and deliver end-to-end data platforms for analytics, BI, machine learning and AI-ready data products
  • Build and optimise scalable ETL/ELT pipelines with Databricks, Spark/PySpark, SQL and Python
  • Apply data quality, governance and security standards across the platform and mentor engineers

Tieto Tech Consulting provides design-led, data-centric, and AI-powered digital engineering & consulting services to enterprises worldwide. They focus on diversity, equity, and inclusion, fostering an inspiring workplace with a global team.

US

  • Build scalable Python-based data pipelines and backend services for analytics workflows.
  • Design software systems using object-oriented programming and sound engineering practices.
  • Create and support platforms for analytics development, model training, and model deployment.

Experian is a global data and technology company that powers opportunities for people and businesses worldwide across markets like financial services, healthcare, and automotive. With a team of 25,200 people in 32 countries, Experian invests in advanced technologies and its people to unlock the power of data.

Canada

  • Architect Spark-driven workflows at scale and design data platforms as products for internal teams.
  • Develop and maintain end-to-end data pipelines and backend ingestion workflows across multiple sources.
  • Champion Samsara's cultural principles and mentor junior team members to drive data-driven decisions.

Samsara is the pioneer of the Connected Operations Cloud, enabling organizations to harness IoT data for actionable insights to improve safety, efficiency, and sustainability. As a recently public company, it fosters a culture of rapid career development, ownership, and high performance.

Brazil

  • Support the development of scalable data pipelines and platform components, following established frameworks and guidance from senior engineers.
  • Apply software engineering best practices, including coding standards, version control, testing, and documentation, to deliver reliable and maintainable code.
  • Collaborate with engineers, product owners, and cross-functional teams in an agile environment to support feature development and delivery commitments.

Experian is a global data and technology company that powers opportunities for people and businesses around the world. With 25,200 employees across 32 countries, it has a people-centric, inclusive, and purpose-driven culture recognized as a World's Best Workplace.

Brazil

  • Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
  • Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
  • Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.

This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.

Europe

  • Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
  • Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
  • Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.

InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.

Canada

  • Architect and lead the implementation of an enterprise lakehouse on Databricks across major clouds.
  • Design scalable batch and streaming data pipelines using PySpark, Spark SQL, and Delta Live Tables.
  • Define and enforce platform standards for data modeling, CI/CD, governance, and cost optimization.

Bounteous is a premier end-to-end digital transformation consultancy partnering with ambitious brands to create digital solutions. With over 4,000 expert team members across the Americas, APAC, and EMEA, we deliver innovative solutions in Strategy, Analytics, Digital Engineering, Cloud, Data & AI, Experience Design, and Marketing.

Europe

  • Design, build, and maintain backend services, REST APIs, databases, and big data pipelines that power customer-facing insights and analytics.
  • Implement and maintain near-real-time stream-based data processing pipelines in collaboration with batch-oriented data refresh workflows.
  • Scale data processing and insights generation pipelines to handle growing volumes of activity data while managing infrastructure costs.

Backstory helps companies understand the state of their revenue business by answering questions that span customer interactions, sales activity, pipeline health, and deal execution. Headquartered in San Francisco, CA, Backstory is backed by Y Combinator and top investors, and is listed in the top 20 percent of Inc 5000 companies.

Canada

  • Design, build, and operate high-scale data ingestion and replication systems from production data stores into the data lakehouse.
  • Build and maintain reliable, scalable data platform infrastructure capable of handling petabytes of data across analytics, AI, and operational use cases.
  • Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams move and access data safely.

Samsara is the pioneer of the Connected Operations Cloud, enabling organizations that depend on physical operations to harness IoT data for actionable insights. As a publicly traded company, Samsara fosters a growth-oriented culture and serves industries that represent over 40% of global GDP.

US

  • Lead workspace architecture, Unity Catalog governance, and cluster policy design for client tenant organizations.
  • Perform tenant discovery, requirements gathering, source profiling, and security classification for new data intake requests.
  • Develop end-to-end technical designs for tenant onboarding, including Data Sharing Agreements and SLA documentation.

M9 Solutions provides IT services and solutions to the Federal Government, mobilizing skilled people and technologies for improved performance and sustainable change. With 15+ years of proven delivery and growth, the company has been recognized as an Inc. 5000 Fastest-Growing Private Company multiple times and values diverse perspectives.

India 5w PTO 26w maternity 2w paternity

  • Design, build, and launch sophisticated data models and visualizations supporting multiple products.
  • Optimize pipelines, frameworks, and systems for easier development of data artifacts.
  • Collaborate with cross-functional teams and embody core values such as ownership and customer focus.

Outreach provides the only complete agentic AI platform for revenue teams. The company is used by world leading enterprises like Databricks, SAP, Siemens, and Verizon and promotes a culture of diversity and inclusion.

Global Unlimited PTO

  • Architect and maintain cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases.
  • Design and automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI.
  • Manage end-to-end data lifecycles including ingestion, storage, processing, and delivery of structured and unstructured data.

Trustonic makes smartphones affordable for the many, enabling global access to devices and digital finance through secure smartphone locking technology. They partner with mobile carriers, retailers, and financiers across 30+ countries, and pride themselves on a diverse, inclusive culture that values doing the right thing for each other, the community, and the planet.

Canada

  • Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
  • Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
  • Define data governance policies and leverage AI tools to streamline data pipeline development.

For over four decades, PAR Technology Corporation has been a leader in restaurant technology, empowering brands worldwide to create lasting connections with their guests. With over 100,000 restaurants in more than 110 countries, we embrace a 'Better Together' ethos and offer comprehensive software and hardware solutions.

Brazil

  • Lead and support Data Engineering teams, driving technical excellence and mentoring engineers in scalable data solutions.
  • Design and oversee data solutions using Databricks, Apache Spark, and cloud platforms like AWS, Azure, or GCP.
  • Collaborate with clients and stakeholders to define technical strategies and ensure high-quality, business-driven outcomes.

CI&T helps large enterprises transform AI potential into real business impact with AI deployment, AI-native execution, and tech-integrated solutions. With 30 years of experience and 8,000 employees across 25+ countries, they foster a collaborative culture focused on innovation and growth.

Europe

  • Build and scale data infrastructure powering targeting, identity, and measurement capabilities.
  • Optimize core ETL/ELT pipelines and ensure operational reliability with documented SLAs.
  • Implement privacy-compliant data methodologies meeting GDPR/CCPA standards.

Kargo creates powerful moments of connection between brands and consumers to build businesses. With 600+ employees and offices across the US, UK, Australia, and Ireland, they take a creative science approach to deliver unique ad experiences across premium platforms.

UK

  • Build and maintain data pipelines for analytics, ML, and product applications.
  • Design scalable data infrastructure with a focus on quality and observability.
  • Collaborate with cross-functional teams to understand data needs and implement solutions.

Prolific builds human data infrastructure to power the next wave of AI innovation. They are a remote-first company focused on ethical data collection and mission-driven culture.

US 4w PTO

  • Leverage test-driven development to deliver backend systems and user interfaces for healthcare data integration.
  • Design, implement, and maintain data models, ETL processes, and APIs for performance and scalability.
  • Contribute to automated testing suites and optimize data operations for integrity and security.

Bellese is a mission-driven digital services company pioneering innovative technology solutions in civic healthcare. With a collaborative, remote-first culture, the team is focused on improving public health outcomes through service design and skilled engineering.

US Unlimited PTO

  • Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
  • Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
  • Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.

YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.

United States

  • Lead the design and evolution of scalable financial data systems supporting commissions, incentives, and payments.
  • Build and maintain robust data pipelines using Python, SQL, Spark, and Terraform for accuracy and performance.
  • Define technical strategy and roadmap for financial data operations in collaboration with stakeholders.

Our partner is a fast-growing technology company building financial data infrastructure for insurance operations. They have a remote-friendly work environment and emphasize engineering excellence and cross-functional collaboration.

United States

  • Develop customer engagement strategies and coach junior Solutions Architects on use case prioritization.
  • Influence stakeholders at all levels through complex engagements with the wider cloud ecosystem.
  • Contribute to Databricks' technical community engagement by leading workshops, seminars, and meet-ups.

Databricks is the data and AI company, providing a unified platform for data, analytics, and AI. More than 10,000 organizations worldwide, including over 50% of the Fortune 500, rely on Databricks, and the company fosters a culture of proactiveness, customer-centricity, and innovation.