Architect batchโฏ+โฏstream pipelines (Airflow, Kafka, dbt) for diverse structured and unstructured marked data. Provide reusable SDKs in Python and Go for internal data producers. Implement and tune S3, columnโoriented and timeโseries data storage for petabyteโscale analytics; own partitioning, compression, TTL, versioning and cost optimisation. Develop internal libraries for schema management, data contracts, validation and lineage.
Job listings
The Data Science team at Fieldguide leverages novel, proprietary datasets to build Fieldguide Insights โ a product that delivers new-to-the-industry visibility into Audit and Advisory execution and performance. You'll work closely with product, design, engineering, and customer teams to build analytics and data science models that uncover key performance drivers.
Join the Data Engineering team to design, build, and scale AI-driven features at Filigran. The mission involves transforming real-world security problems into intelligent product capabilities. Responsibilities include designing ML models, maintaining AI/ML pipelines, and collaborating with engineers to integrate into open-source platforms.
Develop state-of-the-art models for information retrieval as part of our Search team. Focus on advancing semantic search techniques to improve accuracy and efficiency, involving working with a wide range of novel technologies. Collaborate with other teams to integrate your work into our search infrastructure. Engage in research collaborations and publish your work in top-tier conferences and journals.
This role combines advanced data analytics with Docebo LMS administration to provide actionable insights that drive training effectiveness and organizational performance. The Learning Analyst will design and implement comprehensive reporting frameworks, develop key learning metrics, and leverage data analytics tools to transform learning data into strategic business intelligence. Working closely with cross-functional teams, this role ensures our learning initiatives are effectively tracked and optimized.
This role support the implementation and automation of key rotation policies and secure credential storage across our cloud infrastructure. You will collaborate with the Data Engineering team to ensure proper access control and encryption practices are enforced in all ETL pipelines and data services and assist in developing monitoring and alerting systems for key usage and rotation schedules.
Lead a talented team of data scientists who will drive design, development, and deployment of autonomous vehicle performance KPIs ranging from straightforward to advanced machine learning outputs. Part of the Safety Assurance for Effective Autonomous Driving Software (SAFE-ADS) department, this position serves as the central body for automated driving system (ADS) safety. The ideal candidate will be an expert in data science, statistical modeling, and data development across the entire data maturity curve.
A highly skilled Senior Data Operations Engineer is needed to join the team to design and implement robust data architectures using Databricks, AWS, and Azure platforms. The ideal candidate will have experience in big data technologies, cloud platforms, and data engineering to provide data pipeline support, operations and engineering across multiple data domains and data platforms.
As a U.S. Department of Defense SkillBridge fellow or intern, you will be embedded within functional and cross functional teams directly supporting our lines of business. You will be equipped, trained, and treated as one of us. Candidates are matched to teams within engineering and data science divisions and involved in day-to-day operations and tasks.
We are seeking a highly skilled Lead Data Engineer with strong expertise in PySpark, SQL, and Python, as well as a solid understanding of ETL and data warehousing principles. The ideal candidate will have a proven track record of designing, building, and maintaining scalable data pipelines in a collaborative and fast-paced environment.