We are seeking a Data Engineer with deep experience in managing and processing high-volume IoT data to enable the cloud-based training of machine learning models that power real-time inference on edge devices. In this role, you will architect and maintain cloud-based data infrastructure and pipelines that support ML workflows for training, validation, and deployment of machine learning models optimized for deployment in edge environments.
You will design data workflows to support model training, evaluation, and retraining cycles for deployment on edge devices. The candidate will architect and maintain scalable data pipelines to ingest, process, store, and access large volumes of structured and semi-structured RFID time-series data from edge networks. You will also implement robust ETL/ELT workflows for preparing data for cloud-based ML model training and evaluation. Monitor and optimize data pipelines for performance, reliability, and cost across edge-to-cloud infrastructure.