Job Description
We are seeking a Data Engineer with a focus on NLP to build and optimize the data pipelines that fuel our Ukrainian LLM and Kyivstar’s NLP initiatives. In this role, you will design robust ETL/ELT processes to collect, process, and manage large-scale text and metadata, enabling our data scientists and ML engineers to develop cutting-edge language models. You will work at the intersection of data engineering and machine learning, ensuring that our datasets and infrastructure are reliable, scalable, and tailored to the needs of training and evaluating NLP models in a Ukrainian language context.
This position involves designing, developing, and maintaining ETL/ELT pipelines for gathering, transforming, and storing text data, implementing web scraping and data collection services, setting up and managing cloud-based data infrastructure, automating data processing workflows, and maintaining analytical databases. You will also collaborate with Data Scientists and NLP Engineers to build data features and datasets for machine learning models, implement data quality checks, and manage data security and compliance.
The company offers the option to work remotely, performance bonuses, training opportunities, health and life insurance, a wellbeing program, and reimbursement of expenses for Kyivstar mobile communication.
About Kyivstar.Tech
Kyivstar.Tech is a Ukrainian hybrid IT company and a resident of Diia.City that aims to change lives by creating technological solutions and products.