Job Description
We’re looking for a passionate and experienced Data Engineer to join our team and help us build scalable and efficient ETL pipelines in the Google Cloud Platform (GCP) ecosystem. The ideal candidate will bring strong experience in PySpark, Python, and SQL, with a solid understanding of data pipeline orchestration and performance optimization. You will play a key role in transforming raw data into actionable insights that drive business value.
Responsibilities include designing, building, and maintaining scalable ETL pipelines using PySpark and other GCP-native tools and working with GCP infrastructure and services (e.g., BigQuery, Dataflow, Cloud Composer, Cloud Functions, Pub/Sub, etc..). One will also develop and optimize data ingestion, transformation, and loading processes.
Other responsibilities are ensuring data quality, reliability, and performance across pipelines and monitoring, troubleshooting, and enhancing data workflows and systems.
About Truelogic
Truelogic is a leading provider of nearshore staff augmentation services headquartered in New York, delivering technology solutions to companies of all sizes.