Job Description
Develop scalable, robust infrastructure and processes using tools such as Airflow, Spark, and Kafka. Implement systems to detect and address potential data issues promptly. Assist in designing and implementing solutions to track and manage data across pipelines. Contribute to the design and improvement of the shared data platform, enabling critical use cases such as product analytics, bot detection, and image classification. Identify and implement improvements in system reliability, maintainability, and performance.
About Wikimedia Foundation
The Wikimedia Foundation operates Wikipedia and other Wikimedia free knowledge projects with the vision of a world where everyone can freely share in the sum of all knowledge.