Job Description

The Institutional Data Initiative (IDI) is a new research center working to advance society’s relationship with knowledge by expanding access to, and deepening our understanding of, the data that underpins AI. By collaborating with library, government, and academic institutions to publish their knowledge collections as AI training sets. The technical capabilities of our Principal Engineers define the depth of analysis and inquiry at IDI while developing and deploying repeatable methods and pipelines. The person in this role will have an ability to think creatively about extracting and manipulating data to unlock knowledge collections that have been stubbornly inaccessible, sometimes for centuries. Their understanding of machine learning and AI fundamentals will help identify areas of high impact and utilize models to facilitate this work. Beyond data, Principal Engineers also contribute to the building of community around IDI’s work to enable outside collaborators. As a Principal Engineer, you will: Develop, refine and evaluate methods for analyzing and augmenting corpora; research, train and evaluate machine learning models; write and contribute to open-source software; provide technical leadership; build and lead development of multiple discrete projects; draft technical communications; be a technological ambassador; engage with partners.

About Harvard University

By working at Harvard University, you join a vibrant community that advances Harvard's world-changing mission in meaningful ways, inspires innovation and collaboration.

Apply for This Position