Text Mining the Hathi Trust
This workshop will introduce students to data analysis using HathiTrust’s repository (containing over 15 million books), especially its extracted features datasets, as well as its Data Portal’s tools for accessing copyrighted data. Students will learn not only how to navigate HathiTrust’s search functionalities, but also to build their own custom datasets. It is an onerous process to access HathiTrust’s data analytics tools, and we will walk through each step of accessing a Data Capsule. We will experiment with both Voyant-Tools for user-friendly text analysis, as well as some easy to adapt scripts in R for text analysis.
There will be four workshops in this series. You are not required to come to everyone, but consistent attendance is encouraged.
- Thursday, January 31, 2019 Show more dates
- 10:00am - 11:00am
- Paley Library Digital Scholarship Center (DSC)
- Main Campus