Text Mining the Hathi Trust

Text Mining the Hathi Trust

This workshop will introduce students to data analysis using HathiTrust’s repository (containing over 15 million books), especially its extracted features datasets, as well as its Data Portal’s tools for accessing copyrighted data. Students will learn not only how to navigate HathiTrust’s search functionalities, but also to build their own custom datasets. It is an onerous process to access HathiTrust’s data analytics tools, and we will walk through each step of accessing a Data Capsule. We will experiment with both Voyant-Tools for user-friendly text analysis, as well as some easy to adapt scripts in R for text analysis.

There will be four workshops in this series. You are not required to come to everyone, but consistent attendance is encouraged.

Date:
Thursday, February 21, 2019
Time:
10:00am - 11:00am
Location:
Paley Library Digital Scholarship Center (DSC)
Campus:
Main Campus
Registration has closed.