Data Curation: Collecting, Organizing, and Managing
The old computer-science maxim “garbage in, garbage out” is strongly applicable to digital humanities projects today. Much of the data you will obtain throughout the research process will still need to be organized, reformatted, and cleaned before it can be analyzed in different visualization and mapping platforms. Participants in this three-part workshop will transform a messy Twitter data set, gathered by the DSC, into a workable spreadsheet. Session 1 will focus on downloading the data and transferring into an appropriate file format. Session 2 will use this dataset to explore file management systems and basic spreadsheet (Excel) cleaning techniques. Session 3 will introduce participants to the more advanced cleaning application OpenRefine.
There will be three workshops in this series. You are not required to come to every one, but consistent attendance is encouraged.
Tuesdays 10AM September 4, 11, 18th
Related LibGuide: Data Management by Gretchen Sneff
- Dates & Times:
- 10:00am - 11:00am, Tuesday, September 4, 2018
10:00am - 11:00am, Tuesday, September 11, 2018
10:00am - 11:00am, Tuesday, September 18, 2018
- Paley Library Digital Scholarship Center (DSC)
- Main Campus