Data Cleaning



Mar 15 3:30 – 5:00 pm
Center for Digital Humanities
Firestone Library, Floor B

Do you have messy data? Is the mess getting in the way of your analysis? Does Excel crash whenever you open *that* file? Don’t despair! Help is on the way. The Center for Digital Humanities graduate fellow Phil Gleissner is hosting a one-hour workshop to help you get past the mess in your data set and on to the analysis and visualizations you actually want to be doing. We will be using the open source data cleaning power tool, OpenRefine. This is a hands on workshop, so please bring your own laptop with OpenRefine pre-installed, if possible. (There are also excellent video tutorials available. If you have difficulty installing OpenRefine come 30mins early and CDH staff will help you get up and running.) A sample (messy) data set will be provided, but participants are encouraged to bring their own datasets for consultation.