Intermediate Data Cleaning Using Basic Programming
Tuesday, April 23, 2019 - 10:00am to 11:30pm
Room 308, Howard-Tilton Memorial Library
In this intermediate-level workshop, attendees will explore possibilities for more advanced data cleaning and validating using a free, open-source tool (OpenRefine). Basic OpenRefine functionality such as splitting cell data and clustering will be quickly reviewed. However, the focus of the workshop will be using Jython (Java-based Python) scripts, regular expressions, and external validation tools.
Requirements: Knowledge of spreadsheets and basic data structures.
Note: This session is 1.5 hours; We recommend bringing your own laptop so that you can take the configured software with you when you go.
Live stream via Zoom: https://tulane.zoom.us/j/659499859?pwd=M05qZ3J3MndIVGUwM1RueEJxLzJsQT09
Password for stream: Cleaning
Presenter: Rachel Tillay