Introduction to Data Cleaning

For all those interested in working with data, once one has gathered data there are often problems with missing or incorrect information as well as other serious concerns. Hence data scientists will often need to clean data before they begin their analysis. Dr. Andy Walsh will be giving an overview presentation on data cleaning, including available Python tools, from 2-2:50 in Thackeray 524 on Wednesday, January 28.


Speaker Bio: Dr. Walsh has a PhD in molecular microbiology and immunology from the Bloomberg School of Public Health at Johns Hopkins University. Andy has also held a postdoc in Computational Biology at Carnegie Mellon, which led to the Pittsburgh start-up Health Monitoring Systems. HMS collects emergency room data and is contracted by the US Center for Disease Control as well as many state governments to detect potential outbreaks of syndromes (conversations with Andy in the Fall of 2019 were very interesting!). Dr. Walsh is the Chief Science Officer at Health Monitoring Systems where he develops statistical methods for public health surveillance and author of the book "Faith Across the Multiverse", which is an apology of science for people of faith.  Andy also contributed a very good chapter on Evolutionary Programming and Ant Foraging Optimization in Jeffrey Wheeler's  Optimization text.
 

Tuesday, February 3, 2026 - 14:00 to 14:45

Thackeray 524

Speaker Information
Andy Walsh