visdat, skimr, and assertr use case: Exploring and understanding a new data set

Tags: #<Tag:0x00007fa7bc0b8450>


Hi all! I gave a talk and wrote a corresponding blog post a couple of weeks ago around strategies for working with a new data set, and it just so happened that all of the packages I recommended are part of rOpenSci :nail_care: So it makes sense to share it here (thank you @stefanie for the nudge :purple_heart:)!

I used visdat, skimr, and assertr to demonstrate how I might approach a data set on TTC (Toronto’s public transit) subway delays, after discovering there’s some ~funky (i.e., buggy) features in the data and wanting to learn more about the data + evaluate any assumptions I had about it.

Talk: Slides, GitHub Repo
Blog post:

Both the talk and the blog post were very well received, so big thanks to Nick Tierney, @elinw, @michaelquinn32, and Tony Fischetti for building and maintaining fabulous tools!