visdat, skimr, and assertr use case: Exploring and understanding a new data set

sharla · March 16, 2019, 4:14pm

Hi all! I gave a talk and wrote a corresponding blog post a couple of weeks ago around strategies for working with a new data set, and it just so happened that all of the packages I recommended are part of rOpenSci So it makes sense to share it here (thank you @stefanie for the nudge )!

I used visdat, skimr, and assertr to demonstrate how I might approach a data set on TTC (Toronto’s public transit) subway delays, after discovering there’s some ~funky (i.e., buggy) features in the data and wanting to learn more about the data + evaluate any assumptions I had about it.

Talk: Slides, GitHub Repo
Blog post: https://sharla.party/posts/new-data-strategies/

Both the talk and the blog post were very well received, so big thanks to Nick Tierney, @elinw, @michaelquinn32, and Tony Fischetti for building and maintaining fabulous tools!

Topic		Replies	Views
Data validation with the assertr package Blog	0	665	April 11, 2017
rOpenSci \| Help make assertr better! Come close issues Blog	0	137	February 27, 2024
rOpenSci vs rOpenGov General Q&A questions , package	7	2077	April 12, 2017
next steps in testdat? Package Use Questions	5	1382	April 3, 2016
An R API for UK Police Data Package Use Questions data , api , package	5	2226	October 27, 2016

visdat, skimr, and assertr use case: Exploring and understanding a new data set

Related topics