Slides and some thoughts on a talk about reproducibility

noamross · January 27, 2016, 11:54pm

I recently gave a presentation on reproducibility to my organization: “Reproducibility from a Mostly Selfish Point of View.” You can find the slides here on figshare.

I am grateful to the following for source material and inspiration:

Florian Markowetz’s great article “Five selfish reasons to work reproducibly”
Karl Broman’s reproducible research course introduction
A series of great suggestions on the literature from Gabe Becker
ROpenSci’s reproducibility guide
Titus Brown’s latest post on the SWC listserv

I broke the talk into two sections: “Why” and “How”. Why was, as the title suggests, primarily focused on the benefits of reproducibility to us, and I proceeded from avoiding negatives (risk avoidance) to creating positives (more impact). In How I tried to be very high-level, talking about major concepts in reproducibility, and then talking generally about the tools that I have used for each, emphasizing that they may not be the right tools for everyone. Then we had a discussion about the most promising areas and tools to start with.

This went remarkably well. A few quick thoughts:

Many people are primed for this topic. The steady drum of reproducibility-related stories in the science press over the past few years has heightened awareness of this stuff.
Despite my avoiding a focus on openness, open-data and code mandates came up a lot, because people want to reduce the effort involved in getting code and data in shape to share per these requirements.
The biggest response in terms of positives came from talking about impact, rather than risk management or productivity (though those resonated as well). As I put it, “If we’re going to share, let’s share impressively.” There was a lot of talk about new things we could do, and new audiences we could reach, with additional research products that emerge from reproducible workflows. An example I like is Andrew Rambaut’s MERS data. Rambaut did the work of aggregating this for his own analyses, but by releasing (and updating) the data set with a nice little D3 data viewer on top of it he provided the community with a great tool and increased the visibility of his own work.

jhollist · January 28, 2016, 1:09am

Noam,

This is all fantastic, and timely! I roped myself into doing something similar, albeit for a different crowd: Fed Managers. That being said, a lot of what you include is going to be relevant even for them, especially the last three slides. I think including that information – caveats, realizing that reproducibility occurs along a gradient, and identifying resources required is important. Anyway, thanks for sharing this. I plan to borrow heavily!

Cheers,
Jeff

p.s. links to Karl’s course materials and the reproducibility guide is busted.

sckott · January 28, 2016, 6:15am

@noamross nice, interesting stuff

I think I missed how this connects to reproducibility - maybe I didn’t read the slides closely enough can you explain?

p.s. hope you don’t mind, fixed two broken links in your post

noamross · January 28, 2016, 2:51pm

The slides are pretty sparse, as I mostly just riffed on them, so I don’t think you missed anything. The point I was making was this: When you do you science in reproducible fashion, every step along the way is a potential output, not just the manuscript. It’s a much smaller leap to dress up and publish your data set, workflow or method when they are prepared this way.

(Thanks for fixing the links!)

Topic		Replies	Views
Implementing Increased Transparency and Reproducibility in Economics General Q&A reproducibility , economics	0	618	February 7, 2021
Community Call - Reproducible Research with R Blog reproducibility , openscience , community-call	6	1235	August 1, 2019
Workflow best practices General Q&A	11	1612	December 16, 2021
rOpenSci \| Teaching targets with Penguins Blog	0	236	July 20, 2023
rOpenSci \| targets: Democratizing Reproducible Analysis Pipelines Blog	0	317	February 5, 2021

Slides and some thoughts on a talk about reproducibility

Related topics