How do you review code that accompanies a research project or paper? Help rOpenSci plan a Community Call

cboettig · September 7, 2018, 3:34am

I’m percent with Hao’s comment about how refactoring makes the notion of unit tests pretty hard in research code. In software development, I often have a reasonable idea what the API should look like, so even while I still need to do a deal of refactoring of internals, it doesn’t break anything. In contrast, research code often isn’t even functions to begin with. Still, it maybe possible to think of a somewhat different testing/assertion paradigm for (rapidly evolving) research scripts. For instance, there was an interesting effort to develop a testing framework for Rmds at the 2017 unconf; https://github.com/ropenscilabs/testrmd.

Ironically, I think the more common problem I encounter in trainee code is too little refactoring. Am I alone in thinking this? I believe it’s easy to get stuck feeling “this giant block of code took me 2 weeks to write, I’ll just continue to modify and extend it”, when the more sensible thing is more often to re-write into smaller functions, not bigger ones. Refactoring a script that already “works” can seem like a waste of time and only risk breaking things. When is it time to refactor, and how do folks encourage refactoring?

Topic		Replies	Views
Community Call Summary - Code Review in the Lab Blog r , codereview , community , community-call	0	633	November 29, 2018
Call for contributors for a paper on code review Software-Review	1	859	September 24, 2018
How rOpenSci uses Code Review to Promote Reproducible Science Blog onboarding , codereview	0	665	September 1, 2017
How could the onboarding / package review process be even better? Package Development	16	3872	September 20, 2016
Use of R package review guidelines in independent manuscript review UseCases software-peer-review , dev-guide	2	1605	August 8, 2019

How do you review code that accompanies a research project or paper? Help rOpenSci plan a Community Call

Related topics