Sam Swift

  • samswift@cmu.edu

Open Methodology

Open Methodology refers to a set of research practices that allow full transparency of methods and analysis. Like the movements for Open Access publication and Open Source software, Open Methodology has the potential to eliminate barriers to the efficient exchange of ideas. In research those barriers slow research progress and invite ethical transgression. Full openness requires three elements not usually required by journals in the social sciences:

Freely available raw data

The low marginal cost of publishing materials online means there is no reason to be limited to what can reasonably fit into a journal. Data is the most valuable piece of any empirical work and sharing it publicly allows more minds to interpret and learn from the results.

Freely available statistical code

Statistical methods are routinely described in published product of research, but with analysis the devil is always in the details. Even with the raw data and the published description of analyses it can be difficult to replicate the exact results published. By publishing the full code from data manipulation to significance tests, there need not be any ambiguity about the reported results. This has benefits of transparency but also of establishing stronger norms and standardizing procedures within a field.

Open Source statistical computing

Having access to freely available data and statistical code is important, but there are limits to its usefulness if the code is written for use with proprietary software I don't own. Most proprietary statistical packages come with significant costs that introduce real barriers to scholars fully understanding one another's work. This problem is particularly large in interdisciplinary research where different fields have different software norms that decrease the likelihood of smooth statistical communication. With open source statistical computing software, anyone in the world has the tools to join a scholarly discussion.
R is the most widely used open source statistical package/language and is freely available and runs on all operating systems. I recommend these resources for those looking to try it out:

Sound Interesting?

I do my best to follow these practices in my own research but I am interested in doing more to institutionalize the concept. If it's interesting to you, send me an e-mail.