New Article Published: Reflections on reproducible research, in Reflections on Language Documentation 20 Years after Himmelmann 1998
In 1998 Nikolas Himmelmann wrote "Documentary and descriptive linguistics”, an article for the journal Linguistics (I’m not going to link because it’s paywalled, but you should be able to find a copy floating around online). Himmelmann 1998 is an often cited work that makes a case for the work of documentation (collecting data) being as important as the work of description (describing data) in understanding how languages work.
To look at the 20 years since this publication, Bradley McDonnell, Andrea L. Berez-Kroeker and Gary Holton brought together over 30 linguists to reflect on how the field has changed and grown in that time. This is a Special Publication of Language Documentation & Conservation, available open access.
From the introduction to the special issue:
[W]e invited 38 experts from around the world to reflect on either particular issues within the realm of language documentation or particular regions where language documentation projects are being carried out. The issues addressed in this volume represent a broad and diverse set of topics from multiple perspectives and for multiple purposes that continue to be relevant to documentary linguists and language communities. Some topics have been hotly debated over the past two decades, while others have emerged more recently.
All of the chapters are short and to the point (they’re around 10 pages each). I has a great time writing with Andrea Berez-Kroeker about how language documentation has changed over the last two decades in regards to data management and sharing. Himmelmann talks about the importance of sharing primary data, but the nature of that data, and how we can share it has changed so much since 1998.
Abstract
Reproducibility in language documentation and description means that the analysis given in descriptive publication is presented in a way that allows the reader to access the data on which the claims are based, to verify the analysis for themself. Linguists, including Himmelmann, have long pointed to the centrality of documentation data to linguistic description. Over the twenty years since Himmelmann’s 1998 paper we have seen a growth in digital archiving, and the rise of the Open Access movement. Although there is good infrastructure in place to make reproducible research possible, few descriptive publications clearly link to underlying data, and very little documentation data is publicly accessible. We discuss some of the institutional roadblocks to reproducibility, including a lack of support for the development of published primary data. We also look at what work on language documentation and description can learn from the recent replication crisis in psychology.
Reference
Gawne, Lauren and Andrea L. Berez-Kroeker. 2018. Reflections on reproducible research. In McDonnell, Bradley, Andrea L. Berez-Kroeker, and Gary Holton. (Eds.) Reflections on Language Documentation 20 Years after Himmelmann 1998. Language Documentation & Conservation Special Publication no. 15. [PP 22-32] Honolulu: University of Hawai‘i Press. [Open Access PDF]












