Persistent identification and citation of software

Catherine Mary Jones, Brian Matthews, Ian Gent, Tom Griffin, Jonathan Tedds

Research output: Contribution to journalArticlepeer-review


Software underpins the academic research process across disciplines. To be able to understand, use/reuse and preserve data, the software code that generated, analysed or presented the data will need to be retained and executed. An important part of this process is being able to persistently identify the software concerned. This paper discusses the reasons for doing so and introduces a model of software entities to enable better identification of what is being identified.

The DataCite metadata schema provides a persistent identification scheme and we consider how this scheme can be applied to software. We then explore examples of persistent identification and reuse. The examples show the differences and similarities of software used in academic research, which has been written and reused at different scales. The key concepts of being able to identify what precisely is being used and provide a mechanism for appropriate credit are important to both of them.
Original languageEnglish
Pages (from-to)104-114
JournalInternational Journal of Digital Curation
Issue number2
Publication statusPublished - 4 Jul 2017


Dive into the research topics of 'Persistent identification and citation of software'. Together they form a unique fingerprint.

Cite this