From websites to Wikidata: digitising Scotland’s stories

Jennifer Shaw, Grace Young, Mark Cranston, Sara Thomas, Kirsty S. Ross*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Data connected to our cultural and industrial heritage are often collated by passionate volunteers, typically in the form of blogs and websites. Decades of effort is invested in these sites, which are then vulnerable to data loss due to technical failures, lack/loss of volunteers, or link rot (Chapekis et al., 2024). In collaboration with undergraduate Computer Science students from the University of St Andrews and Wikimedia UK, two unique datasets (created from the websites for Women’s History Scotland and Scottish Brick History) were extracted, cleaned, uploaded to Wikidata, then used to generate creative websites to allow readers to interact with the data. This data paper will explore the pluses and minuses of our approach, and how we would tackle similar datasets in the future.
Original languageEnglish
Article number33
Number of pages7
JournalJournal of Open Humanities Data
Volume12
DOIs
Publication statusPublished - 25 Feb 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education
  2. SDG 5 - Gender Equality
    SDG 5 Gender Equality

Keywords

  • Scottish brick history
  • Wikidata
  • Undergraduate research
  • Open access
  • Web scraping
  • Human-computer interaction

Fingerprint

Dive into the research topics of 'From websites to Wikidata: digitising Scotland’s stories'. Together they form a unique fingerprint.

Cite this