Description
This Knowlege Graph represents the information of the "Edinburgh Ladies’ Debating Society" (years: 1865 - 1880) collection in RDF (ttl format). This collection consists of the complete runs of two Edinburgh journals, ‘The Attempt’ (10 volumes, 1865-74) and its successor ‘The Ladies’ Edinburgh Magazine’ (6 volumes, 1875-80). These publications were produced by a leading Edinburgh women’s club, known during the period as the Edinburgh Essay Society or the Ladies’ Edinburgh Essay Society, but subsequently as the Ladies’ Edinburgh Debating Society. The Society existed from 1865 to 1935. The raw dataset is provided by the NLS in this link. As other NLS data collections, they are originally provided using two XMLs schemas: METS for descriptive, structural, technical and administrative metadata (Title, Author, Publisher, etc); and ALTO for encoding the OCR text of a page.
In this work, we have extracted the information from METS and ALTO XMLS using defoe tool and developed a new information extraction defoe query , and created a new Knowlege Graph called LadiesDebating-KG. The LadiesDebating-KG uses the NLS Ontology to represent the information extracted. Furthermore, during the information extraction phase, we have employed several techniques to mitigate two common OCR errors: long-S and the line-break hyphenation.
The LadiesDebating-KG contains 38,279 RDF triples. It has information from 2 series and 16 volumes: 'The attempt' serie has 10 volumes and 'The Ladies' serie has 6 volumes . Each serie has an Editor, mmsid, Shelf-Locator, publication year, etc. A Volume has several Pages, with text in them. The data model of the LadiesDebating-KG can be found here.
In this work, we have extracted the information from METS and ALTO XMLS using defoe tool and developed a new information extraction defoe query , and created a new Knowlege Graph called LadiesDebating-KG. The LadiesDebating-KG uses the NLS Ontology to represent the information extracted. Furthermore, during the information extraction phase, we have employed several techniques to mitigate two common OCR errors: long-S and the line-break hyphenation.
The LadiesDebating-KG contains 38,279 RDF triples. It has information from 2 series and 16 volumes: 'The attempt' serie has 10 volumes and 'The Ladies' serie has 6 volumes . Each serie has an Editor, mmsid, Shelf-Locator, publication year, etc. A Volume has several Pages, with text in them. The data model of the LadiesDebating-KG can be found here.
Date made available | 2022 |
---|---|
Publisher | Zenodo |