Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language
Loading...
Access rights
openAccess
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2022-08-11
Department
Major/Subject
Mcode
Degree programme
Language
en
Pages
10
70-79
70-79
Series
CEUR Workshop Proceedings, Volume 3184
Abstract
This paper presents knowledge extraction and natural language processing methods used to enrich the knowledge graph of the plenary debates (textual transcripts of speeches) of the Parliament of Finland. This knowledge graph includes some 960 000 speeches (1907–2021) interlinked with a prosopographical knowledge graph about the politicians. A recent subset of the speeches was used to extract named entities and topical keywords for semantic searching and browsing the data and for data analysis. The process is based on linguistic analysis, named entity linking, and automatic subject indexing. The results were included into the ParliamentSampo knowledge graph in a SPARQL endpoint. This data can be used for studying parliamentary language and culture in Digital Humanities research and for developing applications, such as the ParliamentSampo portal.Description
Funding Information: Acknowledgements Our work is part of the Semantic Parliament project18, funded by the Academy of Finland and is also related to the EU project InTaVia19 and the EU COST action Nexus Linguarum20. The project uses the computing resources of the CSC – IT Center for Science. Publisher Copyright: © 2022 Copyright for this paper by its authors. | openaire: EC/H2020/101004825/EU//InTaVia
Keywords
digital humanities, linked data, natural language processing, parliamentary studies
Other note
Citation
Tamper, M, Leal, R, Sinikallio, L, Leskinen, P, Tuominen, J & Hyvönen, E 2022, ' Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language ', CEUR Workshop Proceedings, vol. 3184, pp. 70-79 . < http://ceur-ws.org/Vol-3184/TEXT2KG_Paper_5.pdf >