Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Date

2021-08-01

Major/Subject

Mcode

Degree programme

Language

en

Pages

17

Series

3rd Conference on Language, Data and Knowledge, LDK 2021, OpenAccess Series in Informatics ; Volume 93

Abstract

This paper presents a knowledge graph created by transforming the plenary debates of the Parliament of Finland (1907-) into Linked Open Data (LOD). The data, totaling over 900 000 speeches, with automatically created semantic annotations and rich ontology-based metadata, are published in a Linked Open Data Service and are used via a SPARQL API and as data dumps. The speech data is part of larger LOD publication FinnParla that also includes prosopographical data about the politicians. The data is being used for studying parliamentary language and culture in Digital Humanities in several universities. To serve a wider variety of users, the entirety of this data was also produced using Parla-CLARIN markup. We present the first publication of all Finnish parliamentary debates as data. Technical novelties in our approach include the use of both Parla-CLARIN and an RDF schema developed for representing the speeches, integration of the data to a new Parliament of Finland Ontology for deeper data analyses, and enriching the data with a variety of external national and international data sources.

Description

| openaire: EC/H2020/101004825/EU//InTaVia Funding Information: Acknowledgements Thanks to Ari Apilo, Sari Wilenius, and Päivikki Karhula of PoF for providing material for the project. Our work was funded by the Academy of Finland as part of the Semantic Parliament project, the EU project InTaVia: In/Tangible European Heritage1, and is related to the COST action NexusLinguarum2 on linguistic data science. CSC – IT Center for Science, Finland, provided computational resources for the work. Publisher Copyright: © Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela, and Eero Hyvönen; licensed under Creative Commons License CC-BY 4.0

Keywords

Digital humanities, Linked open data, Parla-CLARIN, Parliamentary data, Plenary debates

Other note

Citation

Sinikallio, L, Drobac, S, Tamper, M, Leal, R, Koho, M, Tuominen, J, Mela, M L & Hyvönen, E 2021, Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup . in D Gromann, G Serasset, T Declerck, J P McCrae, J Gracia, J Bosque-Gil, F Bobillo & B Heinisch (eds), 3rd Conference on Language, Data and Knowledge, LDK 2021 ., 8, OpenAccess Series in Informatics, vol. 93, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, International Conference on Language, Data, and Knowledge, Zaragoza, Spain, 01/09/2021 . https://doi.org/10.4230/OASIcs.LDK.2021.8