Browsing by Author "Leskinen, Petri"
Now showing 1 - 20 of 54
Results Per Page
Sort Options
Item AATOS – A configurable tool for automatic annotation(2017) Tamper, Minna; Leskinen, Petri; Ikkala, Esko; Oksanen, Arttu; Mäkelä, Eetu; Heino, Erkki; Tuominen, Jouni; Koho, Mikko; Hyvönen, Eero; Department of Computer Science; Professorship Hyvönen EeroThis paper presents an automatic annotation tool AATOS for providing documents with semantic annotations. The tool links entities found from the texts to ontologies defined by the user. The application is highly configurable and can be used with different natural language Finnish texts. The application was developed as a part of the WarSampo (http://seco.cs.aalto.fi/projects/sotasampo/en/) and Semantic Finlex (http://seco.cs.aalto.fi/projects/lawlod/en/) projects and tested using Kansa Taisteli magazine articles and consolidated Finnish legislation of Semantic Finlex. The quality of the automatic annotation was evaluated by measuring precision and recall against existing manual annotations. The results showed that the quality of the input text, as well as the selection and configuration of the ontologies impacted the results.Item Advancing Disambiguation of Actors Against Multiple Linked Open Data Sources(2023-10-09) Wahjoe, Muhammad; Drobac, Senka; Leskinen, Petri; Perustieteiden korkeakoulu; Hyvönen, EeroDisambiguation is an important step in the semantic data transformation process. In this scope, the process sought to eliminate the ambiguity of which person a record is describing. \emph{Constellation of Correspondence} or CoCo is a data integration project focused on historical epistolary data. In its data transformation flow, actor records from source data are linked to actor entities in an external linked open data source to enrich the actors' information with metadata found in external databases. This work presents an advanced disambiguation system for CoCo data transformation flow. The system has managed to deliver a reliable and flexible linking system that provides advantages,hi such as the incorporation of an additional external database, novel linking rule definition and implementation, and a more transparent linking result provenance presentation and management. This work also evaluates linking process performance in various linking cases by employing the help of a human expert judge to evaluate whether the proposed valid link made by the linking systems are indeed accurate or not. The system and the proposed rule configuration delivers a satisfactory performance on the easier, more common case but still struggles to deliver good precision on rarer edge cases. There are insightful observations made regarding the data that was observed during the development and evaluation of the system. Firstly is the importance of naming similarity in determining a link between two actors and the imperfection of name similarity in the majority of the valid linking case. This observation justifies the need for dissimilarity tolerance in naming comparison despite the importance of naming similarity. This imperfect state of the systems inspires the several future works that this work proposes. The proposed future works are the further fine-tuning of the linking rule and selection rule and the advancing the evaluation by increasing the completeness of the evaluation and the research of a more automated evaluation process.Item Akatemiasampo-portaali ja -datapalvelu henkilöiden ja henkilöryhmien historialliseen tutkimukseen (AcademySampo Portal and Data Service for Biographical and Prosopographical Research)(Informaatiotutkimuksen yhdistys, 2021-05-01) Hyvönen, Eero; Leskinen, Petri; Rantala, Heikki; Ikkala, Esko; Tuominen, Jouni; Tietotekniikan laitos; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML); Professorship Hyvönen EeroAcademySampo – Academic people in Finland 1640–1899 is a portal and a Linked Open Data service on the Semantic Web. AcademySampo contains richly interlinked open data about all people that have got academic education in Finland in 1640–1899. The system is targeted to researchers and the general public for biographical and prosopographical research. This review gives an overview on how AcademySampo can be utilized in practise with its novel digital humanities tools included in the portal and by using the data service via APIs.Item Analyses of Networks of Politicians Based on Linked Data: Case ParliamentSampo - Parliament of Finland on the Semantic Web(Springer, 2022-08-29) Pokkimäki, Henna; Leskinen, Petri; Tamper, Minna; Hyvönen, Eero; Department of Computer Science; Chiusano, Silvia; Cerquitelli, Tania; Wrembel, Robert; Nørvåg, Kjetil; Catania, Barbara; Vargas-Solar, Genoveva; Zumpano, Ester; Professorship Hyvönen Eero; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML); Aalto UniversityIn parliamentary debates the speakers make reference to each other. By extracting and linking named entities from the speeches it is possible to construct reference networks and use them for analysing networks of politicians and parties and their debates. This paper presents how such a network can be constructed automatically, based on a speech corpus 2015–2022 of the Parliament of Finland, and be used as a basis for network analysis.Item Analyzing and Visualizing Prosopographical Linked Data Based on Biographies(2018) Leskinen, Petri; Hyvönen, Eero; Tuominen, Jouni; Department of Computer Science; Fokkens, Antske; ter Braake, Serge; Sluijter, Ronald; Arthur, Paul; Wandl-Vogt, Eveline; Professorship Hyvönen EeroThis paper shows how faceted search on biographical data can be utilized as a flexible basis for filtering target groups of people and, in particular, how generic data analysis and visualizations tools can then be applied for solving prosopographical research questions based on the filtered data. This idea is demonstrated and evaluated in practice by presenting two application case studies: 1) linked data extracted from a printed registry of over 10 000 alumni (1867–1992) of the prominent Finnish high school Norssi, and 2) a knowledge graph extracted from 13 000 short biographies of significant Finnish people (from 3rd century to present times) in the National Biography of Finland. In both cases, the data is enriched by linking their entities with several other external datasets.Item Analyzing biography collections historiographically as Linked Data: Case National Biography of Finland(IOS PRESS, 2022-08-30) Tamper, Minna; Leskinen, Petri; Hyvönen, Eero; Valjus, Risto; Keravuori, Kirsi; Professorship Hyvönen Eero; Computer Science Professors; Finnish Literature Society; Department of Computer ScienceBiographical collections are available on the Web for close reading. However, the underlying texts can also be used for data analysis and distant reading, if the documents are available as data. Such data is usable for creating intelligent user interfaces to biographical data, including Digital Humanities tooling for visualizations, data analysis, and knowledge discovery in biographical and prosopographical research. In this paper, we re-use biographical collection data from a historiographical perspective for analyzing the underlying collection. For example: What kind of people have been included in the collection? Does the language used for describing female biographees differ from that for men? As a case study, the Finnish National Biography, available as part of the Linked Open Data service and semantic portal BiographySampo – Finnish Biographies on the Semantic Web is used. The analyses show interesting results related to, e.g., how specific prosopographical groups, such as women or professional groups are represented and portrayed. Various novel statistics and network analyses of the biographees are presented. Our analyses give new insights to the editors of the National Biography as well as to researchers in biography, prosopography, and historiography. The presented approach can be applied also to similar biography collections in other countries.Item Analyzing the Lives of Finnish Academic People 1640-1899 in Nordic and Baltic Countries: AcademySampo Data Service and Portal(RWTH Aachen University, 2022) Leskinen, Petri; Hyvönen, Eero; Rantala, Heikki; Department of Computer Science; Professorship Hyvönen Eero; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML)This paper shows how the newly published Linked Open Data (LOD) service and semantic portal “AcademySampo - Finnish Academic People 1640-1899” can be used for Digital Humanities (DH) research. The original primary data, based on some ten man-years of digitization work, covers a significant part of the Finnish university history based on the student registries in 1640-1852 and 1853-1899. They contain biographical descriptions of 28 000 students of the University of Helsinki, originally the Royal Academy of Turku. AcademySampo also sheds light to the academic history of Sweden and Baltic countries through their shared history with Finland in the larger Swedish Empire. The Finnish student registries have been widely used by genealogists and historians by close reading. We argue that unprecedented new possibilities for DH research are now enabled by using AcademySampo: the underlying knowledge graph can be accessed and analyzed using Semantic Web technologies and tools and with the ready-to-use data-analytic tools of the portal. Examples of data-analysis are presented by using the AcademySampo system for studying migrations of students in Finland, Sweden, Russia, and Estonia, history of student nations, inheritance of vocations and social classes, lengths of family lines of students, and network analyses of students. Related analyses have been made before using biographical dictionaries but not for academic history and student registries.Item Bio CRM(2018-01-01) Tuominen, Jouni; Hyvönen, Eero; Leskinen, Petri; Department of Computer Science; Professorship Hyvönen E.Biographies make a promising application case of Linked Data: they can be used, e.g., as a basis for Digital Humanities research in prosopography and as a key data and linking resource in semantic Cultural Heritage (CH) portals. In both use cases, a semantic data model for harmonizing and interlinking heterogeneous data from different sources is needed. This paper presents such a data model, Bio CRM, with the following key ideas: 1) The model is a domain specific extension of CIDOC CRM, making it applicable to not only biographical data but to other CH data, too. 2) The model makes a distinction between enduring unary roles of actors, their enduring binary relationships, and perduing events, where the participants can take different roles modeled as a role concept hierarchy. 3) The model can be used as a basis for semantic data validation and enrichment by reasoning. 4) The enriched data conforming to Bio CRM is targeted to be used by SPARQL queries in a flexible ways using a hierarchy of roles in which participants can be involved in events.Item Bio CRM: A Data Model for Representing Biographical Data for Prosopographical Research(2018) Tuominen, Jouni; Hyvönen, Eero; Leskinen, Petri; Department of Computer Science; Fokkens, Antske; ter Braake, Serge; Sluijter, Ronald; Arthur, Paul; Wandl-Vogt, Eveline; Professorship Hyvönen EeroBiographies make a promising application case of Linked Data: they can be used, e.g., as a basis for Digital Humanities research in prosopography and as a key data and linking resource in semantic Cultural Heritage (CH) portals. In both use cases, a semantic data model for harmonizing and interlinking heterogeneous data from different sources is needed. This paper presents such a data model, Bio CRM, with the following key ideas: 1) The model is a domain specific extension of CIDOC CRM, making it applicable to not only biographical data but to other CH data, too. 2) The model makes a distinction between enduring unary roles of actors, their enduring binary relationships, and perduing events, where the participants can take different roles modeled as a role concept hierarchy. 3) The model can be used as a basis for semantic data validation and enrichment by reasoning. 4) The enriched data conforming to Bio CRM is targeted to be used by SPARQL queries in a flexible ways using a hierarchy of roles in which participants can be involved in events.Item Biografiasampo yhdistää ja rikastaa suomalaiset elämäkerrat linkitettynä datana semanttisessa webissä (Biographysampo links and enriches Finnish biographies as linked data on the Semantic Web(Informaatiotutkimuksen yhdistys, 2021-06-01) Hyvönen, Eero; Leskinen, Petri; Tamper, Minna; Rantala, Heikki; Ikkala, Esko; Tuominen, Jouni; Keravuori, Kirsi; Tietotekniikan laitos; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML); Professorship Hyvönen Eero; Finnish Literature SocietyInformaatiotutkimuksen tavoitteena on kehittää uusia tapoja tuottaa, organisoida ja käyttää tietoa sekä yksilöiden että organisaatioiden näkökulmasta. Tässä katsauksessa esitellään kulttuurihistoriallisen tiedon tuottajia ja käyttäjiä palvelevan ns. Sampo-mallin sovellus Biografiasampo kansalaisille, digitaalisten ihmistieteiden tutkijoille ja uusien sovellusten kehittäjille. Biografiasammon kunnianhimoisena tavoitteena on käynnistää uusi aikakausi elämäkertakokoelmien julkaisemisessa ja käyttämisessä verkossa semanttisen webin teknologioita ja linkitetyn avoimen datan julkaisuperiaatteita hyödyntäen. Innovaationa on luoda kieliteknologian, tekoälyn ja semanttisen webin teknologioiden avulla elämäkertojen teksteistä ja niihin eri lähteissä liittyvistä tietokannoista tietämysverkko (knowledge graph) osana kansallista tietoinfrastruktuuria. Sovelluksen ydinaineistona ovat Kansallisbiografia ja muut Suomalaisen Kirjallisuuden Seuran toimittamat ja julkaisemat pienoiselämäkerrat, yhteensä 13 100 elämäntarinaa, joita on kirjoittanut 980 suomalaista tutkijaa maamme suurimmaksi sanotussa historiantutkimuksen hankkeessa. Elämäkerroista louhittua dataa on rikastettu automaattisen loogisen päättelyn avulla ja linkittämällä sitä 16 muuhun tietolähteeseen. Tietämysverkko on julkaistu linkitetyn avoimen datan Linked Data Finland -palvelussa. Datapalvelun avulla on toteutettu seitsemästä sovellusnäkymästä koostuva älykäs, avoin ja maksuton verkkopalvelu biografiasampo.fi, jolla on ollut noin 50 000 käyttäjää. Sekä järjestelmän elämäkerrat että niistä louhittu data ovat avoimesti käytettävissä datapalveluna Linked Data Finland -alustalla.Item BiographySampo – Publishing and enriching biographies on the semantic web for digital humanities research(Springer, 2019-06-02) Hyvönen, Eero; Leskinen, Petri; Tamper, Minna; Rantala, Heikki; Ikkala, Esko; Tuominen, Jouni; Keravuori, Kirsi; Department of Computer Science; Zaveri, Amrapali; Gray, Alasdair J.G.; Hammar, Karl; Hitzler, Pascal; Lopez, Vanessa; Janowicz, Krzysztof; Fernández, Miriam; Haller, Armin; Professorship Hyvönen Eero; Finnish Literature SocietyThis paper argues for making a paradigm shift in publishing and using biographical dictionaries on the web, based on Linked Data. The idea is to provide the user with enhanced reading experience of biographies by enriching contents with data linking and reasoning. In addition, versatile tooling for (1) biographical research of individual persons as well as for (2) prosopographical research on groups of people are provided. To demonstrate and evaluate the new possibilities, we present the semantic portal “BiographySampo – Finnish Biographies on the Semantic Web”. The system is based on a knowledge graph extracted automatically from a collection of 13 100 textual biographies, enriched with data linking to 16 external data sources, and by harvesting external collection data from libraries, museums, and archives. The portal was released in September 2018 for free public use at http://biografiasampo.fi.Item Combining faceted search with data-analytic visualizations on top of a SPARQL endpoint(2018-01-01) Leskinen, Petri; Miyakita, Goki; Koho, Mikko; Hyvönen, Eero; Department of Computer Science; Professorship Hyvönen EeroThis paper discusses practical experiences on creating data-analytic visualizations in a browser, on top of a SPARQL endpoint based on the results of faceted search. Four use cases related to Digital Humanities research in proposog-raphy are discussed where the SPARQL Faceter tool was used and extended in different ways. The Faceter tool allows the user to select a group of people with shared properties, e.g., people with the same place of birth, gender, profession, or employer. The filtered data can then be visualized, e.g., as column charts, with business graphics, sankey diagrams, or on a map. The use cases examine the potential of visualization as well as automated knowledge discovery in Digital Humanities research.Item Communication now and then: analyzing the Republic of Letters as a communication network(SPRINGER, 2022-05-10) Ureña-Carrion, Javier; Leskinen, Petri; Tuominen, Jouni; van den Heuvel, Charles; Hyvönen, Eero; Kivelä, Mikko; Department of Computer Science; Professorship Saramäki J.; Professorship Hyvönen Eero; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML); Computer Science - Complex Systems (Cxsys); Professorship Kivelä Mikko; Department of Computer Science; Huygens Institute for the History of the NetherlandsHuge advances in understanding patterns of human communication, and the underlying social networks where it takes place, have been made recently using massive automatically recorded data sets from digital communication, such as emails and phone calls. However, it is not clear to what extent these results on human behaviour are artefacts of contemporary communication technology and culture and if the fundamental patterns in communication have changed over history. This paper presents an analysis of historical epistolary metadata with the aim of comparing the underlying historical communication patterns with those of contemporary communication. Our work uses a new epistolary dataset containing metadata on over 150,000 letters sent between the 16th and 19th centuries. The analyses indicate striking resemblances between contemporary and epistolary communication network patterns, including dyadic interactions and ego-level behaviour. Certain aspects of the letter datasets are insufficient to corroborate other similarities or differences for these communication networks. Despite these drawbacks, our work helps confirm that several features of human communication are not artefacts of contemporary mediums or culture, but are likely elements of human behaviour.Item Constellations of Correspondence: a Linked Data Service and Portal for Studying Large and Small Networks of Epistolary Exchange in the Grand Duchy of Finland(RWTH Aachen University, 2022) Tuominen, Jouni; Koho, Mikko; Pikkanen, Ilona; Drobac, Senka; Enqvist, Johanna; Hyvönen, Eero; La Mela, Matti; Leskinen, Petri; Paloposki, Hanna Leena; Rantala, Heikki; Department of Computer Science; Professorship Hyvönen Eero; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML); Finnish Literature Society; University of Helsinki; Uppsala UniversityThis paper presents the vision of aggregating, harmonizing, and publishing letter catalog metadata (information e.g. of senders, receivers and datings of letters) from cultural heritage (CH) institutions in Finland as a single reconciled Linked Open Data (LOD) service and a semantic portal providing data analytical tools for researchers. The research is conducted as part of the consortium research project Constellations of Correspondence (CoCo). The target of the project is to study - for the first time - scattered, heterogeneous epistolary metadata regarding the period of the Grand Duchy of Finland (1809-1917) as one, integrated dataset and make it interoperable and available. This will enable scholars to ask ambitious research questions in the field of computer science and to conduct empirical, bottom-up case studies e.g. on epistolary culture, communicative networks, and heritagization processes. This paper discusses one of the first datasets acquired by the project, the letter collection of the Board of the Finnish Art Society (1846-1901), provided by the Finnish National Gallery, which contains details of c. 1150 letters sent or received by c. 400 actors.Item A Content-Based Recommender System for an Online Video Platform(2019-12-16) Kurkinen, Antti; Leskinen, Petri; Vainionpää, Janne; Perustieteiden korkeakoulu; Hyvönen, EeroAn increasing amount of video content is available for anyone on the Internet. This tremendous amount of content has led us to information overload, where finding relevant content has become a challenging task. Recommender systems try to mitigate this problem by providing personalized recommendations of items that are likely to be interesting to a particular user. Recommender systems utilize different methods to analyze videos and users' behavioral data to generate the recommendations. However, metadata related to the videos might be inaccurate or insufficient to represent the actual content of the video. This research studies how content-based filtering can be used to generate personalized video recommendations, and how speech transcription analysis can enhance the recommendation of videos where sufficient metadata is not available. The recommender system was implemented for an existing online video platform where the content consists mostly of information-centric lecture and conference type videos. The performance of the recommender system was evaluated by conducting offline experiments which simulated users' actions based on an existing dataset from another video platform. The results indicated that the system is able to generate personalized recommendations for users based on their earlier actions on the system. The speech transcription analysis slightly increased precision when used together with user-generated fields. However, the precision and recall of transcript-only approaches were significantly lower than approaches using user-generated fields. The results were consistent with, and within the same magnitude as, results from previous research using the same dataset. However, future work and experiments are needed in order to verify how the system performs with real users, and how meaningful the recommendations are to the users.Item A Data-driven Approach to Create an Ontology of Parliamentary Work: Case Parliament of Finland on the Semantic Web(RWTH Aachen University, 2023) Hyvönen, Eero; Leskinen, Petri; Tuominen, Jouni; Department of Computer Science; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML); Professorship Hyvönen EeroItem Design and outcomes of an acoustic data visualization seminar(2014) Robinson, Philip W.; Patynen, Jukka; Haapaniemi, Aki; Kuusinen, Antti; Leskinen, Petri; Zan-Bi, Morley; Lokki, TapioRecently, the Department of Media Technology at Aalto University offered a seminar entitled Applied Data Analysis and Visualization. The course used spatial impulse response measurements from concert halls as the context to explore high-dimensional data visualization methods. Students were encouraged to represent source and receiver positions, spatial aspects, and temporal development of sound fields, frequency characteristics, and comparisons between halls, using animations and interactive graphics. The primary learning objectives were for the students to translate their skills across disciplines and gain a working understanding of high-dimensional data visualization techniques. Accompanying files present examples of student-generated, animated and interactive visualizations.Item Eduskunnan täysistuntojen puheenvuorojen henkilömainintoihin perustuvien verkostoiden analyysi(2023-03-20) Poikkimäki, Henna; Leskinen, Petri; Perustieteiden korkeakoulu; Hyvönen, EeroParlamentaarisen datan avoimuus on tärkeää muun muassa päätöksenteon läpinäkyvyyden ja parlamentaariseen keskusteluun liittyvän tutkimuksen kannalta. Semanttinen parlamentti konsortiohanke kokosi yhteen kaikki täysistuntojen puheenvuorot ja täysistunnoissa puhuneet henkilöt aina eduskunnan perustamisesta 1907 saakka luomalla linkitetyn avoimen datan infrastruktuurin, Parlamenttisammon. Dataa on rikastettu esimerkiksi tunnistamalla puheenvuoroista henkilömainintoja, joiden pohjalta tässä työssä luodaan kansanedustajien ja puolueiden välisiä verkostoja. Työssä esitellään ensin verkosto- ja viiteanalyysi sekä käydään läpi niiden välisiä yhteyksiä. Työssä muodostetaan viiteanalyysistä tuttuja viittaus- ja yhteisviittausverkostoja sekä bibliografiseen kytkentään perustuvia verkostoja, joita analysoidaan viite- ja verkostoanalyysin menetelmin. Verkostojen avulla pyritään tunnistamaan parlamentaarisissa keskusteluissa aktiivisia ja keskeisiä henkilöitä sekä tutkimaan, onko puheenvuoroissa tehtyjen henkilömainintojen taustalla toistuvia kaavoja. Viittausverkostojen avulla saatiin selville, että usein mainittuja kansanedustajia ovat ministereinä toimineet henkilöt ja itse paljon henkilömainintoja puheenvuoroissaan tekevät kansanedustajat. Paljon viittauksia tekevistä henkilöistä puolestaan kukaan ei toiminut ministerinä ja suurin osa heistä oli opposition edustajia. Yhteisviittausverkoston avulla saatiin selville henkilöt, jotka on mainittu usein samojen kansanedustajien toimesta, kun taas bibliografiseen kytkentään perustuvan verkoston avulla selvitettiin, ketkä kansanedustajista ovat maininneet usein samoja henkilöitä. Suurin vaikuttava tekijä siihen, ketkä henkilöön viittaavat tai keitä henkilö mainitsee puheenvuoroissaan vaikuttaa olevan henkilön puolueen parlamentaarinen rooli, hallitus tai oppositio. Selkeitä kansanedustajien ryhmittymiä ei kuitenkaan löydetty henkilömainintojen perusteella tässä työssä käytetyillä menetelmillä. Otettaessa huomioon puheenvuorojen lauseet, joissa henkilömaininnat tapahtuvat, opposition ja hallituksen poliitikkojen jako muuttuu selkeämmäksi. Lopussa työssä pohditaan, voisiko henkilömainintojen perusteella löytää selkeämpiä kansanedustajien ryhmittymiä ottamalla esimerkiksi paremmin huomioon henkilömainintojen kontekstit.Item Extracting Genealogical Networks of Linked Data from Biographical Texts(Springer, 2019) Leskinen, Petri; Hyvönen, Eero; Department of Computer Science; Hitzler, Pascal; Kirrane, Sabrina; Hartig, Olaf; de Boer, Victor; Schlobach, Stefan; Vidal, Maria-Esther; Maleshkova, Maria; Hammar, Karl; Lasierra, Nelia; Stadtmüller, Steffen; Hose, Katja; Verborgh, Ruben; Professorship Hyvönen EeroThis paper presents the idea and our work of extracting and reassembling a genealogical network automatically from a collection of biographies. The network can be used as a tool for network analysis of historical persons. The data has been published as Linked Data and as an interactive online service as part of the in-use data service and semantic portal BiographySampo—Finnish Biographies on the Semantic Web.Item Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language(RWTH Aachen University, 2022-08-11) Tamper, Minna; Leal, Rafael; Sinikallio, Laura; Leskinen, Petri; Tuominen, Jouni; Hyvönen, Eero; Department of Computer Science; Professorship Hyvönen Eero; Computer Science Professors; Computer Science - Artificial Intelligence and Machine Learning (AIML)This paper presents knowledge extraction and natural language processing methods used to enrich the knowledge graph of the plenary debates (textual transcripts of speeches) of the Parliament of Finland. This knowledge graph includes some 960 000 speeches (1907–2021) interlinked with a prosopographical knowledge graph about the politicians. A recent subset of the speeches was used to extract named entities and topical keywords for semantic searching and browsing the data and for data analysis. The process is based on linguistic analysis, named entity linking, and automatic subject indexing. The results were included into the ParliamentSampo knowledge graph in a SPARQL endpoint. This data can be used for studying parliamentary language and culture in Digital Humanities research and for developing applications, such as the ParliamentSampo portal.
- «
- 1 (current)
- 2
- 3
- »