An Anonymization Tool for Open Data Publication of Legal Documents
Access rights
openAccess
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2022
Department
Major/Subject
Mcode
Degree programme
Language
en
Pages
10
12-21
12-21
Series
CEUR Workshop Proceedings, Volume 3257
Abstract
The EU General Data Protection Regulation (GDPR) requires anonymization of documents containing personal data, such as court decisions, for public use. Doing this manually is costly and time-consuming but can be automated by applying Natural Language Processing (NLP) methods. This paper introduces the ANOPPI tool developed for (semi-)automatic anonymization of Finnish texts. The tool can be used both as a web application and programmatically through a REST API. Evaluation shows that ANOPPI performs well with different types of documents, however, further improving the performance of the named entity recognition and disambiguation methods would enhance the usefulness of the software. The tool is being published as open source for public use by the Ministry of Justice in Finland. A use case of ANOPPI is to publish court decisions on the Web in the LawSampo semantic portal for human close reading and as Linked Open Data for data analysis in legal informatics.Description
Publisher Copyright: © 2022 Copyright for this paper by its authors.
Keywords
anonymization, case law, named entity recognition, pseudonymization
Other note
Citation
Oksanen, A, Hyvönen, E, Tamper, M, Tuominen, J, Ylimaa, H, Löytynoja, K, Kokkonen, M & Hietanen, A 2022, ' An Anonymization Tool for Open Data Publication of Legal Documents ', CEUR Workshop Proceedings, vol. 3257, pp. 12-21 .