Explainable empirical risk minimization
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Zhang, Linli | en_US |
dc.contributor.author | Karakasidis, Georgios | en_US |
dc.contributor.author | Odnoblyudova, Arina | en_US |
dc.contributor.author | Dogruel, Leyla | en_US |
dc.contributor.author | Tian, Yu | en_US |
dc.contributor.author | Jung, Alex | en_US |
dc.contributor.department | Department of Computer Science | en |
dc.contributor.groupauthor | Professorship Jung Alexander | en |
dc.contributor.groupauthor | Computer Science Professors | en |
dc.contributor.groupauthor | Computer Science - Artificial Intelligence and Machine Learning (AIML) - Research area | en |
dc.contributor.groupauthor | Computer Science - Large-scale Computing and Data Analysis (LSCA) - Research area | en |
dc.contributor.groupauthor | Helsinki Institute for Information Technology (HIIT) | en |
dc.contributor.organization | Department of Computer Science | en_US |
dc.contributor.organization | Johannes Gutenberg University Mainz | en_US |
dc.date.accessioned | 2024-01-04T09:17:19Z | |
dc.date.available | 2024-01-04T09:17:19Z | |
dc.date.issued | 2024-03 | en_US |
dc.description | Funding Information: Open Access funding provided by Aalto University. The work in this paper is supported by the following three fundings: funding “Ensemble nowcasting of irradiance/clouds for solar energy using novel machine learning tools—can AI beat physics?" with project number 885377 granted by Austrian Research Promotion Agency (FFG) in 2021, funding “Intelligent Techniques in Condition Monitoring of Electromechanical Energy Conversion Systems" granted by Academy of Finland (decision number 331197) in 2020, and funding“XAI-based software-defined energy networks via packetized management for fossil fuel-free next-generation of industrial cyber-physical systems (X-SDEN)” granted by Academy of Finland (decision number 349966) in 2022. Publisher Copyright: © 2023, The Author(s). | |
dc.description.abstract | The successful application of machine learning (ML) methods increasingly depends on their interpretability or explainability. Designing explainable ML (XML) systems is instrumental for ensuring transparency of automated decision-making that targets humans. The explainability of ML methods is also an essential ingredient for trustworthy artificial intelligence. A key challenge in ensuring explainability is its dependence on the specific human end user of an ML system. The users of ML methods might have vastly different background knowledge about ML principles, with some having formal training in the specific field and others having none. We use information-theoretic concepts to develop a novel measure for the subjective explainability of predictions delivered by a ML method. We construct this measure via the conditional entropy of predictions, given the user signal. Our approach allows for a wide range of user signals, ranging from responses to surveys to biophysical measurements. We use this measure of subjective explainability as a regularizer for model training. The resulting explainable empirical risk minimization (EERM) principle strives to balance subjective explainability and risk. The EERM principle is flexible and can be combined with arbitrary ML models. We present several practical implementations of EERM for linear models and decision trees. Numerical experiments demonstrate the application of EERM to weather prediction and detecting inappropriate language in social media. | en |
dc.description.version | Peer reviewed | en |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Zhang, L, Karakasidis, G, Odnoblyudova, A, Dogruel, L, Tian, Y & Jung, A 2024, 'Explainable empirical risk minimization', Neural Computing and Applications, vol. 36, no. 8, pp. 3983-3996. https://doi.org/10.1007/s00521-023-09269-3 | en |
dc.identifier.doi | 10.1007/s00521-023-09269-3 | en_US |
dc.identifier.issn | 0941-0643 | |
dc.identifier.issn | 1433-3058 | |
dc.identifier.other | PURE UUID: e38d43c7-8c46-4748-87f8-a18f56274b23 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/e38d43c7-8c46-4748-87f8-a18f56274b23 | en_US |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85178895312&partnerID=8YFLogxK | |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/166988522/SCI_Zhang_etal_Neural_Computing_and_Applications.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/125561 | |
dc.identifier.urn | URN:NBN:fi:aalto-202401041250 | |
dc.language.iso | en | en |
dc.publisher | Springer | |
dc.relation.ispartofseries | Neural Computing and Applications | en |
dc.relation.ispartofseries | Volume 36, issue 8, pp. 3983-3996 | en |
dc.rights | openAccess | en |
dc.subject.keyword | Empirical risk minimization | en_US |
dc.subject.keyword | Explainable machine learning | en_US |
dc.subject.keyword | Subjective explainability | en_US |
dc.title | Explainable empirical risk minimization | en |
dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
dc.type.version | publishedVersion |