Multitask Balanced and Recalibrated Network for Medical Code Prediction

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorSun, Weien_US
dc.contributor.authorJi, Shaoxiongen_US
dc.contributor.authorCambria, Eriken_US
dc.contributor.authorMarttinen, Pekkaen_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorProfessorship Marttinen P.en
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Artificial Intelligence and Machine Learning (AIML) - Research areaen
dc.contributor.organizationAalto Universityen_US
dc.contributor.organizationDepartment of Computer Scienceen_US
dc.contributor.organizationNanyang Technological Universityen_US
dc.date.accessioned2022-11-30T08:39:00Z
dc.date.available2022-11-30T08:39:00Z
dc.date.issued2022-11-09en_US
dc.description| openaire: EC/H2020/101016775/EU//INTERVENE
dc.description.abstractHuman coders assign standardized medical codes to clinical documents generated during patients’ hospitalization, which is error prone and labor intensive. Automated medical coding approaches have been developed using machine learning methods, such as deep neural networks. Nevertheless, automated medical coding is still challenging because of complex code association, noise in lengthy documents, and the imbalanced class problem. We propose a novel neural network, called the Multitask Balanced and Recalibrated Neural Network, to solve these issues. Significantly, the multitask learning scheme shares the relationship knowledge between different coding branches to capture code association. A recalibrated aggregation module is developed by cascading convolutional blocks to extract high-level semantic features that mitigate the impact of noise in documents. Also, the cascaded structure of the recalibrated module can benefit learning from lengthy notes. To solve the imbalanced class problem, we deploy focal loss to redistribute the attention on low- and high-frequency medical codes. Experimental results show that our proposed model outperforms competitive baselines on a real-world clinical dataset called the Medical Information Mart for Intensive Care (MIMIC-III).en
dc.description.versionPeer revieweden
dc.format.extent20
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationSun, W, Ji, S, Cambria, E & Marttinen, P 2022, 'Multitask Balanced and Recalibrated Network for Medical Code Prediction', ACM Transactions on Intelligent Systems and Technology, vol. 14, no. 1, 17, pp. 1–20. https://doi.org/10.1145/3563041en
dc.identifier.doi10.1145/3563041en_US
dc.identifier.issn2157-6904
dc.identifier.issn2157-6912
dc.identifier.otherPURE UUID: ff4c1ec7-74e5-4af8-8a70-f419c10739c6en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/ff4c1ec7-74e5-4af8-8a70-f419c10739c6en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/93167818/Multitask_Balanced_and_Recalibrated_Network_for_Medical_Code_Prediction.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/117993
dc.identifier.urnURN:NBN:fi:aalto-202211306749
dc.language.isoenen
dc.publisherACM
dc.relationinfo:eu-repo/grantAgreement/EC/H2020/101016775/EU//INTERVENEen_US
dc.relation.ispartofseriesACM Transactions on Intelligent Systems and Technologyen
dc.relation.ispartofseriesVolume 14, issue 1, pp. 1–20en
dc.rightsopenAccessen
dc.titleMultitask Balanced and Recalibrated Network for Medical Code Predictionen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files