DeepGraphGO: Graph neural network for large-scale, multispecies protein function prediction
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | You, Ronghui | en_US |
dc.contributor.author | Yao, Shuwei | en_US |
dc.contributor.author | Mamitsuka, Hiroshi | en_US |
dc.contributor.author | Zhu, Shanfeng | en_US |
dc.contributor.department | Department of Computer Science | en |
dc.contributor.groupauthor | Probabilistic Machine Learning | en |
dc.contributor.groupauthor | Helsinki Institute for Information Technology (HIIT) | en |
dc.contributor.groupauthor | Professorship Kaski Samuel | en |
dc.contributor.organization | Fudan University | en_US |
dc.date.accessioned | 2021-08-25T06:54:00Z | |
dc.date.available | 2021-08-25T06:54:00Z | |
dc.date.issued | 2021-07-01 | en_US |
dc.description | Publisher Copyright: © 2021 Oxford University Press. All rights reserved. | |
dc.description.abstract | Motivation: Automated function prediction (AFP) of proteins is a large-scale multi-label classification problem. Two limitations of most network-based methods for AFP are (i) a single model must be trained for each species and (ii) protein sequence information is totally ignored. These limitations cause weaker performance than sequence-based methods. Thus, the challenge is how to develop a powerful network-based method for AFP to overcome these limitations. Results: We propose DeepGraphGO, an end-to-end, multispecies graph neural network-based method for AFP, which makes the most of both protein sequence and high-order protein network information. Our multispecies strategy allows one single model to be trained for all species, indicating a larger number of training samples than existing methods. Extensive experiments with a large-scale dataset show that DeepGraphGO outperforms a number of competing state-of-the-art methods significantly, including DeepGOPlus and three representative network-based methods: GeneMANIA, deepNF and clusDCA. We further confirm the effectiveness of our multispecies strategy and the advantage of DeepGraphGO over so-called difficult proteins. Finally, we integrate DeepGraphGO into the stateof- the-art ensemble method, NetGO, as a component and achieve a further performance improvement. Availability and implementation: https://github.com/yourh/DeepGraphGO. | en |
dc.description.version | Peer reviewed | en |
dc.format.extent | I262-I271 | |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | You, R, Yao, S, Mamitsuka, H & Zhu, S 2021, ' DeepGraphGO : Graph neural network for large-scale, multispecies protein function prediction ', Bioinformatics, vol. 37, pp. I262-I271 . https://doi.org/10.1093/bioinformatics/btab270 | en |
dc.identifier.doi | 10.1093/bioinformatics/btab270 | en_US |
dc.identifier.issn | 1367-4803 | |
dc.identifier.issn | 1460-2059 | |
dc.identifier.other | PURE UUID: bede6bf7-b3d8-4ad7-8ceb-4d7982649482 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/bede6bf7-b3d8-4ad7-8ceb-4d7982649482 | en_US |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85112020570&partnerID=8YFLogxK | en_US |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/66692009/DeepGraphGO.btab270.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/109175 | |
dc.identifier.urn | URN:NBN:fi:aalto-202108258412 | |
dc.language.iso | en | en |
dc.publisher | OXFORD UNIV PRESS INC | |
dc.relation.ispartofseries | Bioinformatics | en |
dc.relation.ispartofseries | Volume 37 | en |
dc.rights | openAccess | en |
dc.title | DeepGraphGO: Graph neural network for large-scale, multispecies protein function prediction | en |
dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
dc.type.version | publishedVersion |