On the Generalization of Equivariant Graph Neural Networks
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Karczewski, Rafał | |
dc.contributor.author | Souza, Amauri H. | |
dc.contributor.author | Garg, Vikas | |
dc.contributor.department | Department of Computer Science | en |
dc.contributor.groupauthor | Probabilistic Machine Learning | en |
dc.contributor.groupauthor | Professorship Kaski Samuel | en |
dc.contributor.groupauthor | Computer Science Professors | en |
dc.contributor.groupauthor | Computer Science - Artificial Intelligence and Machine Learning (AIML) | en |
dc.contributor.groupauthor | Professorship Garg Vikas | en |
dc.date.accessioned | 2024-09-25T06:05:31Z | |
dc.date.available | 2024-09-25T06:05:31Z | |
dc.date.issued | 2024 | |
dc.description | Publisher Copyright: Copyright 2024 by the author(s) | |
dc.description.abstract | E(n)-Equivariant Graph Neural Networks (EGNNs) are among the most widely used and successful models for representation learning on geometric graphs (e.g., 3D molecules). However, while the expressivity of EGNNs has been explored in terms of geometric variants of the Weisfeiler-Leman isomorphism test, characterizing their generalization capability remains open. In this work, we establish the first generalization bound for EGNNs. Our bound depicts a dependence on the weighted sum of logarithms of the spectral norms of the weight matrices (EGNN parameters). In addition, our main result reveals interesting novel insights: i) the spectral norms of the initial layers may impact generalization more than the final ones; ii) ε-normalization is beneficial to generalization ' confirming prior empirical evidence. We leverage these insights to introduce a spectral norm regularizer tailored to EGNNs. Experiments on real-world datasets substantiate our analysis, demonstrating a high correlation between theoretical and empirical generalization gaps and the effectiveness of the proposed regularization scheme. | en |
dc.description.version | Peer reviewed | en |
dc.format.extent | 28 | |
dc.format.mimetype | application/pdf | |
dc.identifier.citation | Karczewski, R, Souza, A H & Garg, V 2024, ' On the Generalization of Equivariant Graph Neural Networks ', Proceedings of Machine Learning Research, vol. 235, pp. 23159-23186 . < https://proceedings.mlr.press/v235/karczewski24a.html > | en |
dc.identifier.issn | 2640-3498 | |
dc.identifier.other | PURE UUID: 579680b1-1be5-4f47-b052-8dda59cc13f2 | |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/579680b1-1be5-4f47-b052-8dda59cc13f2 | |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85203830679&partnerID=8YFLogxK | |
dc.identifier.other | PURE LINK: https://proceedings.mlr.press/v235/karczewski24a.html | |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/159372888/On_the_Generalization_of_Equivariant_Graph_Neural_Networks.pdf | |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/130959 | |
dc.identifier.urn | URN:NBN:fi:aalto-202409256502 | |
dc.language.iso | en | en |
dc.publisher | JMLR | |
dc.relation.ispartofseries | Proceedings of Machine Learning Research | |
dc.relation.ispartofseries | Volume 235, pp. 23159-23186 | |
dc.rights | openAccess | en |
dc.title | On the Generalization of Equivariant Graph Neural Networks | en |
dc.type | A4 Artikkeli konferenssijulkaisussa | fi |
dc.type.version | publishedVersion |