Compositional Generalization in Grounded Language Learning via Induced Model Sparsity

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorSpilsbury, Samen_US
dc.contributor.authorIlin, Alexanderen_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Artificial Intelligence and Machine Learning (AIML) - Research areaen
dc.contributor.organizationDepartment of Computer Scienceen_US
dc.date.accessioned2022-10-19T06:44:18Z
dc.date.available2022-10-19T06:44:18Z
dc.date.issued2022en_US
dc.descriptionFunding Information: We thank Yonatan Bisk for his valuable feedback and suggestions on this work. We also acknowledge the computational resources provided by the Aalto Science-IT project and the support within the Academy of Finland Flagship programme: Finnish Center for Artificial Intelligence (FCAI). Publisher Copyright: © 2022 Association for Computational Linguistics.
dc.description.abstractWe provide a study of how induced model sparsity can help achieve compositional generalization and better sample efficiency in grounded language learning problems. We consider simple language-conditioned navigation problems in a grid world environment with disentangled observations. We show that standard neural architectures do not always yield compositional generalization. To address this, we design an agent that contains a goal identification module that encourages sparse correlations between words in the instruction and attributes of objects, composing them together to find the goal. The output of the goal identification module is the input to a value iteration network planner. Our agent maintains a high level of performance on goals containing novel combinations of properties even when learning from a handful of demonstrations. We examine the internal representations of our agent and find the correct correspondences between words in its dictionary and attributes in the environment.en
dc.description.versionPeer revieweden
dc.format.extent13
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationSpilsbury, S & Ilin, A 2022, Compositional Generalization in Grounded Language Learning via Induced Model Sparsity. in NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, Proceedings of the Student Research Workshop. Association for Computational Linguistics, pp. 143-155, Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seattle, Washington, United States, 10/07/2022. https://doi.org/10.18653/v1/2022.naacl-srw.19en
dc.identifier.doi10.18653/v1/2022.naacl-srw.19en_US
dc.identifier.isbn978-1-955917-73-5
dc.identifier.otherPURE UUID: 72a1c239-b0c8-4a6f-8ce9-df62d328b88een_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/72a1c239-b0c8-4a6f-8ce9-df62d328b88een_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85137561372&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/88982651/Compositional_Generalization_in_Grounded_Language_Learning_via_Induced_Model_Sparsity.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/117228
dc.identifier.urnURN:NBN:fi:aalto-202210196016
dc.language.isoenen
dc.relation.ispartofConference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologiesen
dc.relation.ispartofseriesNAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Student Research Workshopen
dc.relation.ispartofseriespp. 143-155en
dc.rightsopenAccessen
dc.titleCompositional Generalization in Grounded Language Learning via Induced Model Sparsityen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files