Interactive Groupwise Comparison for Reinforcement Learning from Human Feedback

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorKompatscher, Jan
dc.contributor.authorShi, Danqing
dc.contributor.authorVarni, Giovanna
dc.contributor.authorWeinkauf, Tino
dc.contributor.authorOulasvirta, Antti
dc.contributor.departmentDepartment of Information and Communications Engineeringen
dc.contributor.departmentELLIS Instituteen
dc.contributor.groupauthorUser Interfacesen
dc.contributor.groupauthorHelsinki Institute for Information Technology (HIIT)en
dc.contributor.organizationDepartment of Information and Communications Engineering
dc.contributor.organizationBEC-INFM
dc.contributor.organizationKTH Royal Institute of Technology
dc.date.accessioned2025-12-02T07:40:13Z
dc.date.available2025-12-02T07:40:13Z
dc.date.issued2025
dc.descriptionPublisher Copyright: © 2025 The Author(s). Computer Graphics Forum published by Eurographics - The European Association for Computer Graphics and John Wiley & Sons Ltd. | openaire: EC/HE/101141916/EU//Artificial User
dc.description.abstractReinforcement learning from human feedback (RLHF) has emerged as a key enabling technology for aligning AI behaviour with human preferences. The traditional way to collect data in RLHF is via pairwise comparisons: human raters are asked to indicate which one of two samples they prefer. We present an interactive visualisation that better exploits the human visual ability to compare and explore whole groups of samples. The interface is comprised of two linked views: 1) an exploration view showing a contextual overview of all sampled behaviours organised in a hierarchical clustering structure; and 2) a comparison view displaying two selected groups of behaviours for user queries. Users can efficiently explore large sets of behaviours by iterating between these two views. Additionally, we devised an active learning approach suggesting groups for comparison. As shown by our evaluation in six simulated robotics tasks, our approach increases the final rewards by 69.34%. It leads to lower error rates and better policies. We open-source the code that can be easily integrated into the RLHF training loop, supporting research on human–AI alignment.en
dc.description.versionPeer revieweden
dc.format.mimetypeapplication/pdf
dc.identifier.citationKompatscher, J, Shi, D, Varni, G, Weinkauf, T & Oulasvirta, A 2025, 'Interactive Groupwise Comparison for Reinforcement Learning from Human Feedback', Computer Graphics Forum. https://doi.org/10.1111/cgf.70290en
dc.identifier.doi10.1111/cgf.70290
dc.identifier.issn0167-7055
dc.identifier.issn1467-8659
dc.identifier.otherPURE UUID: bfe5bd78-aa32-49ef-bfc8-7ebedd3943bc
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/bfe5bd78-aa32-49ef-bfc8-7ebedd3943bc
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/201642771/Interactive_Groupwise_Comparison_for_Reinforcement_Learning_from_Human.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/140813
dc.identifier.urnURN:NBN:fi:aalto-202512028958
dc.language.isoenen
dc.publisherWiley
dc.relationinfo:eu-repo/grantAgreement/EC/HE/101141916/EU//Artificial User
dc.relation.fundinginfoJ.K., S.D., and A.O. received support from the Research Council of Finland (FCAI: 328400, 345604, 341763; Subjective Functions: 357578) and the ERC (AdG project Artificial User: 101141916). TW was supported by the Swedish e‐Science Research Centre, or SeRC. AO acknowledges the research environment provided by ELLIS Institute Finland.
dc.relation.ispartofseriesComputer Graphics Forumen
dc.rightsopenAccessen
dc.rightsCC BY-NC
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/
dc.subject.keywordhuman–computer interfaces,visualisation
dc.subject.keywordinteraction
dc.subject.keywordvisual analytics
dc.titleInteractive Groupwise Comparison for Reinforcement Learning from Human Feedbacken
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Interactive_Groupwise_Comparison_for_Reinforcement_Learning_from_Human.pdf
Size:
1.42 MB
Format:
Adobe Portable Document Format