The Role of ImageNet Classes in Fréchet Inception Distance
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Kynkäänniemi, Tuomas | en_US |
| dc.contributor.author | Karras, Tero | en_US |
| dc.contributor.author | Aittala, Miika | en_US |
| dc.contributor.author | Aila, Timo | en_US |
| dc.contributor.author | Lehtinen, Jaakko | en_US |
| dc.contributor.department | Department of Computer Science | en |
| dc.contributor.groupauthor | Professorship Lehtinen Jaakko | en |
| dc.contributor.groupauthor | Computer Science Professors | en |
| dc.contributor.groupauthor | Computer Science - Visual Computing (VisualComputing) - Research area | en |
| dc.contributor.groupauthor | Computer Science - Artificial Intelligence and Machine Learning (AIML) - Research area | en |
| dc.contributor.groupauthor | Helsinki Institute for Information Technology (HIIT) | en |
| dc.contributor.organization | Nvidia | en_US |
| dc.date.accessioned | 2023-12-11T09:29:06Z | |
| dc.date.available | 2023-12-11T09:29:06Z | |
| dc.date.issued | 2023-05-01 | en_US |
| dc.description | | openaire: EC/H2020/866435/EU//PIPE | |
| dc.description.abstract | Fréchet Inception Distance (FID) is the primary metric for ranking models in data-driven generative modeling. While remarkably successful, the metric is known to sometimes disagree with human judgement. We investigate a root cause of these discrepancies, and visualize what FID "looks at" in generated images. We show that the feature space that FID is (typically) computed in is so close to the ImageNet classifications that aligning the histograms of Top-N classifications between sets of generated and real images can reduce FID substantially -- without actually improving the quality of results. Thus, we conclude that FID is prone to intentional or accidental distortions. As a practical example of an accidental distortion, we discuss a case where an ImageNet pre-trained FastGAN achieves a FID comparable to StyleGAN2, while being worse in terms of human evaluation. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 26 | |
| dc.identifier.citation | Kynkäänniemi, T, Karras, T, Aittala, M, Aila, T & Lehtinen, J 2023, The Role of ImageNet Classes in Fréchet Inception Distance. in 11th International Conference on Learning Representations (ICLR 2023). Curran Associates Inc., International Conference on Learning Representations, Kigali, Rwanda, 01/05/2023. < https://arxiv.org/abs/2203.06026 > | en |
| dc.identifier.isbn | 9781713899259 | |
| dc.identifier.other | PURE UUID: 06da1068-4c5e-46aa-878d-4eff54b8c2d4 | en_US |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/06da1068-4c5e-46aa-878d-4eff54b8c2d4 | en_US |
| dc.identifier.other | PURE LINK: https://www.proceedings.com/75096.html | en_US |
| dc.identifier.other | PURE LINK: https://arxiv.org/abs/2203.06026 | en_US |
| dc.identifier.other | PURE LINK: https://openreview.net/forum?id=4oXTQ6m_ws8 | en_US |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/124767 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202312117135 | |
| dc.language.iso | en | en |
| dc.relation | info:eu-repo/grantAgreement/EC/H2020/866435/EU//PIPE | en_US |
| dc.relation.ispartof | International Conference on Learning Representations | en |
| dc.relation.ispartofseries | 11th International Conference on Learning Representations (ICLR 2023) | en |
| dc.rights | openAccess | en |
| dc.title | The Role of ImageNet Classes in Fréchet Inception Distance | en |
| dc.type | A4 Artikkeli konferenssijulkaisussa | fi |