A Comparative Study of Classical and Quantum Transformer Models and Their Applications

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorRaasakka, Matti
dc.contributor.authorShah, Jayaditya
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.supervisorRaasakka, Matti
dc.date.accessioned2024-11-19T09:15:50Z
dc.date.available2024-11-19T09:15:50Z
dc.date.issued2024-09-29
dc.description.abstractTransformers are a neural network architecture that enable the utilisation of a larger context frame than traditional neural networks when training deep learning models. With the advent of the ongoing decade, transformers have enabled the application of large language models, improved computer vision, generative models and other large-scale artificial intelligence systems. This thesis investigates the potential of quantum transformers to implement deep learning tasks focusing on the Quixer model, developed by Quantinuum, by comparing a classically simulated version of Quixer to classical transformers. The thesis is motivated by the need to optimize transformer components for large-scale applications, to address the quadratic complexity arising from the self attention mechanism. The results of the thesis indicate that Quixer performs in line with the classical baseline as published in the paper by Quantinuum when reproduced with the same dataset, and model performance follows the trend for another dataset of twice the size. Hence, providing a proof of concept quantum transformers can be considered an effective method for developing large-scale models in the future with the eventual improvement of quantum hardware.en
dc.format.extent36+20
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/131672
dc.identifier.urnURN:NBN:fi:aalto-202411197190
dc.language.isoenen
dc.programmeAalto Bachelor’s Programme in Science and Technologyfi
dc.programme.majorQuantum Technologyen
dc.programme.mcodeSCI3103fi
dc.subject.keywordmachine learningen
dc.subject.keywordphysicsen
dc.subject.keywordquantum circuitsen
dc.subject.keywordquantum transformersen
dc.subject.keywordquantum informationen
dc.subject.keywordtransformer architectureen
dc.titleA Comparative Study of Classical and Quantum Transformer Models and Their Applicationsen
dc.typeG1 Kandidaatintyöfi
dc.type.dcmitypetexten
dc.type.ontasotBachelor's thesisen
dc.type.ontasotKandidaatintyöfi

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Shah_Jayaditya_2024.pdf
Size:
2.15 MB
Format:
Adobe Portable Document Format