Adversarial robustness of GPT-3.5 Turbo
No Thumbnail Available
Files
Nguyen_Anh_2024.pdf (3.93 MB) (opens in new window)
Aalto login required (access for Aalto Staff only).
URL
Journal Title
Journal ISSN
Volume Title
Perustieteiden korkeakoulu |
Bachelor's thesis
Electronic archive copy is available locally at the Harald Herlin Learning Centre. The staff of Aalto University has access to the electronic bachelor's theses by logging into Aaltodoc with their personal Aalto user ID. Read more about the availability of the bachelor's theses.
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Authors
Date
2024-04-26
Department
Major/Subject
Data Science
Mcode
SCI3095
Degree programme
Aalto Bachelor’s Programme in Science and Technology
Language
en
Pages
34+12
Series
Abstract
The rapid advancement of artificial intelligence (AI) models has brought forth a critical need for a thorough examination of potential ethical and security concerns. However, many ethical issues regarding AI are being overlooked, including misinformation, bias, and accuracy. Within the scope of robustness, the study aims to assess the consistency of GPT’s output given the diversity of inputs. The primary objective is to construct a comprehensive framework for assessing the robustness of large language models (LLMs), with a specific emphasis on GPT 3.5. The research takes a proactive stance by developing experiments and methods using Python coding language, incorporating literature review and data analysis. This study answered the question how the algorithmic robustness of Large Language Models can, particularly GPT 3.5 turbo , be effectively assessed. By delving deeply into the assessment of robustness, the research seeks to address challenges associated with the widespread use of powerful language models, fostering a more secure, ethical, and transparent landscape in the field of artificial intelligence.Description
Supervisor
Korpi-Lagg, MaaritThesis advisor
Jung, AlexKeywords
GPT 3.5 turbo, adversarial robustness, artificial intelligence, large language model