Hybird prompt optimization without fine-tuning: Enhancing information extraction and translation in vehicle maintenance records

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorHedman, Anders
dc.contributor.advisorLi, Haibo
dc.contributor.authorWu, Zhiyuan
dc.contributor.schoolSähkötekniikan korkeakoulufi
dc.contributor.schoolSchool of Electrical Engineeringen
dc.contributor.supervisorZhou, Quan
dc.date.accessioned2025-08-19T17:11:21Z
dc.date.available2025-08-19T17:11:21Z
dc.date.issued2025-06-10
dc.description.abstractUnstructured, multilingual maintenance logs hamper automated fault analysis, yet fully fine-tuning large language models (LLMs) is prohibitively costly. We therefore adapt a 70 B-parameter LLaMA-33 model without gradient updates using a two-step Hybrid Prompt Optimization (HPO) scheme that merges expert-written instructions with DSPy’s automatic prompt search. The approach is trained and tested on 1000 real, Volvo Trucks records while running only on CPU. For evaluation we draw 10 disjoint test sets of 50 records each; scores reported here are the mean of those ten runs. Relative to a zero-shot baseline, manual prompts raise structured-field extraction accuracy by 23 %; DSPy adds 27 %; and HPO supplies a further 5 %, yielding a cumulative gain of ≈ 60 %. For full-text translation the same steps deliver 55 %, +2 %, and +2 %, respectively. Paired two-tailed t-tests across the 10 × 50 predictions confirm that all improvements remain significant at α = 0.05. Under the best prompt, the lightweight LLaMA-33-70B reaches 92 % of GPT-4o’s extraction quality and slightly surpasses it in translation—yet consumes only a fraction of the compute budget. These findings show that prompt engineering alone can unlock near–state-of-the-art multilingual extraction and translation, delivering a low-cost alternative to full model fine-tuning for industrial after-sales data.en
dc.format.extent76
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/138120
dc.identifier.urnURN:NBN:fi:aalto-202508196349
dc.language.isoenen
dc.locationP1fi
dc.programmeMaster's Programme in ICT Innovationen
dc.programme.majorAutonomous Systemsen
dc.subject.keywordprompt engineeringen
dc.subject.keywordlarge language modelsen
dc.subject.keywordhybrid optimizationen
dc.subject.keywordinformation extractionen
dc.subject.keywordmultilingual translationen
dc.subject.keywordautomotive industryen
dc.titleHybird prompt optimization without fine-tuning: Enhancing information extraction and translation in vehicle maintenance recordsen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Wu_Zhiyuan_2025.pdf
Size:
3.81 MB
Format:
Adobe Portable Document Format