A business news event detection algorithm with an application to the forest industry

dc.contributorAalto Universityen
dc.contributorAalto-yliopistofi
dc.contributor.advisorMalo, Pekka
dc.contributor.authorNguyen, Khang
dc.contributor.departmentTieto- ja palvelujohtamisen laitosfi
dc.contributor.schoolKauppakorkeakoulufi
dc.contributor.schoolSchool of Businessen
dc.date.accessioned2021-06-13T16:00:31Z
dc.date.available2021-06-13T16:00:31Z
dc.date.issued2021
dc.description.abstractThe forest industry is an important industry that generates billions of euros and employs millions of workers. However, it lacks a particular type of business intelligence enjoyed by other industries, namely the extraction of knowledge from online articles. Despite many studies on this subject, no relevant study exists for the forestry industry due to the lack of a usable dataset. This thesis proposes an event detection algorithm for online articles that can be applied to both general business news and forest industry news. To that end, three research questions are examined. Firstly, the creation of a robust dataset that is inclusive of forest industry news. Secondly, establishing the feasibility of building an event detection algorithm to recognize and classify both general business and forest industry news. Lastly, proposing an optimally performing model for the said algorithm. To build an event detection algorithm, machine learning methods, particularly natural language processing, are used. The proposed solution comprises contextualized word embeddings and a classification model. Those word embeddings are created with BERT, a state-of-the-art model for text handling from Google. For model performance tuning, one approach is implemented to address the class imbalance problem. The evaluation shows that the proposed solution delivers a strong result, which indicates promising practical implementations in the forest industry. Companies in the industry should be potentially able to enjoy an aspect of business intelligence that has been employed in other industries. This thesis is the first to empirically examine the links between online news articles, events detection, and the forest industry. The thesis’s contributions are twofold. First, the thesis provides an annotated dataset for use with different machine learning methods. Secondly, it complements literature on the feasibility of an event detection algorithm applicable to both business and forestry industry news.en
dc.format.extent83+2
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/108056
dc.identifier.urnURN:NBN:fi:aalto-202106137315
dc.language.isoenen
dc.locationP1 Ifi
dc.programmeInformation and Service Management (ISM)en
dc.subject.keywordforest industry newsen
dc.subject.keywordbusiness newsen
dc.subject.keywordnatural language processingen
dc.subject.keywordmachine learningen
dc.subject.keywordclassificationen
dc.subject.keywordevent detectionen
dc.titleA business news event detection algorithm with an application to the forest industryen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotMaisterin opinnäytefi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
master_Nguyen_Khang_2021.pdf
Size:
2.18 MB
Format:
Adobe Portable Document Format