TAIGA: a novel dataset for multitask learning of continuous and categorical forest variables from hyperspectral imagery
Loading...
Access rights
openAccess
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2022
Department
Major/Subject
Mcode
Degree programme
Language
en
Pages
12
Series
IEEE Transactions on Geoscience and Remote Sensing, Volume 60
Abstract
The spectral and spatial resolutions of modern optical Earth observation data are continuously increasing. To fully utilize the data, integrate them with other information sources, and create applications relevant to real-world problems, extensive training data are required. We present TAIGA, an open dataset including continuous and categorical forestry data, accompanied by airborne hyperspectral imagery with a pixel size of 0.7 m. The dataset contains over 70 million labeled pixels belonging to more than 600 forest stands. To establish a baseline on TAIGA dataset for multitask learning, we trained and validated a convolutional neural network to simultaneously retrieve 13 forest variables. Due to the size of the imagery, the training and testing sets were independent, with strictly no overlap for patches up to 45 x 45 pixels. Our retrieval results show that including both spectral and textural information improves the accuracy of mapping key boreal forest structural characteristics, compared with an earlier study including only spectral information from the same image. TAIGA responds to the increased availability of hyperspectral and very high resolution imagery, and includes the forestry variables relevant for forestry and environmental applications. We propose the dataset as a new benchmark for spatial-spectral methods that overcomes the limitations of widely used small-scale hyperspectral datasets.Description
Publisher Copyright: Author
Keywords
boreal forest, convolutional neural networks, Data models, Deep learning, Forestry, Hyperspectral imaging, multitask learning, Soil, Spatial resolution, Training
Other note
Citation
Mottus, M, Pham, P, Halme, E, Molinier, M, Cu, H & Laaksonen, J 2022, ' TAIGA: a novel dataset for multitask learning of continuous and categorical forest variables from hyperspectral imagery ', IEEE Transactions on Geoscience and Remote Sensing, vol. 60 . https://doi.org/10.1109/TGRS.2022.3141217