TAIGA: a novel dataset for multitask learning of continuous and categorical forest variables from hyperspectral imagery

Loading...
Thumbnail Image

Access rights

openAccess

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Date

2022

Major/Subject

Mcode

Degree programme

Language

en

Pages

12

Series

IEEE Transactions on Geoscience and Remote Sensing, Volume 60

Abstract

The spectral and spatial resolutions of modern optical Earth observation data are continuously increasing. To fully utilize the data, integrate them with other information sources, and create applications relevant to real-world problems, extensive training data are required. We present TAIGA, an open dataset including continuous and categorical forestry data, accompanied by airborne hyperspectral imagery with a pixel size of 0.7 m. The dataset contains over 70 million labeled pixels belonging to more than 600 forest stands. To establish a baseline on TAIGA dataset for multitask learning, we trained and validated a convolutional neural network to simultaneously retrieve 13 forest variables. Due to the size of the imagery, the training and testing sets were independent, with strictly no overlap for patches up to 45 x 45 pixels. Our retrieval results show that including both spectral and textural information improves the accuracy of mapping key boreal forest structural characteristics, compared with an earlier study including only spectral information from the same image. TAIGA responds to the increased availability of hyperspectral and very high resolution imagery, and includes the forestry variables relevant for forestry and environmental applications. We propose the dataset as a new benchmark for spatial-spectral methods that overcomes the limitations of widely used small-scale hyperspectral datasets.

Description

Publisher Copyright: Author

Keywords

boreal forest, convolutional neural networks, Data models, Deep learning, Forestry, Hyperspectral imaging, multitask learning, Soil, Spatial resolution, Training

Other note

Citation

Mottus, M, Pham, P, Halme, E, Molinier, M, Cu, H & Laaksonen, J 2022, ' TAIGA: a novel dataset for multitask learning of continuous and categorical forest variables from hyperspectral imagery ', IEEE Transactions on Geoscience and Remote Sensing, vol. 60 . https://doi.org/10.1109/TGRS.2022.3141217