Hyperspectral images of grapevine leaves including healthy leaves and leaves with biotic and abiotic symptoms

Ryckewaert, Maxime; Héran, Daphné; Trani, Jean-Philippe; Mas-Garcia, Silvia; Feilhes, Carole; Prezman, Fanny; Serrano, Eric; Bendoula, Ryad

doi:10.1038/s41597-023-02642-w

Download PDF

Data Descriptor
Open access
Published: 26 October 2023

Hyperspectral images of grapevine leaves including healthy leaves and leaves with biotic and abiotic symptoms

Maxime Ryckewaert^1,2,
Daphné Héran¹,
Jean-Philippe Trani¹,
Silvia Mas-Garcia¹,
Carole Feilhes³,
Fanny Prezman³,
Eric Serrano³ &
…
Ryad Bendoula¹

Scientific Data volume 10, Article number: 743 (2023) Cite this article

1359 Accesses
Metrics details

Subjects

Abstract

A hyperspectral imaging database was collected on two hundred and five grape plant leaves. Leaves were measured with a hyperspectral camera in the visible/near infrared spectral range under controlled conditions. This dataset contains hyperspectral acquisition of grape leaves of seven different varieties. For each variety, acquisitions were performed on healthy leaves and leaves with foliar symptoms caused by different grapevine diseases showing clear symptoms of biotic or abiotic stress on other organs. For each leaf, chemical measurements such as chlorophyll and flavonol contents were also performed.

Automatic detection of methane emissions in multispectral satellite imagery using a vision transformer

Article Open access 14 May 2024

Climate change impacts and adaptations of wine production

Article 26 March 2024

Composition and metabolism of microbial communities in soil pores

Article Open access 27 April 2024

Background & Summary

In a context of population increase, reduction of arable lands and global climatic change, agricultural production still needs to be increased and secured in a durable way with respect to the environment. Improving plant production for food and feed is one of the main challenges for the years to come. To meet this challenge, one of the ways is the early and precise identification of biotic and abiotic symptoms on plant leaves (water stress, disease, or bacteria).

Optical instruments and especially multispectral (MSI) and hyperspectral imaging (HSI) are relevant tools for the automated and non-invasive detection of biotic and abiotic symptoms^1,2,3,4,5.

HSI is highly informative because it has both high spectral and spatial resolutions leading to an increased use in agricultural applications in recent years. For example, new case studies have emerged in the field of plant breeding⁶, crop monitoring such as water stress detection⁷, maturity monitoring⁸, disease detection^9,10 or prediction of biochemical traits^11,12.

Although these various studies are promising, the use of HSI forces current analysis methods to be rethought to exploit this rich amount of information^6,13,14. The training dataset should be as diverse as possible to enable the learning algorithm to be robust. This is particularly true with biological samples which are complex media containing a high variability of response. The data we proposed is designed to be this new kind of HSI library for an application in agriculture and more specifically for identification and detection of different stress symptoms at the grapevine leaf level. For this purpose, a spectral imaging database¹⁵ was collected from two hundred and four grapevine leaves. The leaves were measured with a hyperspectral camera in the visible/near infrared spectral range under controlled conditions. For each leaf, foliar contents (chlorophyll, epidermal flavonol and nitrogen) were measured. This dataset contains the reflectance spectra of grape leaves of seven different varieties. For each variety, acquisitions were performed on healthy leaves and leaves with foliar symptoms caused by different grapevine diseases showing clear symptoms of biotic or abiotic stress.

A subset of this database has already been used to produce a new method of HSI data processing. We have recently published this new approach⁹. In this work, combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) is proposed to detect the Flavescence dorée grapevine disease from hyperspectral imaging. By making this data available to other researchers, we hope to encourage them to do similar work and proposed new algorithms.

Methods

Samples and analyses

Leaves were collected during September 2020, in the south of France (GPS coordinates: 43.84208931745156, 1.8538190583140841). Infected leaves were chosen in order to represent at best the variability of the available symptoms in terms of severity and stage of infection. A similar proportion of the number of leaves of both red and white varieties was collected for this experiment. In total two hundred and four leaves were collected in the fields. All information about leaves and their respective symptoms is summarized in Tables 1–3.

Table 1 Number of healthy and infected leaves per variety.

Full size table

Table 2 Number of images per symptom.

Full size table

Table 3 Number of images per symptom per variety.

Full size table

Each leaf and each vine from which it was extracted were diagnosed by a phytopathology expert. Leaves were extracted from the front face, in the middle of the canopy to avoid the younger and older organs which can present a different physiological behaviour. Regarding healthy leaves, they were selected in the same regions and they were asserted absent of any symptom. However, some of the healthy sample can exhibit slight forms of mechanical or chemical wounds (due to protection, management operations) and some slight damage caused by insects. In order to guarantee that leaf physiological status were not affected by the time delay between collection and acquisition, leaves were carried in controlled temperature and hydric conditions.

Foliar content measurements

For each leaf, foliar content measurements (see Figs. 1–3) were made before sampling. These measurements were carried out with a Dualex 311 scientific+ TM (Force-A, Orsay, France) to provide chlorophyll a + b content (µg/cm2), epidermal flavonols content (in % of relative absorbance) and the crop nitrogen status index (NBI).

Hyperspectral image acquisition

Hyperspectral images were acquired on each individual leaf under controlled conditions in laboratory. Acquisitions of leaf images were performed with a hyperspectral camera (IQ, Specim, Finland). Imaging of grapevine leaves was carried out in the spectral range of 400–900 nm, with a spectral resolution of 7 nm. Illumination was provided by a halogen lamp (Arrilite 750 Plus ARRI, Munich, Germany) and constant angles of −50° and 50° were maintained between the axes of the halogen lamp and the axis of the hyperspectral camera.

For each sample image, the intensity of the reflected light I(λ) was measured. The dark current I_d(λ) i.e. signal without light, was recorded for each acquisition and then subtracted. The intensity I₀(λ) of the light reflected by a certified standard reference (Labsphere, SRS-40-010) was measured to standardise spectra and to prevent from non-linearities of all the instrumentation components (light source, lens, fibers and spectrometer). From these measurements, a reflectance image R(λ) was calculated for each sample, as follows:

$$R(\lambda )=\frac{I(\lambda )-{I}_{d}(\lambda )}{{I}_{0}(\lambda )-{I}_{d}(\lambda )}$$

(1)

Where λ is the wavelength, I(λ) and I₀(λ) are the images of the reflected light intensity from the sample and from the reference respectively, and I_d(λ) is the image in dark

Data Records

The dataset, mentioned scripts and algorithms are available in the INRAE data repository¹⁵.

This dataset contains two table files (description.csv and description_variables.csv), a folder called Data/ and another folder called Code/. The first table file (description.csv) contains the experiment factors and the reference measurements. The second table file (description_variables.csv) contains information about variables used in the first table file.

The folder ‘Data/’ contains 204 folders corresponding to 204 hyperspectral image acquisitions. In this data directory (see Fig. 4), each folder is named with the acquisition date followed by the acquisition number (YYYY-MM-DD_NBR). Reflectance image files are located in the ‘results’ directories and the acquisition date and acquisition number are specified in the file name (REFLECTANCE_YYYY-MM-DD_NBR.dat). These reflectance images are stored in ENVI format containing binary data (.dat) and header file (.hdr). Each reflectance image (.dat) is around 214 MB. The first and the second dimensions correspond to a spatial position (pixels) forming the image composed of 512 × 512 pixels (see Fig. 5). The third dimension refers to spectral variables with two hundred and four spectral bands.

For each image acquisition, raw image, white reference and dark measurements are available in the directory named ‘capture’. All these raw data are also stored in ENVI format.

For each hyperspectral acquisition, a metadata file is produced containing information about the acquisition. In this metadata file, an identification key called global_tag allows the image to be linked to factors in the experimentation or to reference values.

Technical Validation

We analysed part of this dataset in a first publication to classify images of diseased (flavescence dorée) and healthy leaves⁹. In this study, we proposed a methodology based on multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA). In this publication we tried to classify each leaf pixel for each image. For the total pixels to be classified (both infected and healthy pixels), the classification rate achieved 85.1%. Another classification result at the leaf (image) level was also investigated. The classification of an image was obtained by counting the majority class among the image pixels. Out of the thirty seven test images, only two images were misclassified with this method.

The aforementioned publication was based on a subset of the whole dataset. Indeed, the exploitation of this quantity of spectral images to differentiate as many symptoms is a real challenge. Due to the complexity of this dataset and the difficulty of providing masks for each symptom, for initial exploration the average spectrum of the database (see Fig. 6) is calculated, as well as the average spectra per variety (see Fig. 7) and per symptom (see Fig. 8). Then an exploration of the average spectra per leaf is performed by a Principal Component Analysis (PCA).

Mean spectrum and standard deviation of the entire database

The mean spectrum and standard deviation are calculated from all spectra of the two hundred and four measured leaves (see Fig. 6). The average spectrum is typical of a vegetation spectrum. Low values between 400 nm and 500 nm are mainly related to carotenoid and chlorophyll (a + b) contents. The characteristic large peak around 550 nm is attributed to the anthocyanin content. The spectral region between 620 nm and 680 nm is related to the chlorophyll content of the leaves. The red edge between 680 and 750 nm is also typical of vegetation, separates the visible spectral region related to pigments and the plateau between 750 and 1000 nm related to the leaf structure.

Average spectra per variety and per symptom

From this database, average spectra are calculated per variety (see Fig. 7) and per symptom (see Fig. 8). Out of the seven average spectra per variety (see Fig. 7), two spectrum shapes are identified in the spectral region between 500 nm to 700 nm. The three spectra corresponding to ‘Duras’, ‘Fer’ and ‘Gamay’ varieties have lower values around 550 nm while the spectra corresponding to ‘Chardonnay’, ‘Colombard’, ‘Loin de l’œil’ and ‘Mauzac’ have higher values. This difference seems to be related to the anthocyanin content in leaves depending on whether the variety is red or white.

Figure 8 displays the average spectrum for each of the twelve symptoms. The spectrum corresponding to the healthy leaf modality shows the same similarities of a typical vegetation spectrum as described above (see Fig. 6). Although for each symptom the spectra are averaged across all grape varieties, differences are noticeable. For example, ‘deficiency’ and ‘chlorosis’ symptoms differ from other symptoms with higher values from 500 nm to 650 nm. Two other symptoms (‘buffalo treehopper’ and ‘water stress’) also differ in the same spectral range but with lower values. The differences between the average spectra between 600 nm and 700 nm, as well as the dynamics of the red-edge or the shape of the plateau would require further processing.

Principal component analysis

For each image, an average spectrum was calculated from the leaf pixels. Then a principal component analysis was performed on these two hundred and four average spectra. Figure 9 shows PCA scores obtained for the two first components. A few combinations (variety, symptoms) show a particular behaviour on this score plot. For example, scores of ‘Chardonnay’ combined with ‘flavescence dorée’ have positive scores on both axes and are opposite to the negative scores of healthy ‘Chardonnay’ modality. For other varieties and symptoms, scores are more evenly distributed along the two axes. This is can be explained by the preponderance of the ‘flavescence dorée’ and ‘heatlhy’ observations. Another notable observation is the clear distinction of some observations from the rest of the group, such as ‘senescence’ combined with the ‘Duras’ and ‘Mauzac’ variety on PC1 and ‘senescence’ combined with ‘Loin de l’œil’ variety on PC2.

These results should be considered in relation to the loadings of the principal components concerned (see Figs. 10, 11). The first component corresponds to an inverted overall shape of the spectra (see Fig. 6) which could correspond to the total amount of signal received by the camera. The second component shows loadings of positive values between 400 nm and 600 nm with a strong positive value in the 550 nm region which is related to anthocyanin content. This technical validation was only carried out on the first two principal components of a PCA. The availability of this dataset would allow further study through other principal components or even more generally using other methods. This dataset offers great perspectives for further study, such as classification capabilities according to confounding factors, assessment of spectral variability of symptoms according to variety or improvement of the labelling process by selecting only symptomatic areas of the leaf.

Usage Notes

There are many advantages to this database. Firstly, it provides hyperspectral images covering a multitude of grapevine symptoms, including different grape varieties. One of the benefits of this dataset lies in the possibility of developing new analysis methods. On a more practical level, it will be used to study the potential of hyperspectral imaging to detect the symptoms proposed and to identify confounding factors. In addition, measurements are carried out under controlled conditions, guaranteeing the reliability and accuracy of the data collected. In particular, measurements are carried out on whole leaves, which generates an abundance of pixels available.

This dataset has certain limitations. Firstly, there may be an imbalance between the number of images available for each grapevine symptom, which could potentially bias the results. Another limitation is that the small number of images available can be limiting for deep learning approaches.

Code availability

A Python code example is provided in the ‘Code’ folder. This example helps the reader to understand how to open hyperspectral images, to extract spectra, or to display the results obtained after a Principal Component Analysis (PCA) applied on the first image and a PCA on all average spectra where one average spectrum is obtained per leaf/image.

References

Mahlein, A.-K., Steiner, U., Hillnhutter, C., Dehne, H.-W. & Oerke, E.-C. Hyperspectral imaging for small-scale analysis of symptoms caused by different sugar beet diseases. Plant Methods 8, 3, https://doi.org/10.1186/1746-4811-8-3 (2012).
Article PubMed PubMed Central Google Scholar
Sankaran, S., Mishra, A., Ehsani, R. & Davis, C. A review of advanced techniques for detecting plant diseases. Comput. Electron. Agric. 72, 1–13, https://doi.org/10.1016/j.compag.2010.02.007 (2010).
Article Google Scholar
AL-Saddik, H., Simon, J.-C. & Cointault, F. Development of Spectral Disease Indices for ‘Flavescence Doree’ Grapevine Disease Identification. Sensors 17, 2772, https://doi.org/10.3390/s17122772 (2017).
Article PubMed PubMed Central Google Scholar
Albetis, J. et al. On the Potentiality of UAV Multispectral Imagery to Detect Flavescence doree and Grapevine Trunk Diseases. Remote. Sens. 11, 23, https://doi.org/10.3390/rs11010023 (2018).
Article ADS Google Scholar
Sanaeifar, A. et al. Proximal hyperspectral sensing of abiotic stresses in plants. Sci. The Total. Environ. 160652. https://doi.org/10.1016/j.scitotenv.2022.160652 (2022).
Ryckewaert, M. et al. Massive spectral data analysis for plant breeding using parSketch-PLSDA method: Discrimination of sunflower genotypes. Biosyst. Eng. 210, 69–77, https://doi.org/10.1016/j.biosystemseng.2021.08.005 (2021).
Article CAS Google Scholar
Ryckewaert, M. et al. Physiological variable predictions using VIS–NIR spectroscopy for water stress detection on grapevine: Interest in combining climate data using multiblock method. Comput. Electron. Agric. 197, 106973, https://doi.org/10.1016/j.compag.2022.106973 (2022).
Article Google Scholar
Courand, A. et al. Evaluation of a robust regression method (RoBoost-PLSR) to predict biochemical variables for agronomic applications: Case study of grape berry maturity monitoring. Chemom. Intell. Lab. Syst. 221, 104485, https://doi.org/10.1016/j.chemolab.2021.104485 (2022).
Article CAS Google Scholar
Mas Garcia, S. et al. Combination of multivariate curve resolution with factorial discriminant analysis for the detection of grapevine diseases using hyperspectral imaging. A case study: flavescence doree. The Analyst 10.1039.D1AN01735G. https://doi.org/10.1039/D1AN01735G (2021).
Abdelghafour, F. et al. Including measurement effects and temporal variations in VIS-NIRS models to improve early detection of plant disease: Application to Alternaria solani in potatoes. Comput. Electron. Agric. 211, 107947, https://doi.org/10.1016/j.compag.2023.107947 (2023).
Article Google Scholar
Liu, H., Bruning, B., Garnett, T. & Berger, B. The Performances of Hyperspectral Sensors for Proximal Sensing of Nitrogen Levels in Wheat. Sensors 20, 4550. https://doi.org/10.3390/s20164550. Number: 16 Publisher: Multidisciplinary Digital Publishing Institute (2020).
Rubio-Delgado, J., Perez, C. J. & Vega-Rodriguez, M. A. Predicting leaf nitrogen content in olive trees using hyperspectral data for precision agriculture. Precis. Agric. 22, 1–21, https://doi.org/10.1007/s11119-020-09727-1 (2021).
Article CAS Google Scholar
Mishra, P. et al. A generic workflow combining deep learning and chemometrics for processing close-range spectral images to detect drought stress in Arabidopsis thaliana to support digital phenotyping. Chemom. Intell. Lab. Syst. 216, 104373, https://doi.org/10.1016/j.chemolab.2021.104373 (2021).
Article CAS Google Scholar
Gaci, B. et al. A novel approach to combine spatial and spectral information from hyperspectral images. Chemom. Intell. Lab. Syst. 240, 104897, https://doi.org/10.1016/j.chemolab.2023.104897 (2023).
Article CAS Google Scholar
Ryckewaert, M. Hyperspectral images of grape leaves including healthy leaves and leaves with biotic and abiotic symptoms. Recherche Data Gouv, https://doi.org/10.57745/WW7TY7 (2023).

Download references

Acknowledgements

This work was supported by the INTERREG SUDOE SOE3/P2/E0911 Viniot project.

Author information

Authors and Affiliations

ITAP, Univ Montpellier, INRAE, Institut Agro, Montpellier, France
Maxime Ryckewaert, Daphné Héran, Jean-Philippe Trani, Silvia Mas-Garcia & Ryad Bendoula
Inria, LIRMM, University of Montpellier, CNRS, Montpellier, France
Maxime Ryckewaert
IFV, 1920 Route de Lisle-sur-Tarn, 81310, Peyrole, France
Carole Feilhes, Fanny Prezman & Eric Serrano

Authors

Maxime Ryckewaert
View author publications
You can also search for this author in PubMed Google Scholar
Daphné Héran
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Philippe Trani
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Mas-Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Carole Feilhes
View author publications
You can also search for this author in PubMed Google Scholar
Fanny Prezman
View author publications
You can also search for this author in PubMed Google Scholar
Eric Serrano
View author publications
You can also search for this author in PubMed Google Scholar
Ryad Bendoula
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.F., F.P., E.S., M.R. and R.B. conceived the experiment(s), C.F., F.P. and E.S.. conducted the experiment(s), S.M.G., M.R. and D.H. analysed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Maxime Ryckewaert.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ryckewaert, M., Héran, D., Trani, JP. et al. Hyperspectral images of grapevine leaves including healthy leaves and leaves with biotic and abiotic symptoms. Sci Data 10, 743 (2023). https://doi.org/10.1038/s41597-023-02642-w

Download citation

Received: 06 February 2023
Accepted: 13 October 2023
Published: 26 October 2023
DOI: https://doi.org/10.1038/s41597-023-02642-w