The laser-sensing real-time bioaerosol detector Rapid-E (Plair SA) produces data prone to devicespecific noise due to laser and detector sensibility, which is critical when transferring classification models between different devices (Matavulj et al. 2021). This study implements convolutional neural networks (CNNs) for pollen classification and tests how models trained with a combination of different datasets perform at two locations. Materials and Methods The study was conducted in San Michele all'Adige, Italy, and Novi Sad, Serbia. Taxonomically determined monospecific pollen samples were collected and injected into the devices to obtain reference datasets used for classification. A multi-modal CNN architecture was applied for pollen classification on one year of Rapid-E data, obtaining daily pollen concentrations which were then compared with the concentrations obtained by light microscopy analysis on Hirst-type volumetric pollen samples. CNNs were trained with different combinations of reference datasets to test if the models were transferable from one device to the other. All networks were additionally trained with the domain adaptation technique (Ganin and Lempinsky 2015) to exclude the device-specific noise. Results and Discussion Models lost performance when trained on data from one and tested on another device. We showed that data of the same pollen taxon from the two devices are more different than data of distinct pollen taxa from one device. Combining all available reference data in a single model enabled the classification of a higher number of pollen taxa in both study locations. The domain adaptation technique improved the performance of transferred models for several pollen taxa. Conclusions CNNs recognize significant differences in data from the two devices which can be solved by combining reference datasets from both locations. When a reference dataset is not available, the domain adaptation can improve the performance of models trained on data from other devices.
Matavulj, P.; Cristofori, A.; Cristofolini, F.; Gottardini, E.; Brdar, S.; Sikoparija, B. (2022). Integration of data from different rapid e-devices supports pollen classification in more locations. In: One health Paestum 2022: 5th MedPalyos Symposium. 16th AIA Congress (Italian Society of Aerobiology Environment and Medicine) “Aerobiologia4.0”, Paestum, 13-18 settembre 2022. Firenze: 93-94. url: https://eventi.infomed-online.it/one-health-2022/ handle: https://hdl.handle.net/10449/78455
Integration of data from different rapid e-devices supports pollen classification in more locations
Cristofori, A.;Cristofolini, F.;Gottardini, E.;
2022-01-01
Abstract
The laser-sensing real-time bioaerosol detector Rapid-E (Plair SA) produces data prone to devicespecific noise due to laser and detector sensibility, which is critical when transferring classification models between different devices (Matavulj et al. 2021). This study implements convolutional neural networks (CNNs) for pollen classification and tests how models trained with a combination of different datasets perform at two locations. Materials and Methods The study was conducted in San Michele all'Adige, Italy, and Novi Sad, Serbia. Taxonomically determined monospecific pollen samples were collected and injected into the devices to obtain reference datasets used for classification. A multi-modal CNN architecture was applied for pollen classification on one year of Rapid-E data, obtaining daily pollen concentrations which were then compared with the concentrations obtained by light microscopy analysis on Hirst-type volumetric pollen samples. CNNs were trained with different combinations of reference datasets to test if the models were transferable from one device to the other. All networks were additionally trained with the domain adaptation technique (Ganin and Lempinsky 2015) to exclude the device-specific noise. Results and Discussion Models lost performance when trained on data from one and tested on another device. We showed that data of the same pollen taxon from the two devices are more different than data of distinct pollen taxa from one device. Combining all available reference data in a single model enabled the classification of a higher number of pollen taxa in both study locations. The domain adaptation technique improved the performance of transferred models for several pollen taxa. Conclusions CNNs recognize significant differences in data from the two devices which can be solved by combining reference datasets from both locations. When a reference dataset is not available, the domain adaptation can improve the performance of models trained on data from other devices.File | Dimensione | Formato | |
---|---|---|---|
2022 One Health Cristofolini.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
1.12 MB
Formato
Adobe PDF
|
1.12 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.