DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention.

Bibliographic Details
Title: DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention.
Authors: Wang, Guoying1 (AUTHOR) wgy@zafu.edu.cn, Shi, Bing1 (AUTHOR) 2021611011034@stu.zafu.edu.cn, Yi, Xiaomei1 (AUTHOR) yxm@zafu.edu.cn, Wu, Peng1 (AUTHOR) wp@zafu.edu.cn, Kong, Linjun2 (AUTHOR) klj@zufe.edu.cn, Mo, Lufeng1,3 (AUTHOR) klj@zufe.edu.cn
Superior Title: Animals (2076-2615). Feb2024, Vol. 14 Issue 3, p499. 22p.
Subject Terms: *DEEP learning, *FISH locomotion, *OPTICAL reflection, *IMAGE recognition (Computer vision), *SIGNAL-to-noise ratio, *RECOGNITION (Psychology), *PROBLEM solving
Abstract: Simple Summary: Blurry scenarios often affect the clarity of fish images, posing significant challenges to deep learning models in terms of the accurate recognition of fish species. A method based on deep learning with a diffusion model and an attention mechanism, DiffusionFR, is proposed herein to improve the accuracy of fish species recognition in blurry scenarios caused by light reflections and water ripple noise. Using a self-constructed dataset, BlurryFish, extensive experiments were conducted and the results showed that the proposed two-stage diffusion network model can restore the clarity of blurry fish images to some extent and the proposed learnable attention module is effective in improving the accuracy of fish species recognition. Blurry scenarios, such as light reflections and water ripples, often affect the clarity and signal-to-noise ratio of fish images, posing significant challenges for traditional deep learning models in accurately recognizing fish species. Firstly, deep learning models rely on a large amount of labeled data. However, it is often difficult to label data in blurry scenarios. Secondly, existing deep learning models need to be more effective for the processing of bad, blurry, and otherwise inadequate images, which is an essential reason for their low recognition rate. A method based on the diffusion model and attention mechanism for fish image recognition in blurry scenarios, DiffusionFR, is proposed to solve these problems and improve the performance of species recognition of fish images in blurry scenarios. This paper presents the selection and application of this correcting technique. In the method, DiffusionFR, a two-stage diffusion network model, TSD, is designed to deblur bad, blurry, and otherwise inadequate fish scene pictures to restore clarity, and a learnable attention module, LAM, is intended to improve the accuracy of fish recognition. In addition, a new dataset of fish images in blurry scenarios, BlurryFish, was constructed and used to validate the effectiveness of DiffusionFR, combining bad, blurry, and otherwise inadequate images from the publicly available dataset Fish4Knowledge. The experimental results demonstrate that DiffusionFR achieves outstanding performance on various datasets. On the original dataset, DiffusionFR achieved the highest training accuracy of 97.55%, as well as a Top-1 accuracy test score of 92.02% and a Top-5 accuracy test score of 95.17%. Furthermore, on nine datasets with light reflection noise, the mean values of training accuracy reached a peak at 96.50%, while the mean values of the Top-1 accuracy test and Top-5 accuracy test were at their highest at 90.96% and 94.12%, respectively. Similarly, on three datasets with water ripple noise, the mean values of training accuracy reached a peak at 95.00%, while the mean values of the Top-1 accuracy test and Top-5 accuracy test were at their highest at 89.54% and 92.73%, respectively. These results demonstrate that the method showcases superior accuracy and enhanced robustness in handling original datasets and datasets with light reflection and water ripple noise. [ABSTRACT FROM AUTHOR]
Copyright of Animals (2076-2615) is the property of MDPI and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Academic Search Premier
Full text is not displayed to guests.
Description
Description not available.