Apple_Yolo: Apple Detection Method Based on Channel Pruning and Mixed Distillation in Complicated Environments

35 Pages Posted: 31 May 2024

See all articles by Chun-Ming Wu

Chun-Ming Wu

Northeast Electric Power University

jin Lei

Northeast Electric Power University

Mei-ling Ren

Northeast Electric Power University

Mei-Ruo Li

Northeast Electric Power University

Yu-Xin Ye

Northeast Electric Power University

Zi-mu Jiang

Northeast Electric Power University

Abstract

Rapid and precise positioning of apples, along with intelligent detection, play a pivotal role in the process of picking apples. Nevertheless, the existing crop detection methods that rely on deep learning sometimes require substantial computational resources and memory, consequently limiting their feasibility for mobile device implementation. This study presents a lightweight algorithm technique for detecting apple targets to address the problem of insufficient storage space and restricted computational capacity in apple-picking mobile devices. The method offers two distinct schemes based on different computing resources. The procedure consists of two primary phases. In the first stage of the lightweight process, the lightweight Feature Pyramid Network (LFPN) replaces the original trunk, followed by the utilization of lightweight down-sampling convolution (LDConv) to substitute the redundant convolutions in the trunk to reduce the number of parameters. Then, the Lightweight multi-channel attention mechanism (LMCA) is embed between the backbone network and the neck network to minimize the effects of unnecessary background. Finally, the model is distilled for the first time using mixed distillation to enhance the model's detection performance further. In the second stage of the lightweight, the Group_slim channel pruning is used to reduce redundant channels further. Subsequently, hybrid distillation is employed again to restore the accuracy of the pruning model. The results show that the average precision (AP) of the model presented in this study is 1% higher than that of the baseline model, given that the parameter count is only about 800k. The models of both schemes can achieve an inference speed of over 17 frames per second on the central processing unit(CPU).

Keywords: LMCA, LFPN, LDConv, Group_slim, Distillation

Suggested Citation

Wu, Chun-Ming and Lei, jin and Ren, Mei-ling and Li, Mei-Ruo and Ye, Yu-Xin and Jiang, Zi-mu, Apple_Yolo: Apple Detection Method Based on Channel Pruning and Mixed Distillation in Complicated Environments. Available at SSRN: https://ssrn.com/abstract=4849516 or http://dx.doi.org/10.2139/ssrn.4849516

Chun-Ming Wu

Northeast Electric Power University ( email )

China

Jin Lei (Contact Author)

Northeast Electric Power University ( email )

China

Mei-ling Ren

Northeast Electric Power University ( email )

China

Mei-Ruo Li

Northeast Electric Power University ( email )

China

Yu-Xin Ye

Northeast Electric Power University ( email )

China

Zi-mu Jiang

Northeast Electric Power University ( email )

China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
105
Abstract Views
248
Rank
562,939
PlumX Metrics