融合Res3D、BiLSTM和注意力機制的羊只行為識別方法

doi:10.6041/j.issn.1000-1298.2024.04.022

首頁 > 過刊瀏覽>2024年第55卷第4期 >221-230. DOI:10.6041/j.issn.1000-1298.2024.04.022

融合Res3D、BiLSTM和注意力機制的羊只行為識別方法
DOI:
                        10.6041/j.issn.1000-1298.2024.04.022
                    
作者:
                        
                        
                    
作者單位:
作者簡介:
通訊作者:
中圖分類號:
基金項目:河北省重點研發(fā)計劃項目（21327402D）

Fusion of Res3D, BiLSTM and Attention Mechanism for Sheep Behavior Recognition Method

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪問統(tǒng)計

參考文獻(xiàn)

相似文獻(xiàn)

引證文獻(xiàn)

資源附件

文章評論

摘要:

識別動物行為可以為疾病預(yù)防和合理喂養(yǎng)提供重要依據(jù)，從而有助于更好地關(guān)注動物的健康和福利。本文提出了一種融合三維殘差卷積神經(jīng)網(wǎng)絡(luò)、雙向長短期記憶網(wǎng)絡(luò)和注意力機制的深度學(xué)習(xí)網(wǎng)絡(luò)模型（AdRes3D-BiLSTM）。AdRes3D-BiLSTM模型可以直接針對視頻流進(jìn)行識別，在AdRes3D部分引入了深度可分離卷積和注意力機制，不但減少了浮點運算量，提升了網(wǎng)絡(luò)輕量化程度，還提高了時間和空間兩個維度的特征提取能力；提取的特征被輸入BiLSTM模塊后，從前后2個方向?qū)r序特征向量進(jìn)行篩選和更新，最后對羊只行為進(jìn)行準(zhǔn)確識別。試驗結(jié)果表明，AdRes3D-BiLSTM對羊只站立、躺臥、進(jìn)食、行走和反芻5種行為的綜合識別準(zhǔn)確率達(dá)到了98.72%，幀速率達(dá)到52.79f/s，模型內(nèi)存占用量為28.03MB。研究結(jié)果為基于視頻流的動物行為識別提供了新的方法和思路。

Abstract:

In intensive sheep farms, behavioral changes can map out whether there are abnormalities in the sheep’s body. For example, when sheep are sick, rumination and feeding time will produce significant changes, and behavioral observation is one of the ways to diagnose their health. Identifying animal behavior can provide a basis for disease prevention and rational feeding, thus improving the focus on animal health and welfare. Therefore, animal behavior recognition has always been a focus of attention for researchers and production managers. Traditional manual observation methods require continuous human monitoring, and the fatigue response from long hours of human work tends to cause subjective errors in the results. In addition, sensor detection methods that require direct contact with the animal’s body tend to stress the animal, affecting animal health and production performance. A deep learning network model AdRes3D-BiLSTM was proposed that incorporated a three dimensional residual convolutional neural network, a bi-directional long and short-term memory network, and an attention mechanism. The AdRes3D component introduced depth separable convolution, a technique instrumental in curtailing computational complexity and enhancing network efficiency. Furthermore, an actionnet attention mechanism based on motion principles was embedded within the AdRes3D section, directing the network’s focus toward discerning behavioral nuances. This augmentation amplified the model’s adeptness in extracting pivotal behavioral key points across consecutive video frames, thereby augmenting its capacity for feature extraction across both temporal and spatial dimensions. Subsequently, the feature vectors extracted from this process were inputted into the BiLSTM module, affording bidirectional filtering and updating for temporal features, and the final sheep behaviors were accurately recognized. A dataset comprising 6000 distinct videos was amassed for training the proposed model. This dataset encompassed different sheep instances, spanning varying periods, lighting conditions, and poses. An additional set of 1200 behavioral videos, distincting from those employed in training, was selected as the testing data. The experimental results showed the efficacy of the AdRes3D-BiLSTM model, as evidenced by an exceptional comprehensive recognition accuracy rate of 98.72% across five fundamental sheep behaviors: standing, lying, feeding, walking, and ruminating. In contrast to five alternative network architectures—namely, C3D, R(2+1)D, Res3D, Res3D-LSTM, and Res3D-BiLSTM-the AdRes3D-BiLSTM model achieved notable improvements in recognition metrics. Specifically, relative to these network models, AdRes3D-BiLSTM exhibited a precision enhancement of 11.32 percentage points, 6.24 percentage points, 4.34 percentage points, 2.04 percentage points and 1.52 percentage points, respectively. The corresponding improvements in recognition recall stood at 11.78 percentage points, 6.38 percentage points, 4.38 percentage points, 2.12 percentage points and 1.68 percentage points, while F1-score improvements registered at 11.70 percentage points, 6.35 percentage points, 4.38 percentage points, 2.08 percentage points and 1.60 percentage points, and the augmentation in recognition accuracy was quantified at 11.97 percentage points, 6.33 percentage points, 4.37 percentage points, 2.32 percentage points and 2.01 percentage points. Furthermore, the method elucidated boasted an impressive frame rate, attaining a remarkable 52.79 frames per second (FPS). This recognition speed substantiated the model’s real-time processing capabilities, thereby satisfying operational demands. Additionally, a 24-hour uninterrupted video segment was randomly culled from the repository of collected videos, effectively validating the model’s efficacy in a real-world environment. This investigation ushers in novel methodologies and conceptual insights for animal behavior recognition based on video streams. The strides furnished fresh avenues for advancing the field, presenting innovative strategies and perspectives for further exploration and implementation.

參考文獻(xiàn)

相似文獻(xiàn)

引證文獻(xiàn)

引用本文

袁洪波,曹潤柳,程曼.融合Res3D、BiLSTM和注意力機制的羊只行為識別方法[J].農(nóng)業(yè)機械學(xué)報,2024,55(4):221-230. YUAN Hongbo, CAO Runliu, CHENG Man. Fusion of Res3D, BiLSTM and Attention Mechanism for Sheep Behavior Recognition Method[J]. Transactions of the Chinese Society for Agricultural Machinery,2024,55(4):221-230.

復(fù)制

文章指標(biāo)

點擊次數(shù):
下載次數(shù):
HTML閱讀次數(shù):
引用次數(shù):

歷史

收稿日期:2023-08-20
最后修改日期:
錄用日期:
在線發(fā)布日期: 2024-04-10
出版日期:

亚洲一区欧美在线,日韩欧美视频免费观看,色戒的三场床戏分别是在几段,欧美日韩国产在线人成

期刊瀏覽

EI收錄結(jié)果

引用本文

分享

文章指標(biāo)

歷史