高级检索

基于ECAPA-TDNN神经网络的牦牛声纹识别算法研究

Research of Yak Voiceprint Recognition Algorithm Based on ECAPA-TDNN Neural Network

  • 摘要: 目的传统的生物体识别是基于生物体固有的轮廓、指纹、血样或者独有的生物体特性进行识别,实施过程繁琐且存在明显的局限性。为建立方便快捷的牦牛声纹识别技术,本研究利用语音识别技术,对牦牛的音频特征进行深度学习模型训练。提出改进的基于 ECAPA-TDNN 网络模型的声纹识别牦牛生物体特征算法。方法本研究以大通牦牛和祁连牦牛为研究对象,研究了不同牦牛的频谱、Mel频谱、LogMel频谱、梅尔倒谱和LogMel特征,并对其音频特征进行深度学习模型训练,提出了基于ECAPA-TDNN 网络模型的声纹识别牦牛生物体特征算法。引入BatchNorm层和Dropout层对ECAPA-TDNN网络进行改进,解决噪音和过拟合问题。同时使用5种不同网络模型 Dropout-capa-tdnn,Ecapa-tdnn,Panns-cnn6,Panns-cnn10,Panns-cnn14和相同音频特征(LogMel频谱)进行牦牛声纹识别实验。结果结果表明,牦牛个体身份识别准确率为96.11%,性别识别准确率为98.83%,年龄段识别准确率为92.35%,相比传统模型具有更高的准确率。结论本研究为牦牛的个体识别提供了理想模型,推进了智慧牧场的语音智能识别技术,以期为促进我国智慧牧业的高质量发展提供技术支撑。

     

    Abstract: Purpose Traditional organism recognition is based on the inherent contour, fingerprint, blood sample or unique biological characteristics of the organism. The implementation process is cumbersome and has obvious limitations. In order to establish a convenient and fast yak voiceprint recognition technology, this study used speech recognition technology to train the deep learning model of yak audio features. An improved voiceprint recognition yak organism feature algorithm based on ECAPA-TDNN network model is proposed. Method In this study, Datong yak and Qilian yak were taken as the research objects, the spectrum, Mel spectrum, LogMel spectrum, Mel cepstrum and LogMel features of different yaks were studied, and their audio features were trained by deep learning model. A voiceprint recognition yak organism feature algorithm based on ECAPA-TDNN network model was proposed. The BatchNorm layer and Dropout layer were introduced to improve the ECAPA-TDNN network to solve the noise and over-fitting problems. The ECAPA-TDNN network is improved, and the BatchNorm layer and Dropout layer were introduced to solve the problem of noise and over-fitting. At the same time, five different network models Dropout-capa-tdnn, Ecapa-tdnn, Panns-cnn6, Panns-cnn10, Panns-cnn14 and the same audio features (LogMel spectrum) were used for yak voiceprint recognition experiments. Result The results showed that the accuracy of yak individual identification is 96.11%, the accuracy of gender identification is 98.83%, and the accuracy of age group identification is 92.35%. Compared with the traditional model, it has higher accuracy. Conclusion This study provides an ideal model for individual identification of yaks and promotes the intelligent speech recognition technology of intelligent pastures, so as to provide technical support for promoting the high-quality development of intelligent animal husbandry in China.

     

/

返回文章
返回