Improved Method for Analyzing Acoustic Signals of Water Environment Based on Convolutional Neural Network SOP

Authors

  • А. О. Oleksii National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”
  • A. A. Verlan National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Norwegian University of Science and Technology

DOI:

https://doi.org/10.31649/1997-9266-2024-177-6-129-134

Keywords:

Convolutional neural networks, second-order pooling, acoustic signal analysis, multiscale convolution

Abstract

The analysis of acoustic signals in marine environments poses numerous challenges, including handling large volumes of data and adapting to rapidly changing environmental conditions. Passive signal reception is often characterized by a low signal-to-noise ratio (SNR), complicating signal processing. Addressing these challenges requires an efficient and versatile approach, which neural networks can provide. Among the various neural network models for acoustic signal analysis, Convolutional Neural Networks (CNNs) are recognized as some of the most effective due to their ability to handle complex data structures and extract meaningful features. Therefore, it is reasonable to explore an efficient CNN-based method that can be modified to enhance the quality of acoustic signal analysis in aquatic environments. An improved approach has been developed, utilizing the CNN SOP network. CQT features were employed as input characteristics for classification. The original two convolutional layers were replaced with a multi-scale convolution using kernels of different sizes, enabling the extraction of both global and local features of the target object, thereby enhancing the model's ability to process diverse signal properties. The extracted features were processed using second-order pooling and then passed to the second-order pooling (SOP) layer. This layer enhances the identification of temporal correlations. The SOP layer outputs feature vectors, which are normalized element-wise using square root and l2 normalization. The normalized data are then fed into a fully connected layer with batch normalization and the ReLU activation function. Subsequently, the data are passed to another fully connected layer with a Softmax activation function, which performs the final classification. To evaluate the performance of the neural network, three datasets were used: two based on real-world underwater objects and one artificial dataset. Each dataset was further augmented with background noise to produce samples with a low signal-to-noise ratio (SNR). In all cases, the proposed improved method demonstrated higher classification accuracy compared to the original method, showcasing its effectiveness in handling noisy data and improving signal analysis.

Author Biographies

А. О. Oleksii, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”

Post-Graduate Student of the Chair of Software Engineering for Power Industry

A. A. Verlan, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Norwegian University of Science and Technology

Dr. Sc. (Eng), Professor, Professor of the Chair of Software Engineering for Power Industry

References

Y. Miao, Y. V. Zakharov, H. Sun, J. Li, and J. Wang, “Underwater Acoustic Signal Classification Based on Sparse Time–Frequency Representation and Deep Learning,” IEEE Journal of Oceanic Engineering, vol. 46, no. 3, pp. 952-962, 2021. https://doi.org/10.1109/JOE.2020.3039037 .

N. Bach, L. Vu, and V. Nguyen, “Classification of Surface Vehicle Propeller Cavitation Noise Using Spectrogram Processing in Combination with Convolution Neural Network,” Sensors, vol. 21, p. 3353, 2021. https://doi.org/10.3390/s21103353 .

L. Xinwei, Y. Feng, and M. Zhang. “An underwater acoustic target recognition method based on combined feature with automatic coding and reconstruction,” IEEE Access 9, pp. 63841-63854, 2021. https://doi.org/10.1109/ACCESS.2021.3075344 .

M. Ahmad, M. A. Ansari, R. Anwar, B. Shahzad, and A. Ikram, “Deep Learning Based Classification of Underwater Acoustic Signals,” Procedia Computer Science, vol. 235, pp. 1115-1124, 2024. https://doi.org/10.1016/j.procs.2024.04.106 .

K.-I. Kim, M.-I. Pak, B.-P. Chon, and C.-H. Ri, “A Method for Underwater Acoustic Signal Classification Using Convolutional Neural Network Combined with Discrete Wavelet Transform,” International Journal of Wavelets, Multiresolution and Information Processing, vol. 19, 2021. https://doi.org/10.1142/S0219691320500927 .

X. Cao, R. Togneri, X. Zhang, and Y. Yu, “Convolutional Neural Network With Second-Order Pooling for Underwater Target Classification,” IEEE Sensors Journal, vol. 19, no. 8, pp. 3058-3066, 2019. https://doi.org/10.1109/JSEN.2018.2886368 .

D. Santos-Domínguez, S. Torres-Guijarro, A. Cardenal-Lopez, and A. Pena-Gimenez, “ShipsEar: An underwater vessel noise database,” Applied Acoustics, vol. 113, pp. 64-69, 2016. https://doi.org/10.1016/j.apacoust.2016.06.008 .

Вимірювальні системи та програмне забезпечення для морських охоронних систем і дослідницьких полігонів, звіт про НДР (заключ.) НТУУ «КПІ»; кер. роб. Є. Мачуський. Київ, 2012, 104 с. + відеосюжет + CD-ROM. Д/б №2429-п.

Downloads

Abstract views: 5

Published

2024-12-27

How to Cite

[1]
Oleksii А. О. . and A. A. Verlan, “Improved Method for Analyzing Acoustic Signals of Water Environment Based on Convolutional Neural Network SOP”, Вісник ВПІ, no. 6, pp. 129–134, Dec. 2024.

Issue

Section

Information technologies and computer sciences

Metrics

Downloads

Download data is not yet available.