计算机科学
自编码
无线
卷积神经网络
语音编码
人工智能
语音识别
传输(电信)
音频信号
编码器
编码(社会科学)
深度学习
电信
数学
统计
操作系统
作者
Haonan Tong,Zhaohui Yang,Wang Si-hua,Ye Hu,Walid Saad,Changchuan Yin
标识
DOI:10.1109/globecom46510.2021.9685654
摘要
In this paper, the problem of audio based semantic communication is investigated over wireless networks. In the considered model, wireless edge devices must transmit large-sized audio data to a server using semantic communication techniques. The techniques enable the transmission of audio semantic information which captures the contextual features of audio signals. To extract the semantic information from audio signals, a wave to vector (wav2vec) architecture based autoencoder that consists of convolutional neural networks (CNNs) is proposed. The proposed autoencoder enables high-accuracy audio transmission with small amounts of data. To further improve the accuracy of semantic information extraction, federated learning (FL) is implemented over multiple devices and a server. Simulation results show that the proposed algorithm can converge effectively and can reduce the mean square error (MSE) between the recovered audio signals and the source audio signals by nearly 100 times, compared to a traditional coding scheme.
科研通智能强力驱动
Strongly Powered by AbleSci AI