Persuasive Speech Topics Computer Networking

FSTF-AN: Fused Sparse Temporal-Frequency Attentive Network for Multi-Channel Speech Enhancement

Abstract: The Transformer has achieved impressive performance in the multi-channel speech enhancement field; however, it struggles to capture local features, which leads to the loss of speech details.

IEEE

Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task

Abstract: Fine-tuning has become a norm to achieve state-of-the-art performance when employing pre-trained networks like foundation models. These models are typically pre-trained on large-scale ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

FSTF-AN: Fused Sparse Temporal-Frequency Attentive Network for Multi-Channel Speech Enhancement

Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task

Trending now