-
EMFORMER & AM-TRF 간단히 보기논문 리뷰 2023. 5. 2. 15:29
Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition
This paper proposes an efficient memory transformer Emformer for low latency streaming speech recognition. In Emformer, the long-range history context is distilled into an augmented memory bank to reduce self-attention’s computation complexity. A cache m
ieeexplore.ieee.org
ISCA Archive (isca-speech.org)
ISCA Archive
Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang Transformer-based acoustic modeling has achieved great success for both hybrid and sequence-to-seq
www.isca-speech.org
transformer transducer를 ASR task에 조금 더 맞게 변형한 AM-TRF와 여기에 메모리를 더욱 효율적으로 쓰고 중복 계산을 줄인 Emformer 구조에 대해서 간단히 알아봅시다.
'논문 리뷰' 카테고리의 다른 글
AudioLM (0) 2023.08.10 Wav2Vec2.0 (0) 2023.08.10 DiffWae: A Versatile Diffusion Model for Audio Synthesis (0) 2023.03.13 Wav2Vec (0) 2022.12.22 Cycle GAN VC 3 and Mask Cycle GAN VC (0) 2022.05.11