CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation.

AllBooks News Images Maps Videos Shopping

CCSRD consists of a content encoder that encodes linguistic content information from the speech input, a non-content encoder that models non-linguistic speech features, and a disentanglement module that learns disentangled representations with a cyclic reconstructor, feature reconstructor and speaker classifier trained ...

CCSRD: Content-Centric Speech Representation Disentanglement ...

aclanthology.org › 2023.findings-emnlp....

About Featured Snippets

CCSRD: Content-Centric Speech Representation Disentanglement...

openreview.net › forum

In this paper, we propose a content-centric speech representation disentanglement learning framework for speech translation, CCSRD, which decomposes speech ...

Publications - Deyi Xiong (熊德意)

dyxiong.github.io › publications

CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation. EMNLP 2023 Findings. Tianhao Shen, Renren Jin ...

Self-Training for End-to-End Speech Translation - Semantic Scholar

www.semanticscholar.org › paper › Self-...

CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation · Xiaohu ZhaoHaoran SunYikun LeiShaolin ZhuDeyi Xiong.

Haoran Sun - OpenReview

openreview.net › profile

MS student, Tianjin University · CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation · Xiaohu Zhao, Haoran ...

Content-Context Factorized Representations for Automated ...

www.researchgate.net › publication › 36...

Aug 22, 2024 · Since speech often contains multiple factors, disentangled representation learning provides a way to extract different representations for ...

[PDF] End-to-end Speech Translation via Cross-modal Progressive Training

www.semanticscholar.org › paper

CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation · Computer Science, Linguistics. Conference on Empirical ...

‪Yikun Lei‬ - ‪Google Scholar‬

scholar.google.fr › citations

CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation. X Zhao, H Sun, Y Lei, S Zhu, D Xiong. Findings of the ...

Self-Training for End-to-End Speech Translation - ResearchGate

www.researchgate.net › publication › 35...

CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation. Conference Paper. Jan 2023. Xiaohu Zhao · Haoran Sun ...

WACO: Word-Aligned Contrastive Learning for Speech Translation

arxiv.org › cs

Dec 19, 2022 · In this paper, we propose Word-Aligned COntrastive learning (WACO), a simple and effective method for extremely low-resource speech-to-text translation.

Missing: CCSRD: Centric Disentanglement