Seung-Bin Kim, Sang-Hoon Lee, and Seong-Whan Lee
Although there has been significant advancement in the field of speech-to-speech translation, conventional models still require language-parallel speech data between the source and target languages for training. In this paper, we introduce TranSentence, a novel speech-to-speech translation without language-parallel speech data. To achieve this, we first adopt a language-agnostic sentence-level speech encoding that captures the semantic information of speech, irrespective of the language. We then train our model to generate speech based on the encoded embeddings obtained from a language-agnostic sentence-level speech encoder that is pre-trained with diverse languages. With this method, despite training exclusively on the target language's monolingual data, we can generate target language speech in the inference stage using language-agnostic speech embeddings from the source language speech. Furthermore, we extend TranSentence to multilingual speech-to-speech translation. The experimental results demonstrate that TranSentence is superior to other models.
Official implementation of TranSentence: https://github.com/sb-kim-prml/TranSentence_code
TranSentence was trained on the Common Voice 11 dataset and evaluated on the CVSS-C dataset. We used single-speaker unit vocoder for unit-to-waveform conversion.
Source (Spanish) Es una de las universidades más jóvenes de Polonia. |
Reference (English) It is one of the youngest universities in Poland. |
TranSentence (English) It is one of the youngest universities in Poland. |
Source (Spanish) Las obras nunca se han recuperado. |
Reference (English) Works have never been recovered. |
TranSentence (English) The works have never been recovered. |
Source (Spanish) Está dedicada a San Pedro de Verona. |
Reference (English) It's dedicated to saint peter of verona. |
TranSentence (English) It is dedicated to saint peter of verona. |
Source (Spanish) Dos años más tarde, la empresa diversificó sus productos, y comenzó su internacionalización. |
Reference (English) Two years later the company diversified its products and started globalizing. |
TranSentence (English) Two years later the company diversified its products and started its internationalization. |
Source (Spanish) Fue la segunda de nueve hermanos. |
Reference (English) She was the second of nine siblings. |
TranSentence (English) She was the second of nine brothers |
Source (French) Cette nuit-là elle avait reçu une lettre de la part de son père. |
Reference (English) That night she had received a letter from her dad. |
TranSentence (English) That night she had received a letter from her father. |
Source (French) Ses performances lui permettent de prolonger son contrat avec le club. |
Reference (English) His achievements allow him to extend his contract with the club. |
TranSentence (English) His performances allow him to extend his contract with the club. |
Source (French) Quand il était en train de traverser la route. |
Reference (English) While he was crossing the road. |
TranSentence (English) When he was crossing the road. |
Source (French) Il travailla dans un style traditionnel. |
Reference (English) He worked in a traditional style. |
TranSentence (English) He worked in a traditional style. |
Source (French) Il sert dans le Royal Army Medical Corps pendant la première guerre mondiale. |
Reference (English) He serves in the royal army medical corps during the first world war. |
TranSentence (English) He serves in the royal army medical corp during the first world war. |
Source (English) The continuation of the block was not possible because of the disagreement between its components. |
Reference (Spanish) La continuación del bloque no fue posible debido a la discordia entre sus componentes. |
TranSentence (Spanish) La continuación del bloque no fue posible debido a la diferencia entre sus componentes. |
Source (English) In times of the spanish colony the river was not less important. |
Reference (Spanish) En tiempo de la Colonia española, el río no fue menos importante. |
TranSentence (Spanish) En los tiempos de la colonia española el río no fue menos importante. |
Source (English) She is married and has a daughter |
Reference (Spanish) Está casada y tiene una hija. |
TranSentence (Spanish) Está casada y tiene una hija |
Source (English) He was expelled from the argentinian territory and embarked. |
Reference (Spanish) Fue expulsado del territorio argentino y embarcado. |
TranSentence (Spanish) Fue expulsado del territorio argentino y embarcado. |
Source (English) He studied journalism in madrid and law at the university of granada |
Reference (Spanish) Estudió periodismo en Madrid y derecho en la Universidad de Granada. |
TranSentence (Spanish) Estudió periodismo en madrid y derecho en la universidad de la granada. |
Source (English) He is being considered as a commercial failure. |
Reference (French) Il est considéré comme un échec commercial. |
TranSentence (French) Il est considéré comme un échec commercial |
Source (English) The city's history is also marked by its industrial activity. |
Reference (French) L'histoire de la ville est aussi marquée par son activité industrielle. |
TranSentence (French) L'histoire de la ville est également marquée par son activité industrielle. |
Source (English) This is where she lived with her husband. |
Reference (French) C'est là qu'elle vécut avec son époux. |
TranSentence (French) C'est là qu'elle vécut avec son mari. |
Source (English) The style remains nonetheless very similar. |
Reference (French) Le style reste néanmoins très similaire. |
TranSentence (French) Le style reste néanmoins très similaire. |
Source (English) This species is named in reference to its small size. |
Reference (French) Cette espèce est nommée en référence à sa petite taille. |
TranSentence (French) Cette espèce est nommée en référence à sa petite taille. |