Phonetics Embedding Learning with Side Information
收藏Figshare2016-01-19 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Phonetics_Embedding_Learning_with_Side_Information/1277821/1
下载链接
链接失效反馈官方服务:
资源简介:
We show that it is possible to learn an efficient acoustic model using only a<br>small amount of easily available word-level similarity annotations. In contrast<br>to the detailed phonetic labeling required by classical speech recognition<br>technologies, the only information our method requires are pairs of<br>speech excerpts which are known to be similar (same word) and pairs of<br>speech excerpts which are known to be different (different words). An acoustic model is obtained by training shallow and deep neural networks, using an<br>architecture and a cost function well-adapted to the nature of the provided information. The resulting model is evaluated on an ABX minimal-pair discrimination task and is shown to perform much better (11.8% ABX error<br>rate) than raw speech features (19.6%), not far from a fully supervised baseline (best neural network: 9.2%, HMM-GMM: 11%).
创建时间:
2014-12-23



