Automatic Speech Recognition Dataset Augmentation with Pre-Trained Model and Script

Cited 2 time in webofscience Cited 3 time in scopus
  • Hit : 103
  • Download : 0
In this paper, we present a method of enhancing automatic speech recognition dataset with an immature pre-trained model and script. Comparing the chunks obtained from the pre-trained model with the ground truth script, we produce the pair of an audio and its script. In each pair, the audio has exact beginning and end of an utterance, and the script is clear since we use the human-written script. In the experiments on news videos and scripts, it is shown that our method extract automatic speech recognition dataset in exact and effective manner. In addition, the new dataset can be used to train speech synthesizing model.
Publisher
IEEE
Issue Date
2019-02-27
Language
English
Citation

The 6th IEEE International Conference on Big Data and Smart Computing (BigComp2019), The 2nd International Workshop on Dialog Systems (IWDS 2019), pp.649 - 651

ISSN
2375-933X
DOI
10.1109/BIGCOMP.2019.8679441
URI
http://hdl.handle.net/10203/274679
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0