end2end-asr-pytorch - PAD_TOKEN - SOS_TOKEN - EOS_TOKEN
1. End-to-End Speech Recognition on Pytorchhttps://github.com/gentaiscool/end2end-asr-pytorch
Transformer-based Speech Recognition Model
2. end2end-asr-pytorch/utils/constant.py
https://github.com/gentaiscool/end2end-asr-pytorch/blob/master/utils/constant.py
PAD_TOKEN = 0
SOS_TOKEN = 1
EOS_TOKEN = 2
PAD_CHAR = "¶"
SOS_CHAR = "§"
EOS_CHAR = "¤"
[*]Sequence to Sequence models: Attention Models
https://deeplearning.cs.cmu.edu/F21/document/slides/lec18.attention.noanim.pdf
To make it explicit, we will add two additional symbols (in addition to the words) to the base vocabulary
<sos>: Indicates start of a sentence
<eos>: Indicates end of a sentence
[*]Neural Abstractive Text Summarization with Sequence-to-Sequence Models
SOS and EOS represent the start and end of a sequence, respectively.
[*]Listen, Attend and Spell
Here <sos> and <eos> are the special start-of-sentence token, and end-of-sentence tokens, respectively.
[*]END-TO-END SPEECH RECOGNITION WITH ADAPTIVE COMPUTATION STEPS
The decoding targets include 7065 Chinese characters and four special tokens as <UNK> (unknown), <PAD> (padding), <SOS> (start of speech) and <EOS> (end of speech).
References
Yongqiang Cheng, https://yongqiang.blog.csdn.net/
免责声明:如果侵犯了您的权益,请联系站长,我们会及时删除侵权内容,谢谢合作!更多信息从访问主页:qidao123.com:ToB企服之家,中国第一个企服评测及商务社交产业平台。
页:
[1]