|
Recent Advance of Thai Open-Vocabulary Automatic Speech Recognition |
|---|---|
| รหัสดีโอไอ | |
| Creator | Chai Wutiwiwatchai |
| Title | Recent Advance of Thai Open-Vocabulary Automatic Speech Recognition |
| Contributor | Vataya Chunwijitra, Sila Chunwijitra, Phuttapong Sertsi, Sawit Kasuriya, Patcharika Chootrakool, Kwanchiva Thangthai, Chanchai Junlouchai, Kamthorn Krairaksa |
| Publisher | Sirindhorn International Institute of Technology, Bangkadi Campus (SIIT-BKD) |
| Publication Year | 2560 |
| Journal Title | Journal of Intelligent Informatics and Smart Technology |
| Journal Vol. | 1 |
| Page no. | 1-7 |
| Keyword | open-vocabulary, speech recognition, Thai language |
| URL Website | https://ph05.tci-thaijo.org/index.php/JIIST |
| Website title | Journal of Intelligent Informatics and Smart Technology |
| ISSN | 2586-9167 |
| Abstract | We describe the recent development of the NECTEC Thai open-vocabulary automatic speech recognition system. Some of the techniques that were found beneficial over its baseline system are: hybrid word-subword language modeling to enhance the vocabulary coverage in a constraint resource; multi-conditioned noisy acoustic modeling to improve the system robustness and spoken-style language model interpolation using a newly developed large social media speech database; recent state-of-the-art speech features; and lastly, online decoding, speech compression, and Docker-based distributed computing to reduce the processing and data transmission time. These techniques result in a 29.0% word error rate on open-vocabulary noisy speech test sets which is 42.5% relatively low-er than the baseline system. The overall system operates at nearly 1.2xRT which is promising for real applications. |