Speech act classification in Vietnamese utterance and its application in smart mobile voice interaction

We can observe the rapid development of the spoken humanmachine interface, thanks to the big progress in automatic recognition and text to speech technologies. Especially, the mobile virtual assistants are becoming more popular than ever. Recognizing user's intent in interaction is one of the biggest problems while the system is designed and is attracting the attention of researchers. Identifying utterance's speech act in an automated manner is important to reveal user's intention (such as informing, asking, requesting and expressing emotion), which can provide useful indicators to improve the performance of human-machine interaction. Automatic speech act classification has been studied in many languages such as English, Chinese, Slovakian, Arabic, but not yet in Vietnamese. In this paper, we present a speech act scheme suitable for Vietnamese utterances in mobile voice interaction. Also, we built a speech act classifier that achieved 84.36% accuracy. This classifier has been applied in our mobile virtual assistant for Vietnamese (VAV). The experiment result has demonstrated the ability of this classifier such as robust, compact and can work well on mobiles.

Title: Speech act classification in Vietnamese utterance and its application in smart mobile voice interaction
Authors: Ngo, T.-L.
Duong, Q.-V.
Pham, S.-B.
Phan, X.-H.
Keywords: Natural spoken language understanding, Speech act, Vietnamese spoken text processing, User intent identification
Issue Date: 2016
Publisher: Association for Computing Machinery
Citation: Scopus
Abstract: We can observe the rapid development of the spoken humanmachine interface, thanks to the big progress in automatic recognition and text to speech technologies. Especially, the mobile virtual assistants are becoming more popular than ever. Recognizing user's intent in interaction is one of the biggest problems while the system is designed and is attracting the attention of researchers. Identifying utterance's speech act in an automated manner is important to reveal user's intention (such as informing, asking, requesting and expressing emotion), which can provide useful indicators to improve the performance of human-machine interaction. Automatic speech act classification has been studied in many languages such as English, Chinese, Slovakian, Arabic, but not yet in Vietnamese. In this paper, we present a speech act scheme suitable for Vietnamese utterances in mobile voice interaction. Also, we built a speech act classifier that achieved 84.36% accuracy. This classifier has been applied in our mobile virtual assistant for Vietnamese (VAV). The experiment result has demonstrated the ability of this classifier such as robust, compact and can work well on mobiles.
Description: ACM International Conference Proceeding Series Volume 08-09-December-2016, 8 December 2016, Pages 396-402 7th Symposium on Information and Communication Technology, SoICT 2016; Ho Chi Minh; Viet Nam; 8 December 2016 through 9 December 2016; Code 125331
URI: http://dl.acm.org/citation.cfm?doid=3011077.3011119
http://repository.vnu.edu.vn/handle/VNU_123/34054
ISBN: 978-145034815-7
Appears in Collections:Bài báo của ĐHQGHN trong Scopus

Nhận xét

Bài đăng phổ biến từ blog này

Caves in Basalts in Krông Nô Area, Đắk Nông Province, Vietnam

Phân tích, đánh giá cảnh quan phục vụ phát triển nông, lâm nghiệp và bảo tồn đa dạng sinh học các huyện biên giới Tây Nam tỉnh Nghệ An : Luận án TS. Kiểm soát và bảo vệ môi trường: 628501

Food sources of macro-invertebrates in an important mangrove ecosystem of Vietnam determined by dual stable isotope signatures