Speech understanding for spoken dialogue systems: from corpus harvesting to grammar rule induction

Iosif Ilias, Klasinas Ioannis, Athanasopoulou Georgia, Palogiannidi Elisavet, Georgiladakis Spyridon, Louka Katerina, Potamianos Alexandros

URI	http://purl.tuc.gr/dl/dias/184E3D0D-4DCB-48FA-BD04-F7F06DFA4ACA	-
Αναγνωριστικό	https://doi.org/10.1016/j.csl.2017.08.002	-
Αναγνωριστικό	https://www.sciencedirect.com/science/article/pii/S0885230816302613	-
Γλώσσα	en	-
Μέγεθος	26 pages	en
Τίτλος	Speech understanding for spoken dialogue systems: from corpus harvesting to grammar rule induction	en
Δημιουργός	Iosif Ilias	en
Δημιουργός	Ιωσηφ Ηλιας	el
Δημιουργός	Klasinas Ioannis	en
Δημιουργός	Κλασινας Ιωαννης	el
Δημιουργός	Athanasopoulou Georgia	en
Δημιουργός	Αθανασοπουλου Γεωργια	el
Δημιουργός	Palogiannidi Elisavet	en
Δημιουργός	Παλογιαννιδη Ελισαβετ	el
Δημιουργός	Georgiladakis Spyridon	en
Δημιουργός	Γεωργιλαδακης Σπυριδων	el
Δημιουργός	Louka Katerina	en
Δημιουργός	Potamianos Alexandros	en
Δημιουργός	Ποταμιανος Αλεξανδρος	el
Εκδότης	Elsevier	en
Περίληψη	We investigate algorithms and tools for the semi-automatic authoring of grammars for spoken dialogue systems (SDS) proposing a framework that spans from corpora creation to grammar induction algorithms. A realistic human-in-the-loop approach is followed balancing automation and human intervention to optimize cost to performance ratio for grammar development. Web harvesting is the main approach investigated for eliciting spoken dialogue textual data, while crowdsourcing is also proposed as an alternative method. Several techniques are presented for constructing web queries and filtering the acquired corpora. We also investigate how the harvested corpora can be used for the automatic and semi-automatic (human-in-the-loop) induction of grammar rules. SDS grammar rules and induction algorithms are grouped into two types, namely, low- and high-level. Two families of algorithms are investigated for rule induction: one based on semantic similarity and distributional semantic models, and the other using more traditional statistical modeling approaches (e.g., slot-filling algorithms using Conditional Random Fields). Evaluation results are presented for two domains and languages. High-level induction precision scores up to 60% are obtained. Results advocate the portability of the proposed features and algorithms across languages and domains.	en
Τύπος	Peer-Reviewed Journal Publication	en
Τύπος	Δημοσίευση σε Περιοδικό με Κριτές	el
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2019-11-14	-
Ημερομηνία Δημοσίευσης	2018	-
Θεματική Κατηγορία	Corpora creation	en
Θεματική Κατηγορία	Crowdsourcing	en
Θεματική Κατηγορία	Grammar induction	en
Θεματική Κατηγορία	Semantic similarity	en
Θεματική Κατηγορία	Spoken dialogue systems	en
Θεματική Κατηγορία	Web mining	en
Βιβλιογραφική Αναφορά	E. Iosif, I. Klasinas, G. Athanasopoulou, E. Palogiannidi, S. Georgiladakis, K. Louka and A. Potamianos, "Speech understanding for spoken dialogue systems: from corpus harvesting to grammar rule induction," Comput. Speech Lang., vol. 47, pp. 272-297, Jan. 2018. doi: 10.1016/j.csl.2017.08.002	en

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Speech understanding for spoken dialogue systems: from corpus harvesting to grammar rule induction

Iosif Ilias, Klasinas Ioannis, Athanasopoulou Georgia, Palogiannidi Elisavet, Georgiladakis Spyridon, Louka Katerina, Potamianos Alexandros

Υπηρεσίες

Εξαγωγή

Κοινοποίηση

Στατιστικά

Μεταδεδομένων & Περιεχομένου σε METS:

Μεταδεδομένων σε Μορφότυπο: