-
arXiv:cs/0008016 [pdf, ps, other]
Processing Self Corrections in a speech to speech system
Abstract: Speech repairs occur often in spontaneous spoken dialogues. The ability to detect and correct those repairs is necessary for any spoken language system. We present a framework to detect and correct speech repairs where all relevant levels of information, i.e., acoustics, lexis, syntax and semantics can be integrated. The basic idea is to reduce the search space for repairs as soon as possible by… ▽ More
Submitted 21 August, 2000; originally announced August 2000.
Comments: 5 pages, 2 figures
ACM Class: I 2.7
Journal ref: Proceedings of COLING 2000, Saarbruecken, Germany; 31.7-4.8; pp 1116-1120
-
arXiv:cs/9907021 [pdf, ps, other]
Architectural Considerations for Conversational Systems -- The Verbmobil/INTARC Experience
Abstract: The paper describes the speech to speech translation system INTARC, developed during the first phase of the Verbmobil project. The general design goals of the INTARC system architecture were time synchronous processing as well as incrementality and interactivity as a means to achieve a higher degree of robustness and scalability. Interactivity means that in addition to the bottom-up (in terms of… ▽ More
Submitted 14 July, 1999; originally announced July 1999.
Comments: 10 pages, to appear in proceedings of First International Workshop on Human Computer Conversation, Bellagio, Italy
ACM Class: I.2.7
-
arXiv:cs/9809022 [pdf, ps, other]
Modelling Users, Intentions, and Structure in Spoken Dialog
Abstract: We outline how utterances in dialogs can be interpreted using a partial first order logic. We exploit the capability of this logic to talk about the truth status of formulae to define a notion of coherence between utterances and explain how this coherence relation can serve for the construction of AND/OR trees that represent the segmentation of the dialog. In a BDI model we formalize basic assum… ▽ More
Submitted 17 September, 1998; originally announced September 1998.
Comments: 17 pages
ACM Class: H.5.2
-
Combining Expression and Content in Domains for Dialog Managers
Abstract: We present work in progress on abstracting dialog managers from their domain in order to implement a dialog manager development tool which takes (among other data) a domain description as input and delivers a new dialog manager for the described domain as output. Thereby we will focus on two topics; firstly, the construction of domain descriptions with description logics and secondly, the interp… ▽ More
Submitted 13 August, 1998; originally announced August 1998.
Comments: 5 pages, uses conference.sty
Journal ref: Proceedings of DL '98, pp. 126-130, Trento, Italy
-
Research on Architectures for Integrated Speech/Language Systems in Verbmobil
Abstract: The German joint research project Verbmobil (VM) aims at the development of a speech to speech translation system. This paper reports on research done in our group which belongs to Verbmobil's subproject on system architectures (TP15). Our specific research areas are the construction of parsers for spontaneous speech, investigations in the parallelization of parsing and to contribute to the deve… ▽ More
Submitted 25 June, 1996; originally announced June 1996.
Comments: 6 pages, 2 Postscript figures
Journal ref: accepted for COLING 96
-
Towards Understanding Spontaneous Speech: Word Accuracy vs. Concept Accuracy
Abstract: In this paper we describe an approach to automatic evaluation of both the speech recognition and understanding capabilities of a spoken dialogue system for train time table information. We use word accuracy for recognition and concept accuracy for understanding performance judgement. Both measures are calculated by comparing these modules' output with a correct reference answer. We report evalua… ▽ More
Submitted 15 May, 1996; originally announced May 1996.
Comments: 4 pages PS, Latex2e source importing 2 eps figures, uses icslp.cls, caption.sty, psfig.sty; to appear in the Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP 96)
-
Robust Parsing of Spoken Dialogue Using Contextual Knowledge and Recognition Probabilities
Abstract: In this paper we describe the linguistic processor of a spoken dialogue system. The parser receives a word graph from the recognition module as its input. Its task is to find the best path through the graph. If no complete solution can be found, a robust mechanism for selecting multiple partial results is applied. We show how the information content rate of the results can be improved if the sel… ▽ More
Submitted 8 May, 1995; originally announced May 1995.
Comments: 4 pages, LaTex source, 3 PostScript figures, uses epsf.sty and ETRW.sty, to appear in Proceedings of ESCA Workshop on Spoken Dialogue Systems, Denmark, May 30-June 2
-
Anytime Algorithms for Speech Parsing?
Abstract: This paper discusses to which extent the concept of ``anytime algorithms'' can be applied to parsing algorithms with feature unification. We first try to give a more precise definition of what an anytime algorithm is. We arque that parsing algorithms have to be classified as contract algorithms as opposed to (truly) interruptible algorithms. With the restriction that the transaction being active… ▽ More
Submitted 21 June, 1994; originally announced June 1994.
Comments: 5 pages, 2 figures
Journal ref: COLING-94