Skip to main content

Showing 1–36 of 36 results for author: Tur, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.01448  [pdf, other

    cs.CL cs.LG

    AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

    Authors: Saleh Soltan, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan

    Abstract: In this work, we demonstrate that multilingual large-scale sequence-to-sequence (seq2seq) models, pre-trained on a mixture of denoising and Causal Language Modeling (CLM) tasks, are more efficient few-shot learners than decoder-only models on various tasks. In particular, we train a 20 billion parameter multilingual seq2seq model called Alexa Teacher Model (AlexaTM 20B) and show that it achieves s… ▽ More

    Submitted 3 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

  2. arXiv:2206.07808  [pdf, other

    cs.CL cs.AI cs.LG

    Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

    Authors: Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, ** Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak , et al. (16 additional authors not shown)

    Abstract: We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9.3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system. Though we train using 70% spoken-form data, our teacher models perform co… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: KDD 2022

    ACM Class: I.2.7

    Journal ref: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  3. arXiv:2204.08582  [pdf, other

    cs.CL cs.AI cs.LG

    MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages

    Authors: Jack FitzGerald, Christopher Hench, Charith Peris, Scott Mackie, Kay Rottmann, Ana Sanchez, Aaron Nash, Liam Urbach, Vishesh Kakarala, Richa Singh, Swetha Ranganath, Laurie Crist, Misha Britan, Wouter Leeuwis, Gokhan Tur, Prem Natarajan

    Abstract: We present the MASSIVE dataset--Multilingual Amazon Slu resource package (SLURP) for Slot-filling, Intent classification, and Virtual assistant Evaluation. MASSIVE contains 1M realistic, parallel, labeled virtual assistant utterances spanning 51 languages, 18 domains, 60 intents, and 55 slots. MASSIVE was created by tasking professional translators to localize the English-only SLURP dataset into 5… ▽ More

    Submitted 17 June, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: Preprint; 8 pages

  4. arXiv:2110.00534  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    TEACh: Task-driven Embodied Agents that Chat

    Authors: Aishwarya Padmakumar, Jesse Thomason, Ayush Shrivastava, Patrick Lange, Anjali Narayan-Chen, Spandana Gella, Robinson Piramuthu, Gokhan Tur, Dilek Hakkani-Tur

    Abstract: Robots operating in human spaces must be able to engage in natural language interaction with people, both understanding and executing instructions, and using conversation to resolve ambiguity and recover from mistakes. To study this, we introduce TEACh, a dataset of over 3,000 human--human, interactive dialogues to complete household tasks in simulation. A Commander with access to oracle informati… ▽ More

    Submitted 28 December, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: Accepted at AAAI 2022; 7 pages main, 28 pages total, 29 figures; Version 3 uses a new test set for EDH instances that restrict evaluation to state changes only on task-relevant objects

  5. arXiv:2106.08484  [pdf, other

    cs.CL cs.HC

    Generative Conversational Networks

    Authors: Alexandros Papangelis, Karthik Gopalakrishnan, Aishwarya Padmakumar, Seokhwan Kim, Gokhan Tur, Dilek Hakkani-Tur

    Abstract: Inspired by recent work in meta-learning and generative teaching networks, we propose a framework called Generative Conversational Networks, in which conversational agents learn to generate their own labelled training data (given some seed data) and then train themselves from that data to perform a given task. We use reinforcement learning to optimize the data generation process where the reward s… ▽ More

    Submitted 16 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: SIGDial 2021

  6. arXiv:2105.11589  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator

    Authors: Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gokhan Tür, Devi Parikh, Dilek Hakkani-Tür

    Abstract: Interactive robots navigating photo-realistic environments need to be trained to effectively leverage and handle the dynamic nature of dialogue in addition to the challenges underlying vision-and-language navigation (VLN). In this paper, we present VISITRON, a multi-modal Transformer-based navigator better suited to the interactive regime inherent to Cooperative Vision-and-Dialog Navigation (CVDN)… ▽ More

    Submitted 15 March, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: Accepted at Findings of the Annual Meeting of the Association for Computational Linguistics (ACL) 2022, previous version accepted at Visually Grounded Interaction and Language (ViGIL) Workshop at NAACL 2021

    ACM Class: I.2.9

  7. arXiv:2105.11541  [pdf, other

    cs.CV

    Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation

    Authors: Tao Tu, Qing **, Govind Thattai, Gokhan Tur, Prem Natarajan

    Abstract: GuessWhat?! is a two-player visual dialog guessing game where player A asks a sequence of yes/no questions (Questioner) and makes a final guess (Guesser) about a target object in an image, based on answers from player B (Oracle). Based on this dialog history between the Questioner and the Oracle, a Guesser makes a final guess of the target object. Previous baseline Oracle model encodes no visual i… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  8. arXiv:2103.14580  [pdf, other

    cs.CL

    Correcting Automated and Manual Speech Transcription Errors using Warped Language Models

    Authors: Mahdi Namazifar, John Malik, Li Erran Li, Gokhan Tur, Dilek Hakkani Tür

    Abstract: Masked language models have revolutionized natural language processing systems in the past few years. A recently introduced generalization of masked language models called warped language models are trained to be more robust to the types of errors that appear in automatic or manual transcriptions of spoken language by exposing the language model to the same types of errors during training. In this… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Submitted to INTERSPEECH

  9. arXiv:2101.03431  [pdf, other

    cs.AI cs.CL cs.CV cs.RO

    Are We There Yet? Learning to Localize in Embodied Instruction Following

    Authors: Shane Storks, Qiaozi Gao, Govind Thattai, Gokhan Tur

    Abstract: Embodied instruction following is a challenging problem requiring an agent to infer a sequence of primitive actions to achieve a goal environment state from complex language and visual inputs. Action Learning From Realistic Environments and Directives (ALFRED) is a recently proposed benchmark for this problem consisting of step-by-step natural language instructions to achieve subgoals which compos… ▽ More

    Submitted 9 January, 2021; originally announced January 2021.

    Comments: Accepted to HAI @ AAAI 2021

  10. arXiv:2012.14653  [pdf, other

    cs.CL cs.HC

    Can You be More Social? Injecting Politeness and Positivity into Task-Oriented Conversational Agents

    Authors: Yi-Chia Wang, Alexandros Papangelis, Runze Wang, Zhaleh Feizollahi, Gokhan Tur, Robert Kraut

    Abstract: Goal-oriented conversational agents are becoming prevalent in our daily lives. For these systems to engage users and achieve their goals, they need to exhibit appropriate social behavior as well as provide informative replies that guide users through tasks. The first component of the research in this paper applies statistical modeling techniques to understand conversations between users and human… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  11. arXiv:2012.00958  [pdf, other

    cs.CL

    Interactive Teaching for Conversational AI

    Authors: Qing **, Feiyang Niu, Govind Thattai, Joel Chengottusseriyil, Qiaozi Gao, Aishwarya Reganti, Prashanth Rajagopal, Gokhan Tur, Dilek Hakkani-Tur, Prem Nataraja

    Abstract: Current conversational AI systems aim to understand a set of pre-designed requests and execute related actions, which limits them to evolve naturally and adapt based on human interactions. Motivated by how children learn their first language interacting with adults, this paper describes a new Teachable AI system that is capable of learning new language nuggets called concepts, directly from end us… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Comments: Accepted at Human in the Loop Dialogue Systems Workshop @NeurIPS 2020

  12. arXiv:2011.10731  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question Answering

    Authors: Weixin Liang, Feiyang Niu, Aishwarya Reganti, Govind Thattai, Gokhan Tur

    Abstract: The predominant approach to visual question answering (VQA) relies on encoding the image and question with a "black-box" neural encoder and decoding a single token as the answer like "yes" or "no". Despite this approach's strong quantitative results, it struggles to come up with intuitive, human-readable forms of justification for the prediction process. To address this insufficiency, we reformula… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: NeurIPS KR2ML 2020

  13. arXiv:2011.03023  [pdf, other

    cs.CL cs.AI

    Language Model is All You Need: Natural Language Understanding as Question Answering

    Authors: Mahdi Namazifar, Alexandros Papangelis, Gokhan Tur, Dilek Hakkani-Tür

    Abstract: Different flavors of transfer learning have shown tremendous impact in advancing research and applications of machine learning. In this work we study the use of a specific family of transfer learning, where the target domain is mapped to the source domain. Specifically we map Natural Language Understanding (NLU) problems to QuestionAnswering (QA) problems and we show that in low data regimes this… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

  14. arXiv:2011.01900  [pdf, other

    cs.CL cs.AI

    Warped Language Models for Noise Robust Language Understanding

    Authors: Mahdi Namazifar, Gokhan Tur, Dilek Hakkani Tür

    Abstract: Masked Language Models (MLM) are self-supervised neural networks trained to fill in the blanks in a given sentence with masked tokens. Despite the tremendous success of MLMs for various text based tasks, they are not robust for spoken language understanding, especially for spontaneous conversational speech recognition noise. In this work we introduce Warped Language Models (WLM) in which input sen… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: To appear at IEEE SLT 2021

  15. arXiv:2009.12046  [pdf, other

    cs.CL

    Controllable Text Generation with Focused Variation

    Authors: Lei Shu, Alexandros Papangelis, Yi-Chia Wang, Gokhan Tur, Hu Xu, Zhaleh Feizollahi, Bing Liu, Piero Molino

    Abstract: This work introduces Focused-Variation Network (FVN), a novel model to control language generation. The main problems in previous controlled language generation models range from the difficulty of generating text according to the given attributes, to the lack of diversity of the generated texts. FVN addresses these issues by learning disjoint discrete latent spaces for each attribute inside codebo… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

  16. arXiv:2003.09125  [pdf, other

    eess.AS cs.LG

    Improving Embedding Extraction for Speaker Verification with Ladder Network

    Authors: Fei Tao, Gokhan Tur

    Abstract: Speaker verification is an established yet challenging task in speech processing and a very vibrant research area. Recent speaker verification (SV) systems rely on deep neural networks to extract high-level embeddings which are able to characterize the users' voices. Most of the studies have investigated on improving the discriminability of the networks to extract better embeddings for performance… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

  17. arXiv:2002.07629  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Multi-Task Siamese Neural Network for Improving Replay Attack Detection

    Authors: Patrick von Platen, Fei Tao, Gokhan Tur

    Abstract: Automatic speaker verification systems are vulnerable to audio replay attacks which bypass security by replaying recordings of authorized speakers. Replay attack detection (RA) detection systems built upon Residual Neural Networks (ResNet)s have yielded astonishing results on the public benchmark ASVspoof 2019 Physical Access challenge. With most teams using fine-tuned feature extraction pipelines… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: Submit to INTERSPEECH2020

  18. arXiv:2002.00750  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Joint Contextual Modeling for ASR Correction and Language Understanding

    Authors: Yue Weng, Sai Sumanth Miryala, Chandra Khatri, Runze Wang, Huaixiu Zheng, Piero Molino, Mahdi Namazifar, Alexandros Papangelis, Hugh Williams, Franziska Bell, Gokhan Tur

    Abstract: The quality of automatic speech recognition (ASR) is critical to Dialogue Systems as ASR errors propagate to and directly impact downstream tasks such as language understanding (LU). In this paper, we propose multi-task neural approaches to perform contextual language correction on ASR outputs jointly with LU to improve the performance of both tasks simultaneously. To measure the effectiveness of… ▽ More

    Submitted 28 January, 2020; originally announced February 2020.

    Comments: Accepted at IEEE ICASSP 2020

  19. arXiv:2001.08868  [pdf, other

    cs.CL cs.AI

    Exploration Based Language Learning for Text-Based Games

    Authors: Andrea Madotto, Mahdi Namazifar, Joost Huizinga, Piero Molino, Adrien Ecoffet, Huaixiu Zheng, Alexandros Papangelis, Dian Yu, Chandra Khatri, Gokhan Tur

    Abstract: This work presents an exploration and imitation-learning-based agent capable of state-of-the-art performance in playing text-based computer games. Text-based computer games describe their world to the player through natural language and expect the player to interact with the game using text. These games are of interest as they can be seen as a testbed for language understanding, problem-solving, a… ▽ More

    Submitted 7 June, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted at IJCAI 2020

  20. arXiv:2001.06463  [pdf, other

    cs.HC cs.AI cs.CL

    Plato Dialogue System: A Flexible Conversational AI Research Platform

    Authors: Alexandros Papangelis, Mahdi Namazifar, Chandra Khatri, Yi-Chia Wang, Piero Molino, Gokhan Tur

    Abstract: As the field of Spoken Dialogue Systems and Conversational AI grows, so does the need for tools and environments that abstract away implementation details in order to expedite the development process, lower the barrier of entry to the field, and offer a common test-bed for new ideas. In this paper, we present Plato, a flexible Conversational AI platform written in Python that supports any kind of… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

  21. arXiv:1908.02402  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Flexibly-Structured Model for Task-Oriented Dialogues

    Authors: Lei Shu, Piero Molino, Mahdi Namazifar, Hu Xu, Bing Liu, Huaixiu Zheng, Gokhan Tur

    Abstract: This paper proposes a novel end-to-end architecture for task-oriented dialogue systems. It is based on a simple and practical yet very effective sequence-to-sequence approach, where language understanding and state tracking tasks are modeled jointly with a structured copy-augmented sequential decoder and a multi-label decoder for each slot. The policy engine and language generation tasks are model… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  22. OCC: A Smart Reply System for Efficient In-App Communications

    Authors: Yue Weng, Huaixiu Zheng, Franziska Bell, Gokhan Tur

    Abstract: Smart reply systems have been developed for various messaging platforms. In this paper, we introduce Uber's smart reply system: one-click-chat (OCC), which is a key enhanced feature on top of the Uber in-app chat system. It enables driver-partners to quickly respond to rider messages using smart replies. The smart replies are dynamically selected according to conversation content using machine lea… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: link to demo: https://www.youtube.com/watch?v=nOffUT7rS0A&t=32s

    Journal ref: KDD 19, August 4-8, 2019, Anchorage, AK, USA

  23. arXiv:1907.05507  [pdf, other

    cs.HC cs.CL

    Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning

    Authors: Alexandros Papangelis, Yi-Chia Wang, Piero Molino, Gokhan Tur

    Abstract: We present the first complete attempt at concurrently training conversational agents that communicate only via self-generated language. Using DSTC2 as seed data, we trained natural language understanding (NLU) and generation (NLG) networks for each agent and let the agents interact online. We model the interaction as a stochastic collaborative game where each agent (player) has a role ("assistant"… ▽ More

    Submitted 24 July, 2019; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: SIGDIAL 2019

  24. arXiv:1811.04369  [pdf, other

    cs.CL cs.AI cs.LG

    User Modeling for Task Oriented Dialogues

    Authors: Izzeddin Gur, Dilek Hakkani-Tur, Gokhan Tur, Pararth Shah

    Abstract: We introduce end-to-end neural network based models for simulating users of task-oriented dialogue systems. User simulation in dialogue systems is crucial from two different perspectives: (i) automatic evaluation of different dialogue models, and (ii) training task-oriented dialogue systems. We design a hierarchical sequence-to-sequence model that first encodes the initial user goal and system tur… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: Accepted at SLT 2018

  25. arXiv:1804.06512  [pdf, other

    cs.CL

    Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

    Authors: Bing Liu, Gokhan Tur, Dilek Hakkani-Tur, Pararth Shah, Larry Heck

    Abstract: In this work, we present a hybrid learning method for training task-oriented dialogue systems through online user interactions. Popular methods for learning task-oriented dialogues include applying reinforcement learning with user feedback on supervised pre-training models. Efficiency of such learning method may suffer from the mismatch of dialogue state distribution between offline training and o… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: To appear in NAACL 2018 as a long paper

  26. arXiv:1801.04871  [pdf, other

    cs.AI cs.CL

    Building a Conversational Agent Overnight with Dialogue Self-Play

    Authors: Pararth Shah, Dilek Hakkani-Tür, Gokhan Tür, Abhinav Rastogi, Ankur Bapna, Neha Nayak, Larry Heck

    Abstract: We propose Machines Talking To Machines (M2M), a framework combining automation and crowdsourcing to rapidly bootstrap end-to-end dialogue agents for goal-oriented dialogues in arbitrary domains. M2M scales to new tasks with just a task schema and an API client from the dialogue system developer, but it is also customizable to cater to task-specific interactions. Compared to the Wizard-of-Oz appro… ▽ More

    Submitted 15 January, 2018; originally announced January 2018.

    Comments: 11 pages, 4 figures

  27. arXiv:1711.10712  [pdf, other

    cs.CL

    End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning

    Authors: Bing Liu, Gokhan Tur, Dilek Hakkani-Tur, Pararth Shah, Larry Heck

    Abstract: In this paper, we present a neural network based task-oriented dialogue system that can be optimized end-to-end with deep reinforcement learning (RL). The system is able to track dialogue state, interface with knowledge bases, and incorporate query results into agent's responses to successfully complete task-oriented dialogues. Dialogue policy learning is conducted with a hybrid supervised and dee… ▽ More

    Submitted 30 November, 2017; v1 submitted 29 November, 2017; originally announced November 2017.

  28. arXiv:1707.02363  [pdf, other

    cs.AI cs.CL

    Towards Zero-Shot Frame Semantic Parsing for Domain Scaling

    Authors: Ankur Bapna, Gokhan Tur, Dilek Hakkani-Tur, Larry Heck

    Abstract: State-of-the-art slot filling models for goal-oriented human/machine conversational language understanding systems rely on deep learning methods. While multi-task training of such models alleviates the need for large in-domain annotated datasets, bootstrap** a semantic parsing model for a new domain using only the semantic frame, such as the back-end API or knowledge graph schema, is still one o… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

    Comments: 4 pages + 1 references

  29. arXiv:1705.03455  [pdf, other

    cs.CL cs.AI cs.LG

    Sequential Dialogue Context Modeling for Spoken Language Understanding

    Authors: Ankur Bapna, Gokhan Tur, Dilek Hakkani-Tur, Larry Heck

    Abstract: Spoken Language Understanding (SLU) is a key component of goal oriented dialogue systems that would parse user utterances into semantic frame representations. Traditionally SLU does not utilize the dialogue history beyond the previous system turn and contextual ambiguities are resolved by the downstream components. In this paper, we explore novel approaches for modeling dialogue context in a recur… ▽ More

    Submitted 7 July, 2017; v1 submitted 8 May, 2017; originally announced May 2017.

    Comments: 8 + 2 pages, Updated 10/17: Updated typos in abstract, Updated 07/07: Updated Title, abstract and few minor changes

  30. arXiv:1609.03286  [pdf, other

    cs.AI cs.CL

    Knowledge as a Teacher: Knowledge-Guided Structural Attention Networks

    Authors: Yun-Nung Chen, Dilek Hakkani-Tur, Gokhan Tur, Asli Celikyilmaz, Jianfeng Gao, Li Deng

    Abstract: Natural language understanding (NLU) is a core component of a spoken dialogue system. Recently recurrent neural networks (RNN) obtained strong results on NLU due to their superior ability of preserving sequential information over time. Traditionally, the NLU module tags semantic slots for utterances considering their flat structures, as the underlying RNN structure is a linear chain. However, natu… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: 11 pages, 5 figures

  31. arXiv:1401.0509  [pdf, other

    cs.CL cs.LG

    Zero-Shot Learning for Semantic Utterance Classification

    Authors: Yann N. Dauphin, Gokhan Tur, Dilek Hakkani-Tur, Larry Heck

    Abstract: We propose a novel zero-shot learning method for semantic utterance classification (SUC). It learns a classifier $f: X \to Y$ for problems where none of the semantic categories $Y$ are present in the training set. The framework uncovers the link between categories and utterances using a semantic space. We show that this semantic space can be learned by deep neural networks trained on large amounts… ▽ More

    Submitted 7 March, 2014; v1 submitted 20 December, 2013; originally announced January 2014.

  32. Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation

    Authors: G. Tur, D. Hakkani-Tur, A. Stolcke, E. Shriberg

    Abstract: We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmentation of speech into topically coherent units. We propose two methods for combining lexical and prosodic information using hidden Markov models and decision trees. Lexical information is obtained from a speech recognizer, and prosodic features are extracted automatically from speech waveforms. We e… ▽ More

    Submitted 31 May, 2001; originally announced May 2001.

    Comments: 27 pages, 8 figures

    ACM Class: I.2.7

    Journal ref: Computation Linguistics 27(1), 31-57, March 2001

  33. Prosody-Based Automatic Segmentation of Speech into Sentences and Topics

    Authors: E. Shriberg, A. Stolcke, D. Hakkani-Tur, G. Tur

    Abstract: A crucial step in processing speech audio data for information extraction, topic detection, or browsing/playback is to segment the input into sentence and topic units. Speech segmentation is challenging, since the cues typically present for segmenting text (headers, paragraphs, punctuation) are absent in spoken language. We investigate the use of prosody (information gleaned from the timing and… ▽ More

    Submitted 27 June, 2000; originally announced June 2000.

    Comments: 30 pages, 9 figures. To appear in Speech Communication 32(1-2), Special Issue on Accessing Information in Spoken Audio, September 2000

    ACM Class: I.2.7

    Journal ref: Speech Communication 32(1-2), 127-154, September 2000

  34. Morphological Disambiguation by Voting Constraints

    Authors: Kemal Oflazer, Gokhan Tur

    Abstract: We present a constraint-based morphological disambiguation system in which individual constraints vote on matching morphological parses, and disambiguation of all the tokens in a sentence is performed at the end by selecting parses that receive the highest votes. This constraint application paradigm makes the outcome of the disambiguation independent of the rule sequence, and hence relieves the… ▽ More

    Submitted 25 April, 1997; originally announced April 1997.

    Comments: 8 pages, Latex source. To appear in Proceedings of ACL/EACL'97 Compressed postscript also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/acl97.ps.z

  35. arXiv:cmp-lg/9607030  [pdf, ps

    cs.CL

    Using Multiple Sources of Information for Constraint-Based Morphological Disambiguation

    Authors: Gokhan Tur

    Abstract: This thesis presents a constraint-based morphological disambiguation approach that is applicable to languages with complex morphology--specifically agglutinative languages with productive inflectional and derivational morphological phenomena. For morphologically complex languages like Turkish, automatic morphological disambiguation involves selecting for each token morphological parse(s), with t… ▽ More

    Submitted 30 July, 1996; originally announced July 1996.

    Comments: M.Sc. Thesis submitted to the Department of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey. Also available as: ftp://ftp.cs.bilkent.edu.tr/pub/tech-reports/1996/BU-CEIS-9615ps.z

    Report number: BU-CEIS-9615

  36. arXiv:cmp-lg/9604001  [pdf, ps

    cs.CL

    Combining Hand-crafted Rules and Unsupervised Learning in Constraint-based Morphological Disambiguation

    Authors: Kemal Oflazer, Gokhan Tur

    Abstract: This paper presents a constraint-based morphological disambiguation approach that is applicable languages with complex morphology--specifically agglutinative languages with productive inflectional and derivational morphological phenomena. In certain respects, our approach has been motivated by Brill's recent work, but with the observation that his transformational approach is not directly applic… ▽ More

    Submitted 12 April, 1996; v1 submitted 11 April, 1996; originally announced April 1996.

    Comments: gzipped and uuencoded postscript, 13 pages. Also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/emnlp.ps.z