Skip to main content

Showing 1–39 of 39 results for author: Ng, H T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.09821  [pdf, other

    cs.CL

    Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Knowledge in the real world is being updated constantly. However, it is costly to frequently update large language models (LLMs). Therefore, it is crucial for LLMs to understand the concept of temporal knowledge. However, prior works on temporal question answering (TQA) did not emphasize multi-answer and multi-hop types of temporal reasoning. In this paper, we propose a complex temporal question-a… ▽ More

    Submitted 12 July, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To appear in Findings of ACL 2024

  2. arXiv:2311.06807  [pdf, other

    cs.CL

    On the Robustness of Question Rewriting Systems to Questions of Varying Hardness

    Authors: Hai Ye, Hwee Tou Ng, Wenjuan Han

    Abstract: In conversational question answering (CQA), the task of question rewriting~(QR) in context aims to rewrite a context-dependent question into an equivalent self-contained question that gives the same answer. In this paper, we are interested in the robustness of a QR system to questions varying in rewriting hardness or difficulty. Since there is a lack of questions classified based on their rewritin… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: ACL'22, main, long paper

  3. arXiv:2310.14947  [pdf, other

    cs.CL

    System Combination via Quality Estimation for Grammatical Error Correction

    Authors: Muhammad Reza Qorib, Hwee Tou Ng

    Abstract: Quality estimation models have been developed to assess the corrections made by grammatical error correction (GEC) models when the reference or gold-standard corrections are not available. An ideal quality estimator can be utilized to combine the outputs of multiple GEC systems by choosing the best subset of edits from the union of all edits proposed by the GEC base systems. However, we found that… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  4. arXiv:2306.09697  [pdf, other

    cs.CL

    Class-Adaptive Self-Training for Relation Extraction with Incompletely Annotated Training Data

    Authors: Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng

    Abstract: Relation extraction (RE) aims to extract relations from sentences and documents. Existing relation extraction models typically rely on supervised machine learning. However, recent studies showed that many RE datasets are incompletely annotated. This is known as the false negative problem in which valid relations are falsely annotated as 'no_relation'. Models trained with such data inevitably make… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  5. arXiv:2306.08952  [pdf, other

    cs.CL cs.AI

    Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Reasoning about time is of fundamental importance. Many facts are time-dependent. For example, athletes change teams from time to time, and different government officials are elected periodically. Previous time-dependent question answering (QA) datasets tend to be biased in either their coverage of time spans or question types. In this paper, we introduce a comprehensive probing dataset \tempreaso… ▽ More

    Submitted 27 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  6. arXiv:2306.06779  [pdf, other

    cs.CL

    Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering

    Authors: Hai Ye, Qizhe Xie, Hwee Tou Ng

    Abstract: In this work, we study multi-source test-time model adaptation from user feedback, where K distinct models are established for adaptation. To allow efficient adaptation, we cast the problem as a stochastic decision-making process, aiming to determine the best adapted model after adaptation. We discuss two frameworks: multi-armed bandit learning and multi-armed dueling bandits. Compared to multi-ar… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Main conference of ACL 2023

  7. arXiv:2305.15014  [pdf, other

    cs.CL

    Unlocking Temporal Question Answering for Large Language Models Using Code Execution

    Authors: Xingxuan Li, Liying Cheng, Qingyu Tan, Hwee Tou Ng, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) have made significant progress in natural language processing (NLP), and are utilized extensively in various applications. Recent works, such as chain-of-thought (CoT), have shown that intermediate reasoning steps can improve the performance of LLMs for complex reasoning tasks, such as math problems and symbolic question-answering tasks. However, we notice the challeng… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2302.04618  [pdf, other

    cs.CL

    Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study

    Authors: Hai Ye, Yuyang Ding, Juntao Li, Hwee Tou Ng

    Abstract: A deployed question answering (QA) model can easily fail when the test data has a distribution shift compared to the training data. Robustness tuning (RT) methods have been widely studied to enhance model robustness against distribution shifts before model deployment. However, can we improve a model after deployment? To answer this question, we evaluate test-time adaptation (TTA) to improve a mode… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Findings of EMNLP 2022

  9. Grammatical Error Correction: A Survey of the State of the Art

    Authors: Christopher Bryant, Zheng Yuan, Muhammad Reza Qorib, Hannan Cao, Hwee Tou Ng, Ted Briscoe

    Abstract: Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors respectively. The field has seen significant progress in the last decade, m… ▽ More

    Submitted 29 April, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Journal ref: Computational Linguistics (2023) 49 (3): 643-701

  10. arXiv:2205.12696  [pdf, other

    cs.CL cs.IR

    Revisiting DocRED -- Addressing the False Negative Problem in Relation Extraction

    Authors: Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng, Sharifah Mahani Aljunied

    Abstract: The DocRED dataset is one of the most popular and widely used benchmarks for document-level relation extraction (RE). It adopts a recommend-revise annotation scheme so as to have a large-scale annotated dataset. However, we find that the annotation of DocRED is incomplete, i.e., false negative samples are prevalent. We analyze the causes and effects of the overwhelming false negative problem in th… ▽ More

    Submitted 16 June, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted by EMNLP 2022

  11. arXiv:2203.10900  [pdf, other

    cs.CL

    Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation

    Authors: Qingyu Tan, Ruidan He, Lidong Bing, Hwee Tou Ng

    Abstract: Document-level Relation Extraction (DocRE) is a more challenging task compared to its sentence-level counterpart. It aims to extract relations from multiple sentences at once. In this paper, we propose a semi-supervised framework for DocRE with three novel components. Firstly, we use an axial attention module for learning the interdependency among entity-pairs, which improves the performance on tw… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: To appear in the Findings of ACL 2022

  12. arXiv:2202.10948  [pdf, other

    cs.CL

    A Semi-supervised Learning Approach with Two Teachers to Improve Breakdown Identification in Dialogues

    Authors: Qian Lin, Hwee Tou Ng

    Abstract: Identifying breakdowns in ongoing dialogues helps to improve communication effectiveness. Most prior work on this topic relies on human annotated data and data augmentation to learn a classification model. While quality labeled dialogue data requires human annotation and is usually expensive to obtain, unlabeled data is easier to collect from various sources. In this paper, we propose a novel semi… ▽ More

    Submitted 19 April, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 9 pages, 2 figures

    Journal ref: Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (2022)

  13. arXiv:2111.01465  [pdf, other

    cs.CL

    System Combination for Grammatical Error Correction Based on Integer Programming

    Authors: Ruixi Lin, Hwee Tou Ng

    Abstract: In this paper, we propose a system combination method for grammatical error correction (GEC), based on nonlinear integer programming (IP). Our method optimizes a novel F score objective based on error types, and combines multiple end-to-end GEC systems. The proposed IP approach optimizes the selection of a single best system for each grammatical error type present in the data. Experiments of the I… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted for RANLP 2021

    Journal ref: RANLP (RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING) (2021)

  14. arXiv:2110.15149  [pdf, other

    cs.CL

    Diversity-Driven Combination for Grammatical Error Correction

    Authors: Wenjuan Han, Hwee Tou Ng

    Abstract: Grammatical error correction (GEC) is the task of detecting and correcting errors in a written text. The idea of combining multiple system outputs has been successfully used in GEC. To achieve successful system combination, multiple component systems need to produce corrected sentences that are both diverse and of comparable quality. However, most existing state-of-the-art GEC approaches are based… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted by ICTAI 2021

  15. arXiv:2109.13724  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Translating from Morphologically Complex Languages: A Paraphrase-Based Approach

    Authors: Preslav Nakov, Hwee Tou Ng

    Abstract: We propose a novel approach to translating from a morphologically complex language. Unlike previous research, which has targeted word inflections and concatenations, we focus on the pairwise relationship between morphologically related words, which we treat as potential paraphrases and handle using paraphrasing techniques at the word, phrase, and sentence level. An important advantage of this fram… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: machine translation, morphologically complex languages, paraphrases (word, phrase, and sentence level), infelctional morphology, derivational morphology, Malay, Indonesian

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: ACL-2011

  16. arXiv:2108.09505  [pdf, other

    cs.CL

    A Hierarchical Entity Graph Convolutional Network for Relation Extraction across Documents

    Authors: Tapas Nayak, Hwee Tou Ng

    Abstract: Distantly supervised datasets for relation extraction mostly focus on sentence-level extraction, and they cover very few relations. In this work, we propose cross-document relation extraction, where the two entities of a relation tuple appear in two different documents that are connected via a chain of common entities. Following this idea, we create a dataset for two-hop relation extraction, where… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

    Comments: Accepted in RANLP 2021

  17. arXiv:2011.11499  [pdf, other

    cs.CL

    Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model

    Authors: Juntao Li, Ruidan He, Hai Ye, Hwee Tou Ng, Lidong Bing, Rui Yan

    Abstract: Recent research indicates that pretraining cross-lingual language models on large-scale unlabeled texts yields significant performance improvements over various cross-lingual and low-resource tasks. Through training on one hundred languages and terabytes of texts, cross-lingual language models have proven to be effective in leveraging high-resource languages to enhance low-resource language proces… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Journal ref: IJCAI-PRICAI2020

  18. arXiv:2010.01535  [pdf, ps, other

    cs.CL

    A Survey of Unsupervised Dependency Parsing

    Authors: Wenjuan Han, Yong Jiang, Hwee Tou Ng, Kewei Tu

    Abstract: Syntactic dependency parsing is an important task in natural language processing. Unsupervised dependency parsing aims to learn a dependency parser from sentences that have no annotation of their correct parse trees. Despite its difficulty, unsupervised parsing is an interesting research direction because of its capability of utilizing almost unlimited unannotated text data. It also serves as the… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: COLING 2020

  19. arXiv:2009.11538  [pdf, other

    cs.CL

    Feature Adaptation of Pre-Trained Language Models across Languages and Domains with Robust Self-Training

    Authors: Hai Ye, Qingyu Tan, Ruidan He, Juntao Li, Hwee Tou Ng, Lidong Bing

    Abstract: Adapting pre-trained language models (PrLMs) (e.g., BERT) to new domains has gained much attention recently. Instead of fine-tuning PrLMs as done in most previous work, we investigate how to adapt the features of PrLMs to new domains without fine-tuning. We explore unsupervised domain adaptation (UDA) in this paper. With the features from PrLMs, we adapt the models trained with labeled data from t… ▽ More

    Submitted 30 November, 2020; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: To appear at EMNLP 2020. 14 pages. Code is available at: https://github.com/oceanypt/CFd

  20. arXiv:2002.09919  [pdf, other

    cs.CL cs.AI

    Do Multi-Hop Question Answering Systems Know How to Answer the Single-Hop Sub-Questions?

    Authors: Yixuan Tang, Hwee Tou Ng, Anthony K. H. Tung

    Abstract: Multi-hop question answering (QA) requires a model to retrieve and integrate information from different parts of a long text to answer a question. Humans answer this kind of complex questions via a divide-and-conquer approach. In this paper, we investigate whether top-performing models for multi-hop questions understand the underlying sub-questions like humans. We adopt a neural decomposition mode… ▽ More

    Submitted 26 January, 2021; v1 submitted 23 February, 2020; originally announced February 2020.

  21. arXiv:1912.03832  [pdf, other

    cs.CL cs.LG

    Effective Attention Modeling for Neural Relation Extraction

    Authors: Tapas Nayak, Hwee Tou Ng

    Abstract: Relation extraction is the task of determining the relation between two entities in a sentence. Distantly-supervised models are popular for this task. However, sentences can be long and two entities can be located far from each other in a sentence. The pieces of evidence supporting the presence of a relation between two entities may not be very direct, since the entities may be connected via some… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

    Comments: Accepted at CoNLL 2019

  22. arXiv:1911.09886  [pdf, other

    cs.CL cs.LG

    Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction

    Authors: Tapas Nayak, Hwee Tou Ng

    Abstract: A relation tuple consists of two entities and the relation between them, and often such tuples are found in unstructured text. There may be multiple relation tuples present in a text and they may share one or both entities among them. Extracting such relation tuples from a sentence is a difficult task and sharing of entities or overlap** entities among the tuples makes it more challenging. Most… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted at AAAI 2020

  23. arXiv:1910.00194  [pdf, other

    cs.CL

    Improved Word Sense Disambiguation Using Pre-Trained Contextualized Word Representations

    Authors: Christian Hadiwinoto, Hwee Tou Ng, Wee Chung Gan

    Abstract: Contextualized word representations are able to give different representations for the same word in different contexts, and they have been shown to be effective in downstream natural language processing tasks, such as question answering, named entity recognition, and sentiment analysis. However, evaluation on word sense disambiguation (WSD) in prior work shows that using contextualized word repres… ▽ More

    Submitted 28 December, 2019; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 10 pages, 2 figures, EMNLP 2019, added URL to the source code

  24. arXiv:1906.06906  [pdf, other

    cs.CL

    An Interactive Multi-Task Learning Network for End-to-End Aspect-Based Sentiment Analysis

    Authors: Ruidan He, Wee Sun Lee, Hwee Tou Ng, Daniel Dahlmeier

    Abstract: Aspect-based sentiment analysis produces a list of aspect terms and their corresponding sentiments for a natural language sentence. This task is usually done in a pipeline manner, with aspect term extraction performed first, followed by sentiment predictions toward the extracted aspect terms. While easier to develop, such an approach does not fully exploit joint information from the two subtasks a… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: Accepted to ACL2019

  25. arXiv:1809.00530  [pdf, other

    cs.CL

    Adaptive Semi-supervised Learning for Cross-domain Sentiment Classification

    Authors: Ruidan He, Wee Sun Lee, Hwee Tou Ng, Daniel Dahlmeier

    Abstract: We consider the cross-domain sentiment classification problem, where a sentiment classifier is to be learned from a source domain and to be generalized to a target domain. Our approach explicitly minimizes the distance between the source and the target instances in an embedded feature space. With the difference between source and target minimized, we then exploit additional information from the ta… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted to EMNLP2018

  26. arXiv:1806.04346  [pdf, other

    cs.CL

    Exploiting Document Knowledge for Aspect-level Sentiment Classification

    Authors: Ruidan He, Wee Sun Lee, Hwee Tou Ng, Daniel Dahlmeier

    Abstract: Attention-based long short-term memory (LSTM) networks have proven to be useful in aspect-level sentiment classification. However, due to the difficulties in annotating aspect-level data, existing public datasets for this task are all relatively small, which largely limits the effectiveness of those neural models. In this paper, we explore two approaches that transfer knowledge from document- leve… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: Accepted to ACL 2018 (short paper)

  27. arXiv:1805.01676  [pdf, other

    cs.CL

    Up** the Ante: Towards a Better Benchmark for Chinese-to-English Machine Translation

    Authors: Christian Hadiwinoto, Hwee Tou Ng

    Abstract: There are many machine translation (MT) papers that propose novel approaches and show improvements over their self-defined baselines. The experimental setting in each paper often differs from one another. As such, it is hard to determine if a proposed approach is really useful and advances the state of the art. Chinese-to-English translation is a common translation direction in MT papers, although… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: LREC 2018 (8 pages, 2 figures)

  28. arXiv:1801.08831  [pdf, other

    cs.CL

    A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

    Authors: Shamil Chollampatt, Hwee Tou Ng

    Abstract: We improve automatic correction of grammatical, orthographic, and collocation errors in text using a multilayer convolutional encoder-decoder neural network. The network is initialized with embeddings that make use of character N-gram information to better suit this task. When evaluated on common benchmark test data sets (CoNLL-2014 and JFLEG), our model substantially outperforms all prior neural… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: 8 pages, 3 figures, In Proceedings of AAAI 2018

  29. arXiv:1801.08290  [pdf, other

    cs.CL

    A Question-Focused Multi-Factor Attention Network for Question Answering

    Authors: Souvik Kundu, Hwee Tou Ng

    Abstract: Neural network models recently proposed for question answering (QA) primarily focus on capturing the passage-question relation. However, they have minimal capability to link relevant facts distributed across multiple sentences which is crucial in achieving deeper understanding, such as performing multi-sentence reasoning, co-reference resolution, etc. They also do not explicitly focus on the quest… ▽ More

    Submitted 25 January, 2018; originally announced January 2018.

    Comments: 8 pages, AAAI 2018

  30. arXiv:1702.04510  [pdf, other

    cs.CL

    A Dependency-Based Neural Reordering Model for Statistical Machine Translation

    Authors: Christian Hadiwinoto, Hwee Tou Ng

    Abstract: In machine translation (MT) that involves translating between two languages with significant differences in word order, determining the correct word order of translated words is a major challenge. The dependency parse tree of a source sentence can help to determine the correct word order of the translated words. In this paper, we present a novel reordering approach utilizing a neural network and d… ▽ More

    Submitted 15 February, 2017; originally announced February 2017.

    Comments: 7 pages, 3 figures, Proceedings of AAAI-17

    Journal ref: Proceedings of AAAI-17 (2017)

  31. arXiv:1608.01084  [pdf, other

    cs.CL

    To Swap or Not to Swap? Exploiting Dependency Word Pairs for Reordering in Statistical Machine Translation

    Authors: Christian Hadiwinoto, Yang Liu, Hwee Tou Ng

    Abstract: Reordering poses a major challenge in machine translation (MT) between two languages with significant differences in word order. In this paper, we present a novel reordering approach utilizing sparse features based on dependency word pairs. Each instance of these features captures whether two words, which are related by a dependency link in the source sentence dependency parse tree, follow the sam… ▽ More

    Submitted 3 August, 2016; originally announced August 2016.

    Comments: 7 pages, 1 figures, Proceedings of AAAI-16

    Journal ref: Proceedings of AAAI-16. (pp. 2943--2949) (2016)

  32. arXiv:1606.00210  [pdf, other

    cs.CL

    Exploiting N-Best Hypotheses to Improve an SMT Approach to Grammatical Error Correction

    Authors: Duc Tam Hoang, Shamil Chollampatt, Hwee Tou Ng

    Abstract: Grammatical error correction (GEC) is the task of detecting and correcting grammatical errors in texts written by second language learners. The statistical machine translation (SMT) approach to GEC, in which sentences written by second language learners are translated to grammatically correct sentences, has achieved state-of-the-art accuracy. However, the SMT approach is unable to utilize global c… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    Comments: Accepted for presentation at IJCAI-16

  33. arXiv:1606.00189  [pdf, other

    cs.CL

    Neural Network Translation Models for Grammatical Error Correction

    Authors: Shamil Chollampatt, Kaveh Taghipour, Hwee Tou Ng

    Abstract: Phrase-based statistical machine translation (SMT) systems have previously been used for the task of grammatical error correction (GEC) to achieve state-of-the-art accuracy. The superiority of SMT systems comes from their ability to learn text transformations from erroneous to corrected text, without explicitly modeling error types. However, phrase-based SMT systems suffer from limitations of disc… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    Comments: Accepted for presentation at IJCAI-16

  34. Improving Statistical Machine Translation for a Resource-Poor Language Using Related Resource-Rich Languages

    Authors: Preslav Ivanov Nakov, Hwee Tou Ng

    Abstract: We propose a novel language-independent approach for improving machine translation for resource-poor languages by exploiting their similarity to resource-rich ones. More precisely, we improve the translation from a resource-poor source language X_1 into a resource-rich language Y given a bi-text containing a limited number of parallel sentences for X_1-Y and a larger bi-text for X_2-Y for some res… ▽ More

    Submitted 22 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 44, pages 179-222, 2012

  35. arXiv:1011.0835  [pdf, other

    cs.CL

    A PDTB-Styled End-to-End Discourse Parser

    Authors: Ziheng Lin, Hwee Tou Ng, Min-Yen Kan

    Abstract: We have developed a full discourse parser in the Penn Discourse Treebank (PDTB) style. Our trained parser first identifies all discourse and non-discourse relations, locates and labels their arguments, and then classifies their relation types. When appropriate, the attribution spans to these relations are also determined. We present a comprehensive evaluation from both component-wise and error-cas… ▽ More

    Submitted 3 November, 2010; originally announced November 2010.

    Comments: 15 pages, 5 figures, 7 tables

    Report number: TRB8/10

    Journal ref: Natural Language Engineering 20 (02), 151 - 184, 2014

  36. arXiv:1010.3995  [pdf, ps, other

    quant-ph cs.DS math.NA

    An iterative approach for amplitude amplification with nonorthogonal measurements

    Authors: H. T. Ng, Franco Nori

    Abstract: Using three coupled harmonic oscillators, we present an amplitude-amplification method for factorization of an integer. We generalize the method in [arXiv:1007.4338] by employing non-orthogonal measurements on the harmonic oscillator. This method can increase the probability of obtaining the factors by repeatedly using the nonlinear interactions between the oscillators and non-orthogonal measureme… ▽ More

    Submitted 6 August, 2012; v1 submitted 19 October, 2010; originally announced October 2010.

    Comments: 21 pages, 5 figures; title changed, major revisions

  37. arXiv:1007.4338  [pdf, ps, other

    quant-ph cs.DS

    A proposal for factorization using Kerr nonlinearities between three harmonic oscillators

    Authors: H. T. Ng, Franco Nori

    Abstract: We propose an alternative method to factorize an integer by using three harmonic oscillators. These oscillators are coupled together via specific Kerr nonlinear interactions. This method can be applied even if two harmonic oscillators are prepared in mixed states. As simple examples, we show how to factorize N=15 and 35 using this approach. The effect of dissipation of the harmonic oscillators on… ▽ More

    Submitted 6 August, 2012; v1 submitted 25 July, 2010; originally announced July 2010.

    Comments: 21 pages, 5 figures; title changed, major revisions

  38. Exemplar-Based Word Sense Disambiguation: Some Recent Improvements

    Authors: Hwee Tou Ng

    Abstract: In this paper, we report recent improvements to the exemplar-based learning approach for word sense disambiguation that have achieved higher disambiguation accuracy. By using a larger value of $k$, the number of nearest neighbors to use for determining the class of a test example, and through 10-fold cross validation to automatically determine the best $k$, we have obtained improved disambiguati… ▽ More

    Submitted 10 June, 1997; originally announced June 1997.

    Comments: 6 pages

    Journal ref: In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing (EMNLP-2), August 1997

  39. arXiv:cmp-lg/9606032  [pdf, ps

    cs.CL

    Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach

    Authors: Hwee Tou Ng, Hian Beng Lee

    Abstract: In this paper, we present a new approach for word sense disambiguation (WSD) using an exemplar-based learning algorithm. This approach integrates a diverse set of knowledge sources to disambiguate word sense, including part of speech of neighboring words, morphological form, the unordered set of surrounding words, local collocations, and verb-object syntactic relation. We tested our WSD program,… ▽ More

    Submitted 29 June, 1996; originally announced June 1996.

    Comments: In Proceedings of ACL96, 8 pages

    Journal ref: ACL-96