Skip to main content

Showing 1–2 of 2 results for author: Gale, W A

Searching in archive cs. Search in all archives.
.
  1. Tagging French Without Lexical Probabilities -- Combining Linguistic Knowledge And Statistical Learning

    Authors: Evelyne Tzoukermann, Dragomir R. Radev, William A. Gale

    Abstract: This paper explores morpho-syntactic ambiguities for French to develop a strategy for part-of-speech disambiguation that a) reflects the complexity of French as an inflected language, b) optimizes the estimation of probabilities, c) allows the user flexibility in choosing a tagset. The problem in extracting lexical probabilities from a limited training corpus is that the statistical model may no… ▽ More

    Submitted 10 October, 1997; originally announced October 1997.

    Comments: uses ypsfig

  2. arXiv:cmp-lg/9407020  [pdf, ps

    cs.CL

    A Sequential Algorithm for Training Text Classifiers

    Authors: David D. Lewis, William A. Gale

    Abstract: The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was developed and tested on a newswire text categorization task. This method, which we call uncertain… ▽ More

    Submitted 24 July, 1994; v1 submitted 24 July, 1994; originally announced July 1994.

    Comments: 10 pages, uuencoded, compressed PostScript; Proc. SIGIR-94 LaTex available from [email protected]