Skip to main content

Showing 1–9 of 9 results for author: Vembu, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:1911.07335  [pdf, other

    cs.CL cs.LG stat.ML

    Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

    Authors: Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan, Rheeya Uppaal, Andrew McCallum

    Abstract: Existing deep active learning algorithms achieve impressive sampling efficiency on natural language processing tasks. However, they exhibit several weaknesses in practice, including (a) inability to use uncertainty sampling with black-box models, (b) lack of robustness to labeling noise, and (c) lack of transparency. In response, we propose a transparent batch active sampling framework by estimati… ▽ More

    Submitted 20 July, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: This is a pre-print of an article published in Springer Machine Learning journal. The final authenticated version is available online at: https://doi.org/10.1007/s10994-020-05897-1

  2. arXiv:1710.08579  [pdf, other

    cs.DL

    Implementing Recommendation Algorithms in a Large-Scale Biomedical Science Knowledge Base

    Authors: Jessica Perrie, Yanqi Hao, Zack Hayat, Recep Colak, Kelly Lyons, Shankar Vembu, Sam Molyneux

    Abstract: The number of biomedical research articles published has doubled in the past 20 years. Search engine based systems naturally center around searching, but researchers may not have a clear goal in mind, or the goal may be expressed in a query that a literature search engine cannot easily answer, such as identifying the most prominent authors in a given field of research. The discovery process can be… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: 21 pages; 5 figures

  3. arXiv:1607.06988  [pdf, other

    cs.LG stat.ML

    Interactive Learning from Multiple Noisy Labels

    Authors: Shankar Vembu, Sandra Zilles

    Abstract: Interactive learning is a process in which a machine learning algorithm is provided with meaningful, well-chosen examples as opposed to randomly chosen examples typical in standard supervised learning. In this paper, we propose a new method for interactive learning from multiple noisy labels where we exploit the disagreement among annotators to quantify the easiness (or meaningfulness) of an examp… ▽ More

    Submitted 23 July, 2016; originally announced July 2016.

  4. arXiv:1408.2552  [pdf, other

    q-bio.PE cs.LG stat.ML

    Comparing Nonparametric Bayesian Tree Priors for Clonal Reconstruction of Tumors

    Authors: Amit G. Deshwar, Shankar Vembu, Quaid Morris

    Abstract: Statistical machine learning methods, especially nonparametric Bayesian methods, have become increasingly popular to infer clonal population structure of tumors. Here we describe the treeCRP, an extension of the Chinese restaurant process (CRP), a popular construction used in nonparametric mixture models, to infer the phylogeny and genotype of major subclonal lineages represented in the population… ▽ More

    Submitted 11 August, 2014; originally announced August 2014.

    Comments: Preprint of an article submitted for consideration in the Pacific Symposium on Biocomputing \c{opyright} 2015; World Scientific Publishing Co., Singapore, 2015; http://psb.stanford.edu/

  5. arXiv:1406.7250  [pdf, other

    q-bio.PE cs.LG stat.ML

    Reconstructing subclonal composition and evolution from whole genome sequencing of tumors

    Authors: Amit G. Deshwar, Shankar Vembu, Christina K. Yung, Gun Ho Jang, Lincoln Stein, Quaid Morris

    Abstract: Tumors often contain multiple subpopulations of cancerous cells defined by distinct somatic mutations. We describe a new method, PhyloWGS, that can be applied to WGS data from one or more tumor samples to reconstruct complete genotypes of these subpopulations based on variant allele frequencies (VAFs) of point mutations and population frequencies of structural variations. We introduce a principled… ▽ More

    Submitted 6 January, 2015; v1 submitted 27 June, 2014; originally announced June 2014.

  6. arXiv:1210.3384  [pdf, other

    cs.LG q-bio.PE q-bio.QM stat.ML

    Inferring clonal evolution of tumors from single nucleotide somatic mutations

    Authors: Wei Jiao, Shankar Vembu, Amit G. Deshwar, Lincoln Stein, Quaid Morris

    Abstract: High-throughput sequencing allows the detection and quantification of frequencies of somatic single nucleotide variants (SNV) in heterogeneous tumor cell populations. In some cases, the evolutionary history and population frequency of the subclonal lineages of tumor cells present in the sample can be reconstructed from these SNV frequency measurements. However, automated methods to do this reconst… ▽ More

    Submitted 2 November, 2013; v1 submitted 11 October, 2012; originally announced October 2012.

  7. arXiv:1206.4661  [pdf

    cs.LG stat.ML

    Predicting accurate probabilities with a ranking loss

    Authors: Aditya Menon, Xiaoqian Jiang, Shankar Vembu, Charles Elkan, Lucila Ohno-Machado

    Abstract: In many real-world applications of machine learning classifiers, it is essential to predict the probability of an example belonging to a particular class. This paper proposes a simple technique for predicting probabilities based on optimizing a ranking loss, followed by isotonic regression. This semi-parametric technique offers both good ranking and regression performance, and models a richer set… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  8. arXiv:1205.2610  [pdf

    cs.LG

    Probabilistic Structured Predictors

    Authors: Shankar Vembu, Thomas Gartner, Mario Boley

    Abstract: We consider MAP estimators for structured prediction with exponential family models. In particular, we concentrate on the case that efficient algorithms for uniform sampling from the output space exist. We show that under this assumption (i) exact computation of the partition function remains a hard problem, and (ii) the partition function and the gradient of the log partition function can be appr… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009). arXiv admin note: substantial text overlap with arXiv:0912.4473

    Report number: UAI-P-2009-PG-557-564

  9. arXiv:0912.4473  [pdf, ps, other

    cs.LG cs.AI

    Learning to Predict Combinatorial Structures

    Authors: Shankar Vembu

    Abstract: The major challenge in designing a discriminative learning algorithm for predicting structured data is to address the computational issues arising from the exponential size of the output space. Existing algorithms make different assumptions to ensure efficient, polynomial time estimation of model parameters. For several combinatorial structures, including cycles, partially ordered sets, permutatio… ▽ More

    Submitted 26 June, 2010; v1 submitted 22 December, 2009; originally announced December 2009.

    Comments: PhD thesis, Department of Computer Science, University of Bonn (submitted, December 2009)