Skip to main content

Showing 1–7 of 7 results for author: Kobren, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.16336  [pdf, other

    cs.LG cs.AI cs.CL

    Leveraging Extracted Model Adversaries for Improved Black Box Attacks

    Authors: Naveen Jafer Nizar, Ari Kobren

    Abstract: We present a method for adversarial input generation against black box models for reading comprehension based question answering. Our approach is composed of two steps. First, we approximate a victim black box model via model extraction (Krishna et al., 2020). Second, we use our own white box method to generate input perturbations that cause the approximate model to fail. These perturbed inputs ar… ▽ More

    Submitted 2 November, 2020; v1 submitted 30 October, 2020; originally announced October 2020.

    Journal ref: Analyzing and interpreting neural networks for NLP, 2020

  2. arXiv:2001.00076  [pdf, other

    cs.LG cs.DS stat.ML

    Scalable Hierarchical Clustering with Tree Grafting

    Authors: Nicholas Monath, Ari Kobren, Akshay Krishnamurthy, Michael Glass, Andrew McCallum

    Abstract: We introduce Grinch, a new algorithm for large-scale, non-greedy hierarchical clustering with general linkage functions that compute arbitrary similarity between two point sets. The key components of Grinch are its rotate and graft subroutines that efficiently reconfigure the hierarchy as new points arrive, supporting discovery of clusters with complex structure. Grinch is motivated by a new notio… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

    Comments: 23 pages (appendix included), published at KDD 2019

  3. arXiv:1907.10165  [pdf, other

    cs.LG cs.CL stat.ML

    Optimal Transport-based Alignment of Learned Character Representations for String Similarity

    Authors: Derek Tam, Nicholas Monath, Ari Kobren, Aaron Traylor, Rajarshi Das, Andrew McCallum

    Abstract: String similarity models are vital for record linkage, entity resolution, and search. In this work, we present STANCE --a learned model for computing the similarity of two strings. Our approach encodes the characters of each string, aligns the encodings using Sinkhorn Iteration (alignment is posed as an instance of optimal transport) and scores the alignment with a convolutional neural network. W… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: ACL Long Paper

  4. arXiv:1906.07859  [pdf, other

    cs.LG stat.ML

    Supervised Hierarchical Clustering with Exponential Linkage

    Authors: Nishant Yadav, Ari Kobren, Nicholas Monath, Andrew McCallum

    Abstract: In supervised clustering, standard techniques for learning a pairwise dissimilarity function often suffer from a discrepancy between the training and clustering objectives, leading to poor cluster quality. Rectifying this discrepancy necessitates matching the procedure for training the dissimilarity function to the clustering algorithm. In this paper, we introduce a method for training the dissimi… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: Appears in ICML 2019

  5. arXiv:1905.12807  [pdf, other

    cs.AI cs.LG stat.ML

    Constructing High Precision Knowledge Bases with Subjective and Factual Attributes

    Authors: Ari Kobren, Pablo Barrio, Oksana Yakhnenko, Johann Hibschman, Ian Langmore

    Abstract: Knowledge bases (KBs) are the backbone of many ubiquitous applications and are thus required to exhibit high precision. However, for KBs that store subjective attributes of entities, e.g., whether a movie is "kid friendly", simply estimating precision is complicated by the inherent ambiguity in measuring subjective phenomena. In this work, we develop a method for constructing KBs with tunable prec… ▽ More

    Submitted 31 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Appears at KDD 2019 Applied Data Science Track, 19 pages

  6. arXiv:1905.11924  [pdf, other

    cs.DS cs.DL

    Paper Matching with Local Fairness Constraints

    Authors: Ari Kobren, Barna Saha, Andrew McCallum

    Abstract: Automatically matching reviewers to papers is a crucial step of the peer review process for venues receiving thousands of submissions. Unfortunately, common paper matching algorithms often construct matchings suffering from two critical problems: (1) the group of reviewers assigned to a paper do not collectively possess sufficient expertise, and (2) reviewer workloads are highly skewed. In this pa… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: Appears at KDD 2019 Research Track, 20 pages

  7. arXiv:1704.01858  [pdf, other

    cs.LG stat.ML

    An Online Hierarchical Algorithm for Extreme Clustering

    Authors: Ari Kobren, Nicholas Monath, Akshay Krishnamurthy, Andrew McCallum

    Abstract: Many modern clustering methods scale well to a large number of data items, N, but not to a large number of clusters, K. This paper introduces PERCH, a new non-greedy algorithm for online hierarchical clustering that scales to both massive N and K--a problem setting we term extreme clustering. Our algorithm efficiently routes new data points to the leaves of an incrementally-built tree. Motivated b… ▽ More

    Submitted 6 April, 2017; originally announced April 2017.

    Comments: 20 pages. Code available here: https://github.com/iesl/xcluster