Skip to main content

Showing 1–7 of 7 results for author: Kushnir, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.00752  [pdf

    cs.CL cs.AI

    Data Collection and Analysis of French Dialects

    Authors: Omar Shaur Choudhry, Paul Omara Odida, Joshua Reiner, Keiron Appleyard, Danielle Kushnir, William Toon

    Abstract: This paper discusses creating and analysing a new dataset for data mining and text analytics research, contributing to a joint Leeds University research project for the Corpus of National Dialects. This report investigates machine learning classifiers to classify samples of French dialect text across various French-speaking countries. Following the steps of the CRISP-DM methodology, this report ex… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 4 pages plus 1 page for references. 4 figures including 1 image

  2. arXiv:2003.10339  [pdf, other

    cs.LG stat.ML

    Diffusion-based Deep Active Learning

    Authors: Dan Kushnir, Luca Venturi

    Abstract: The remarkable performance of deep neural networks depends on the availability of massive labeled data. To alleviate the load of data annotation, active deep learning aims to select a minimal set of training points to be labelled which yields maximal model accuracy. Most existing approaches implement either an `exploration'-type selection criterion, which aims at exploring the joint distribution o… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  3. arXiv:1801.05856  [pdf, other

    cs.SI cs.LG stat.ML

    Active Community Detection with Maximal Expected Model Change

    Authors: Dan Kushnir, Benjamin Mirabelli

    Abstract: We present a novel active learning algorithm for community detection on networks. Our proposed algorithm uses a Maximal Expected Model Change (MEMC) criterion for querying network nodes label assignments. MEMC detects nodes that maximally change the community assignment likelihood model following a query. Our method is inspired by detection in the benchmark Stochastic Block Model (SBM), where we p… ▽ More

    Submitted 20 March, 2020; v1 submitted 10 January, 2018; originally announced January 2018.

  4. arXiv:1712.07242  [pdf, other

    cs.LG

    Linear Time Clustering for High Dimensional Mixtures of Gaussian Clouds

    Authors: Dan Kushnir, Shirin Jalali, Iraj Saniee

    Abstract: Clustering mixtures of Gaussian distributions is a fundamental and challenging problem that is ubiquitous in various high-dimensional data processing tasks. While state-of-the-art work on learning Gaussian mixture models has focused primarily on improving separation bounds and their generalization to arbitrary classes of mixture models, less emphasis has been paid to practical computational effici… ▽ More

    Submitted 1 March, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

  5. Economic and Technological Complexity: A Model Study of Indicators of Knowledge-based Innovation Systems

    Authors: Inga Ivanova, Oivind Strand, Duncan Kushnir, Loet Leydesdorff

    Abstract: The Economic Complexity Index (ECI; Hidalgo & Hausmann, 2009) measures the complexity of national economies in terms of product groups. Analogously to ECI, a Patent Complexity Index (PatCI) can be developed on the basis of a matrix of nations versus patent classes. Using linear algebra, the three dimensions: countries, product groups, and patent classes can be combined into a measure of "Triple He… ▽ More

    Submitted 7 December, 2016; v1 submitted 7 February, 2016; originally announced February 2016.

    Journal ref: Technological Forecasting and Social Change 120 (July 2017) 77-89

  6. arXiv:1512.04214  [pdf

    cs.DL

    The Globalization of Academic Entrepreneurship? The Recent Growth (2009-2014) in University Patenting Decomposed

    Authors: Loet Leydesdorff, Henry Etzkowitz, Duncan Kushnir

    Abstract: The contribution of academia to US patents has become increasingly global. Following a pause, with a relatively flat rate, from 1998 to 2008, the long-term trend of university patenting rising as a share of all patenting has resumed, driven by the internationalization of academic entrepreneurship and the persistence of US university technology transfer. We disaggregate this recent growth in univer… ▽ More

    Submitted 14 December, 2015; originally announced December 2015.

  7. arXiv:1210.6456  [pdf

    cs.DL

    Interactive Overlay Maps for US Patent (USPTO) Data Based on International Patent Classifications (IPC)

    Authors: Loet Leydesdorff, Duncan Kushnir, Ismael Rafols

    Abstract: We report on the development of an interface to the US Patent and Trademark Office (USPTO) that allows for the map** of patent portfolios as overlays to basemaps constructed from citation relations among all patents contained in this database during the period 1976-2011. Both the interface and the data are in the public domain; the freeware programs VOSViewer and/or Pajek can be used for the vis… ▽ More

    Submitted 18 November, 2012; v1 submitted 24 October, 2012; originally announced October 2012.

    Comments: Scientometrics (forthcoming)