Skip to main content

Showing 1–19 of 19 results for author: Nokleby, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.04579  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Information-Theoretic Bayes Risk Lower Bounds for Realizable Models

    Authors: Matthew Nokleby, Ahmad Beirami

    Abstract: We derive information-theoretic lower bounds on the Bayes risk and generalization error of realizable machine learning models. In particular, we employ an analysis in which the rate-distortion function of the model parameters bounds the required mutual information between the training samples and the model parameters in order to learn a model up to a Bayes risk constraint. For realizable models, w… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  2. arXiv:2006.05752  [pdf, ps, other

    cs.LG cs.DC math.OC stat.ML

    Anytime MiniBatch: Exploiting Stragglers in Online Distributed Optimization

    Authors: Nuwan Ferdinand, Haider Al-Lawati, Stark C. Draper, Matthew Nokleby

    Abstract: Distributed optimization is vital in solving large-scale machine learning problems. A widely-shared feature of distributed optimization techniques is the requirement that all nodes complete their assigned tasks in each computational epoch before the system can proceed to the next epoch. In such settings, slow nodes, called stragglers, can greatly slow progress. To mitigate the impact of stragglers… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: International Conference on Learning Representations (ICLR), May 2019, New Orleans, LA, USA

    Journal ref: Proc. of the 7th Int. Conf. on Learning Representations (ICLR), May 2019, New Orleans, LA, USA

  3. arXiv:2005.08854  [pdf, other

    cs.LG cs.DC eess.SP math.OC stat.ML

    Scaling-up Distributed Processing of Data Streams for Machine Learning

    Authors: Matthew Nokleby, Haroon Raja, Waheed U. Bajwa

    Abstract: Emerging applications of machine learning in numerous areas involve continuous gathering of and learning from streams of data. Real-time incorporation of streaming data into the learned models is essential for improved inference in these applications. Further, these applications often involve data that are either inherently gathered at geographically distributed entities or that are intentionally… ▽ More

    Submitted 31 August, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 45 pages, 9 figures; preprint of a journal paper published in Proceedings of the IEEE (Special Issue on Optimization for Data-driven Learning and Control)

    Journal ref: Proc. of the IEEE, vol. 108, no. 11, pp. 1984-2012, Nov. 2020

  4. arXiv:2004.07268  [pdf, other

    cs.CV cs.LG

    Learning Furniture Compatibility with Graph Neural Networks

    Authors: Luisa F. Polania, Mauricio Flores, Yiran Li, Matthew Nokleby

    Abstract: We propose a graph neural network (GNN) approach to the problem of predicting the stylistic compatibility of a set of furniture items from images. While most existing results are based on siamese networks which evaluate pairwise compatibility between items, the proposed GNN architecture exploits relational information among groups of items. We present two GNN models, both of which comprise a deep… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at CVPR Workshops

  5. arXiv:1903.07507  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    An Effective Label Noise Model for DNN Text Classification

    Authors: Ishan **dal, Daniel Pressel, Brian Lester, Matthew Nokleby

    Abstract: Because large, human-annotated datasets suffer from labeling errors, it is crucial to be able to train deep neural networks in the presence of label noise. While training image classification models with label noise have received much attention, training text classification models have not. In this paper, we propose an approach to training deep networks that is robust to label noise. This approach… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted at NAACL-HLT 2019 Main Conference Long paper

  6. arXiv:1811.04345  [pdf, other

    cs.LG cs.AI stat.ML

    Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining

    Authors: Ishan **dal, Zhiwei Qin, Xuewen Chen, Matthew Nokleby, Jie** Ye

    Abstract: In this paper, we develop a reinforcement learning (RL) based system to learn an effective policy for carpooling that maximizes transportation efficiency so that fewer cars are required to fulfill the given amount of trip demand. For this purpose, first, we develop a deep neural network model, called ST-NN (Spatio-Temporal Neural Network), to predict taxi trip time from the raw GPS trip data. Seco… ▽ More

    Submitted 10 November, 2018; originally announced November 2018.

    Comments: Accepted at IEEE International Conference on Big Data 2018. arXiv admin note: text overlap with arXiv:1710.04350

  7. arXiv:1810.11499  [pdf, other

    cs.IT cs.LG

    Information Bottleneck Methods for Distributed Learning

    Authors: Parinaz Farajiparvar, Ahmad Beirami, Matthew Nokleby

    Abstract: We study a distributed learning problem in which Alice sends a compressed distillation of a set of training data to Bob, who uses the distilled version to best solve an associated learning problem. We formalize this as a rate-distortion problem in which the training set is the source and Bob's cross-entropy loss is the distortion measure. We consider this problem for unsupervised learning for batc… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

  8. arXiv:1810.10957  [pdf, other

    cs.IT eess.SP stat.ML

    Tensor Matched Kronecker-Structured Subspace Detection for Missing Information

    Authors: Ishan **dal, Matthew Nokleby

    Abstract: We consider the problem of detecting whether a tensor signal having many missing entities lies within a given low dimensional Kronecker-Structured (KS) subspace. This is a matched subspace detection problem. Tensor matched subspace detection problem is more challenging because of the intertwined signal dimensions. We solve this problem by projecting the signal onto the Kronecker structured subspac… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

  9. arXiv:1802.08378  [pdf, other

    cs.IT

    Multi-Scale Spectrum Sensing in Dense Multi-Cell Cognitive Networks

    Authors: Nicolo Michelusi, Matthew Nokleby, Urbashi Mitra, Robert Calderbank

    Abstract: Multi-scale spectrum sensing is proposed to overcome the cost of full network state information on the spectrum occupancy of primary users (PUs) in dense multi-cell cognitive networks. Secondary users (SUs) estimate the local spectrum occupancies and aggregate them hierarchically to estimate spectrum occupancy at multiple spatial scales. Thus, SUs obtain fine-grained estimates of spectrum occupanc… ▽ More

    Submitted 6 December, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

    Comments: To appear on IEEE Transactions on Communications

  10. arXiv:1710.04350  [pdf, other

    stat.ML cs.LG

    A Unified Neural Network Approach for Estimating Travel Time and Distance for a Taxi Trip

    Authors: Ishan **dal, Tony, Qin, Xuewen Chen, Matthew Nokleby, Jie** Ye

    Abstract: In building intelligent transportation systems such as taxi or rideshare services, accurate prediction of travel time and distance is crucial for customer experience and resource management. Using the NYC taxi dataset, which contains taxi trips data collected from GPS-enabled taxis [23], this paper investigates the use of deep neural networks to jointly predict taxi trip time and distance. We prop… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

  11. arXiv:1705.03419  [pdf, other

    cs.CV cs.LG stat.ML

    Learning Deep Networks from Noisy Labels with Dropout Regularization

    Authors: Ishan **dal, Matthew Nokleby, Xuewen Chen

    Abstract: Large datasets often have unreliable labels-such as those obtained from Amazon's Mechanical Turk or social media platforms-and classifiers trained on mislabeled datasets often exhibit poor performance. We present a simple, effective technique for accounting for label noise when training deep neural networks. We augment a standard deep network with a softmax layer that models the label noise statis… ▽ More

    Submitted 9 May, 2017; originally announced May 2017.

    Comments: Published at 2016 IEEE 16th International Conference on Data Mining

  12. arXiv:1705.02556  [pdf, other

    cs.IT cs.LG stat.ML

    Classification and Representation via Separable Subspaces: Performance Limits and Algorithms

    Authors: Ishan **dal, Matthew Nokleby

    Abstract: We study the classification performance of Kronecker-structured models in two asymptotic regimes and developed an algorithm for separable, fast and compact K-S dictionary learning for better classification and representation of multidimensional signals by exploiting the structure in the signal. First, we study the classification performance in terms of diversity order and pairwise geometry of the… ▽ More

    Submitted 29 December, 2017; v1 submitted 6 May, 2017; originally announced May 2017.

    Comments: This paper is submitted to IEEE JSTSP Special Issue on Information-Theoretic Methods in Data Acquisition, Analysis, and Processing 2018

    Journal ref: IEEE Journal of Selected Topics in Signal Processing ( Volume: 12 , Issue: 5 , Oct. 2018 )

  13. Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

    Authors: Matthew Nokleby, Waheed U. Bajwa

    Abstract: Motivated by machine learning applications in networks of sensors, internet-of-things (IoT) devices, and autonomous agents, we propose techniques for distributed stochastic convex learning from high-rate data streams. The setup involves a network of nodes---each one of which has a stream of data arriving at a constant rate---that solve a stochastic convex optimization problem by collaborating with… ▽ More

    Submitted 6 August, 2018; v1 submitted 25 April, 2017; originally announced April 2017.

    Comments: 16 pages, 6 figures; Accepted for publication in IEEE Transactions on Signal and Information Processing over Networks

    Journal ref: Published in IEEE Trans. Signal Inform. Proc. over Netw., vol. 5, no. 1, pp. 152-167, Mar. 2019

  14. arXiv:1702.07973  [pdf, other

    cs.IT

    Multi-scale Spectrum Sensing in Small-Cell mm-Wave Cognitive Wireless Networks

    Authors: Nicolo Michelusi, Matthew Nokleby, Urbashi Mitra, Robert Calderbank

    Abstract: In this paper, a multi-scale approach to spectrum sensing in cognitive cellular networks is proposed. In order to overcome the huge cost incurred in the acquisition of full network state information, a hierarchical scheme is proposed, based on which local state estimates are aggregated up the hierarchy to obtain aggregate state information at multiple scales, which are then sent back to each cell… ▽ More

    Submitted 25 February, 2017; originally announced February 2017.

    Comments: To appear on ICC 2017

  15. arXiv:1608.01267  [pdf, ps, other

    cs.IT

    Low-Dimensional Sha** for High-Dimensional Lattice Codes

    Authors: Nuwan S. Ferdinand, Brian M. Kurkoski, Matthew Nokleby, Behnaam Aazhang

    Abstract: We propose two low-complexity lattice code constructions that have competitive coding and sha** gains. The first construction, named systematic Voronoi sha**, maps short blocks of integers to the dithered Voronoi integers, which are dithered integers that are uniformly distributed over the Voronoi region of a low-dimensional sha** lattice. Then, these dithered Voronoi integers are encoded us… ▽ More

    Submitted 3 August, 2016; originally announced August 2016.

    Comments: 13 pages

  16. arXiv:1605.02268  [pdf, other

    cs.IT cs.LG stat.ML

    Rate-Distortion Bounds on Bayes Risk in Supervised Learning

    Authors: Matthew Nokleby, Ahmad Beirami, Robert Calderbank

    Abstract: We present an information-theoretic framework for bounding the number of labeled samples needed to train a classifier in a parametric Bayesian setting. We derive bounds on the average $L_p$ distance between the learned classifier and the true maximum a posteriori classifier, which are well-established surrogates for the excess classification error due to imperfect learning. We provide lower and up… ▽ More

    Submitted 17 November, 2017; v1 submitted 7 May, 2016; originally announced May 2016.

    Comments: Revised submission to IEEE Transactions on Information Theory

  17. arXiv:1404.5187  [pdf, other

    cs.IT

    Discrimination on the Grassmann Manifold: Fundamental Limits of Subspace Classifiers

    Authors: Matthew Nokleby, Miguel Rodrigues, Robert Calderbank

    Abstract: We present fundamental limits on the reliable classification of linear and affine subspaces from noisy, linear features. Drawing an analogy between discrimination among subspaces and communication over vector wireless channels, we propose two Shannon-inspired measures to characterize asymptotic classifier performance. First, we define the classification capacity, which characterizes necessary and… ▽ More

    Submitted 10 December, 2014; v1 submitted 21 April, 2014; originally announced April 2014.

    Comments: 19 pages, 4 figures. Revised submission to IEEE Transactions on Information Theory

  18. Toward Resource-Optimal Consensus over the Wireless Medium

    Authors: Matthew Nokleby, Waheed U. Bajwa, Robert Calderbank, Behnaam Aazhang

    Abstract: We carry out a comprehensive study of the resource cost of averaging consensus in wireless networks. Most previous approaches suppose a graphical network, which abstracts away crucial features of the wireless medium, and measure resource consumption only in terms of the total number of transmissions required to achieve consensus. Under a path-loss dominated model, we study the resource requirement… ▽ More

    Submitted 11 February, 2013; v1 submitted 15 August, 2012; originally announced August 2012.

    Comments: 12 pages, 3 figures, to appear in IEEE Journal Selected Topics in Signal Processing, April 2013

    Journal ref: IEEE J. Select. Topics Signal Processing, vol. 7, no. 2, pp. 284-295, Apr. 2013

  19. arXiv:1203.0695  [pdf, ps, other

    cs.IT

    Cooperative Compute-and-Forward

    Authors: Matthew Nokleby, Behnaam Aazhang

    Abstract: We examine the benefits of user cooperation under compute-and-forward. Much like in network coding, receivers in a compute-and-forward network recover finite-field linear combinations of transmitters' messages. Recovery is enabled by linear codes: transmitters map messages to a linear codebook, and receivers attempt to decode the incoming superposition of signals to an integer combination of codew… ▽ More

    Submitted 3 March, 2012; originally announced March 2012.

    Comments: submitted to IEEE Transactions on Information Theory