Skip to main content

Showing 1–23 of 23 results for author: Studer, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2007.13242  [pdf, other

    cs.LG stat.ML

    WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic

    Authors: Renkun Ni, Hong-min Chu, Oscar Castañeda, **-yeh Chiang, Christoph Studer, Tom Goldstein

    Abstract: Low-resolution neural networks represent both weights and activations with few bits, drastically reducing the multiplication complexity. Nonetheless, these products are accumulated using high-resolution (typically 32-bit) additions, an operation that dominates the arithmetic complexity of inference when using extreme quantization (e.g., binary weights). To further optimize inference, we propose a… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

  2. Headless Horseman: Adversarial Attacks on Transfer Learning Models

    Authors: Ahmed Abdelkader, Michael J. Curry, Liam Fowl, Tom Goldstein, Avi Schwarzschild, Manli Shu, Christoph Studer, Chen Zhu

    Abstract: Transfer learning facilitates the training of task-specific classifiers using pre-trained models as feature extractors. We present a family of transferable adversarial attacks against such classifiers, generated without access to the classification head; we call these \emph{headless attacks}. We first demonstrate successful transfer attacks against a victim network using \textit{only} its feature… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Comments: 5 pages, 2 figures. Accepted in ICASSP 2020. Code available on https://github.com/zhuchen03/headless-attack.git

  3. arXiv:2003.06693  [pdf, other

    cs.CR cs.LG stat.ML

    Certified Defenses for Adversarial Patches

    Authors: **-Yeh Chiang, Renkun Ni, Ahmed Abdelkader, Chen Zhu, Christoph Studer, Tom Goldstein

    Abstract: Adversarial patch attacks are among one of the most practical threat models against real-world computer vision systems. This paper studies certified and empirical defenses against patch attacks. We begin with a set of experiments showing that most existing defenses, which work by pre-processing input images to mitigate adversarial patches, are easily broken by simple white-box adversaries. Motivat… ▽ More

    Submitted 25 September, 2020; v1 submitted 14 March, 2020; originally announced March 2020.

    Comments: International Conference on Learning Representations, ICLR 2020

  4. arXiv:2001.10509  [pdf, other

    cs.LG eess.SP stat.ML

    MSE-Optimal Neural Network Initialization via Layer Fusion

    Authors: Ramina Ghods, Andrew S. Lan, Tom Goldstein, Christoph Studer

    Abstract: Deep neural networks achieve state-of-the-art performance for a range of classification and inference tasks. However, the use of stochastic gradient descent combined with the nonconvexity of the underlying optimization problems renders parameter learning susceptible to initialization. To address this issue, a variety of methods that rely on random parameter initialization or knowledge distillation… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: Extended version of the CISS 2020 paper containing the proof for convolutional layers

  5. arXiv:1909.13355  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    Siamese Neural Networks for Wireless Positioning and Channel Charting

    Authors: Eric Lei, Oscar Castañeda, Olav Tirkkonen, Tom Goldstein, Christoph Studer

    Abstract: Neural networks have been proposed recently for positioning and channel charting of user equipments (UEs) in wireless systems. Both of these approaches process channel state information (CSI) that is acquired at a multi-antenna base-station in order to learn a function that maps CSI to location information. CSI-based positioning using deep neural networks requires a dataset that contains both CSI… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

    Comments: Presented at Allerton 2019; 8 pages

  6. arXiv:1908.02878  [pdf, other

    eess.SP cs.IT stat.ML

    Improving Channel Charting with Representation-Constrained Autoencoders

    Authors: Pengzhi Huang, Oscar Castañeda, Emre Gönültaş, Saïd Medjkouh, Olav Tirkkonen, Tom Goldstein, Christoph Studer

    Abstract: Channel charting (CC) has been proposed recently to enable logical positioning of user equipments (UEs) in the neighborhood of a multi-antenna base-station solely from channel-state information (CSI). CC relies on dimensionality reduction of high-dimensional CSI features in order to construct a channel chart that captures spatial and radio geometries so that UEs close in space are close in the cha… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: Presented at the 20th IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 2019

  7. arXiv:1905.08232  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarially robust transfer learning

    Authors: Ali Shafahi, Parsa Saadatpanah, Chen Zhu, Amin Ghiasi, Christoph Studer, David Jacobs, Tom Goldstein

    Abstract: Transfer learning, in which a network is trained on one task and re-purposed on another, is often used to produce neural network classifiers when data is scarce or full-scale training is too costly. When the goal is to produce a model that is not only accurate but also adversarially robust, data scarcity and computational limitations become even more cumbersome. We consider robust transfer learnin… ▽ More

    Submitted 21 February, 2020; v1 submitted 20 May, 2019; originally announced May 2019.

  8. arXiv:1905.05897  [pdf, other

    stat.ML cs.CR cs.LG

    Transferable Clean-Label Poisoning Attacks on Deep Neural Nets

    Authors: Chen Zhu, W. Ronny Huang, Ali Shafahi, Hengduo Li, Gavin Taylor, Christoph Studer, Tom Goldstein

    Abstract: Clean-label poisoning attacks inject innocuous looking (and "correctly" labeled) poison images into training data, causing a model to misclassify a targeted image after being trained on this data. We consider transferable poisoning attacks that succeed without access to the victim network's outputs, architecture, or (in some cases) training data. To achieve this, we propose a new "polytope attack"… ▽ More

    Submitted 16 May, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted to ICML2019

  9. arXiv:1904.12843  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarial Training for Free!

    Authors: Ali Shafahi, Mahyar Najibi, Amin Ghiasi, Zheng Xu, John Dickerson, Christoph Studer, Larry S. Davis, Gavin Taylor, Tom Goldstein

    Abstract: Adversarial training, in which a network is trained on adversarial examples, is one of the few defenses against adversarial attacks that withstands strong attacks. Unfortunately, the high cost of generating strong adversarial examples makes standard adversarial training impractical on large-scale problems like ImageNet. We present an algorithm that eliminates the overhead cost of generating advers… ▽ More

    Submitted 20 November, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

    Comments: Accepted to NeurIPS 2019

  10. arXiv:1809.02104  [pdf, other

    cs.LG cs.CV stat.ML

    Are adversarial examples inevitable?

    Authors: Ali Shafahi, W. Ronny Huang, Christoph Studer, Soheil Feizi, Tom Goldstein

    Abstract: A wide range of defenses have been proposed to harden neural networks against adversarial attacks. However, a pattern has emerged in which the majority of adversarial defenses are quickly broken by new attacks. Given the lack of success at generating robust defenses, we are led to ask a fundamental question: Are adversarial attacks inevitable? This paper analyzes adversarial examples from a theore… ▽ More

    Submitted 3 February, 2020; v1 submitted 6 September, 2018; originally announced September 2018.

    Journal ref: International Conference on Learning Representations, 2019. https://openreview.net/forum?id=r1lWUoA9FQ

  11. arXiv:1807.05247  [pdf, other

    cs.IT eess.SP stat.ML

    Channel Charting: Locating Users within the Radio Environment using Channel State Information

    Authors: Christoph Studer, Saïd Medjkouh, Emre Gönültaş, Tom Goldstein, Olav Tirkkonen

    Abstract: We propose channel charting (CC), a novel framework in which a multi-antenna network element learns a chart of the radio geometry in its surrounding area. The channel chart captures the local spatial geometry of the area so that points that are close in space will also be close in the channel chart and vice versa. CC works in a fully unsupervised manner, i.e., learning is only based on channel sta… ▽ More

    Submitted 21 August, 2018; v1 submitted 13 July, 2018; originally announced July 2018.

    Comments: To appear in IEEE Access

  12. arXiv:1806.03551  [pdf, other

    stat.ML cs.LG eess.SP

    An Estimation and Analysis Framework for the Rasch Model

    Authors: Andrew S. Lan, Mung Chiang, Christoph Studer

    Abstract: The Rasch model is widely used for item response analysis in applications ranging from recommender systems to psychology, education, and finance. While a number of estimators have been proposed for the Rasch model over the last decades, the available analytical performance guarantees are mostly asymptotic. This paper provides a framework that relies on a novel linear minimum mean-squared error (L-… ▽ More

    Submitted 9 June, 2018; originally announced June 2018.

    Comments: To be presented at ICML 2018

  13. arXiv:1806.03547  [pdf, other

    cs.IT eess.SP stat.ML

    Linear Spectral Estimators and an Application to Phase Retrieval

    Authors: Ramina Ghods, Andrew S. Lan, Tom Goldstein, Christoph Studer

    Abstract: Phase retrieval refers to the problem of recovering real- or complex-valued vectors from magnitude measurements. The best-known algorithms for this problem are iterative in nature and rely on so-called spectral initializers that provide accurate initialization vectors. We propose a novel class of estimators suitable for general nonlinear measurement systems, called linear spectral estimators (LSPE… ▽ More

    Submitted 9 June, 2018; originally announced June 2018.

    Comments: To appear at ICML 2018, extended version with supplementary material

  14. arXiv:1804.00792  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks

    Authors: Ali Shafahi, W. Ronny Huang, Mahyar Najibi, Octavian Suciu, Christoph Studer, Tudor Dumitras, Tom Goldstein

    Abstract: Data poisoning is an attack on machine learning models wherein the attacker adds examples to the training set to manipulate the behavior of the model at test time. This paper explores poisoning attacks on neural nets. The proposed attacks use "clean-labels"; they don't require the attacker to have any control over the labeling of training data. They are also targeted; they control the behavior of… ▽ More

    Submitted 10 November, 2018; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: Presented at the NIPS 2018 conference. 11 pages, 4 figures, with a supplementary section of 7 pages, 7 figures. First two authors contributed equally

  15. arXiv:1802.00430  [pdf, other

    stat.ML math.ST stat.ME

    Linearized Binary Regression

    Authors: Andrew S. Lan, Mung Chiang, Christoph Studer

    Abstract: Probit regression was first proposed by Bliss in 1934 to study mortality rates of insects. Since then, an extensive body of work has analyzed and used probit or related binary regression methods (such as logistic regression) in numerous applications and fields. This paper provides a fresh angle to such well-established binary regression methods. Concretely, we demonstrate that linearizing the prob… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

    Comments: To be presented at CISS (http://ee-ciss.princeton.edu/)

  16. arXiv:1712.09913  [pdf, other

    cs.LG cs.CV stat.ML

    Visualizing the Loss Landscape of Neural Nets

    Authors: Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer, Tom Goldstein

    Abstract: Neural network training relies on our ability to find "good" minimizers of highly non-convex loss functions. It is well-known that certain network architecture designs (e.g., skip connections) produce loss functions that train easier, and well-chosen training parameters (batch size, learning rate, optimizer) produce minimizers that generalize better. However, the reasons for these differences, and… ▽ More

    Submitted 7 November, 2018; v1 submitted 28 December, 2017; originally announced December 2017.

    Comments: NIPS 2018 (extended version, 10.5 pages), code is available at https://github.com/tomgoldstein/loss-landscape

  17. arXiv:1706.02379  [pdf, other

    cs.LG cs.CV stat.ML

    Training Quantized Nets: A Deeper Understanding

    Authors: Hao Li, Soham De, Zheng Xu, Christoph Studer, Hanan Samet, Tom Goldstein

    Abstract: Currently, deep neural networks are deployed on low-power portable devices by first training a full-precision model using powerful hardware, and then deriving a corresponding low-precision model for efficient inference on such systems. However, training models directly with coarsely quantized weights is a key step towards learning on embedded platforms that have limited computing resources, memory… ▽ More

    Submitted 13 November, 2017; v1 submitted 7 June, 2017; originally announced June 2017.

    Comments: NIPS 2017

  18. arXiv:1501.02844  [pdf, other

    stat.ML

    SPRITE: A Response Model For Multiple Choice Testing

    Authors: Ryan Ning, Andrew E. Waters, Christoph Studer, Richard G. Baraniuk

    Abstract: Item response theory (IRT) models for categorical response data are widely used in the analysis of educational data, computerized adaptive testing, and psychological surveys. However, most IRT models rely on both the assumption that categories are strictly ordered and the assumption that this ordering is known a priori. These assumptions are impractical in many real-world scenarios, such as multip… ▽ More

    Submitted 12 January, 2015; originally announced January 2015.

  19. arXiv:1412.5968  [pdf, ps, other

    stat.ML cs.LG

    Quantized Matrix Completion for Personalized Learning

    Authors: Andrew S. Lan, Christoph Studer, Richard G. Baraniuk

    Abstract: The recently proposed SPARse Factor Analysis (SPARFA) framework for personalized learning performs factor analysis on ordinal or binary-valued (e.g., correct/incorrect) graded learner responses to questions. The underlying factors are termed "concepts" (or knowledge components) and are used for learning analytics (LA), the estimation of learner concept-knowledge profiles, and for content analytics… ▽ More

    Submitted 18 December, 2014; originally announced December 2014.

    Journal ref: In Proc. 7th Intl. Conf. on Educational Data Mining, pages 280-283, July 2014

  20. arXiv:1412.5967  [pdf, ps, other

    stat.ML cs.LG

    Tag-Aware Ordinal Sparse Factor Analysis for Learning and Content Analytics

    Authors: Andrew S. Lan, Christoph Studer, Andrew E. Waters, Richard G. Baraniuk

    Abstract: Machine learning offers novel ways and means to design personalized learning systems wherein each student's educational experience is customized in real time depending on their background, learning goals, and performance to date. SPARse Factor Analysis (SPARFA) is a novel framework for machine learning-based learning analytics, which estimates a learner's knowledge of the concepts underlying a dom… ▽ More

    Submitted 18 December, 2014; originally announced December 2014.

    Journal ref: In Proc. 6th Intl. Conf. on Educational Data Mining, pages 90-97, July 2013

  21. arXiv:1312.5734  [pdf, ps, other

    stat.ML cs.LG math.OC stat.AP

    Time-varying Learning and Content Analytics via Sparse Factor Analysis

    Authors: Andrew S. Lan, Christoph Studer, Richard G. Baraniuk

    Abstract: We propose SPARFA-Trace, a new machine learning-based framework for time-varying learning and content analytics for education applications. We develop a novel message passing-based, blind, approximate Kalman filter for sparse factor analysis (SPARFA), that jointly (i) traces learner concept knowledge over time, (ii) analyzes learner concept knowledge state transitions (induced by interacting with… ▽ More

    Submitted 19 December, 2013; originally announced December 2013.

  22. arXiv:1305.1956  [pdf, ps, other

    stat.ML cs.LG

    Joint Topic Modeling and Factor Analysis of Textual Information and Graded Response Data

    Authors: Andrew S. Lan, Christoph Studer, Andrew E. Waters, Richard G. Baraniuk

    Abstract: Modern machine learning methods are critical to the development of large-scale personalized learning systems that cater directly to the needs of individual learners. The recently developed SPARse Factor Analysis (SPARFA) framework provides a new statistical model and algorithms for machine learning-based learning analytics, which estimate a learner's knowledge of the latent concepts underlying a d… ▽ More

    Submitted 9 May, 2013; v1 submitted 8 May, 2013; originally announced May 2013.

  23. arXiv:1303.5685  [pdf, ps, other

    stat.ML cs.LG math.OC stat.AP

    Sparse Factor Analysis for Learning and Content Analytics

    Authors: Andrew S. Lan, Andrew E. Waters, Christoph Studer, Richard G. Baraniuk

    Abstract: We develop a new model and algorithms for machine learning-based learning analytics, which estimate a learner's knowledge of the concepts underlying a domain, and content analytics, which estimate the relationships among a collection of questions and those concepts. Our model represents the probability that a learner provides the correct response to a question in terms of three factors: their unde… ▽ More

    Submitted 19 July, 2013; v1 submitted 22 March, 2013; originally announced March 2013.

    Journal ref: Journal of Machine Learning Research, vol. 15, pp. 1959-2008, June, 2014