Skip to main content

Showing 1–50 of 50 results for author: Lei, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19617  [pdf, ps, other

    cs.LG cs.IT math.OC

    Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity

    Authors: Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

    Abstract: Optimization of convex functions under stochastic zeroth-order feedback has been a major and challenging question in online learning. In this work, we consider the problem of optimizing second-order smooth and strongly convex functions where the algorithm is only accessible to noisy evaluations of the objective function it queries. We provide the first tight characterization for the rate of the mi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2403.09164  [pdf, other

    cs.CL stat.AP

    Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge

    Authors: Li Yizhen, Huang Shaohan, Qi Jiaxing, Quan Lei, Han Dongran, Luan Zhongzhi

    Abstract: No previous work has studied the performance of Large Language Models (LLMs) in the context of Traditional Chinese Medicine (TCM), an essential and distinct branch of medical knowledge with a rich history. To bridge this gap, we present a TCM question dataset named TCM-QA, which comprises three question types: single choice, multiple choice, and true or false, to examine the LLM's capacity for kno… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2403.06424  [pdf, other

    stat.ML cs.CV cs.LG

    Bridging Domains with Approximately Shared Features

    Authors: Ziliang Samuel Zhong, Xiang Pan, Qi Lei

    Abstract: Multi-source domain adaptation aims to reduce performance degradation when applying machine learning models to unseen domains. A fundamental challenge is devising the optimal strategy for feature selection. Existing literature is somewhat paradoxical: some advocate for learning invariant features from source domains, while others favor more diverse features. To address the challenge, we propose a… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  4. arXiv:2403.02695  [pdf, other

    cs.LG

    Controllable Prompt Tuning For Balancing Group Distributional Robustness

    Authors: Hoang Phan, Andrew Gordon Wilson, Qi Lei

    Abstract: Models trained on data composed of different groups or domains can suffer from severe performance degradation under distribution shifts. While recent methods have largely focused on optimizing the worst-group objective, this often comes at the expense of good performance on other groups. To address this problem, we introduce an optimization scheme to achieve good performance across groups and find… ▽ More

    Submitted 4 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning

  5. arXiv:2402.09478  [pdf, other

    cs.CR cs.LG

    Data Reconstruction Attacks and Defenses: A Systematic Evaluation

    Authors: Sheng Liu, Zihan Wang, Yuxiao Chen, Qi Lei

    Abstract: Reconstruction attacks and defenses are essential in understanding the data leakage problem in machine learning. However, prior work has centered around empirical observations of gradient inversion attacks, lacks theoretical justifications, and cannot disentangle the usefulness of defending methods from the computational limitation of attacking methods. In this work, we propose to view the problem… ▽ More

    Submitted 27 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2401.15530  [pdf, ps, other

    cs.LG cs.IT

    An Information-Theoretic Analysis of In-Context Learning

    Authors: Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

    Abstract: Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustra… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  7. arXiv:2312.10586  [pdf, other

    cs.CV

    Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI

    Authors: Qinqian Lei, Bo Wang, Robby T. Tan

    Abstract: Detecting human-object interactions (HOI) in a few-shot setting remains a challenge. Existing meta-learning methods struggle to extract representative features for classification due to the limited data, while existing few-shot HOI models rely on HOI text labels for classification. Moreover, some query images may display visual similarity to those outside their class, such as similar backgrounds b… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 9 pages, 4 figures

  8. arXiv:2312.05720  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning

    Authors: Jianwei Li, Sheng Liu, Qi Lei

    Abstract: Language models trained via federated learning (FL) demonstrate impressive capabilities in handling complex tasks while protecting user privacy. Recent studies indicate that leveraging gradient information and prior knowledge can potentially reveal training samples within FL setting. However, these investigations have overlooked the potential privacy risks tied to the intrinsic architecture of the… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

  9. arXiv:2310.13191  [pdf, other

    cs.CL cs.AI

    Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models

    Authors: Jianwei Li, Qi Lei, Wei Cheng, Dongkuan Xu

    Abstract: The pruning objective has recently extended beyond accuracy and sparsity to robustness in language models. Despite this, existing methods struggle to enhance robustness against adversarial attacks when continually increasing model sparsity and require a retraining process. As humans step into the era of large language models, these issues become increasingly prominent. This paper proposes that the… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  10. arXiv:2310.13183  [pdf, other

    cs.CV cs.CL

    Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection

    Authors: Jianwei Li, Weizhi Gao, Qi Lei, Dongkuan Xu

    Abstract: It is widely acknowledged that large and sparse models have higher accuracy than small and dense models under the same model size constraints. This motivates us to train a large model and then remove its redundant neurons or weights by pruning. Most existing works pruned the networks in a deterministic way, the performance of which solely depends on a single pruning criterion and thus lacks variet… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  11. arXiv:2307.11030  [pdf, other

    stat.ML cs.LG

    Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering

    Authors: Yijun Dong, Kevin Miller, Qi Lei, Rachel Ward

    Abstract: Despite the empirical success and practical significance of (relational) knowledge distillation that matches (the relations of) features between teacher and student models, the corresponding theoretical interpretations remain limited for various knowledge distillation paradigms. In this work, we take an initial step toward a theoretical understanding of relational knowledge distillation (RKD), wit… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  12. arXiv:2306.12383  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

    Authors: Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

    Abstract: In stochastic zeroth-order optimization, a problem of practical relevance is understanding how to fully exploit the local geometry of the underlying objective function. We consider a fundamental setting in which the objective function is quadratic, and provide the first tight characterization of the optimal Hessian-dependent sample complexity. Our contribution is twofold. First, from an informatio… ▽ More

    Submitted 25 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  13. arXiv:2212.03714  [pdf, other

    cs.LG cs.CR stat.ML

    Reconstructing Training Data from Model Gradient, Provably

    Authors: Zihan Wang, Jason D. Lee, Qi Lei

    Abstract: Understanding when and how much a model gradient leaks information about the training sample is an important question in privacy. In this paper, we present a surprising result: even without training or memorizing the data, we can fully reconstruct the training samples from a single gradient query at a randomly chosen parameter value. We prove the identifiability of the training data under mild con… ▽ More

    Submitted 10 June, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  14. arXiv:2210.13983  [pdf, other

    cs.LG

    Optimization for Amortized Inverse Problems

    Authors: Tianci Liu, Tong Yang, Quan Zhang, Qi Lei

    Abstract: Incorporating a deep generative model as the prior distribution in inverse problems has established substantial success in reconstructing images from corrupted observations. Notwithstanding, the existing optimization approaches use gradient descent largely without adapting to the non-convex nature of the problem and can be sensitive to initial values, impeding further performance improvement. In t… ▽ More

    Submitted 28 January, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  15. arXiv:2209.14434  [pdf, other

    cs.CV cs.AI

    Efficient Medical Image Assessment via Self-supervised Learning

    Authors: Chun-Yin Huang, Qi Lei, Xiaoxiao Li

    Abstract: High-performance deep learning methods typically rely on large annotated training datasets, which are difficult to obtain in many clinical applications due to the high cost of medical image labeling. Existing data assessment methods commonly require knowing the labels in advance, which are not feasible to achieve our goal of 'knowing which data to label.' To this end, we formulate and propose a no… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  16. arXiv:2205.05236  [pdf, other

    cs.SI cs.DB

    Reconnecting the Estranged Relationships: Optimizing the Influence Propagation in Evolving Networks

    Authors: Taotao Cai, Qi Lei, Quan Z. Sheng, Shuiqiao Yang, Jian Yang, Wei Emma Zhang

    Abstract: Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, has recently received significant attention for mass communication and commercial marketing. Existing research efforts dedicated to the IM problem depend on a strong assumption: the selected seed users are willing to spread the information after receiving bene… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  17. arXiv:2203.15664  [pdf, other

    cs.LG stat.ML

    Nearly Minimax Algorithms for Linear Bandits with Shared Representation

    Authors: Jiaqi Yang, Qi Lei, Jason D. Lee, Simon S. Du

    Abstract: We give novel algorithms for multi-task and lifelong linear bandits with shared representation. Specifically, we consider the setting where we play $M$ linear bandits with dimension $d$, each for $T$ rounds, and these $M$ bandit tasks share a common $k(\ll d)$ dimensional linear representation. For both the multi-task setting where we play the tasks concurrently, and the lifelong setting where we… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 19 pages, 3 figures

  18. arXiv:2202.12230  [pdf, other

    cs.LG

    Sample Efficiency of Data Augmentation Consistency Regularization

    Authors: Shuo Yang, Yijun Dong, Rachel Ward, Inderjit S. Dhillon, Sujay Sanghavi, Qi Lei

    Abstract: Data augmentation is popular in the training of large neural networks; currently, however, there is no clear theoretical comparison between different algorithmic choices on how to use augmented data. In this paper, we take a step in this direction - we first present a simple and novel analysis for linear regression with label invariant augmentations, demonstrating that data augmentation consistenc… ▽ More

    Submitted 16 June, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  19. arXiv:2201.09020  [pdf, other

    cs.LG cs.CY

    Bi-CLKT: Bi-Graph Contrastive Learning based Knowledge Tracing

    Authors: Xiangyu Song, Jianxin Li, Qi Lei, Wei Zhao, Yunliang Chen, Ajmal Mian

    Abstract: The goal of Knowledge Tracing (KT) is to estimate how well students have mastered a concept based on their historical learning of related exercises. The benefit of knowledge tracing is that students' learning plans can be better organised and adjusted, and interventions can be made when necessary. With the recent rise of deep learning, Deep Knowledge Tracing (DKT) has utilised Recurrent Neural Net… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 12pages, 2 figures

  20. Origami-inspired soft twisting actuator

    Authors: Diancheng Li, Dongliang Fan, Renjie Zhu, Qiaozhi Lei, Yuxuan Liao, Xin Yang, Yang Pan, Zheng Wang, Yang Wu, Sicong Liu, Hongqiang Wang

    Abstract: Soft actuators have shown great advantages in compliance and morphology matched for manipulation of delicate objects and inspection in a confined space. There is an unmet need for a soft actuator that can provide torsional motion to e.g. enlarge working space and increase degrees of freedom. Towards this goal, we present origami-inspired soft pneumatic actuators (OSPAs) made from silicone. The pro… ▽ More

    Submitted 2 November, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 9 figures. Soft Robotics (2022)

  21. arXiv:2110.09507  [pdf, other

    cs.LG stat.ML

    Provable Hierarchy-Based Meta-Reinforcement Learning

    Authors: Kurtland Chua, Qi Lei, Jason D. Lee

    Abstract: Hierarchical reinforcement learning (HRL) has seen widespread interest as an approach to tractable learning of complex modular behaviors. However, existing work either assume access to expert-constructed hierarchies, or use hierarchy-learning heuristics with no provable guarantees. To address this gap, we analyze HRL in the meta-RL setting, where a learner learns latent hierarchical structure duri… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  22. arXiv:2107.06466  [pdf, other

    cs.LG stat.ML

    Going Beyond Linear RL: Sample Efficient Neural Function Approximation

    Authors: Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

    Abstract: Deep Reinforcement Learning (RL) powered by neural net approximation of the Q function has had enormous empirical success. While the theory of RL has traditionally focused on linear function approximation (or eluder dimension) approaches, little is known about nonlinear RL with neural net approximations of the Q functions. This is the focus of this work, where we study function approximation with… ▽ More

    Submitted 25 December, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  23. arXiv:2107.04518  [pdf, ps, other

    cs.LG stat.ML

    Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

    Authors: Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

    Abstract: Bandit problems with linear or concave reward have been extensively studied, but relatively few works have studied bandits with non-concave reward. This work considers a large family of bandit problems where the unknown underlying reward function is non-concave, including the low-rank generalized linear bandit problems and two-layer neural network with polynomial activation bandit problem. For the… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  24. arXiv:2107.02377  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    A Short Note on the Relationship of Information Gain and Eluder Dimension

    Authors: Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei

    Abstract: Eluder dimension and information gain are two widely used methods of complexity measures in bandit and reinforcement learning. Eluder dimension was originally proposed as a general complexity measure of function classes, but the common examples of where it is known to be small are function spaces (vector spaces). In these cases, the primary tool to upper bound the eluder dimension is the elliptic… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  25. arXiv:2106.12108  [pdf, other

    cs.LG stat.ML

    Near-Optimal Linear Regression under Distribution Shift

    Authors: Qi Lei, Wei Hu, Jason D. Lee

    Abstract: Transfer learning is essential when sufficient data comes from the source domain, with scarce labeled data from the target domain. We develop estimators that achieve minimax linear risk for linear regression problems under distribution shift. Our algorithms cover different transfer learning settings including covariate shift and model shift. We also consider when data are generated from either lin… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  26. arXiv:2105.02221  [pdf, other

    cs.LG stat.ML

    How Fine-Tuning Allows for Effective Meta-Learning

    Authors: Kurtland Chua, Qi Lei, Jason D. Lee

    Abstract: Representation learning has been widely studied in the context of meta-learning, enabling rapid learning of new tasks through shared representations. Recent works such as MAML have explored using fine-tuning-based metrics, which measure the ease by which fine-tuning can achieve good performance, as proxies for obtaining representations. We present a theoretical framework for analyzing representati… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  27. arXiv:2102.11203  [pdf, other

    cs.LG cs.AI stat.ML

    A Theory of Label Propagation for Subpopulation Shift

    Authors: Tianle Cai, Ruiqi Gao, Jason D. Lee, Qi Lei

    Abstract: One of the central problems in machine learning is domain adaptation. Unlike past theoretical work, we consider a new model for subpopulation shift in the input or representation space. In this work, we propose a provably effective framework for domain adaptation based on label propagation. In our analysis, we use a simple but realistic expansion assumption, proposed in \citet{wei2021theoretical}.… ▽ More

    Submitted 19 July, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  28. arXiv:2010.05263  [pdf, other

    cs.LG stat.ML

    Fast Convergence of Langevin Dynamics on Manifold: Geodesics meet Log-Sobolev

    Authors: Xiao Wang, Qi Lei, Ioannis Panageas

    Abstract: Sampling is a fundamental and arguably very important task with numerous applications in Machine Learning. One approach to sample from a high dimensional distribution $e^{-f}$ for some function $f$ is the Langevin Algorithm (LA). Recently, there has been a lot of progress in showing fast convergence of LA even in cases where $f$ is non-convex, notably [53], [39] in which the former paper focuses o… ▽ More

    Submitted 6 December, 2020; v1 submitted 11 October, 2020; originally announced October 2020.

  29. arXiv:2008.01064  [pdf, other

    cs.LG stat.ML

    Predicting What You Already Know Helps: Provable Self-Supervised Learning

    Authors: Jason D. Lee, Qi Lei, Nikunj Saunshi, Jiacheng Zhuo

    Abstract: Self-supervised representation learning solves auxiliary prediction tasks (known as pretext tasks) without requiring labeled data to learn useful semantic representations. These pretext tasks are created solely using the input features, such as predicting a missing image patch, recovering the color channels of an image from context, or predicting missing words in text; yet predicting this \textit{… ▽ More

    Submitted 13 November, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: NeurIPS 2021

  30. arXiv:2007.07244  [pdf, ps, other

    cs.SD cs.MM eess.AS

    Transformer-XL Based Music Generation with Multiple Sequences of Time-valued Notes

    Authors: Xianchao Wu, Chengyuan Wang, Qinying Lei

    Abstract: Current state-of-the-art AI based classical music creation algorithms such as Music Transformer are trained by employing single sequence of notes with time-shifts. The major drawback of absolute time interval expression is the difficulty of similarity computing of notes that share the same note value yet different tempos, in one or among MIDI files. In addition, the usage of single sequence restri… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: 9 pages, 7 figures

  31. arXiv:2003.10392  [pdf, other

    cs.LG stat.ML

    Steepest Descent Neural Architecture Optimization: Esca** Local Optimum with Signed Neural Splitting

    Authors: Lemeng Wu, Mao Ye, Qi Lei, Jason D. Lee, Qiang Liu

    Abstract: Develo** efficient and principled neural architecture optimization methods is a critical challenge of modern deep learning. Recently, Liu et al.[19] proposed a splitting steepest descent (S2D) method that jointly optimizes the neural parameters and architectures based on progressively growing network structures by splitting neurons into multiple copies in a steepest descent fashion. However, S2D… ▽ More

    Submitted 20 June, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

  32. arXiv:2003.08089  [pdf, other

    cs.LG cs.IT stat.ML

    Solving Inverse Problems with a Flow-based Noise Model

    Authors: Jay Whang, Qi Lei, Alexandros G. Dimakis

    Abstract: We study image inverse problems with a normalizing flow prior. Our formulation views the solution as the maximum a posteriori estimate of the image conditioned on the measurements. This formulation allows us to use noise models with arbitrary dependencies as well as non-linear forward operators. We empirically validate the efficacy of our method on various inverse problems, including compressed se… ▽ More

    Submitted 1 July, 2021; v1 submitted 18 March, 2020; originally announced March 2020.

  33. arXiv:2002.09434  [pdf, ps, other

    cs.LG math.OC stat.ML

    Few-Shot Learning via Learning the Representation, Provably

    Authors: Simon S. Du, Wei Hu, Sham M. Kakade, Jason D. Lee, Qi Lei

    Abstract: This paper studies few-shot learning via representation learning, where one uses $T$ source tasks with $n_1$ data per task to learn a representation in order to reduce the sample complexity of a target task for which there is only $n_2 (\ll n_1)$ data. Specifically, we focus on the setting where there exists a good \emph{common representation} between source and target, and our goal is to understa… ▽ More

    Submitted 30 March, 2021; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: ICLR2021

  34. arXiv:2002.06789  [pdf, other

    cs.LG stat.ML

    CAT: Customized Adversarial Training for Improved Robustness

    Authors: Minhao Cheng, Qi Lei, Pin-Yu Chen, Inderjit Dhillon, Cho-Jui Hsieh

    Abstract: Adversarial training has become one of the most effective methods for improving robustness of neural networks. However, it often suffers from poor generalization on both clean and perturbed data. In this paper, we propose a new algorithm, named Customized Adversarial Training (CAT), which adaptively customizes the perturbation level and the corresponding label for each training sample in adversari… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  35. arXiv:2002.06768  [pdf, other

    cs.LG cs.GT stat.ML

    Last iterate convergence in no-regret learning: constrained min-max optimization for convex-concave landscapes

    Authors: Qi Lei, Sai Ganesh Nagarajan, Ioannis Panageas, Xiao Wang

    Abstract: In a recent series of papers it has been established that variants of Gradient Descent/Ascent and Mirror Descent exhibit last iterate convergence in convex-concave zero-sum games. Specifically, \cite{DISZ17, LiangS18} show last iterate convergence of the so called "Optimistic Gradient Descent/Ascent" for the case of \textit{unconstrained} min-max optimization. Moreover, in \cite{Metal} the authors… ▽ More

    Submitted 21 February, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

  36. arXiv:2001.01279   

    eess.IV cs.CV cs.LG

    Deep Transfer Convolutional Neural Network and Extreme Learning Machine for Lung Nodule Diagnosis on CT images

    Authors: Xufeng Huang, Qiang Lei, Tingli Xie, Yahui Zhang, Zhen Hu, Qi Zhou

    Abstract: Some content of the article needs to be kept secret

    Submitted 28 April, 2020; v1 submitted 5 January, 2020; originally announced January 2020.

    Comments: Some content of the article needs to be kept secret

    Journal ref: Knowledge-Based Systems (2020) 106230

  37. arXiv:1910.07703  [pdf, other

    cs.LG cs.DC math.NA stat.ML

    Communication-Efficient Asynchronous Stochastic Frank-Wolfe over Nuclear-norm Balls

    Authors: Jiacheng Zhuo, Qi Lei, Alexandros G. Dimakis, Constantine Caramanis

    Abstract: Large-scale machine learning training suffers from two prior challenges, specifically for nuclear-norm constrained problems with distributed systems: the synchronization slowdown due to the straggling workers, and high communication costs. In this work, we propose an asynchronous Stochastic Frank Wolfe (SFW-asyn) method, which, for the first time, solves the two problems simultaneously, while succ… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  38. arXiv:1910.07030  [pdf, other

    cs.LG stat.ML

    SGD Learns One-Layer Networks in WGANs

    Authors: Qi Lei, Jason D. Lee, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: Generative adversarial networks (GANs) are a widely used framework for learning generative models. Wasserstein GANs (WGANs), one of the most successful variants of GANs, require solving a minmax optimization problem to global optimality, but are in practice successfully trained using stochastic gradient descent-ascent. In this paper, we show that, when the generator is a one-layer network, stochas… ▽ More

    Submitted 1 July, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

    Comments: 24 pages, 4 figures, ICML2020

  39. arXiv:1906.07437  [pdf, other

    cs.LG stat.ML

    Inverting Deep Generative models, One layer at a time

    Authors: Qi Lei, Ajil Jalal, Inderjit S. Dhillon, Alexandros G. Dimakis

    Abstract: We study the problem of inverting a deep generative model with ReLU activations. Inversion corresponds to finding a latent code vector that explains observed measurements as much as possible. In most prior works this is performed by attempting to solve a non-convex optimization problem involving the generator. In this paper we obtain several novel theoretical results for the inversion problem. W… ▽ More

    Submitted 19 June, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

  40. arXiv:1906.02436  [pdf, other

    cs.LG math.OC stat.ML

    Primal-Dual Block Frank-Wolfe

    Authors: Qi Lei, Jiacheng Zhuo, Constantine Caramanis, Inderjit S. Dhillon, Alexandros G. Dimakis

    Abstract: We propose a variant of the Frank-Wolfe algorithm for solving a class of sparse/low-rank optimization problems. Our formulation includes Elastic Net, regularized SVMs and phase retrieval as special cases. The proposed Primal-Dual Block Frank-Wolfe algorithm reduces the per-iteration cost while maintaining linear convergence rate. The per iteration cost of our method depends on the structural compl… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  41. arXiv:1903.04709  [pdf, other

    cs.DC cs.NI

    Service Capacity Enhanced Task Offloading and Resource Allocation in Multi-Server Edge Computing Environment

    Authors: Wei Du, Tao Lei, Qiang He, Wei Liu, Qiwang Lei, Hailiang Zhao, Wei Wang

    Abstract: An edge computing environment features multiple edge servers and multiple service clients. In this environment, mobile service providers can offload client-side computation tasks from service clients' devices onto edge servers to reduce service latency and power consumption experienced by the clients. A critical issue that has yet to be properly addressed is how to allocate edge computing resource… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Comments: This paper has been accepted by Early Submission Phase of ICWS2019

  42. arXiv:1812.00151  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Discrete Adversarial Attacks and Submodular Optimization with Applications to Text Classification

    Authors: Qi Lei, Lingfei Wu, Pin-Yu Chen, Alexandros G. Dimakis, Inderjit S. Dhillon, Michael Witbrock

    Abstract: Adversarial examples are carefully constructed modifications to an input that completely change the output of a classifier but are imperceptible to humans. Despite these successful attacks for continuous data (such as image and audio samples), generating adversarial examples for discrete structures such as text has proven significantly more challenging. In this paper we formulate the attacks with… ▽ More

    Submitted 4 April, 2019; v1 submitted 1 December, 2018; originally announced December 2018.

    Comments: In SysML 2019

  43. arXiv:1809.05259  [pdf, other

    cs.LG stat.ML

    Random War** Series: A Random Features Method for Time-Series Embedding

    Authors: Lingfei Wu, Ian En-Hsu Yen, **feng Yi, Fangli Xu, Qi Lei, Michael Witbrock

    Abstract: Time series data analytics has been a problem of substantial interests for decades, and Dynamic Time War** (DTW) has been the most widely adopted technique to measure dissimilarity between time series. A number of global-alignment kernels have since been proposed in the spirit of DTW to extend its use to kernel-based estimation method such as support vector machine. However, those kernels suffer… ▽ More

    Submitted 14 September, 2018; originally announced September 2018.

    Comments: AIStats18, Oral Paper, Add code link for generating RWS

  44. arXiv:1803.09327  [pdf, other

    cs.LG stat.ML

    Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization

    Authors: Jiong Zhang, Qi Lei, Inderjit S. Dhillon

    Abstract: Vanishing and exploding gradients are two of the main obstacles in training deep neural networks, especially in capturing long range dependencies in recurrent neural networks~(RNNs). In this paper, we present an efficient parametrization of the transition matrix of an RNN that allows us to stabilize the gradients that arise in its training. Specifically, we parameterize the transition matrix by it… ▽ More

    Submitted 25 March, 2018; originally announced March 2018.

    Comments: main text 13 pages, 22 pages including reference and appendix

  45. arXiv:1802.08241  [pdf, other

    cs.CV cs.LG stat.ML

    Hessian-based Analysis of Large Batch Training and Robustness to Adversaries

    Authors: Zhewei Yao, Amir Gholami, Qi Lei, Kurt Keutzer, Michael W. Mahoney

    Abstract: Large batch size training of Neural Networks has been shown to incur accuracy loss when trained with the current methods. The exact underlying reasons for this are still not completely understood. Here, we study large batch size training through the lens of the Hessian operator and robust optimization. In particular, we perform a Hessian based study to analyze exactly how the landscape of the loss… ▽ More

    Submitted 2 December, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

    Comments: Presented in NeurIPS'18 conference

    Journal ref: NeurIPS 2018

  46. arXiv:1702.06362  [pdf, other

    cs.LG

    Negative-Unlabeled Tensor Factorization for Location Category Inference from Highly Inaccurate Mobility Data

    Authors: **feng Yi, Qi Lei, Wesley Gifford, Ji Liu, Junchi Yan

    Abstract: Identifying significant location categories visited by mobile users is the key to a variety of applications. This is an extremely challenging task due to the possible deviation between the estimated location coordinate and the actual location, which could be on the order of kilometers. To estimate the actual location category more precisely, we propose a novel tensor factorization framework, throu… ▽ More

    Submitted 24 May, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

  47. arXiv:1702.03584  [pdf, other

    cs.AI cs.LG

    Similarity Preserving Representation Learning for Time Series Clustering

    Authors: Qi Lei, **feng Yi, Roman Vaculin, Lingfei Wu, Inderjit S. Dhillon

    Abstract: A considerable amount of clustering algorithms take instance-feature matrices as their inputs. As such, they cannot directly analyze time series data due to its temporal nature, usually unequal lengths, and complex properties. This is a great pity since many of these algorithms are effective, robust, efficient, and easy to use. In this paper, we bridge this gap by proposing an efficient representa… ▽ More

    Submitted 2 June, 2019; v1 submitted 12 February, 2017; originally announced February 2017.

  48. arXiv:1612.03301  [pdf, other

    stat.ML cs.DC cs.IT cs.LG stat.CO

    Gradient Coding

    Authors: Rashish Tandon, Qi Lei, Alexandros G. Dimakis, Nikos Karampatziakis

    Abstract: We propose a novel coding theoretic framework for mitigating stragglers in distributed learning. We show how carefully replicating data blocks and coding across gradients can provide tolerance to failures and stragglers for Synchronous Gradient Descent. We implement our schemes in python (using MPI) to run on Amazon EC2, and show how we compare against baseline approaches in running time and gener… ▽ More

    Submitted 7 March, 2017; v1 submitted 10 December, 2016; originally announced December 2016.

  49. arXiv:1610.03317  [pdf, other

    cs.DS cs.LG

    A Greedy Approach for Budgeted Maximum Inner Product Search

    Authors: Hsiang-Fu Yu, Cho-Jui Hsieh, Qi Lei, Inderjit S. Dhillon

    Abstract: Maximum Inner Product Search (MIPS) is an important task in many machine learning applications such as the prediction phase of a low-rank matrix factorization model for a recommender system. There have been some works on how to perform MIPS in sub-linear time recently. However, most of them do not have the flexibility to control the trade-off between search efficient and search quality. In this pa… ▽ More

    Submitted 11 October, 2016; originally announced October 2016.

  50. arXiv:1509.01404  [pdf, ps, other

    math.NA cs.CV cs.LG math.OC stat.ML

    Coordinate Descent Methods for Symmetric Nonnegative Matrix Factorization

    Authors: Arnaud Vandaele, Nicolas Gillis, Qi Lei, Kai Zhong, Inderjit Dhillon

    Abstract: Given a symmetric nonnegative matrix $A$, symmetric nonnegative matrix factorization (symNMF) is the problem of finding a nonnegative matrix $H$, usually with much fewer columns than $A$, such that $A \approx HH^T$. SymNMF can be used for data analysis and in particular for various clustering tasks. In this paper, we propose simple and very efficient coordinate descent schemes to solve this proble… ▽ More

    Submitted 31 May, 2016; v1 submitted 4 September, 2015; originally announced September 2015.

    Comments: 25 pages, 5 figures, 7 tables. Main changes: comparison with another symNMF algorithm (namely, BetaSNMF), and correction of an error in the convergence proof

    Journal ref: IEEE Transactions on Signal Processing 64 (21), pp. 5571-5584, 2016