Skip to main content

Showing 1–40 of 40 results for author: Xie, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.16852  [pdf, other

    cs.LG cs.AI stat.ML

    EM Distillation for One-step Diffusion Models

    Authors: Sirui Xie, Zhisheng Xiao, Diederik P Kingma, Tingbo Hou, Ying Nian Wu, Kevin Patrick Murphy, Tim Salimans, Ben Poole, Ruiqi Gao

    Abstract: While diffusion models can learn complex distributions, sampling requires a computationally expensive iterative process. Existing distillation methods enable efficient sampling, but have notable limitations, such as performance degradation with very few sampling steps, reliance on training data access, or mode-seeking optimization that may fail to capture the full distribution. We propose EM Disti… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.04454  [pdf, other

    cs.LG math.OC stat.ML

    Implicit Bias of AdamW: $\ell_\infty$ Norm Constrained Optimization

    Authors: Shuo Xie, Zhiyuan Li

    Abstract: Adam with decoupled weight decay, also known as AdamW, is widely acclaimed for its superior performance in language modeling tasks, surpassing Adam with $\ell_2$ regularization in terms of generalization and optimization. However, this advantage is not theoretically well-understood. One challenge here is that though intuitively Adam with $\ell_2$ regularization optimizes the $\ell_2$ regularized l… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  3. arXiv:2403.15025  [pdf, other

    cs.LG stat.ML

    Robust Conformal Prediction under Distribution Shift via Physics-Informed Structural Causal Model

    Authors: Rui Xu, Yue Sun, Chao Chen, Parv Venkitasubramaniam, Sihong Xie

    Abstract: Uncertainty is critical to reliable decision-making with machine learning. Conformal prediction (CP) handles uncertainty by predicting a set on a test input, ho** the set to cover the true label with at least $(1-α)$ confidence. This coverage can be guaranteed on test data even if the marginal distributions $P_X$ differ between calibration and test datasets. However, as it is common in practice,… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  4. arXiv:2402.05052  [pdf, other

    cs.LG stat.ML

    Causal Representation Learning from Multiple Distributions: A General Setting

    Authors: Kun Zhang, Shaoan Xie, Ignavier Ng, Yujia Zheng

    Abstract: In many problems, the measured variables (e.g., image pixels) are just mathematical functions of the hidden causal variables (e.g., the underlying concepts or objects). For the purpose of making predictions in changing environments or making proper changes to the system, it is helpful to recover the hidden causal variables $Z_i$ and their causal relations represented by graph $\mathcal{G}_Z$. This… ▽ More

    Submitted 9 April, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  5. arXiv:2310.12462  [pdf, other

    cs.LG cs.CL stat.ML

    Unmasking Transformers: A Theoretical Approach to Data Recovery via Attention Weights

    Authors: Yichuan Deng, Zhao Song, Shenghao Xie, Chiwun Yang

    Abstract: In the realm of deep learning, transformers have emerged as a dominant architecture, particularly in natural language processing tasks. However, with their widespread adoption, concerns regarding the security and privacy of the data processed by these models have arisen. In this paper, we address a pivotal question: Can the data fed into transformers be recovered using their attention weights and… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  6. arXiv:2310.03218  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Energy-Based Prior Model with Diffusion-Amortized MCMC

    Authors: Peiyu Yu, Yaxuan Zhu, Sirui Xie, Xiaojian Ma, Ruiqi Gao, Song-Chun Zhu, Ying Nian Wu

    Abstract: Latent space Energy-Based Models (EBMs), also known as energy-based priors, have drawn growing interests in the field of generative modeling due to its flexibility in the formulation and strong modeling power of the latent space. However, the common practice of learning latent space EBMs with non-convergent short-run MCMC for prior and posterior sampling is hindering the model from further progres… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  7. arXiv:2308.08358  [pdf, ps, other

    cs.LG stat.ML

    Convergence of Two-Layer Regression with Nonlinear Units

    Authors: Yichuan Deng, Zhao Song, Shenghao Xie

    Abstract: Large language models (LLMs), such as ChatGPT and GPT4, have shown outstanding performance in many human life task. Attention computation plays an important role in training LLMs. Softmax unit and ReLU unit are the key structure in attention computation. Inspired by them, we put forward a softmax ReLU regression problem. Generally speaking, our goal is to find an optimal solution to the regression… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  8. arXiv:2306.06510  [pdf, other

    cs.LG stat.ML

    Partial Identifiability for Domain Adaptation

    Authors: Ling**g Kong, Shaoan Xie, Weiran Yao, Yujia Zheng, Guangyi Chen, Petar Stojanov, Victor Akinwande, Kun Zhang

    Abstract: Unsupervised domain adaptation is critical to many real-world applications where label information is unavailable in the target domain. In general, without further assumptions, the joint distribution of the features and the label is not identifiable in the target domain. To address this issue, we rely on the property of minimal changes of causal mechanisms across domains to minimize unnecessary in… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: ICML 2022

  9. arXiv:2306.05751  [pdf, other

    cs.LG stat.ME

    Advancing Counterfactual Inference through Nonlinear Quantile Regression

    Authors: Shaoan Xie, Biwei Huang, Bin Gu, Tongliang Liu, Kun Zhang

    Abstract: The capacity to address counterfactual "what if" inquiries is crucial for understanding and making use of causal influences. Traditional counterfactual inference, under Pearls' counterfactual framework, typically depends on having access to or estimating a structural causal model. Yet, in practice, this causal model is often unknown and might be challenging to identify. Hence, this paper aims to p… ▽ More

    Submitted 27 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

  10. arXiv:2203.08857  [pdf, other

    stat.ML cs.AI cs.LG

    Noisy Tensor Completion via Low-rank Tensor Ring

    Authors: Yuning Qiu, Guoxu Zhou, Qibin Zhao, Shengli Xie

    Abstract: Tensor completion is a fundamental tool for incomplete data analysis, where the goal is to predict missing entries from partial observations. However, existing methods often make the explicit or implicit assumption that the observed entries are noise-free to provide a theoretical guarantee of exact recovery of missing entries, which is quite restrictive in practice. To remedy such drawbacks, this… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  11. arXiv:2112.05090  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Extending the WILDS Benchmark for Unsupervised Adaptation

    Authors: Shiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang

    Abstract: Machine learning systems deployed in the wild are often trained on a source distribution but deployed on a different target distribution. Unlabeled data can be a powerful point of leverage for mitigating these distribution shifts, as it is frequently much more available than labeled data and can often be obtained from distributions beyond the source distribution as well. However, existing distribu… ▽ More

    Submitted 23 April, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  12. Evaluating Effectiveness of Public Health Intervention Strategies for Mitigating COVID-19 Pandemic

    Authors: Shanghong Xie, Wenbo Wang, Qinxia Wang, Yuanjia Wang, Donglin Zeng

    Abstract: Coronavirus disease 2019 (COVID-19) pandemic is an unprecedented global public health challenge. In the United States (US), state governments have implemented various non-pharmaceutical interventions (NPIs), such as physical distance closure (lockdown), stay-at-home order, mandatory facial mask in public in response to the rapid spread of COVID-19. To evaluate the effectiveness of these NPIs, we p… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Journal ref: Statistics in Medicine 41 (9) (2022) 3820-3836

  13. arXiv:2106.09226  [pdf, other

    cs.LG stat.ML

    Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning

    Authors: Colin Wei, Sang Michael Xie, Tengyu Ma

    Abstract: Pretrained language models have achieved state-of-the-art performance when adapted to a downstream NLP task. However, theoretical analysis of these models is scarce and challenging since the pretraining and downstream tasks can be very different. We propose an analysis framework that links the pretraining and downstream tasks with an underlying latent variable generative model of text -- the downs… ▽ More

    Submitted 20 April, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

  14. arXiv:2101.11159  [pdf, other

    stat.ME cs.DM stat.CO

    An Early Stop** Bayesian Data Assimilation Approach for Mixed-Logit Estimation

    Authors: Shanshan Xie, Tim Hillel, Ying **

    Abstract: The mixed-logit model is a flexible tool in transportation choice analysis, which provides valuable insights into inter and intra-individual behavioural heterogeneity. However, applications of mixed-logit models are limited by the high computational and data requirements for model estimation. When estimating on small samples, the Bayesian estimation approach becomes vulnerable to over and under-fi… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  15. arXiv:2012.04550  [pdf, other

    cs.LG stat.ML

    In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness

    Authors: Sang Michael Xie, Ananya Kumar, Robbie Jones, Fereshte Khani, Tengyu Ma, Percy Liang

    Abstract: Consider a prediction setting with few in-distribution labeled examples and many unlabeled examples both in- and out-of-distribution (OOD). The goal is to learn a model which performs well both in-distribution and OOD. In these settings, auxiliary information is often cheaply available for every input. How should we best leverage this auxiliary information for the prediction task? Empirically acro… ▽ More

    Submitted 7 April, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: ICLR 2021

  16. arXiv:2009.05266  [pdf, other

    cs.LG stat.ML

    GTEA: Inductive Representation Learning on Temporal Interaction Graphs via Temporal Edge Aggregation

    Authors: Siyue Xie, Yiming Li, Da Sun Handason Tam, Xiaxin Liu, Qiu Fang Ying, Wing Cheong Lau, Dah Ming Chiu, Shou Zhi Chen

    Abstract: In this paper, we propose the Graph Temporal Edge Aggregation (GTEA) framework for inductive learning on Temporal Interaction Graphs (TIGs). Different from previous works, GTEA models the temporal dynamics of interaction sequences in the continuous-time space and simultaneously takes advantage of both rich node and edge/ interaction attributes in the graph. Concretely, we integrate a sequence mode… ▽ More

    Submitted 3 May, 2023; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: accepted by PAKDD2023

  17. arXiv:2009.01272  [pdf, other

    cs.LG stat.ML

    Understanding the wiring evolution in differentiable neural architecture search

    Authors: Sirui Xie, Shoukang Hu, Xinjiang Wang, Chunxiao Liu, Jian** Shi, Xunying Liu, Dahua Lin

    Abstract: Controversy exists on whether differentiable neural architecture search methods discover wiring topology effectively. To understand how wiring topology evolves, we study the underlying mechanism of several existing differentiable NAS frameworks. Our investigation is motivated by three observed searching patterns of differentiable NAS: 1) they search by growing instead of pruning; 2) wider networks… ▽ More

    Submitted 25 February, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

    Comments: AISTATS 2021

  18. arXiv:2007.06559  [pdf, other

    cs.LG cs.CV cs.SI stat.ML

    Graph Structure of Neural Networks

    Authors: Jiaxuan You, Jure Leskovec, Kaiming He, Saining Xie

    Abstract: Neural networks are often represented as graphs of connections between neurons. However, despite their wide use, there is currently little understanding of the relationship between the graph structure of the neural network and its predictive performance. Here we systematically investigate how does the graph structure of neural networks affect their predictive performance. To this end, we develop a… ▽ More

    Submitted 27 August, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: ICML 2020, with open-source code

  19. arXiv:2007.04649  [pdf, other

    cs.LG stat.ML

    Learning to Reweight with Deep Interactions

    Authors: Yang Fan, Yingce Xia, Lijun Wu, Shufang Xie, Weiqing Liu, Jiang Bian, Tao Qin, Xiang-Yang Li

    Abstract: Recently, the concept of teaching has been introduced into machine learning, in which a teacher model is used to guide the training of a student model (which will be used in real tasks) through data selection, loss function design, etc. Learning to reweight, which is a specific kind of teaching that reweights training data using a teacher model, receives much attention due to its simplicity and ef… ▽ More

    Submitted 12 January, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted to AAAI-2021

  20. arXiv:2006.16205  [pdf, other

    cs.LG stat.ML

    Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

    Authors: Sang Michael Xie, Tengyu Ma, Percy Liang

    Abstract: We focus on prediction problems with structured outputs that are subject to output validity constraints, e.g. pseudocode-to-code translation where the code must compile. While labeled input-output pairs are expensive to obtain, "unlabeled" outputs, i.e. outputs without corresponding inputs, are freely available (e.g. code on GitHub) and provide information about output validity. We can capture the… ▽ More

    Submitted 24 October, 2023; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: ICML 2021 Long talk

  21. arXiv:2002.10716  [pdf, other

    cs.LG stat.ML

    Understanding and Mitigating the Tradeoff Between Robustness and Accuracy

    Authors: Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, Percy Liang

    Abstract: Adversarial training augments the training set with perturbations to improve the robust error (over worst-case perturbations), but it often leads to an increase in the standard error (on unperturbed test inputs). Previous explanations for this tradeoff rely on the assumption that no predictor in the hypothesis class has low standard and robust error. In this work, we precisely characterize the eff… ▽ More

    Submitted 6 July, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Appearing at International Conference on Machine Learning (ICML) 2020

  22. arXiv:2002.09128  [pdf, other

    cs.LG stat.ML

    DSNAS: Direct Neural Architecture Search without Parameter Retraining

    Authors: Shoukang Hu, Sirui Xie, Hehui Zheng, Chunxiao Liu, Jian** Shi, Xunying Liu, Dahua Lin

    Abstract: If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this de… ▽ More

    Submitted 31 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: To appear in CVPR 2020

  23. arXiv:1912.10508  [pdf, other

    cs.IT stat.AP

    Direct and Indirect Effects -- An Information Theoretic Perspective

    Authors: Gabriel Schamberg, William Chapman, Shang-** Xie, Todd P. Coleman

    Abstract: Information theoretic (IT) approaches to quantifying causal influences have experienced some popularity in the literature, in both theoretical and applied (e.g. neuroscience and climate science) domains. While these causal measures are desirable in that they are model agnostic and can capture non-linear interactions, they are fundamentally different from common statistical notions of causal influe… ▽ More

    Submitted 28 July, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

  24. arXiv:1911.01919  [pdf

    stat.AP stat.ML

    Neural Network Based Parameter Estimation Method for the Pareto/NBD Model

    Authors: Shao-Ming Xie

    Abstract: Whether stochastic or parametric, the Pareto/NBD model can only be utilized for an in-sample prediction rather than an out-of-sample prediction. This research thus provides a neural network based extension of the Pareto/NBD model to estimate the out-of-sample parameters, which overrides the estimation burden and the application dilemma of the Pareto/NBD approach. The empirical results indicate tha… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 35 pages, 6 figures, 14 tables

  25. arXiv:1906.06832  [pdf, other

    cs.LG cs.CV stat.ML

    Sample-Efficient Neural Architecture Search by Learning Action Space

    Authors: Linnan Wang, Saining Xie, Teng Li, Rodrigo Fonseca, Yuandong Tian

    Abstract: Neural Architecture Search (NAS) has emerged as a promising technique for automatic neural network design. However, existing MCTS based NAS approaches often utilize manually designed action space, which is not directly related to the performance metric to be optimized (e.g., accuracy), leading to sample-inefficient explorations of architectures. To improve the sample efficiency, this paper propose… ▽ More

    Submitted 31 March, 2021; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: Accepted at TPAMI-2021

  26. arXiv:1906.06032  [pdf, other

    cs.LG stat.ML

    Adversarial Training Can Hurt Generalization

    Authors: Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John C. Duchi, Percy Liang

    Abstract: While adversarial training can improve robust accuracy (against an adversary), it sometimes hurts standard accuracy (when there is no adversary). Previous work has studied this tradeoff between standard and robust accuracy, but only in the setting where no predictor performs well on both objectives in the infinite data limit. In this paper, we show that even when the optimal predictor with infinit… ▽ More

    Submitted 26 August, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  27. arXiv:1901.10517  [pdf, other

    cs.LG stat.ML

    Reparameterizable Subset Sampling via Continuous Relaxations

    Authors: Sang Michael Xie, Stefano Ermon

    Abstract: Many machine learning tasks require sampling a subset of items from a collection based on a parameterized distribution. The Gumbel-softmax trick can be used to sample a single item, and allows for low-variance reparameterized gradients with respect to the parameters of the underlying distribution. However, stochastic optimization involving subset sampling is typically not reparameterizable. To ove… ▽ More

    Submitted 26 February, 2021; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: IJCAI 2019

  28. arXiv:1901.10042  [pdf

    cs.LG cs.CV stat.ML

    Deep Features Analysis with Attention Networks

    Authors: Shipeng Xie, Da Chen, Rong Zhang, Hui Xue

    Abstract: Deep neural network models have recently draw lots of attention, as it consistently produce impressive results in many computer vision tasks such as image classification, object detection, etc. However, interpreting such model and show the reason why it performs quite well becomes a challenging question. In this paper, we propose a novel method to interpret the neural network models with attention… ▽ More

    Submitted 20 January, 2019; originally announced January 2019.

    Comments: In AAAI-19 Workshop on Network Interpretability for Deep Learning

  29. arXiv:1812.09926  [pdf, other

    cs.LG cs.AI stat.ML

    SNAS: Stochastic Neural Architecture Search

    Authors: Sirui Xie, Hehui Zheng, Chunxiao Liu, Liang Lin

    Abstract: We propose Stochastic Neural Architecture Search (SNAS), an economical end-to-end solution to Neural Architecture Search (NAS) that trains neural operation parameters and architecture distribution parameters in same round of back-propagation, while maintaining the completeness and differentiability of the NAS pipeline. In this work, NAS is reformulated as an optimization problem on parameters of a… ▽ More

    Submitted 31 March, 2020; v1 submitted 24 December, 2018; originally announced December 2018.

    Comments: ICLR 2019

  30. arXiv:1812.09028  [pdf, other

    cs.LG cs.RO stat.ML

    NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning

    Authors: Sirui Xie, Junning Huang, Lanxin Lei, Chunxiao Liu, Zheng Ma, Wei Zhang, Liang Lin

    Abstract: Reinforcement learning agents need exploratory behaviors to escape from local optima. These behaviors may include both immediate dithering perturbation and temporally consistent exploration. To achieve these, a stochastic policy model that is inherently consistent through a period of time is in desire, especially for tasks with either sparse rewards or long term information. In this work, we intro… ▽ More

    Submitted 24 December, 2018; v1 submitted 21 December, 2018; originally announced December 2018.

    Comments: To appear in ICLR 2019

  31. arXiv:1811.04480  [pdf, other

    cs.LG stat.ML

    Semi-supervised Deep Representation Learning for Multi-View Problems

    Authors: Vahid Noroozi, Sara Bahaadini, Lei Zheng, Sihong Xie, Weixiang Shao, Philip S. Yu

    Abstract: While neural networks for learning representation of multi-view data have been previously proposed as one of the state-of-the-art multi-view dimension reduction techniques, how to make the representation discriminative with only a small amount of labeled data is not well-studied. We introduce a semi-supervised neural network model, named Multi-view Discriminative Neural Network (MDNN), for multi-v… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: Accepted to IEEE Big Data 2018. 9 Pages

  32. arXiv:1811.03739  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Securing Behavior-based Opinion Spam Detection

    Authors: Shuaijun Ge, Guixiang Ma, Sihong Xie, Philip S. Yu

    Abstract: Reviews spams are prevalent in e-commerce to manipulate product ranking and customers decisions maliciously. While spams generated based on simple spamming strategy can be detected effectively, hardened spammers can evade regular detectors via more advanced spamming strategies. Previous work gave more attention to evasion against text and graph-based detectors, but evasions against behavior-based… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  33. arXiv:1810.04472   

    cs.LG cs.AI stat.ML

    Domain Confusion with Self Ensembling for Unsupervised Adaptation

    Authors: Jiawei Wang, Zhaoshui He, Chengjian Feng, Zhou** Zhu, Qinzhuang Lin, Jun Lv, Shengli Xie

    Abstract: Data collection and annotation are time-consuming in machine learning, expecially for large scale problem. A common approach for this problem is to transfer knowledge from a related labeled domain to a target one. There are two popular ways to achieve this goal: adversarial learning and self training. In this article, we first analyze the training unstablity problem and the mistaken confusion issu… ▽ More

    Submitted 8 July, 2020; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: The expression is ambiguous, which is not convenient for readers to understand, and in today's view, the conclusion of the paper is of little significance, so it is no longer open

  34. arXiv:1805.10407  [pdf, other

    cs.LG cs.AI stat.ML

    Semi-supervised Deep Kernel Learning: Regression with Unlabeled Data by Minimizing Predictive Variance

    Authors: Neal Jean, Sang Michael Xie, Stefano Ermon

    Abstract: Large amounts of labeled data are typically required to train deep learning models. For many real-world problems, however, acquiring additional data can be expensive or even impossible. We present semi-supervised deep kernel learning (SSDKL), a semi-supervised regression model based on minimizing predictive variance in the posterior regularization framework. SSDKL combines the hierarchical represe… ▽ More

    Submitted 4 March, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: In Proceedings of Neural Information Processing Systems (NeurIPS) 2018

  35. arXiv:1805.08465  [pdf, other

    cs.LG stat.ML

    Beyond Unfolding: Exact Recovery of Latent Convex Tensor Decomposition under Reshuffling

    Authors: Chao Li, Mohammad Emtiyaz Khan, Zhun Sun, Gang Niu, Bo Han, Shengli Xie, Qibin Zhao

    Abstract: Exact recovery of tensor decomposition (TD) methods is a desirable property in both unsupervised learning and scientific data analysis. The numerical defects of TD methods, however, limit their practical applications on real-world data. As an alternative, convex tensor decomposition (CTD) was proposed to alleviate these problems, but its exact-recovery property is not properly addressed so far. To… ▽ More

    Submitted 28 January, 2020; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: AAAI-2020

  36. arXiv:1803.09830  [pdf, other

    stat.ME

    Cox Regression Model Under Dependent Truncation

    Authors: Lior Rennert, Sharon X. Xie

    Abstract: Truncation is a statistical phenomenon that occurs in many time to event studies. For example, autopsy-confirmed studies of neurodegenerative diseases are subject to an inherent left and right truncation, also known as double truncation. When the goal is to study the effect of risk factors on survival, the standard Cox regression model cannot be used when the data is subject to truncation. Existin… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  37. arXiv:1706.03692  [pdf, other

    cs.LG stat.ML

    SEVEN: Deep Semi-supervised Verification Networks

    Authors: Vahid Noroozi, Lei Zheng, Sara Bahaadini, Sihong Xie, Philip S. Yu

    Abstract: Verification determines whether two samples belong to the same class or not, and has important applications such as face and fingerprint verification, where thousands or millions of categories are present but each category has scarce labeled examples, presenting two major challenges for existing deep learning models. We propose a deep semi-supervised model named SEmi-supervised VErification Networ… ▽ More

    Submitted 14 June, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: 7 pages, 2 figures, accepted to the 2017 International Joint Conference on Artificial Intelligence (IJCAI-17)

  38. arXiv:1409.5185  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Deeply-Supervised Nets

    Authors: Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu

    Abstract: Our proposed deeply-supervised nets (DSN) method simultaneously minimizes classification error while making the learning process of hidden layers direct and transparent. We make an attempt to boost the classification performance by studying a new formulation in deep networks. Three aspects in convolutional neural networks (CNN) style architectures are being looked at: (1) transparency of the inter… ▽ More

    Submitted 25 September, 2014; v1 submitted 18 September, 2014; originally announced September 2014.

    Comments: Patent disclosure, UCSD Docket No. SD2014-313, filed on May 22, 2014

  39. arXiv:1404.4412  [pdf, other

    cs.LG cs.CV stat.ML

    Efficient Nonnegative Tucker Decompositions: Algorithms and Uniqueness

    Authors: Guoxu Zhou, Andrzej Cichocki, Qibin Zhao, Shengli Xie

    Abstract: Nonnegative Tucker decomposition (NTD) is a powerful tool for the extraction of nonnegative parts-based and physically meaningful latent components from high-dimensional tensor data while preserving the natural multilinear structure of data. However, as the data tensor often has multiple modes and is large-scale, existing NTD algorithms suffer from a very high computational complexity in terms of… ▽ More

    Submitted 16 September, 2015; v1 submitted 16 April, 2014; originally announced April 2014.

    Comments: appears in IEEE Transactions on Image Processing, 2015

  40. arXiv:1310.4252  [pdf, ps, other

    stat.ML cs.LG

    Multilabel Consensus Classification

    Authors: Sihong Xie, Xiangnan Kong, **g Gao, Wei Fan, Philip S. Yu

    Abstract: In the era of big data, a large amount of noisy and incomplete data can be collected from multiple sources for prediction tasks. Combining multiple models or data sources helps to counteract the effects of low data quality and the bias of any single model or data source, and thus can improve the robustness and the performance of predictive models. Out of privacy, storage and bandwidth consideratio… ▽ More

    Submitted 15 October, 2013; originally announced October 2013.