Skip to main content

Showing 1–50 of 53 results for author: Lin, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.01621  [pdf, other

    cs.LG q-bio.QM stat.ME stat.ML

    Deciphering interventional dynamical causality from non-intervention systems

    Authors: Jifan Shi, Yang Li, Juan Zhao, Siyang Leng, Kazuyuki Aihara, Luonan Chen, Wei Lin

    Abstract: Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational crite… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2406.06213  [pdf, ps, other

    cs.LG cs.AI stat.AP stat.ML

    A Statistical Theory of Regularization-Based Continual Learning

    Authors: Xuyang Zhao, Huiyuan Wang, Weiran Huang, Wei Lin

    Abstract: We provide a statistical analysis of regularization-based continual learning on a sequence of linear regression tasks, with emphasis on how different regularization terms affect the model performance. We first derive the convergence rate for the oracle estimator obtained as if all data were available simultaneously. Next, we consider a family of generalized $\ell_2$-regularization algorithms index… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  3. arXiv:2405.15403  [pdf, other

    cs.LG stat.ML

    Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random

    Authors: Mingming Ha, Xuewen Tao, Wenfang Lin, Qionxu Ma, Wujiang Xu, Linxun Chen

    Abstract: In most practical applications such as recommendation systems, display advertising, and so forth, the collected data often contains missing values and those missing values are generally missing-not-at-random, which deteriorates the prediction performance of models. Some existing estimators and regularizers attempt to achieve unbiased estimation to improve the predictive performance. However, varia… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  4. arXiv:2405.11720  [pdf, other

    stat.ME stat.AP

    Estimating optimal tailored active surveillance strategy under interval censoring

    Authors: Muxuan Liang, Yingqi Zhao, Daniel W. Lin, Matthew Cooperberg, Yingye Zheng

    Abstract: Active surveillance (AS) using repeated biopsies to monitor disease progression has been a popular alternative to immediate surgical intervention in cancer care. However, a biopsy procedure is invasive and sometimes leads to severe side effects of infection and bleeding. To reduce the burden of repeated surveillance biopsies, biomarker-assistant decision rules are sought to replace the fix-for-all… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 14 pages, 4 figures, 2 tables

  5. arXiv:2405.00308  [pdf

    cs.CR stat.AP

    FPGA Digital Dice using Pseudo Random Number Generator

    Authors: Michael Lim Kee Hian, Ten Wei Lin, Zachary Wu Xuan, Stephanie-Ann Loy, Maoyang Xiang, T. Hui Teo

    Abstract: The goal of this project is to design a digital dice that displays dice numbers in real-time. The number is generated by a pseudo-random number generator (PRNG) using XORshift algorithm that is implemented in Verilog HDL on an FPGA. The digital dice is equipped with tilt sensor, display, power management circuit, and rechargeable battery hosted in a 3D printed dice casing. By shaking the digital d… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 15 pages, 5 figures

  6. arXiv:2312.05705  [pdf, other

    cs.LG stat.ML

    Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC

    Authors: Wu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani

    Abstract: Second-order methods such as KFAC can be useful for neural net training. However, they are often memory-inefficient since their preconditioning Kronecker factors are dense, and numerically unstable in low precision as they require matrix inversion or decomposition. These limitations render such methods unpopular for modern mixed-precision training. We address them by (i) formulating an inverse-fre… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: A long version of the ICML 2024 paper

  7. arXiv:2312.02213  [pdf, other

    cs.LG cs.AI cs.DB stat.AP

    JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization

    Authors: Shang-Ching Liu, ShengKun Wang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-** Cheng, Sian-Hong Luo, Tsungyao Chang, Jianwei Zhang

    Abstract: In this study, we introduce JarviX, a sophisticated data analytics framework. JarviX is designed to employ Large Language Models (LLMs) to facilitate an automated guide and execute high-precision data analyzes on tabular datasets. This framework emphasizes the significance of varying column types, capitalizing on state-of-the-art LLMs to generate concise data insight summaries, propose relevant an… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  8. arXiv:2310.12026  [pdf, other

    stat.ML cs.LG stat.AP

    Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

    Authors: Mingzhang Yin, Ruijiang Gao, Weiran Lin, Steven M. Shugan

    Abstract: Designing products to meet consumers' preferences is essential for a business's success. We propose the Gradient-based Survey (GBS), a discrete choice experiment for multiattribute product design. The experiment elicits consumer preferences through a sequence of paired comparisons for partial profiles. GBS adaptively constructs paired comparison questions based on the respondents' previous choices… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  9. arXiv:2309.06985  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    CARE: Large Precision Matrix Estimation for Compositional Data

    Authors: Shucong Zhang, Huiyuan Wang, Wei Lin

    Abstract: High-dimensional compositional data are prevalent in many applications. The simplex constraint poses intrinsic challenges to inferring the conditional dependence relationships among the components forming a composition, as encoded by a large precision matrix. We introduce a precise specification of the compositional precision matrix and relate it to its basis counterpart, which is shown to be asym… ▽ More

    Submitted 22 March, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 67 pages, 7 figures, to appear in Journal of the American Statistical Association (http://www.tandfonline.com/r/JASA)

  10. arXiv:2307.03136  [pdf, other

    math.OC cs.LG stat.ML

    Multiplicative Updates for Online Convex Optimization over Symmetric Cones

    Authors: Ilayda Canyakmaz, Wayne Lin, Georgios Piliouras, Antonios Varvitsiotis

    Abstract: We study online convex optimization where the possible actions are trace-one elements in a symmetric cone, generalizing the extensively-studied experts setup and its quantum counterpart. Symmetric cones provide a unifying framework for some of the most important optimization models, including linear, second-order cone, and semidefinite optimization. Using tools from the field of Euclidean Jordan A… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 27 pages, 7 figures, 2 tables

  11. arXiv:2302.09738  [pdf, other

    stat.ML cs.LG

    Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning

    Authors: Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: Riemannian submanifold optimization with momentum is computationally challenging because, to ensure that the iterates remain on the submanifold, we often need to solve difficult differential equations. Here, we simplify such difficulties for a class of sparse or structured symmetric positive-definite matrices with the affine-invariant metric. We do so by proposing a generalized version of the Riem… ▽ More

    Submitted 16 March, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: A long version of the ICML 2023 paper. Updated the main text to emphasize challenges of using existing Riemannian methods to estimate sparse and structured SPD matrices

  12. arXiv:2210.08549  [pdf

    stat.AP cs.AI cs.LG cs.NE stat.ML

    Automatic Emergency Dust-Free solution on-board International Space Station with Bi-GRU (AED-ISS)

    Authors: Po-Han Hou, Wei-Chih Lin, Hong-Chun Hou, Yu-Hao Huang, Jih-Hong Shue

    Abstract: With a rising attention for the issue of PM2.5 or PM0.3, particulate matters have become not only a potential threat to both the environment and human, but also a harming existence to instruments onboard International Space Station (ISS). Our team is aiming to relate various concentration of particulate matters to magnetic fields, humidity, acceleration, temperature, pressure and CO2 concentration… ▽ More

    Submitted 2 August, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 11 pages, 5 figures, and 1 table

  13. arXiv:2210.03612  [pdf, ps, other

    stat.ML cs.AI cs.CR cs.CV cs.LG

    1st ICLR International Workshop on Privacy, Accountability, Interpretability, Robustness, Reasoning on Structured Data (PAIR^2Struct)

    Authors: Hao Wang, Wanyu Lin, Hao He, Di Wang, Chengzhi Mao, Muhan Zhang

    Abstract: Recent years have seen advances on principles and guidance relating to accountable and ethical use of artificial intelligence (AI) spring up around the globe. Specifically, Data Privacy, Accountability, Interpretability, Robustness, and Reasoning have been broadly recognized as fundamental principles of using machine learning (ML) technologies on decision-critical and/or privacy-sensitive applicat… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  14. arXiv:2209.08737  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Heterogeneous Federated Learning on a Graph

    Authors: Huiyuan Wang, Xuyang Zhao, Wei Lin

    Abstract: Federated learning, where algorithms are trained across multiple decentralized devices without sharing local data, is increasingly popular in distributed machine learning practice. Typically, a graph structure $G$ exists behind local devices for communication. In this work, we consider parameter estimation in federated learning with data distribution and communication heterogeneity, as well as lim… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 61 pages, 4 figures

  15. arXiv:2203.15209  [pdf, other

    cs.LG stat.ML

    OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks

    Authors: Wanyu Lin, Hao Lan, Hao Wang, Baochun Li

    Abstract: This paper proposes a new eXplanation framework, called OrphicX, for generating causal explanations for any graph neural networks (GNNs) based on learned latent causal factors. Specifically, we construct a distinct generative model and design an objective function that encourages the generative model to produce causal, compact, and faithful explanations. This is achieved by isolating the causal fa… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022, an oral presentation, source code: https://github.com/WanyuGroup/CVPR2022-OrphicX

  16. arXiv:2110.11562  [pdf, other

    stat.ME math.ST

    Temporal Point Process Graphical Models

    Authors: Yalong Lyu, Huiyuan Wang, Wei Lin

    Abstract: Many real-world objects can be modeled as a stream of events on the nodes of a graph. In this paper, we propose a class of graphical event models named temporal point process graphical models for representing the temporal dependencies among different components of a multivariate point process. In our model, the intensity of an event stream can depend on the historical events in a nonlinear way. We… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 21 pages,5 figures

    MSC Class: Primary 62M08; secondary 62H08; 60G08

  17. arXiv:2109.04657  [pdf, other

    stat.ME

    Principal component analysis for high-dimensional compositional data

    Authors: **gru Zhang, Wei Lin

    Abstract: Dimension reduction for high-dimensional compositional data plays an important role in many fields, where the principal component analysis of the basis covariance matrix is of scientific interest. In practice, however, the basis variables are latent and rarely observed, and standard techniques of principal component analysis are inadequate for compositional data because of the simplex constraint.… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  18. arXiv:2107.10884  [pdf, other

    stat.ML cs.LG

    Structured second-order methods via natural gradient descent

    Authors: Wu Lin, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: In this paper, we propose new structured second-order methods and structured adaptive-gradient methods obtained by performing natural-gradient descent on structured parameter spaces. Natural-gradient descent is an attractive approach to design new algorithms in many settings such as gradient-free, adaptive-gradient, and second-order methods. Our structured methods not only enjoy a structural invar… ▽ More

    Submitted 19 February, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Fixed some typos and added a new figure. ICML 2021 workshop paper. A short version of arXiv:2102.07405 with a focus on optimization tasks

  19. arXiv:2106.10341  [pdf, other

    stat.CO econ.EM

    Scalable Econometrics on Big Data -- The Logistic Regression on Spark

    Authors: Aurélien Ouattara, Matthieu Bulté, Wan-Ju Lin, Philipp Scholl, Benedikt Veit, Christos Ziakas, Florian Felice, Julien Virlogeux, George Dikos

    Abstract: Extra-large datasets are becoming increasingly accessible, and computing tools designed to handle huge amount of data efficiently are democratizing rapidly. However, conventional statistical and econometric tools are still lacking fluency when dealing with such large datasets. This paper dives into econometrics on big datasets, specifically focusing on the logistic regression on Spark. We review t… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  20. arXiv:2106.04795  [pdf, other

    cs.LG math.ST stat.ML

    Nonasymptotic theory for two-layer neural networks: Beyond the bias-variance trade-off

    Authors: Huiyuan Wang, Wei Lin

    Abstract: Large neural networks have proved remarkably effective in modern deep learning practice, even in the overparametrized regime where the number of active parameters is large relative to the sample size. This contradicts the classical perspective that a machine learning model must trade off bias and variance for optimal generalization. To resolve this conflict, we present a nonasymptotic generalizati… ▽ More

    Submitted 30 July, 2023; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: 47 pages, 1 figure

    MSC Class: 62G08 (Primary) 62J07; 68T07 (Secondary)

  21. arXiv:2102.07405  [pdf, other

    stat.ML cs.LG

    Tractable structured natural gradient descent using local parameterizations

    Authors: Wu Lin, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: Natural-gradient descent (NGD) on structured parameter spaces (e.g., low-rank covariances) is computationally challenging due to difficult Fisher-matrix computations. We address this issue by using \emph{local-parameter coordinates} to obtain a flexible and efficient NGD method that works well for a wide-variety of structured parameterizations. We show four applications where our method (1) genera… ▽ More

    Submitted 17 January, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: An extended version of the ICML 2021 paper. Note: A workshop (short) paper with a focus on optimization tasks can be found at arXiv:2107.10884

  22. arXiv:2011.13219  [pdf, other

    cs.RO cs.DC cs.LG cs.MA stat.ML

    Message-Aware Graph Attention Networks for Large-Scale Multi-Robot Path Planning

    Authors: Qingbiao Li, Weizhe Lin, Zhe Liu, Amanda Prorok

    Abstract: The domains of transport and logistics are increasingly relying on autonomous mobile robots for the handling and distribution of passengers or resources. At large system scales, finding decentralized path planning and coordination solutions is key to efficient system performance. Recently, Graph Neural Networks (GNNs) have become popular due to their ability to learn communication policies in dece… ▽ More

    Submitted 25 April, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: This work has been accepted to the IEEE Robotics and Automation Letters (RA-L) for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  23. arXiv:2007.14573  [pdf, other

    cs.LG stat.ML

    FIVES: Feature Interaction Via Edge Search for Large-Scale Tabular Data

    Authors: Yuexiang Xie, Zhen Wang, Yaliang Li, Bolin Ding, Nezihe Merve Gürel, Ce Zhang, Minlie Huang, Wei Lin, **gren Zhou

    Abstract: High-order interactive features capture the correlation between different columns and thus are promising to enhance various learning tasks on ubiquitous tabular data. To automate the generation of interactive features, existing works either explicitly traverse the feature space or implicitly express the interactions via intermediate activations of some designed models. These two kinds of methods s… ▽ More

    Submitted 1 June, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted by KDD-21

  24. arXiv:2006.14278  [pdf, other

    cs.LG cs.SI stat.ML

    Graph Structural-topic Neural Network

    Authors: Qingqing Long, Yilun **, Guojie Song, Yi Li, Wei Lin

    Abstract: Graph Convolutional Networks (GCNs) achieved tremendous success by effectively gathering local features for nodes. However, commonly do GCNs focus more on node features but less on graph structures within the neighborhood, especially higher-order structural patterns. However, such local structural patterns are shown to be indicative of node properties in numerous fields. In addition, it is not jus… ▽ More

    Submitted 4 July, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

  25. arXiv:2003.11627  [pdf, other

    cs.CL cs.LG stat.ML

    Author2Vec: A Framework for Generating User Embedding

    Authors: Xiaodong Wu, Weizhe Lin, Zhilin Wang, Elena Rastorgueva

    Abstract: Online forums and social media platforms provide noisy but valuable data every day. In this paper, we propose a novel end-to-end neural network-based user embedding system, Author2Vec. The model incorporates sentence representations generated by BERT (Bidirectional Encoder Representations from Transformers) with a novel unsupervised pre-training objective, authorship classification, to produce bet… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

  26. arXiv:2002.10060  [pdf, other

    stat.ML cs.LG

    Handling the Positive-Definite Constraint in the Bayesian Learning Rule

    Authors: Wu Lin, Mark Schmidt, Mohammad Emtiyaz Khan

    Abstract: The Bayesian learning rule is a natural-gradient variational inference method, which not only contains many existing learning algorithms as special cases but also enables the design of new algorithms. Unfortunately, when variational parameters lie in an open constraint set, the rule may not satisfy the constraint and requires line-searches which could slow down the algorithm. In this work, we addr… ▽ More

    Submitted 25 October, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: Fixed typos and updated the abstract (ICML 2020)

  27. arXiv:1912.02254  [pdf, other

    cs.LG stat.ML

    Deep Model Compression Via Two-Stage Deep Reinforcement Learning

    Authors: Huixin Zhan, Wei-Ming Lin, Yongcan Cao

    Abstract: Besides accuracy, the model size of convolutional neural networks (CNN) models is another important factor considering limited hardware resources in practical applications. For example, employing deep neural networks on mobile systems requires the design of accurate yet fast CNN for low latency in classification and object detection. To fulfill the need, we aim at obtaining CNN models with both hi… ▽ More

    Submitted 2 July, 2021; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: To appear in ECML/PKDD 21

  28. arXiv:1911.10291  [pdf, other

    cs.LG cs.CV stat.ML

    Invert and Defend: Model-based Approximate Inversion of Generative Adversarial Networks for Secure Inference

    Authors: Wei-An Lin, Yogesh Balaji, Pouya Samangouei, Rama Chellappa

    Abstract: Inferring the latent variable generating a given test sample is a challenging problem in Generative Adversarial Networks (GANs). In this paper, we propose InvGAN - a novel framework for solving the inference problem in GANs, which involves training an encoder network capable of inverting a pre-trained generator network without access to any training data. Under mild assumptions, we theoretically s… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  29. arXiv:1910.13398  [pdf, ps, other

    stat.ML cs.LG

    Stein's Lemma for the Reparameterization Trick with Exponential Family Mixtures

    Authors: Wu Lin, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: Stein's method (Stein, 1973; 1981) is a powerful tool for statistical applications, and has had a significant impact in machine learning. Stein's lemma plays an essential role in Stein's method. Previous applications of Stein's lemma either required strong technical assumptions or were limited to Gaussian distributions with restricted covariance structures. In this work, we extend Stein's lemma to… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

  30. arXiv:1909.12473  [pdf, other

    cs.LG stat.ML

    Noisy Batch Active Learning with Deterministic Annealing

    Authors: Gaurav Gupta, Anit Kumar Sahu, Wan-Yi Lin

    Abstract: We study the problem of training machine learning models incrementally with batches of samples annotated with noisy oracles. We select each batch of samples that are important and also diverse via clustering and importance sampling. More importantly, we incorporate model uncertainty into the sampling probability to compensate for poor estimation of the importance scores when the training data is t… ▽ More

    Submitted 28 October, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

  31. arXiv:1909.06040  [pdf, ps, other

    cs.LG cs.DC stat.ML

    DL2: A Deep Learning-driven Scheduler for Deep Learning Clusters

    Authors: Yanghua Peng, Yixin Bao, Yangrui Chen, Chuan Wu, Chen Meng, Wei Lin

    Abstract: More and more companies have deployed machine learning (ML) clusters, where deep learning (DL) models are trained for providing various AI-driven services. Efficient resource scheduling is essential for maximal utilization of expensive DL clusters. Existing cluster schedulers either are agnostic to ML workload characteristics, or use scheduling heuristics based on operators' understanding of parti… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  32. arXiv:1908.01146  [pdf, other

    cs.LG eess.SY stat.ML

    Develo** an Unsupervised Real-time Anomaly Detection Scheme for Time Series with Multi-seasonality

    Authors: Wentai Wu, Ligang He, Weiwei Lin, Yi Su, Yuhua Cui, Carsten Maple, Stephen Jarvis

    Abstract: On-line detection of anomalies in time series is a key technique used in various event-sensitive scenarios such as robotic system monitoring, smart sensor networks and data center security. However, the increasing diversity of data sources and the variety of demands make this task more challenging than ever. Firstly, the rapid increase in unlabeled data means supervised learning is becoming less s… ▽ More

    Submitted 23 April, 2021; v1 submitted 3 August, 2019; originally announced August 2019.

    Comments: 14 pages, 11 figures. IEEE Transactions on Knowledge and Data Engineering (2020)

  33. arXiv:1906.02914  [pdf, other

    stat.ML cs.LG

    Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations

    Authors: Wu Lin, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: Natural-gradient methods enable fast and simple algorithms for variational inference, but due to computational difficulties, their use is mostly limited to \emph{minimal} exponential-family (EF) approximations. In this paper, we extend their application to estimate \emph{structured} approximations such as mixtures of EF distributions. Such approximations can fit complex, multimodal posterior distr… ▽ More

    Submitted 6 November, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: Corrected some typos and updated the appendix (ICML 2019)

  34. arXiv:1905.12439  [pdf, other

    cs.SD cs.CR cs.LG cs.MM stat.ML

    Towards robust audio spoofing detection: a detailed comparison of traditional and learned features

    Authors: Balamurali BT, Kin Wah Edward Lin, Simon Lui, Jer-Ming Chen, Dorien Herremans

    Abstract: Automatic speaker verification, like every other biometric system, is vulnerable to spoofing attacks. Using only a few minutes of recorded voice of a genuine client of a speaker verification system, attackers can develop a variety of spoofing attacks that might trick such systems. Detecting these attacks using the audio cues present in the recordings is an important challenge. Most existing spoofi… ▽ More

    Submitted 18 June, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Journal ref: IEEE Access. 2019

  35. arXiv:1905.03041  [pdf, other

    cs.SI cs.LG physics.soc-ph stat.ML

    Tag2Vec: Learning Tag Representations in Tag Networks

    Authors: Junshan Wang, Zhicong Lu, Guojie Song, Yue Fan, Lun Du, Wei Lin

    Abstract: Network embedding is a method to learn low-dimensional representation vectors for nodes in complex networks. In real networks, nodes may have multiple tags but existing methods ignore the abundant semantic and hierarchical information of tags. This information is useful to many network applications and usually very stable. In this paper, we propose a tag representation learning model, Tag2Vec, whi… ▽ More

    Submitted 24 September, 2020; v1 submitted 19 April, 2019; originally announced May 2019.

    Comments: 6 pages

  36. arXiv:1812.01278  [pdf, other

    cs.SD cs.AI cs.LG eess.AS stat.ML

    Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy

    Authors: Kin Wah Edward Lin, Balamurali B. T., Enyan Koh, Simon Lui, Dorien Herremans

    Abstract: Separating a singing voice from its music accompaniment remains an important challenge in the field of music information retrieval. We present a unique neural network approach inspired by a technique that has revolutionized the field of vision: pixel-wise image classification, which we combine with cross entropy loss and pretraining of the CNN as an autoencoder on singing voice spectrograms. The p… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: In Press, Neural Computing and Applications, Springer. 2019

    MSC Class: 68-XX; 68Txx

  37. arXiv:1812.01190  [pdf, other

    cs.IR cs.LG stat.ML

    EENMF: An End-to-End Neural Matching Framework for E-Commerce Sponsored Search

    Authors: Wen** Wu, Guojun Liu, Hui Ye, Chenshuang Zhang, Tianshu Wu, Daorui Xiao, Wei Lin, Xiaoyu Zhu

    Abstract: E-commerce sponsored search contributes an important part of revenue for the e-commerce company. In consideration of effectiveness and efficiency, a large-scale sponsored search system commonly adopts a multi-stage architecture. We name these stages as ad retrieval, ad pre-ranking and ad ranking. Ad retrieval and ad pre-ranking are collectively referred to as ad matching in this paper. We propose… ▽ More

    Submitted 9 December, 2018; v1 submitted 3 December, 2018; originally announced December 2018.

  38. arXiv:1811.07674  [pdf, other

    cs.LG stat.ML

    An Adaptive Oversampling Learning Method for Class-Imbalanced Fault Diagnostics and Prognostics

    Authors: Wenfang Lin, Zhenyu Wu, Yang Ji

    Abstract: Data-driven fault diagnostics and prognostics suffers from class-imbalance problem in industrial systems and it raises challenges to common machine learning algorithms as it becomes difficult to learn the features of the minority class samples. Synthetic oversampling methods are commonly used to tackle these problems by generating the minority class samples to balance the distributions between maj… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: 8 pages

  39. arXiv:1806.04854  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam

    Authors: Mohammad Emtiyaz Khan, Didrik Nielsen, Voot Tangkaratt, Wu Lin, Yarin Gal, Akash Srivastava

    Abstract: Uncertainty computation in deep learning is essential to design robust and reliable systems. Variational inference (VI) is a promising approach for such computation, but requires more effort to implement and execute compared to maximum-likelihood methods. In this paper, we propose new natural-gradient algorithms to reduce such efforts for Gaussian mean-field VI. Our algorithms can be implemented w… ▽ More

    Submitted 2 August, 2018; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: Camera ready version

    Journal ref: Thirty-fifth International Conference on Machine Learning, 2018

  40. arXiv:1803.05589  [pdf, other

    stat.ML

    Variational Message Passing with Structured Inference Networks

    Authors: Wu Lin, Nicolas Hubacher, Mohammad Emtiyaz Khan

    Abstract: Recent efforts on combining deep models with probabilistic graphical models are promising in providing flexible models that are also easy to interpret. We propose a variational message-passing algorithm for variational inference in such models. We make three contributions. First, we propose structured inference networks that incorporate the structure of the graphical model in the inference network… ▽ More

    Submitted 14 June, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: Added a missing term in the gradient of the lower bound

    Journal ref: ICLR 2018

  41. arXiv:1803.03289  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Neural Network Compression with Single and Multiple Level Quantization

    Authors: Yuhui Xu, Yongzhuang Wang, Aojun Zhou, Weiyao Lin, Hongkai Xiong

    Abstract: Network quantization is an effective solution to compress deep neural networks for practical usage. Existing network quantization methods cannot sufficiently exploit the depth information to generate low-bit compressed network. In this paper, we propose two novel network quantization approaches, single-level network quantization (SLQ) for high-bit quantization and multi-level network quantization… ▽ More

    Submitted 15 December, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: Published in AAAI18. Code is available at https://github.com/yuhuixu1993/SLQ

  42. Beyond Keywords and Relevance: A Personalized Ad Retrieval Framework in E-Commerce Sponsored Search

    Authors: Su Yan, Wei Lin, Tianshu Wu, Daorui Xiao, Xu Zheng, Bo Wu, Kaipeng Liu

    Abstract: On most sponsored search platforms, advertisers bid on some keywords for their advertisements (ads). Given a search request, ad retrieval module rewrites the query into bidding keywords, and uses these keywords as keys to select Top N ads through inverted indexes. In this way, an ad will not be retrieved even if queries are related when the advertiser does not bid on corresponding keywords. Moreov… ▽ More

    Submitted 23 April, 2018; v1 submitted 28 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 2018 World Wide Web Conference Pages 1919-1928

  43. arXiv:1711.05560  [pdf, other

    stat.ML cs.LG

    Variational Adaptive-Newton Method for Explorative Learning

    Authors: Mohammad Emtiyaz Khan, Wu Lin, Voot Tangkaratt, Zuozhu Liu, Didrik Nielsen

    Abstract: We present the Variational Adaptive Newton (VAN) method which is a black-box optimization method especially suitable for explorative-learning tasks such as active learning and reinforcement learning. Similar to Bayesian methods, VAN estimates a distribution that can be used for exploration, but requires computations that are similar to continuous optimization methods. Our theoretical contribution… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

  44. arXiv:1710.02704  [pdf, ps, other

    stat.ME math.ST stat.ML

    Nonsparse learning with latent variables

    Authors: Zemin Zheng, **chi Lv, Wei Lin

    Abstract: As a popular tool for producing meaningful and interpretable models, large-scale sparse learning works efficiently when the underlying structures are indeed or close to sparse. However, naively applying the existing regularization methods can result in misleading outcomes due to model misspecification. In particular, the direct sparsity assumption on coefficient vectors has been questioned in real… ▽ More

    Submitted 7 October, 2017; originally announced October 2017.

    Comments: 30 pages

    MSC Class: 62J

  45. arXiv:1704.08349  [pdf, other

    stat.ME stat.ML

    SOFAR: large-scale association network learning

    Authors: Yoshimasa Uematsu, Yingying Fan, Kun Chen, **chi Lv, Wei Lin

    Abstract: Many modern big data applications feature large scale in both numbers of responses and predictors. Better statistical efficiency and scientific insights can be enabled by understanding the large-scale response-predictor association network structures via layers of sparse latent factors ranked by importance. Yet sparsity and orthogonality have been two largely incompatible goals. To accommodate bot… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

  46. arXiv:1608.08225  [pdf, other

    cond-mat.dis-nn cs.LG cs.NE stat.ML

    Why does deep and cheap learning work so well?

    Authors: Henry W. Lin, Max Tegmark, David Rolnick

    Abstract: We show how the success of deep learning could depend not only on mathematics but also on physics: although well-known mathematical theorems guarantee that neural networks can approximate arbitrary functions well, the class of functions of practical interest can frequently be approximated through "cheap learning" with exponentially fewer parameters than generic ones. We explore how properties freq… ▽ More

    Submitted 3 August, 2017; v1 submitted 29 August, 2016; originally announced August 2016.

    Comments: Replaced to match version published in Journal of Statistical Physics: https://link.springer.com/article/10.1007/s10955-017-1836-5 Improved refs & discussion, typos fixed. 16 pages, 3 figs

  47. arXiv:1601.04397  [pdf, ps, other

    stat.ME

    Large Covariance Estimation for Compositional Data via Composition-Adjusted Thresholding

    Authors: Yuanpei Cao, Wei Lin, Hongzhe Li

    Abstract: High-dimensional compositional data arise naturally in many applications such as metagenomic data analysis. The observed data lie in a high-dimensional simplex, and conventional statistical methods often fail to produce sensible results due to the unit-sum constraint. In this article, we address the problem of covariance estimation for high-dimensional compositional data, and introduce a compositi… ▽ More

    Submitted 17 January, 2016; originally announced January 2016.

  48. arXiv:1511.00146  [pdf, other

    stat.ML cs.LG stat.CO

    Faster Stochastic Variational Inference using Proximal-Gradient Methods with General Divergence Functions

    Authors: Mohammad Emtiyaz Khan, Reza Babanezhad, Wu Lin, Mark Schmidt, Masashi Sugiyama

    Abstract: Several recent works have explored stochastic gradient methods for variational inference that exploit the geometry of the variational-parameter space. However, the theoretical properties of these methods are not well-understood and these methods typically only apply to conditionally-conjugate models. We present a new stochastic method for variational inference which exploits the geometry of the va… ▽ More

    Submitted 11 August, 2016; v1 submitted 31 October, 2015; originally announced November 2015.

    Comments: Published in UAI 2016. We have made the following change in this revision: instead of expressing convergence rate results in terms of the iterate difference, we state them in terms of the iterate distance divided by the step-size (a measure of first-order optimality). We also removed some claims about the performance with a fixed step size

  49. arXiv:1501.00738  [pdf, other

    physics.soc-ph astro-ph.CO cond-mat.stat-mech cs.SI stat.AP

    Zipf's Law from Scale-free Geometry

    Authors: Henry W. Lin, Abraham Loeb

    Abstract: The spatial distribution of people exhibits clustering across a wide range of scales, from household ($\sim 10^{-2}$ km) to continental ($\sim 10^4$ km) scales. Empirical data indicates simple power-law scalings for the size distribution of cities (known as Zipf's law) and the population density fluctuations as a function of scale. Using techniques from random field theory and statistical physics,… ▽ More

    Submitted 14 February, 2016; v1 submitted 4 January, 2015; originally announced January 2015.

    Comments: 7 pages, 2 figures, accepted for publication in Physical Review E

    Journal ref: Phys. Rev. E 93, 032306 (2016)

  50. Multiscale adaptive smoothing models for the hemodynamic response function in fMRI

    Authors: Jia** Wang, Hongtu Zhu, Jianqing Fan, Kelly Giovanello, Weili Lin

    Abstract: In the event-related functional magnetic resonance imaging (fMRI) data analysis, there is an extensive interest in accurately and robustly estimating the hemodynamic response function (HRF) and its associated statistics (e.g., the magnitude and duration of the activation). Most methods to date are developed in the time domain and they have utilized almost exclusively the temporal information of fM… ▽ More

    Submitted 20 December, 2013; originally announced December 2013.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS609 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS609

    Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 2, 904-935