Skip to main content

Showing 1–41 of 41 results for author: Qin, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.12783  [pdf, other

    stat.ML cs.LG

    Epanechnikov Variational Autoencoder

    Authors: Tian Qin, Wei-Min Huang

    Abstract: In this paper, we bridge Variational Autoencoders (VAEs) [17] and kernel density estimations (KDEs) [25 ],[23] by approximating the posterior by KDEs and deriving an upper bound of the Kullback-Leibler (KL) divergence in the evidence lower bound (ELBO). The flexibility of KDEs makes the optimization of posteriors in VAEs possible, which not only addresses the limitations of Gaussian latent space i… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2402.04550  [pdf, other

    stat.ML cs.LG

    Riemann-Lebesgue Forest for Regression

    Authors: Tian Qin, Wei-Min Huang

    Abstract: We propose a novel ensemble method called Riemann-Lebesgue Forest (RLF) for regression. The core idea in RLF is to mimic the way how a measurable function can be approximated by partitioning its range into a few intervals. With this idea in mind, we develop a new tree learner named Riemann-Lebesgue Tree (RLT) which has a chance to perform Lebesgue type cutting,i.e splitting the node from response… ▽ More

    Submitted 9 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  3. arXiv:2311.02827  [pdf, other

    stat.ML cs.LG stat.AP

    On Subagging Boosted Probit Model Trees

    Authors: Tian Qin, Wei-Min Huang

    Abstract: With the insight of variance-bias decomposition, we design a new hybrid bagging-boosting algorithm named SBPMT for classification problems. For the boosting part of SBPMT, we propose a new tree model called Probit Model Tree (PMT) as base classifiers in AdaBoost procedure. For the bagging part, instead of subsampling from the dataset at each step of boosting, we perform boosted PMTs on each subagg… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  4. arXiv:2310.07990  [pdf

    q-bio.GN cs.IR cs.LG stat.AP

    Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics

    Authors: Chen Zhao, Kuan-Jui Su, Chong Wu, Xuewei Cao, Qiuying Sha, Wu Li, Zhe Luo, Tian Qin, Chuan Qiu, Lan Juan Zhao, Anqi Liu, Lindong Jiang, Xiao Zhang, Hui Shen, Weihua Zhou, Hong-Wen Deng

    Abstract: Background: Missing data is a common challenge in mass spectrometry-based metabolomics, which can lead to biased and incomplete analyses. The integration of whole-genome sequencing (WGS) data with metabolomics data has emerged as a promising approach to enhance the accuracy of data imputation in metabolomics studies. Method: In this study, we propose a novel method that leverages the information f… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 19 pages, 3 figures

  5. arXiv:2212.02125  [pdf, other

    stat.ML cs.AI cs.LG

    TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets

    Authors: Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu

    Abstract: We consider an offline reinforcement learning (RL) setting where the agent need to learn from a dataset collected by rolling out multiple behavior policies. There are two challenges for this setting: 1) The optimal trade-off between optimizing the RL signal and the behavior cloning (BC) signal changes on different states due to the variation of the action coverage induced by different behavior pol… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by ICDM-22 (Best Student Paper Runner-Up Awards)

  6. arXiv:2210.05579  [pdf, other

    cs.GT cs.AI cs.LG stat.ML

    Benefits of Permutation-Equivariance in Auction Mechanisms

    Authors: Tian Qin, Fengxiang He, Dingfeng Shi, Wenbing Huang, Dacheng Tao

    Abstract: Designing an incentive-compatible auction mechanism that maximizes the auctioneer's revenue while minimizes the bidders' ex-post regret is an important yet intricate problem in economics. Remarkable progress has been achieved through learning the optimal auction mechanism by neural networks. In this paper, we consider the popular additive valuation and symmetric valuation setting; i.e., the valuat… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  7. arXiv:2205.12418  [pdf, other

    cs.LG cs.AI stat.ML

    Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret

    Authors: Jiawei Huang, Li Zhao, Tao Qin, Wei Chen, Nan Jiang, Tie-Yan Liu

    Abstract: We propose a new learning framework that captures the tiered structure of many real-world user-interaction applications, where the users can be divided into two groups based on their different tolerance on exploration risks and should be treated separately. In this setting, we simultaneously maintain two policies $π^{\text{O}}$ and $π^{\text{E}}$: $π^{\text{O}}$ ("O" for "online") interacts with m… ▽ More

    Submitted 26 February, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 38 pages; NeurIPS 2022

  8. arXiv:2202.06450  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality

    Authors: Jiawei Huang, **glin Chen, Li Zhao, Tao Qin, Nan Jiang, Tie-Yan Liu

    Abstract: Deployment efficiency is an important criterion for many real-world applications of reinforcement learning (RL). Despite the community's increasing interest, there lacks a formal theoretical formulation for the problem. In this paper, we propose such a formulation for deployment-efficient RL (DE-RL) from an "optimization with constraints" perspective: we are interested in exploring an MDP and obta… ▽ More

    Submitted 30 August, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: 49 Pages; ICLR 2022

  9. arXiv:2106.15962  [pdf, other

    cs.LG cs.AI stat.ML

    On the Generative Utility of Cyclic Conditionals

    Authors: Chang Liu, Haoyue Tang, Tao Qin, **tao Wang, Tie-Yan Liu

    Abstract: We study whether and how can we model a joint distribution $p(x,z)$ using two conditional models $p(x|z)$ and $q(z|x)$ that form a cycle. This is motivated by the observation that deep generative models, in addition to a likelihood model $p(x|z)$, often also use an inference model $q(z|x)$ for extracting representation, but they rely on a usually uninformative prior distribution $p(z)$ to define a… ▽ More

    Submitted 6 November, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: NeurIPS'21 camera-ready version

  10. arXiv:2106.06406  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior

    Authors: Sang-gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon, Tie-Yan Liu

    Abstract: Denoising diffusion probabilistic models have been recently proposed to generate high-quality samples by estimating the gradient of the data density. The framework defines the prior noise as a standard Gaussian distribution, whereas the corresponding data distribution may be more complicated than the standard Gaussian distribution, which potentially introduces inefficiency in denoising the prior n… ▽ More

    Submitted 20 February, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: ICLR 2022. 19 pages, 7 figures, 8 tables. Audio samples: https://speechresearch.github.io/priorgrad/

  11. arXiv:2011.02203  [pdf, other

    cs.LG stat.ML

    Latent Causal Invariant Model

    Authors: Xinwei Sun, Botong Wu, Xiangyu Zheng, Chang Liu, Wei Chen, Tao Qin, Tie-yan Liu

    Abstract: Current supervised learning can learn spurious correlation during the data-fitting process, imposing issues regarding interpretability, out-of-distribution (OOD) generalization, and robustness. To avoid spurious correlation, we propose a Latent Causal Invariance Model (LaCIM) which pursues causal prediction. Specifically, we introduce latent variables that are separated into (a) output-causative f… ▽ More

    Submitted 27 April, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

  12. arXiv:2011.01681  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Causal Semantic Representation for Out-of-Distribution Prediction

    Authors: Chang Liu, Xinwei Sun, **dong Wang, Haoyue Tang, Tao Li, Tao Qin, Wei Chen, Tie-Yan Liu

    Abstract: Conventional supervised learning methods, especially deep ones, are found to be sensitive to out-of-distribution (OOD) examples, largely because the learned representation mixes the semantic factor with the variation factor due to their domain-specific correlation, while only the semantic factor causes the output. To address the problem, we propose a Causal Semantic Generative model (CSG) based on… ▽ More

    Submitted 1 November, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: NeurIPS'21 camera-ready version

  13. arXiv:2007.10791  [pdf, other

    cs.LG cs.CV stat.ML

    Learning to Match Distributions for Domain Adaptation

    Authors: Chaohui Yu, **dong Wang, Chang Liu, Tao Qin, Renjun Xu, Wenjie Feng, Yiqiang Chen, Tie-Yan Liu

    Abstract: When the training and test data are from different distributions, domain adaptation is needed to reduce dataset bias to improve the model's generalization ability. Since it is difficult to directly match the cross-domain joint distributions, existing methods tend to reduce the marginal or conditional distribution divergence using predefined distances such as MMD and adversarial-based discrepancies… ▽ More

    Submitted 26 July, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Preprint. 20 Pages. Code available at https://github.com/**dongwang/transferlearning/tree/master/code/deep/Learning-to-Match

  14. arXiv:2007.04785  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Accuracy Prediction with Non-neural Model for Neural Architecture Search

    Authors: Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Neural architecture search (NAS) with an accuracy predictor that predicts the accuracy of candidate architectures has drawn increasing attention due to its simplicity and effectiveness. Previous works usually employ neural network-based predictors which require more delicate design and are easy to overfit. Considering that most architectures are represented as sequences of discrete symbols which a… ▽ More

    Submitted 19 July, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Code is available at https://github.com/renqianluo/GBDT-NAS

  15. arXiv:2007.04649  [pdf, other

    cs.LG stat.ML

    Learning to Reweight with Deep Interactions

    Authors: Yang Fan, Yingce Xia, Lijun Wu, Shufang Xie, Weiqing Liu, Jiang Bian, Tao Qin, Xiang-Yang Li

    Abstract: Recently, the concept of teaching has been introduced into machine learning, in which a teacher model is used to guide the training of a student model (which will be used in real tasks) through data selection, loss function design, etc. Learning to reweight, which is a specific kind of teaching that reweights training data using a teacher model, receives much attention due to its simplicity and ef… ▽ More

    Submitted 12 January, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted to AAAI-2021

  16. arXiv:2005.08238  [pdf, other

    cs.LG cs.CL stat.ML

    Dual Learning: Theoretical Study and an Algorithmic Extension

    Authors: Zhibing Zhao, Yingce Xia, Tao Qin, Lirong Xia, Tie-Yan Liu

    Abstract: Dual learning has been successfully applied in many machine learning applications including machine translation, image-to-image transformation, etc. The high-level idea of dual learning is very intuitive: if we map an $x$ from one domain to another and then map it back, we should recover the original $x$. Although its effectiveness has been empirically verified, theoretical understanding of dual l… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

    Comments: 11 pages, 2 figures

  17. arXiv:2002.10389  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Neural Architecture Search

    Authors: Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Neural architecture search (NAS) relies on a good controller to generate better architectures or predict the accuracy of given architectures. However, training the controller requires both abundant and high-quality pairs of architectures and their accuracy, while it is costly to evaluate an architecture and obtain its accuracy. In this paper, we propose SemiNAS, a semi-supervised NAS approach that… ▽ More

    Submitted 3 November, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: NeurIPS 2020

  18. arXiv:2002.04658  [pdf, other

    cs.LG cs.CV stat.ML

    A Non-Intrusive Correction Algorithm for Classification Problems with Corrupted Data

    Authors: Jun Hou, Tong Qin, Kailiang Wu, Dongbin Xiu

    Abstract: A novel correction algorithm is proposed for multi-class classification problems with corrupted training data. The algorithm is non-intrusive, in the sense that it post-processes a trained classification model by adding a correction procedure to the model prediction. The correction procedure can be coupled with any approximators, such as logistic regression, neural networks of various architecture… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  19. arXiv:1911.08717  [pdf, other

    cs.LG stat.ML

    Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation

    Authors: Junliang Guo, Xu Tan, Linli Xu, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Non-autoregressive translation (NAT) models remove the dependence on previous target tokens and generate all target tokens in parallel, resulting in significant inference speedup but at the cost of inferior translation accuracy compared to autoregressive translation (AT) models. Considering that AT models have higher accuracy and are easier to train than NAT models, and both of them share the same… ▽ More

    Submitted 21 November, 2019; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: AAAI 2020

  20. arXiv:1911.06191  [pdf, other

    cs.CL cs.LG stat.ML

    Microsoft Research Asia's Systems for WMT19

    Authors: Yingce Xia, Xu Tan, Fei Tian, Fei Gao, Weicong Chen, Yang Fan, Linyuan Gong, Yichong Leng, Renqian Luo, Yiren Wang, Lijun Wu, **hua Zhu, Tao Qin, Tie-Yan Liu

    Abstract: We Microsoft Research Asia made submissions to 11 language directions in the WMT19 news translation tasks. We won the first place for 8 of the 11 directions and the second place for the other three. Our basic systems are built on Transformer, back translation and knowledge distillation. We integrate several of our rececent techniques to enhance the baseline systems: multi-agent dual learning (MADL… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Accepted to "Fourth Conference on Machine Translation (WMT19)"

  21. arXiv:1911.02140  [pdf, other

    cs.LG cs.AI stat.ML

    Fully Parameterized Quantile Function for Distributional Reinforcement Learning

    Authors: Derek Yang, Li Zhao, Zichuan Lin, Tao Qin, Jiang Bian, Tieyan Liu

    Abstract: Distributional Reinforcement Learning (RL) differs from traditional RL in that, rather than the expectation of total returns, it estimates distributions and has achieved state-of-the-art performance on Atari Games. The key challenge in practical distributional RL algorithms lies in how to parameterize estimated distributions so as to better approximate the true continuous distribution. Existing di… ▽ More

    Submitted 2 August, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019. Code at https://github.com/microsoft/FQF

  22. arXiv:1909.10815  [pdf, other

    cs.LG cs.NE stat.ML

    Balanced One-shot Neural Architecture Optimization

    Authors: Renqian Luo, Tao Qin, Enhong Chen

    Abstract: The ability to rank candidate architectures is the key to the performance of neural architecture search~(NAS). One-shot NAS is proposed to reduce the expense but shows inferior performance against conventional NAS and is not adequately stable. We investigate into this and find that the ranking correlation between architectures under one-shot training and the ones under stand-alone full training is… ▽ More

    Submitted 31 March, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Code and model checkpoints are publicly available at https://github.com/renqianluo/NAO_pytorch

  23. arXiv:1909.06708  [pdf, other

    cs.CL cs.LG stat.ML

    Hint-Based Training for Non-Autoregressive Machine Translation

    Authors: Zhuohan Li, Zi Lin, Di He, Fei Tian, Tao Qin, Liwei Wang, Tie-Yan Liu

    Abstract: Due to the unparallelizable nature of the autoregressive factorization, AutoRegressive Translation (ART) models have to generate tokens sequentially during decoding and thus suffer from high inference latency. Non-AutoRegressive Translation (NART) models were proposed to reduce the inference time, but could only achieve inferior translation accuracy. In this paper, we proposed a novel approach to… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: EMNLP-IJCNLP 2019

  24. arXiv:1906.02762  [pdf, other

    cs.LG cs.CL stat.ML

    Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View

    Authors: Yi** Lu, Zhuohan Li, Di He, Zhiqing Sun, Bin Dong, Tao Qin, Liwei Wang, Tie-Yan Liu

    Abstract: The Transformer architecture is widely used in natural language processing. Despite its success, the design principle of the Transformer remains elusive. In this paper, we provide a novel perspective towards understanding the architecture: we show that the Transformer can be mathematically interpreted as a numerical Ordinary Differential Equation (ODE) solver for a convection-diffusion equation in… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  25. arXiv:1905.10396  [pdf, other

    math.NA cs.LG math.DS physics.comp-ph stat.ML

    Structure-preserving Method for Reconstructing Unknown Hamiltonian Systems from Trajectory Data

    Authors: Kailiang Wu, Tong Qin, Dongbin Xiu

    Abstract: We present a numerical approach for approximating unknown Hamiltonian systems using observation data. A distinct feature of the proposed method is that it is structure-preserving, in the sense that it enforces conservation of the reconstructed Hamiltonian. This is achieved by directly approximating the underlying unknown Hamiltonian, rather than the right-hand-side of the governing equations. We p… ▽ More

    Submitted 19 August, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: 27 pages, 19 figures

    Journal ref: SIAM Journal on Scientific Computing 42 (6), A3704--A3729, 2020

  26. arXiv:1902.10245  [pdf, other

    cs.CL cs.LG stat.ML

    Non-Autoregressive Machine Translation with Auxiliary Regularization

    Authors: Yiren Wang, Fei Tian, Di He, Tao Qin, ChengXiang Zhai, Tie-Yan Liu

    Abstract: As a new neural machine translation approach, Non-Autoregressive machine Translation (NAT) has attracted attention recently due to its high efficiency in inference. However, the high efficiency has come at the cost of not capturing the sequential dependency on the target side of translation, which causes NAT to suffer from two kinds of translation errors: 1) repeated translations (due to indisting… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: AAAI 2019

  27. arXiv:1811.05537  [pdf, other

    math.NA cs.LG cs.NE math.DS stat.ML

    Data Driven Governing Equations Approximation Using Deep Neural Networks

    Authors: Tong Qin, Kailiang Wu, Dongbin Xiu

    Abstract: We present a numerical framework for approximating unknown governing equations using observation data and deep neural networks (DNN). In particular, we propose to use residual network (ResNet) as the basic building block for equation approximation. We demonstrate that the ResNet block can be considered as a one-step method that is exact in temporal integration. We then present two multi-step metho… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

  28. arXiv:1810.12081  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Teach with Dynamic Loss Functions

    Authors: Lijun Wu, Fei Tian, Yingce Xia, Yang Fan, Tao Qin, Jianhuang Lai, Tie-Yan Liu

    Abstract: Teaching is critical to human society: it is with teaching that prospective students are educated and human civilization can be inherited and advanced. A good teacher not only provides his/her students with qualified teaching materials (e.g., textbooks), but also sets up appropriate learning objectives (e.g., course projects and exams) considering different situations of a student. When it comes t… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: NIPS 2018

  29. arXiv:1809.06858  [pdf, other

    cs.CL cs.LG stat.ML

    FRAGE: Frequency-Agnostic Word Representation

    Authors: Chengyue Gong, Di He, Xu Tan, Tao Qin, Liwei Wang, Tie-Yan Liu

    Abstract: Continuous word representation (aka word embedding) is a basic building block in many neural network-based models used in natural language processing tasks. Although it is widely accepted that words with similar semantics should be close to each other in the embedding space, we find that word embeddings learned in several tasks are biased towards word frequency: the embeddings of high-frequency an… ▽ More

    Submitted 17 March, 2020; v1 submitted 18 September, 2018; originally announced September 2018.

    Comments: To appear in NIPS 2018

  30. arXiv:1808.08866  [pdf, other

    cs.LG cs.AI stat.ML

    A Study of Reinforcement Learning for Neural Machine Translation

    Authors: Lijun Wu, Fei Tian, Tao Qin, Jianhuang Lai, Tie-Yan Liu

    Abstract: Recent studies have shown that reinforcement learning (RL) is an effective approach for improving the performance of neural machine translation (NMT) system. However, due to its instability, successfully RL training is challenging, especially in real-world systems where deep models and large datasets are leveraged. In this paper, taking several large-scale translation tasks as testbeds, we conduct… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  31. arXiv:1808.07233  [pdf, other

    cs.LG stat.ML

    Neural Architecture Optimization

    Authors: Renqian Luo, Fei Tian, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Automatic neural architecture design has shown its potential in discovering powerful neural network architectures. Existing methods, no matter based on reinforcement learning or evolutionary algorithms (EA), conduct architecture search in a discrete space, which is highly inefficient. In this paper, we propose a simple and efficient method to automatic neural architecture design based on continuou… ▽ More

    Submitted 4 September, 2019; v1 submitted 22 August, 2018; originally announced August 2018.

    Comments: NeurIPS 2018. Code available at: https://github.com/renqianluo/NAO

  32. arXiv:1806.02988  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Binary-Valued Gates for Robust LSTM Training

    Authors: Zhuohan Li, Di He, Fei Tian, Wei Chen, Tao Qin, Liwei Wang, Tie-Yan Liu

    Abstract: Long Short-Term Memory (LSTM) is one of the most widely used recurrent structures in sequence modeling. It aims to use gates to control information flow (e.g., whether to skip some information or not) in the recurrent computations, although its practical implementation based on soft gates only partially achieves this goal. In this paper, we propose a new way for LSTM training, which pushes the out… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  33. arXiv:1805.08340  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Reducing Parameter Space for Neural Network Training

    Authors: Tong Qin, Ling Zhou, Dongbin Xiu

    Abstract: For neural networks (NNs) with rectified linear unit (ReLU) or binary activation functions, we show that their training can be accomplished in a reduced parameter space. Specifically, the weights in each neuron can be trained on the unit sphere, as opposed to the entire space, and the threshold can be trained in a bounded interval, as opposed to the real line. We show that the NNs in the reduced p… ▽ More

    Submitted 29 January, 2020; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: 17 pages, 8 figures

  34. arXiv:1707.00415  [pdf, other

    cs.LG stat.ML

    Dual Supervised Learning

    Authors: Yingce Xia, Tao Qin, Wei Chen, Jiang Bian, Nenghai Yu, Tie-Yan Liu

    Abstract: Many supervised learning tasks are emerged in dual forms, e.g., English-to-French translation vs. French-to-English translation, speech recognition vs. text to speech, and image classification vs. image generation. Two dual tasks have intrinsic connections with each other due to the probabilistic correlation between their models. This connection is, however, not effectively utilized today, since p… ▽ More

    Submitted 3 July, 2017; originally announced July 2017.

    Comments: ICML 2017

  35. arXiv:1704.06933  [pdf, other

    cs.CL cs.LG stat.ML

    Adversarial Neural Machine Translation

    Authors: Lijun Wu, Yingce Xia, Li Zhao, Fei Tian, Tao Qin, Jianhuang Lai, Tie-Yan Liu

    Abstract: In this paper, we study a new learning paradigm for Neural Machine Translation (NMT). Instead of maximizing the likelihood of the human translation as in previous works, we minimize the distinction between human translation and the translation given by an NMT model. To achieve this goal, inspired by the recent success of generative adversarial networks (GANs), we employ an adversarial training arc… ▽ More

    Submitted 30 September, 2018; v1 submitted 20 April, 2017; originally announced April 2017.

    Comments: ACML 2018

  36. arXiv:1702.08635  [pdf, other

    cs.LG cs.AI stat.ML

    Learning What Data to Learn

    Authors: Yang Fan, Fei Tian, Tao Qin, Jiang Bian, Tie-Yan Liu

    Abstract: Machine learning is essentially the sciences of playing with data. An adaptive data selection strategy, enabling to dynamically choose different data at various training stages, can reach a more effective model in a more efficient way. In this paper, we propose a deep reinforcement learning framework, which we call \emph{\textbf{N}eural \textbf{D}ata \textbf{F}ilter} (\textbf{NDF}), to explore aut… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

    Comments: A preliminary version will appear in ICLR 2017, workshop track. https://openreview.net/forum?id=SyJNmVqgg&noteId=SyJNmVqgg

  37. arXiv:1701.03162  [pdf, other

    stat.AP cs.AI cs.LG

    Real-time eSports Match Result Prediction

    Authors: Yifan Yang, Tian Qin, Yu-Heng Lei

    Abstract: In this paper, we try to predict the winning team of a match in the multiplayer eSports game Dota 2. To address the weaknesses of previous work, we consider more aspects of prior (pre-match) features from individual players' match history, as well as real-time (during-match) features at each minute as the match progresses. We use logistic regression, the proposed Attribute Sequence Model, and thei… ▽ More

    Submitted 10 December, 2016; originally announced January 2017.

    Comments: 8 pages, 8 figures

    ACM Class: I.2.1

  38. arXiv:1309.4111  [pdf, other

    stat.ML cs.LG math.ST

    Regularized Spectral Clustering under the Degree-Corrected Stochastic Blockmodel

    Authors: Tai Qin, Karl Rohe

    Abstract: Spectral clustering is a fast and popular algorithm for finding clusters in networks. Recently, Chaudhuri et al. (2012) and Amini et al.(2012) proposed inspired variations on the algorithm that artificially inflate the node degrees for improved statistical performance. The current paper extends the previous statistical estimation results to the more canonical spectral clustering algorithm in a way… ▽ More

    Submitted 16 September, 2013; originally announced September 2013.

  39. arXiv:1307.2302  [pdf, other

    stat.ML

    The blessing of transitivity in sparse and stochastic networks

    Authors: Karl Rohe, Tai Qin

    Abstract: The interaction between transitivity and sparsity, two common features in empirical networks, implies that there are local regions of large sparse networks that are dense. We call this the blessing of transitivity and it has consequences for both modeling and inference. Extant research suggests that statistical inference for the Stochastic Blockmodel is more difficult when the edges are sparse. Ho… ▽ More

    Submitted 1 August, 2013; v1 submitted 8 July, 2013; originally announced July 2013.

  40. arXiv:1206.2380  [pdf, other

    math.ST stat.ME

    The Highest Dimensional Stochastic Blockmodel with a Regularized Estimator

    Authors: Karl Rohe, Tai Qin, Haoyang Fan

    Abstract: In the high dimensional Stochastic Blockmodel for a random network, the number of clusters (or blocks) K grows with the number of nodes N. Two previous studies have examined the statistical estimation performance of spectral clustering and the maximum likelihood estimator under the high dimensional model; neither of these results allow K to grow faster than N^{1/2}. We study a model where, ignorin… ▽ More

    Submitted 1 August, 2013; v1 submitted 11 June, 2012; originally announced June 2012.

  41. arXiv:1204.2296  [pdf, other

    stat.ML math.ST

    Co-clustering for directed graphs: the Stochastic co-Blockmodel and spectral algorithm Di-Sim

    Authors: Karl Rohe, Tai Qin, Bin Yu

    Abstract: Directed graphs have asymmetric connections, yet the current graph clustering methodologies cannot identify the potentially global structure of these asymmetries. We give a spectral algorithm called di-sim that builds on a dual measure of similarity that correspond to how a node (i) sends and (ii) receives edges. Using di-sim, we analyze the global asymmetries in the networks of Enron emails, poli… ▽ More

    Submitted 8 January, 2015; v1 submitted 10 April, 2012; originally announced April 2012.