Skip to main content

Showing 1–42 of 42 results for author: Yao, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.05855  [pdf, other

    cs.LG cs.AI stat.ML

    Self-Distilled Disentangled Learning for Counterfactual Prediction

    Authors: Xinshu Li, Mingming Gong, Lina Yao

    Abstract: The advancements in disentangled representation learning significantly enhance the accuracy of counterfactual predictions by granting precise control over instrumental variables, confounders, and adjustable variables. An appealing method for achieving the independent separation of these factors is mutual information minimization, a task that presents challenges in numerous machine learning scenari… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2405.09810  [pdf, other

    stat.ME

    Trajectory-Based Individualized Treatment Rules

    Authors: Lanqiu Yao, Thaddeus Tarpey

    Abstract: A core component of precision medicine research involves optimizing individualized treatment rules (ITRs) based on patient characteristics. Many studies used to estimate ITRs are longitudinal in nature, collecting outcomes over time. Yet, to date, methods developed to estimate ITRs often ignore the longitudinal structure of the data. Information available from the longitudinal nature of the data c… ▽ More

    Submitted 27 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2402.15301  [pdf, other

    cs.CL cs.LG stat.ME

    Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models

    Authors: Yuzhe Zhang, Yipeng Zhang, Yidong Gan, Lina Yao, Chen Wang

    Abstract: Causal graph recovery is traditionally done using statistical estimation-based methods or based on individual's knowledge about variables of interests. They often suffer from data collection biases and limitations of individuals' knowledge. The advance of large language models (LLMs) provides opportunities to address these problems. We propose a novel method that leverages LLMs to deduce causal re… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  4. arXiv:2402.01827  [pdf, other

    stat.ME

    Extracting Scalar Measures from Curves

    Authors: Lanqiu Yao, Thaddeus Tarpey

    Abstract: The ability to order outcomes is necessary to make comparisons which is complicated when there is no natural ordering on the space of outcomes, as in the case of functional outcomes. This paper examines methods for extracting a scalar summary from functional or longitudinal outcomes based on an average rate of change which can be used to compare curves. Common approaches used in practice use a cha… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  5. arXiv:2401.14549  [pdf, other

    stat.ME

    Privacy-preserving Quantile Treatment Effect Estimation for Randomized Controlled Trials

    Authors: Leon Yao, Paul Yiming Li, Jiannan Lu

    Abstract: In accordance with the principle of "data minimization", many internet companies are opting to record less data. However, this is often at odds with A/B testing efficacy. For experiments with units with multiple observations, one popular data minimizing technique is to aggregate data for each unit. However, exact quantile estimation requires the full observation-level data. In this paper, we devel… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to 2023 CODE conference as a parallel presentation

  6. arXiv:2306.07652  [pdf

    stat.AP q-bio.TO

    Inactivated COVID-19 Vaccination did not affect In vitro fertilization (IVF) / Intra-Cytoplasmic Sperm Injection (ICSI) cycle outcomes

    Authors: Qi Wan, Ying Ling Yao, XingYu Lv, Li Hong Geng, Yue Wang, Enoch Appiah Adu-Gyamfi, Xue Jiao Wang, Yue Qian, Juan Yang, Ming Xing Chend, Zhao Hui Zhong, Yuan Li, Yu Bin Ding

    Abstract: Background: The objective of this study is to evaluate the impact of COVID-19 inactivated vaccine administration on the outcomes of in vitro fertilization (IVF) and intracytoplasmic sperm injection (ICSI) cycles in infertile couples in China. Methods: We collected data from the CYART prospective cohort, which included couples undergoing IVF treatment from January 2021 to September 2022 at Sichuan… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 26 pages, 4 figures and 5 tables

  7. arXiv:2301.02225  [pdf, ps, other

    stat.ML math.ST q-bio.QM

    $l_{1-2}$ GLasso: $L_{1-2}$ Regularized Multi-task Graphical Lasso for Joint Estimation of eQTL Map** and Gene Network

    Authors: Wei Miao, Lan Yao

    Abstract: A critical problem in genetics is to discover how gene expression is regulated within cells. Two major tasks of regulatory association learning are : (i) identifying SNP-gene relationships, known as eQTL map**, and (ii) determining gene-gene relationships, known as gene network estimation. To share information between these two tasks, we focus on the unified model for joint estimation of eQTL ma… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  8. arXiv:2208.06748  [pdf, other

    cs.LG stat.ME

    Learning to Infer Counterfactuals: Meta-Learning for Estimating Multiple Imbalanced Treatment Effects

    Authors: Guanglin Zhou, Lina Yao, Xiwei Xu, Chen Wang, Liming Zhu

    Abstract: We regularly consider answering counterfactual questions in practice, such as "Would people with diabetes take a turn for the better had they choose another medication?". Observational studies are growing in significance in answering such questions due to their widespread accumulation and comparatively easier acquisition than Randomized Control Trials (RCTs). Recently, some works have introduced r… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: 11 pages

  9. arXiv:2206.04907  [pdf, other

    cs.LG stat.ME

    Efficient Heterogeneous Treatment Effect Estimation With Multiple Experiments and Multiple Outcomes

    Authors: Leon Yao, Caroline Lo, Israel Nir, Sarah Tan, Ariel Evnine, Adam Lerer, Alex Peysakhovich

    Abstract: Learning heterogeneous treatment effects (HTEs) is an important problem across many fields. Most existing methods consider the setting with a single treatment arm and a single outcome metric. However, in many real world domains, experiments are run consistently - for example, in internet companies, A/B tests are run every day to measure the impacts of potential changes across many different metric… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  10. arXiv:2203.03978  [pdf, other

    cs.LG stat.ML

    Contrastive Conditional Neural Processes

    Authors: Zesheng Ye, Lina Yao

    Abstract: Conditional Neural Processes~(CNPs) bridge neural networks with probabilistic inference to approximate functions of Stochastic Processes under meta-learning settings. Given a batch of non-{\it i.i.d} function instantiations, CNPs are jointly optimized for in-instantiation observation prediction and cross-instantiation meta-representation adaptation within a generative reconstruction pipeline. Ther… ▽ More

    Submitted 25 March, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: accepted to CVPR2022

  11. arXiv:2203.03523  [pdf, other

    stat.ME

    A Single Index Model for Longitudinal Outcomes to Optimize Individual Treatment Decision Rules

    Authors: Lanqiu Yao, Thaddeus Tarpey

    Abstract: A pressing challenge in medical research is to identify optimal treatments for individual patients. This is particularly challenging in mental health settings where mean responses are often similar across multiple treatments. For example, the mean longitudinal trajectories for patients treated with an active drug and placebo may be very similar but different treatments may exhibit distinctly diffe… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  12. arXiv:2105.00991  [pdf, ps, other

    cs.IR cs.CL stat.CO

    Context-aware Ensemble of Multifaceted Factorization Models for Recommendation Prediction in Social Networks

    Authors: Yunwen Chen, Zuotao Liu, Daqi Ji, Yingwei Xin, Wenguang Wang, Lu Yao, Yi Zou

    Abstract: This paper describes the solution of Shanda Innovations team to Task 1 of KDD-Cup 2012. A novel approach called Multifaceted Factorization Models is proposed to incorporate a great variety of features in social networks. Social relationships and actions between users are integrated as implicit feedbacks to improve the recommendation accuracy. Keywords, tags, profiles, time and some other features… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: KDD 2012

  13. arXiv:2011.12182  [pdf, other

    stat.ME

    A New Algorithm for Convex Biclustering and Its Extension to the Compositional Data

    Authors: Binhuan Wang, Lanqiu Yao, Jiyuan Hu, Huilin Li

    Abstract: Biclustering is a powerful data mining technique that allows simultaneously clustering rows (observations) and columns (features) in a matrix-format data set, which can provide results in a checkerboard-like pattern for visualization and exploratory analysis in a wide array of domains. Multiple biclustering algorithms have been developed in the past two decades, among which the convex biclustering… ▽ More

    Submitted 8 June, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

  14. Spectrum-Guided Adversarial Disparity Learning

    Authors: Zhe Liu, Lina Yao, Lei Bai, Xianzhi Wang, Can Wang

    Abstract: It has been a significant challenge to portray intraclass disparity precisely in the area of activity recognition, as it requires a robust representation of the correlation between subject-specific variation for each activity class. In this work, we propose a novel end-to-end knowledge directed adversarial learning framework, which portrays the class-conditioned intraclass disparity using two comp… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  15. arXiv:2007.06758  [pdf, other

    cs.IR cs.LG stat.ML

    Recommender Systems for the Internet of Things: A Survey

    Authors: May Altulyan, Lina Yao, Xianzhi Wang, Chaoran Huang, Salil S Kanhere, Quan Z Sheng

    Abstract: Recommendation represents a vital stage in develo** and promoting the benefits of the Internet of Things (IoT). Traditional recommender systems fail to exploit ever-growing, dynamic, and heterogeneous IoT data. This paper presents a comprehensive review of the state-of-the-art recommender systems, as well as related techniques and application in the vibrant field of IoT. We discuss several limit… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  16. arXiv:2007.03183  [pdf, other

    cs.IR cs.LG stat.ML

    MAMO: Memory-Augmented Meta-Optimization for Cold-start Recommendation

    Authors: Manqing Dong, Feng Yuan, Lina Yao, Xiwei Xu, Liming Zhu

    Abstract: A common challenge for most current recommender systems is the cold-start problem. Due to the lack of user-item interactions, the fine-tuned recommender systems are unable to handle situations with new users or new items. Recently, some works introduce the meta-optimization idea into the recommendation scenarios, i.e. predicting the user preference by only a few of past interacted items. The core… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  17. arXiv:2007.02842  [pdf, other

    cs.LG stat.ML

    Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting

    Authors: Lei Bai, Lina Yao, Can Li, Xianzhi Wang, Can Wang

    Abstract: Modeling complex spatial and temporal correlations in the correlated time series data is indispensable for understanding the traffic dynamics and predicting the future status of an evolving traffic system. Recent works focus on designing complicated graph neural network architectures to capture shared patterns with the help of pre-defined graphs. In this paper, we argue that learning node-specific… ▽ More

    Submitted 21 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020

  18. arXiv:2007.00767  [pdf, other

    cs.LG cs.CV stat.ML

    NP-PROV: Neural Processes with Position-Relevant-Only Variances

    Authors: Xuesong Wang, Lina Yao, Xianzhi Wang, Fei** Nie

    Abstract: Neural Processes (NPs) families encode distributions over functions to a latent representation, given context data, and decode posterior mean and variance at unknown locations. Since mean and variance are derived from the same latent space, they may fail on out-of-domain tasks where fluctuations in function values amplify the model uncertainty. We present a new member named Neural Processes with P… ▽ More

    Submitted 15 June, 2020; originally announced July 2020.

    Comments: 10 pages, 5 figures

  19. arXiv:2005.05556  [pdf, other

    cs.LG stat.ML

    Agglomerative Neural Networks for Multi-view Clustering

    Authors: Zhe Liu, Yun Li, Lina Yao, Xianzhi Wang, Fei** Nie

    Abstract: Conventional multi-view clustering methods seek for a view consensus through minimizing the pairwise discrepancy between the consensus and subviews. However, the pairwise comparison cannot portray the inter-view relationship precisely if some of the subviews can be further agglomerated. To address the above challenge, we propose the agglomerative analysis to approximate the optimal consensus view,… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  20. arXiv:2004.13245  [pdf, other

    cs.LG cs.CL stat.ML

    Deep Conversational Recommender Systems: A New Frontier for Goal-Oriented Dialogue Systems

    Authors: Dai Hoang Tran, Quan Z. Sheng, Wei Emma Zhang, Salma Abdalla Hamad, Munazza Zaib, Nguyen H. Tran, Lina Yao, Nguyen Lu Dang Khoa

    Abstract: In recent years, the emerging topics of recommender systems that take advantage of natural language processing techniques have attracted much attention, and one of their applications is the Conversational Recommender System (CRS). Unlike traditional recommender systems with content-based and collaborative filtering approaches, CRS learns and models user's preferences through interactive dialogue c… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 7 pages, 3 figures, 1 table

  21. Are You A Risk Taker? Adversarial Learning of Asymmetric Cross-Domain Alignment for Risk Tolerance Prediction

    Authors: Zhe Liu, Lina Yao, Xianzhi Wang, Lei Bai, Jake An

    Abstract: Most current studies on survey analysis and risk tolerance modelling lack professional knowledge and domain-specific models. Given the effectiveness of generative adversarial learning in cross-domain information, we design an Asymmetric cross-Domain Generative Adversarial Network (ADGAN) for domain scale inequality. ADGAN utilizes the information-sufficient domain to provide extra information to i… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

  22. Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation

    Authors: Xiaocong Chen, Chaoran Huang, Lina Yao, Xianzhi Wang, Wei Liu, Wenjie Zhang

    Abstract: Interactive recommendation aims to learn from dynamic interactions between items and users to achieve responsiveness and accuracy. Reinforcement learning is inherently advantageous for co** with dynamic environments and thus has attracted increasing attention in interactive recommendation research. Inspired by knowledge-aware recommendation, we proposed Knowledge-Guided deep Reinforcement learni… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  23. arXiv:2002.02770  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    A Survey on Causal Inference

    Authors: Liuyi Yao, Zhixuan Chu, Sheng Li, Yaliang Li, **g Gao, Aidong Zhang

    Abstract: Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics, for decades. Nowadays, estimating causal effect from observational data has become an appealing research direction owing to the large amount of available data and low budget requirement, compared with randomized controlled trials. Embraced with the rapidl… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  24. arXiv:1910.04689  [pdf, other

    stat.ML cs.LG

    Graph Spectral Embedding for Parsimonious Transmission of Multivariate Time Series

    Authors: Lihan Yao, Paul Bendich

    Abstract: We propose a graph spectral representation of time series data that 1) is parsimoniously encoded to user-demanded resolution; 2) is unsupervised and performant in data-constrained scenarios; 3) captures event and event-transition structure within the time series; and 4) has near-linear computational complexity in both signal length and ambient dimension. This representation, which we call Laplacia… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  25. arXiv:1907.13359  [pdf, other

    cs.LG stat.ML

    Deep Neural Network Hyperparameter Optimization with Orthogonal Array Tuning

    Authors: Xiang Zhang, Xiaocong Chen, Lina Yao, Chang Ge, Manqing Dong

    Abstract: Deep learning algorithms have achieved excellent performance lately in a wide range of fields (e.g., computer version). However, a severe challenge faced by deep learning is the high dependency on hyper-parameters. The algorithm results may fluctuate dramatically under the different configuration of hyper-parameters. Addressing the above issue, this paper presents an efficient Orthogonal Array Tun… ▽ More

    Submitted 28 February, 2020; v1 submitted 31 July, 2019; originally announced July 2019.

    Journal ref: Published on ICONIP 2019

  26. arXiv:1905.10760  [pdf, other

    cs.LG cs.IR stat.ML

    DARec: Deep Domain Adaptation for Cross-Domain Recommendation via Transferring Rating Patterns

    Authors: Feng Yuan, Lina Yao, Boualem Benatallah

    Abstract: Cross-domain recommendation has long been one of the major topics in recommender systems. Recently, various deep models have been proposed to transfer the learned knowledge across domains, but most of them focus on extracting abstract transferable features from auxilliary contents, e.g., images and review texts, and the patterns in the rating matrix itself is rarely touched. In this work, inspired… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

  27. arXiv:1905.10069  [pdf, other

    cs.LG cs.AI stat.ML

    STG2Seq: Spatial-temporal Graph to Sequence Model for Multi-step Passenger Demand Forecasting

    Authors: Lei Bai, Lina Yao, Salil. S Kanhere, Xianzhi Wang, Quan. Z Sheng

    Abstract: Multi-step passenger demand forecasting is a crucial task in on-demand vehicle sharing services. However, predicting passenger demand over multiple time horizons is generally challenging due to the nonlinear and dynamic spatial-temporal dependencies. In this work, we propose to model multi-step citywide passenger demand prediction based on a graph and use a hierarchical graph convolutional structu… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: 7 pages

  28. arXiv:1905.04042  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Prototype Propagation Networks (PPN) for Weakly-supervised Few-shot Learning on Category Graph

    Authors: Lu Liu, Tianyi Zhou, Guodong Long, **g Jiang, Lina Yao, Chengqi Zhang

    Abstract: A variety of machine learning applications expect to achieve rapid learning from a limited number of labeled data. However, the success of most current models is the result of heavy training on big data. Meta-learning addresses this problem by extracting common knowledge across different tasks that can be quickly adapted to new tasks. However, they do not fully explore weakly-supervised informatio… ▽ More

    Submitted 2 June, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: Accepted to IJCAI 2019, Code is publicly available at: https://github.com/liulu112601/PPN

  29. Adversarial Variational Embedding for Robust Semi-supervised Learning

    Authors: Xiang Zhang, Lina Yao, Feng Yuan

    Abstract: Semi-supervised learning is sought for leveraging the unlabelled data when labelled data is difficult or expensive to acquire. Deep generative models (e.g., Variational Autoencoder (VAE)) and semisupervised Generative Adversarial Networks (GANs) have recently shown promising performance in semi-supervised classification for the excellent discriminative representing ability. However, the latent cod… ▽ More

    Submitted 7 May, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 9 pages, Accepted by Research Track in KDD 2019

  30. arXiv:1904.10281  [pdf, other

    cs.LG cs.CL stat.ML

    Quaternion Knowledge Graph Embeddings

    Authors: Shuai Zhang, Yi Tay, Lina Yao, Qi Liu

    Abstract: In this work, we move beyond the traditional complex-valued representations, introducing more expressive hypercomplex representations to model entities and relations for knowledge graph embeddings. More specifically, quaternion embeddings, hypercomplex-valued embeddings with three imaginary components, are utilized to represent entities. Relations are modelled as rotations in the quaternion space.… ▽ More

    Submitted 31 October, 2019; v1 submitted 23 April, 2019; originally announced April 2019.

    Comments: Accepted by NeurIPS 2019

  31. arXiv:1904.01638  [pdf, other

    cs.CV cs.AI eess.IV stat.ML

    A Strong Baseline for Domain Adaptation and Generalization in Medical Imaging

    Authors: Li Yao, Jordan Prosky, Ben Covington, Kevin Lyman

    Abstract: This work provides a strong baseline for the problem of multi-source multi-target domain adaptation and generalization in medical imaging. Using a diverse collection of ten chest X-ray datasets, we empirically demonstrate the benefits of training medical imaging deep learning models on varied patient populations for generalization to out-of-sample domains.

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: Extended abstract of a journal submission

  32. arXiv:1904.00326  [pdf, other

    cs.LG cs.AI stat.ML

    MedGCN: Medication recommendation and lab test imputation via graph convolutional networks

    Authors: Chengsheng Mao, Liang Yao, Yuan Luo

    Abstract: Laboratory testing and medication prescription are two of the most important routines in daily clinical practice. Develo** an artificial intelligence system that can automatically make lab test imputations and medication recommendations can save costs on potentially redundant lab tests and inform physicians of a more effective prescription. We present an intelligent medical system (named MedGCN)… ▽ More

    Submitted 3 February, 2022; v1 submitted 30 March, 2019; originally announced April 2019.

    Journal ref: ournal of Biomedical Informatics, Volume 127, 2022

  33. arXiv:1811.02757  [pdf, other

    cs.LG q-bio.QM stat.ML

    Early Prediction of Acute Kidney Injury in Critical Care Setting Using Clinical Notes

    Authors: Yikuan Li, Liang Yao, Chengsheng Mao, Anand Srivastava, Xiaoqian Jiang, Yuan Luo

    Abstract: Acute kidney injury (AKI) in critically ill patients is associated with significant morbidity and mortality. Development of novel methods to identify patients with AKI earlier will allow for testing of novel strategies to prevent or reduce the complications of AKI. We developed data-driven prediction models to estimate the risk of new AKI onset. We generated models from clinical notes within the f… ▽ More

    Submitted 9 November, 2018; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: 4 pages, 3 figures, accepted by BIBM 2018

  34. arXiv:1809.08106  [pdf, other

    cs.LG stat.ML

    Distribution Networks for Open Set Learning

    Authors: Chengsheng Mao, Liang Yao, Yuan Luo

    Abstract: In open set learning, a model must be able to generalize to novel classes when it encounters a sample that does not belong to any of the classes it has seen before. Open set learning poses a realistic learning scenario that is receiving growing attention. Existing studies on open set learning mainly focused on detecting novel classes, but few studies tried to model them for differentiating novel c… ▽ More

    Submitted 23 November, 2018; v1 submitted 19 September, 2018; originally announced September 2018.

  35. arXiv:1806.08079  [pdf, other

    cs.LG stat.ML

    GrCAN: Gradient Boost Convolutional Autoencoder with Neural Decision Forest

    Authors: Manqing Dong, Lina Yao, Xianzhi Wang, Boualem Benatallah, Shuai Zhang

    Abstract: Random forest and deep neural network are two schools of effective classification methods in machine learning. While the random forest is robust irrespective of the data domain, the deep neural network has advantages in handling high dimensional data. In view that a differentiable neural decision forest can be added to the neural network to fully exploit the benefits of both models, in our work, w… ▽ More

    Submitted 24 June, 2018; v1 submitted 21 June, 2018; originally announced June 2018.

  36. arXiv:1802.04407  [pdf, other

    cs.LG stat.ML

    Adversarially Regularized Graph Autoencoder for Graph Embedding

    Authors: Shirui Pan, Ruiqi Hu, Guodong Long, **g Jiang, Lina Yao, Chengqi Zhang

    Abstract: Graph embedding is an effective method to represent graph data in a low dimensional space for graph analytics. Most existing embedding algorithms typically focus on preserving the topological structure or minimizing the reconstruction errors of graph data, but they have mostly ignored the data distribution of the latent codes from the graphs, which often results in inferior embedding in real-world… ▽ More

    Submitted 7 January, 2019; v1 submitted 12 February, 2018; originally announced February 2018.

  37. arXiv:1511.04590  [pdf, other

    cs.CV cs.CL stat.ML

    Oracle performance for visual captioning

    Authors: Li Yao, Nicolas Ballas, Kyunghyun Cho, John R. Smith, Yoshua Bengio

    Abstract: The task of associating images and videos with a natural language description has attracted a great amount of attention recently. Rapid progress has been made in terms of both develo** novel algorithms and releasing new datasets. Indeed, the state-of-the-art results on some of the standard datasets have been pushed into the regime where it has become more and more difficult to make significant i… ▽ More

    Submitted 14 September, 2016; v1 submitted 14 November, 2015; originally announced November 2015.

    Comments: BMVC2016 (Oral paper)

  38. arXiv:1502.08029  [pdf, other

    stat.ML cs.AI cs.CL cs.CV cs.LG

    Describing Videos by Exploiting Temporal Structure

    Authors: Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville

    Abstract: Recent progress in using recurrent neural networks (RNNs) for image description has motivated the exploration of their application for video description. However, while images are static, working with videos requires modeling their dynamic temporal structure and then properly integrating that information into a natural language description. In this context, we propose an approach that successfully… ▽ More

    Submitted 30 September, 2015; v1 submitted 27 February, 2015; originally announced February 2015.

    Comments: Accepted to ICCV15. This version comes with code release and supplementary material

  39. arXiv:1409.0585  [pdf, other

    stat.ML cs.LG

    On the Equivalence Between Deep NADE and Generative Stochastic Networks

    Authors: Li Yao, Sherjil Ozair, Kyunghyun Cho, Yoshua Bengio

    Abstract: Neural Autoregressive Distribution Estimators (NADEs) have recently been shown as successful alternatives for modeling high dimensional multimodal distributions. One issue associated with NADEs is that they rely on a particular order of factorization for $P(\mathbf{x})$. This issue has been recently addressed by a variant of NADE called Orderless NADEs and its deeper version, Deep Orderless NADE.… ▽ More

    Submitted 1 September, 2014; originally announced September 2014.

    Comments: ECML/PKDD 2014

  40. arXiv:1406.1485  [pdf, other

    stat.ML cs.LG

    Iterative Neural Autoregressive Distribution Estimator (NADE-k)

    Authors: Tapani Raiko, Li Yao, Kyunghyun Cho, Yoshua Bengio

    Abstract: Training of the neural autoregressive density estimator (NADE) can be viewed as doing one step of probabilistic inference on missing values in data. We propose a new model that extends this inference scheme to multiple steps, arguing that it is easier to learn to improve a reconstruction in $k$ steps rather than to learn to reconstruct in a single inference step. The proposed model is an unsupervi… ▽ More

    Submitted 5 December, 2014; v1 submitted 5 June, 2014; originally announced June 2014.

    Comments: Accepted at Neural Information Processing Systems (NIPS) 2014

  41. arXiv:1312.5578  [pdf, other

    cs.LG stat.ML

    Multimodal Transitions for Generative Stochastic Networks

    Authors: Sherjil Ozair, Li Yao, Yoshua Bengio

    Abstract: Generative Stochastic Networks (GSNs) have been recently introduced as an alternative to traditional probabilistic modeling: instead of parametrizing the data distribution directly, one parametrizes a transition operator for a Markov chain whose stationary distribution is an estimator of the data generating distribution. The result of training is therefore a machine that generates samples through… ▽ More

    Submitted 24 January, 2014; v1 submitted 19 December, 2013; originally announced December 2013.

    Comments: 7 figures, 9 pages, submitted to ICLR14

  42. arXiv:1301.4293  [pdf, ps, other

    cs.LG stat.ML

    Latent Relation Representations for Universal Schemas

    Authors: Sebastian Riedel, Limin Yao, Andrew McCallum

    Abstract: Traditional relation extraction predicts relations within some fixed and finite target schema. Machine learning approaches to this task require either manual annotation or, in the case of distant supervision, existing structured sources of the same schema. The need for existing datasets can be avoided by using a universal schema: the union of all involved schemas (surface form predicates as in Ope… ▽ More

    Submitted 28 January, 2013; v1 submitted 17 January, 2013; originally announced January 2013.

    Comments: 4 pages, ICLR workshop