Skip to main content

Showing 1–44 of 44 results for author: Dai, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.19320  [pdf, other

    cs.LG cs.AI stat.ML

    Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

    Authors: Shicong Cen, **cheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Reinforcement learning from human feedback (RLHF) has demonstrated great promise in aligning large language models (LLMs) with human preference. Depending on the availability of preference data, both online and offline RLHF are active areas of investigation. A key bottleneck is understanding how to incorporate uncertainty estimation in the reward function learned from the preference data for RLHF,… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  2. arXiv:2404.13707  [pdf, other

    stat.ME stat.AP

    Robust inference for the unification of confidence intervals in meta-analysis

    Authors: Wei Liang, Haicheng Huang, Hongsheng Dai, Yinghui Wei

    Abstract: Traditional meta-analysis assumes that the effect sizes estimated in individual studies follow a Gaussian distribution. However, this distributional assumption is not always satisfied in practice, leading to potentially biased results. In the situation when the number of studies, denoted as K, is large, the cumulative Gaussian approximation errors from each study could make the final estimation un… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  3. arXiv:2402.08539  [pdf

    cs.LG stat.AP

    Intelligent Diagnosis of Alzheimer's Disease Based on Machine Learning

    Authors: Mingyang Li, Hongyu Liu, Yixuan Li, Zejun Wang, Yuan Yuan, Honglin Dai

    Abstract: This study is based on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset and aims to explore early detection and disease progression in Alzheimer's disease (AD). We employ innovative data preprocessing strategies, including the use of the random forest algorithm to fill missing data and the handling of outliers and invalid data, thereby fully mining and utilizing these limited data re… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  4. arXiv:2401.05414  [pdf, other

    q-fin.ST cs.LG stat.ME

    On the Three Demons in Causality in Finance: Time Resolution, Nonstationarity, and Latent Factors

    Authors: Xinshuai Dong, Haoyue Dai, Yewen Fan, Songyao **, Sathyamoorthy Rajendran, Kun Zhang

    Abstract: Financial data is generally time series in essence and thus suffers from three fundamental issues: the mismatch in time resolution, the time-varying property of the distribution - nonstationarity, and causal factors that are important but unknown/unobserved. In this paper, we follow a causal perspective to systematically look into these three demons in finance. Specifically, we reexamine these iss… ▽ More

    Submitted 12 January, 2024; v1 submitted 28 December, 2023; originally announced January 2024.

  5. arXiv:2307.01389  [pdf, other

    cs.LG stat.ME

    Identification of Causal Relationship between Amyloid-beta Accumulation and Alzheimer's Disease Progression via Counterfactual Inference

    Authors: Haixing Dai, Mengxuan Hu, Qing Li, Lu Zhang, Lin Zhao, Dajiang Zhu, Ibai Diez, Jorge Sepulcre, Fan Zhang, Xingyu Gao, Manhua Liu, Quanzheng Li, Sheng Li, Tianming Liu, Xiang Li

    Abstract: Alzheimer's disease (AD) is a neurodegenerative disorder that is beginning with amyloidosis, followed by neuronal loss and deterioration in structure, function, and cognition. The accumulation of amyloid-beta in the brain, measured through 18F-florbetapir (AV45) positron emission tomography (PET) imaging, has been widely used for early diagnosis of AD. However, the relationship between amyloid-bet… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  6. arXiv:2305.17010  [pdf, other

    cs.LG cs.AI cs.DM stat.ML

    Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets

    Authors: Dinghuai Zhang, Hanjun Dai, Nikolay Malkin, Aaron Courville, Yoshua Bengio, Ling Pan

    Abstract: Combinatorial optimization (CO) problems are often NP-hard and thus out of reach for exact algorithms, making them a tempting domain to apply machine learning methods. The highly structured constraints in these problems can hinder either optimization or sampling directly in the solution space. On the other hand, GFlowNets have recently emerged as a powerful machinery to efficiently sample from com… ▽ More

    Submitted 20 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted by NeurIPS 2023 as spotlight

  7. arXiv:2301.09801  [pdf, other

    cs.CR cs.CY cs.LG stat.ML

    Heterogeneous Domain Adaptation for IoT Intrusion Detection: A Geometric Graph Alignment Approach

    Authors: Jiashu Wu, Hao Dai, Yang Wang, Kejiang Ye, Chengzhong Xu

    Abstract: Data scarcity hinders the usability of data-dependent algorithms when tackling IoT intrusion detection (IID). To address this, we utilise the data rich network intrusion detection (NID) domain to facilitate more accurate intrusion detection for IID domains. In this paper, a Geometric Graph Alignment (GGA) approach is leveraged to mask the geometric heterogeneities between domains for better intrus… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Internet of Things Journal

  8. arXiv:2211.07767  [pdf, other

    stat.ML cs.LG math.OC

    Learning to Optimize with Stochastic Dominance Constraints

    Authors: Hanjun Dai, Yuan Xue, Niao He, Bethany Wang, Na Li, Dale Schuurmans, Bo Dai

    Abstract: In real-world decision-making, uncertainty is important yet difficult to handle. Stochastic dominance provides a theoretically sound approach for comparing uncertain quantities, but optimization with stochastic dominance constraints is often computationally expensive, which limits practical applicability. In this paper, we develop a simple yet efficient approach for the problem, the Light Stochast… ▽ More

    Submitted 24 February, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted to the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  9. arXiv:2210.11021  [pdf, other

    cs.LG cs.AI stat.ML

    Independence Testing-Based Approach to Causal Discovery under Measurement Error and Linear Non-Gaussian Models

    Authors: Haoyue Dai, Peter Spirtes, Kun Zhang

    Abstract: Causal discovery aims to recover causal structures generating the observational data. Despite its success in certain problems, in many real-world scenarios the observed variables are not the target variables of interest, but the imperfect measures of the target variables. Causal discovery under measurement error aims to recover the causal graph among unobserved target variables from observations m… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: accepted to NeurIPS 2022

  10. arXiv:2203.10452  [pdf, other

    cs.LG cs.PL stat.ML

    CrossBeam: Learning to Search in Bottom-Up Program Synthesis

    Authors: Kensen Shi, Hanjun Dai, Kevin Ellis, Charles Sutton

    Abstract: Many approaches to program synthesis perform a search within an enormous space of programs to find one that satisfies a given specification. Prior works have used neural models to guide combinatorial search algorithms, but such approaches still explore a huge portion of the search space and quickly become intractable as the size of the desired program increases. To tame the search space blowup, we… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Published at ICLR 2022

  11. arXiv:2112.00874  [pdf, other

    cs.LG stat.ML

    Neural Stochastic Dual Dynamic Programming

    Authors: Hanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai

    Abstract: Stochastic dual dynamic programming (SDDP) is a state-of-the-art method for solving multi-stage stochastic optimization, widely used for modeling real-world process optimization tasks. Unfortunately, SDDP has a worst-case complexity that scales exponentially in the number of decision variables, which severely limits applicability to only low dimensional problems. To overcome this limitation, we ex… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 24 pages

  12. arXiv:2110.00637  [pdf, other

    cs.LG stat.ML

    ML4C: Seeing Causality Through Latent Vicinity

    Authors: Haoyue Dai, Rui Ding, Yuanyuan Jiang, Shi Han, Dongmei Zhang

    Abstract: Supervised Causal Learning (SCL) aims to learn causal relations from observational data by accessing previously seen datasets associated with ground truth causal relations. This paper presents a first attempt at addressing a fundamental question: What are the benefits from supervision and how does it benefit? Starting from seeing that SCL is not better than random guessing if the learning target i… ▽ More

    Submitted 16 April, 2023; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: causal discovery, supervised causal learning, vicinity, identifiability, learnability

  13. arXiv:2106.02524  [pdf, other

    cs.CL cs.LG stat.ML

    CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

    Authors: James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, T. Greg McKelvey, Hui Dai, Yi Yang, David Sontag

    Abstract: Continuity of care is crucial to ensuring positive health outcomes for patients discharged from an inpatient hospital setting, and improved information sharing can help. To share information, caregivers write discharge notes containing action items to share with patients and their future caregivers, but these action items are easily lost due to the lengthiness of the documents. In this work, we de… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  14. arXiv:2103.14731  [pdf, other

    cs.LG eess.IV eess.SP stat.AP

    Modeling the Nonsmoothness of Modern Neural Networks

    Authors: Runze Liu, Chau-Wai Wong, Huaiyu Dai

    Abstract: Modern neural networks have been successful in many regression-based tasks such as face recognition, facial landmark detection, and image generation. In this work, we investigate an intuitive but understudied characteristic of modern neural networks, namely, the nonsmoothness. The experiments using synthetic data confirm that such operations as ReLU and max pooling in modern neural networks lead t… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

  15. arXiv:2102.02123  [pdf, other

    stat.ME

    Bayesian Fusion: Scalable unification of distributed statistical analyses

    Authors: Hongsheng Dai, Murray Pollock, Gareth Roberts

    Abstract: There has recently been considerable interest in addressing the problem of unifying distributed statistical analyses into a single coherent inference. This problem naturally arises in a number of situations, including in big-data settings, when working under privacy constraints, and in Bayesian model choice. The majority of existing approaches have relied upon convenient approximations of the dist… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  16. arXiv:2009.13697  [pdf, ps, other

    cs.LG cs.GT stat.ML

    A Fast Graph Neural Network-Based Method for Winner Determination in Multi-Unit Combinatorial Auctions

    Authors: Mengyuan Lee, Seyyedali Hosseinalipour, Christopher G. Brinton, Guanding Yu, Huaiyu Dai

    Abstract: The combinatorial auction (CA) is an efficient mechanism for resource allocation in different fields, including cloud computing. It can obtain high economic efficiency and user flexibility by allowing bidders to submit bids for combinations of different items instead of only for individual items. However, the problem of allocating items among the bidders to maximize the auctioneers" revenue, i.e.,… ▽ More

    Submitted 21 December, 2020; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Accepted by Transactions on Cloud Computing

  17. arXiv:2007.14381  [pdf, other

    cs.PL cs.LG stat.ML

    BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration

    Authors: Augustus Odena, Kensen Shi, David Bieber, Rishabh Singh, Charles Sutton, Hanjun Dai

    Abstract: Program synthesis is challenging largely because of the difficulty of search in a large space of programs. Human programmers routinely tackle the task of writing complex programs by writing sub-programs and then analyzing their intermediate results to compose them in appropriate ways. Motivated by this intuition, we present a new synthesis approach that leverages learning to guide a bottom-up sear… ▽ More

    Submitted 30 September, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

  18. arXiv:2006.15820  [pdf, other

    cs.LG cs.AI stat.ML

    Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search

    Authors: Binghong Chen, Chengtao Li, Hanjun Dai, Le Song

    Abstract: Retrosynthetic planning is a critical task in organic chemistry which identifies a series of reactions that can lead to the synthesis of a target product. The vast number of possible chemical transformations makes the size of the search space very big, and retrosynthetic planning is challenging even for experienced chemists. However, existing methods either require expensive return estimation by r… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Presented at ICML 2020

  19. arXiv:2006.15502  [pdf, other

    cs.LG stat.ML

    Scalable Deep Generative Modeling for Sparse Graphs

    Authors: Hanjun Dai, Azade Nazi, Yujia Li, Bo Dai, Dale Schuurmans

    Abstract: Learning graph generative models is a challenging task for deep learning and has wide applicability to a range of domains like chemistry, biology and social science. However current deep neural methods suffer from limited scalability: for a graph with $n$ nodes and $m$ edges, existing deep neural methods require $Ω(n^2)$ complexity by building up the adjacency matrix. On the other hand, many real… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Comments: ICML 2020

  20. arXiv:2006.05082  [pdf, other

    cs.LG stat.ML

    Learning to Stop While Learning to Predict

    Authors: Xinshi Chen, Hanjun Dai, Yu Li, Xin Gao, Le Song

    Abstract: There is a recent surge of interest in designing deep architectures based on the update steps in traditional algorithms, or learning neural networks to improve and replace traditional algorithms. While traditional algorithms have certain stop** criteria for outputting results at different iterations, many algorithm-inspired deep models are restricted to a ``fixed-depth'' for all inputs. Similar… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: Proceedings of the 37th International Conference on Machine Learning

  21. arXiv:2006.03594  [pdf, other

    cs.DC cs.LG cs.NI stat.ML

    From Federated to Fog Learning: Distributed Machine Learning over Heterogeneous Wireless Networks

    Authors: Seyyedali Hosseinalipour, Christopher G. Brinton, Vaneet Aggarwal, Huaiyu Dai, Mung Chiang

    Abstract: Machine learning (ML) tasks are becoming ubiquitous in today's network applications. Federated learning has emerged recently as a technique for training ML models at the network edge by leveraging processing capabilities across the nodes that collect the data. There are several challenges with employing conventional federated learning in contemporary networks, due to the significant heterogeneity… ▽ More

    Submitted 23 October, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: This paper is accepted for publication in IEEE Communications Magazine

  22. arXiv:2004.12905  [pdf, other

    cs.LG cs.CL stat.ML

    Knowledge Base Completion for Constructing Problem-Oriented Medical Records

    Authors: James Mullenbach, Jordan Swartz, T. Greg McKelvey, Hui Dai, David Sontag

    Abstract: Both electronic health records and personal health records are typically organized by data type, with medical problems, medications, procedures, and laboratory results chronologically sorted in separate areas of the chart. As a result, it can be difficult to find all of the relevant information for answering a clinical question about a given medical problem. A promising alternative is to instead o… ▽ More

    Submitted 7 August, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: MLHC 2020

  23. arXiv:2004.07351  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Communication Efficient Federated Learning with Energy Awareness over Wireless Networks

    Authors: Richeng **, Xiaofan He, Huaiyu Dai

    Abstract: In federated learning (FL), reducing the communication overhead is one of the most critical challenges since the parameter server and the mobile devices share the training parameters over wireless links. With such consideration, we adopt the idea of SignSGD in which only the signs of the gradients are exchanged. Moreover, most of the existing works assume Channel State Information (CSI) available… ▽ More

    Submitted 5 September, 2021; v1 submitted 15 April, 2020; originally announced April 2020.

  24. arXiv:2003.07521  [pdf, other

    cs.LG stat.ML

    Energy-Based Processes for Exchangeable Data

    Authors: Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans

    Abstract: Recently there has been growing interest in modeling sets with exchangeability such as point clouds. A shortcoming of current approaches is that they restrict the cardinality of the sets considered or can only express limited forms of distribution over unobserved data. To overcome these limitations, we introduce Energy-Based Processes (EBPs), which extend energy based models to exchangeable data w… ▽ More

    Submitted 8 July, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Journal ref: PMLR 119:2302-2312, 2020

  25. Stochastic-Sign SGD for Federated Learning with Theoretical Guarantees

    Authors: Richeng **, Yufan Huang, Xiaofan He, Huaiyu Dai, Tianfu Wu

    Abstract: Federated learning (FL) has emerged as a prominent distributed learning paradigm. FL entails some pressing needs for develo** novel parameter estimation approaches with theoretical guarantees of convergence, which are also communication efficient, differentially private and Byzantine resilient in the heterogeneous data distribution settings. Quantization-based SGD solvers have been widely adopte… ▽ More

    Submitted 27 September, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Journal ref: Part of this work is published in IEEE Transactions on Neural Networks and Learning Systems, 2024

  26. arXiv:2002.06504  [pdf, other

    cs.LG stat.ML

    Differentiable Top-k Operator with Optimal Transport

    Authors: Yujia Xie, Hanjun Dai, Minshuo Chen, Bo Dai, Tuo Zhao, Hongyuan Zha, Wei Wei, Tomas Pfister

    Abstract: The top-k operation, i.e., finding the k largest or smallest elements from a collection of scores, is an important model component, which is widely used in information retrieval, machine learning, and data mining. However, if the top-k operation is implemented in an algorithmic way, e.g., using bubble algorithm, the resulting model cannot be trained in an end-to-end way using prevalent gradient de… ▽ More

    Submitted 18 February, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

  27. arXiv:2001.01408  [pdf, other

    cs.LG stat.ML

    Retrosynthesis Prediction with Conditional Graph Logic Network

    Authors: Hanjun Dai, Chengtao Li, Connor W. Coley, Bo Dai, Le Song

    Abstract: Retrosynthesis is one of the fundamental problems in organic chemistry. The task is to identify reactants that can be used to synthesize a specified product molecule. Recently, computer-aided retrosynthesis is finding renewed interest from both chemistry and computer science communities. Most existing approaches rely on template-based models that define subgraph matching rules, but whether or not… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Comments: NeurIPS 2019

  28. arXiv:1910.12980  [pdf, other

    cs.LG stat.ML

    Learning Transferable Graph Exploration

    Authors: Hanjun Dai, Yujia Li, Chenglong Wang, Rishabh Singh, Po-Sen Huang, Pushmeet Kohli

    Abstract: This paper considers the problem of efficient exploration of unseen environments, a key challenge in AI. We propose a `learning to explore' framework where we learn a policy from a distribution of environments. At test time, presented with an unseen environment from the same distribution, the policy aims to generalize the exploration strategy to visit the maximum number of unique states in a limit… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: To appear in NeurIPS 2019

  29. arXiv:1907.03750  [pdf, other

    cs.CL cs.LG stat.ML

    Neural Aspect and Opinion Term Extraction with Mined Rules as Weak Supervision

    Authors: Hongliang Dai, Yangqiu Song

    Abstract: Lack of labeled training data is a major bottleneck for neural network based aspect and opinion term extraction on product reviews. To alleviate this problem, we first propose an algorithm to automatically mine extraction rules from existing training examples based on dependency parsing results. The mined rules are then applied to label a large amount of auxiliary data. Finally, we study training… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: ACL 2019

  30. arXiv:1906.00291  [pdf, other

    cs.LG stat.ML

    Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification

    Authors: Harsh Shrivastava, Eugene Bart, Bob Price, Hanjun Dai, Bo Dai, Srinivas Aluru

    Abstract: We propose a new approach, called cooperative neural networks (CoNN), which uses a set of cooperatively trained neural networks to capture latent representations that exploit prior given independence structure. The model is more flexible than traditional graphical models based on exponential family distributions, but incorporates more domain specific prior structure than traditional deep networks… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

  31. arXiv:1904.12083  [pdf, other

    cs.LG stat.CO stat.ML

    Exponential Family Estimation via Adversarial Dynamics Embedding

    Authors: Bo Dai, Zhen Liu, Hanjun Dai, Niao He, Arthur Gretton, Le Song, Dale Schuurmans

    Abstract: We present an efficient algorithm for maximum likelihood estimation (MLE) of exponential family models, with a general parametrization of the energy function that includes neural networks. We exploit the primal-dual view of the MLE with a kinetics augmented model to obtain an estimate associated with an adversarial dual sampler. To represent this sampler, we introduce a novel neural architecture,… ▽ More

    Submitted 30 March, 2020; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: Appearing in NeurIPS 2019 Vancouver, Canada; a preliminary version published in NeurIPS2018 Bayesian Deep Learning Workshop

  32. arXiv:1902.10336  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Distributed Byzantine Tolerant Stochastic Gradient Descent in the Era of Big Data

    Authors: Richeng **, Xiaofan He, Huaiyu Dai

    Abstract: The recent advances in sensor technologies and smart devices enable the collaborative collection of a sheer volume of data from multiple information sources. As a promising tool to efficiently extract useful information from such big data, machine learning has been pushed to the forefront and seen great success in a wide range of relevant areas such as computer vision, health care, and financial m… ▽ More

    Submitted 6 March, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

  33. arXiv:1902.00640  [pdf, other

    cs.LG stat.ML

    Particle Flow Bayes' Rule

    Authors: Xinshi Chen, Hanjun Dai, Le Song

    Abstract: We present a particle flow realization of Bayes' rule, where an ODE-based neural operator is used to transport particles from a prior to its posterior after a new observation. We prove that such an ODE operator exists. Its neural parameterization can be trained in a meta-learning framework, allowing this operator to reason about the effect of an individual observation on the posterior, and thus ge… ▽ More

    Submitted 31 December, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:1022-1031, 2019

  34. Monte Carlo Fusion

    Authors: Hongsheng Dai, Murray Pollock, Gareth Roberts

    Abstract: This paper proposes a new theory and methodology to tackle the problem of unifying distributed analyses and inferences on shared parameters from multiple sources, into a single coherent inference. This surprisingly challenging problem arises in many settings (for instance, expert elicitation, multi-view learning, distributed 'big data' problems etc.), but to-date the framework and methodology prop… ▽ More

    Submitted 1 January, 2019; originally announced January 2019.

    MSC Class: 65C05; 65C60; 62C10; 65C30

    Journal ref: J. Appl. Probab. 56 (2019) 174-191

  35. arXiv:1812.01483  [pdf, other

    stat.ML cs.LG

    CompILE: Compositional Imitation Learning and Execution

    Authors: Thomas Kipf, Yujia Li, Hanjun Dai, Vinicius Zambaldi, Alvaro Sanchez-Gonzalez, Edward Grefenstette, Pushmeet Kohli, Peter Battaglia

    Abstract: We introduce Compositional Imitation Learning and Execution (CompILE): a framework for learning reusable, variable-length segments of hierarchically-structured behavior from demonstration data. CompILE uses a novel unsupervised, fully-differentiable sequence segmentation module to learn latent encodings of sequential data that can be re-composed and executed to perform new tasks. Once trained, our… ▽ More

    Submitted 14 May, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: ICML (2019)

  36. arXiv:1811.02228  [pdf, other

    cs.LG stat.ML

    Kernel Exponential Family Estimation via Doubly Dual Embedding

    Authors: Bo Dai, Hanjun Dai, Arthur Gretton, Le Song, Dale Schuurmans, Niao He

    Abstract: We investigate penalized maximum log-likelihood estimation for exponential family distributions whose natural parameter resides in a reproducing kernel Hilbert space. Key to our approach is a novel technique, doubly dual embedding, that avoids computation of the partition function. This technique also allows the development of a flexible sampling strategy that amortizes the cost of Monte-Carlo sam… ▽ More

    Submitted 24 April, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: 22 pages, 20 figures; AISTATS 2019

  37. arXiv:1809.02727  [pdf, ps, other

    cs.LG stat.ML

    Decentralized Differentially Private Without-Replacement Stochastic Gradient Descent

    Authors: Richeng **, Xiaofan He, Huaiyu Dai

    Abstract: While machine learning has achieved remarkable results in a wide variety of domains, the training of models often requires large datasets that may need to be collected from different individuals. As sensitive information may be contained in the individual's dataset, sharing training data may lead to severe privacy concerns. Therefore, there is a compelling need to develop privacy-aware machine lea… ▽ More

    Submitted 5 February, 2023; v1 submitted 7 September, 2018; originally announced September 2018.

  38. arXiv:1806.02371  [pdf, other

    cs.LG cs.CR cs.SI stat.ML

    Adversarial Attack on Graph Structured Data

    Authors: Hanjun Dai, Hui Li, Tian Tian, Xin Huang, Lin Wang, Jun Zhu, Le Song

    Abstract: Deep learning on graph structures has shown exciting results in various applications. However, few attentions have been paid to the robustness of such models, in contrast to numerous research work for image or text adversarial attack and defense. In this paper, we focus on the adversarial attacks that fool the model by modifying the combinatorial structure of data. We first propose a reinforcement… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: to appear in ICML 2018

  39. arXiv:1805.12393  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    KG^2: Learning to Reason Science Exam Questions with Contextual Knowledge Graph Embeddings

    Authors: Yuyu Zhang, Hanjun Dai, Kamil Toraman, Le Song

    Abstract: The AI2 Reasoning Challenge (ARC), a new benchmark dataset for question answering (QA) has been recently released. ARC only contains natural science questions authored for human exams, which are hard to answer and require advanced logic reasoning. On the ARC Challenge Set, existing state-of-the-art QA systems fail to significantly outperform random baseline, reflecting the difficult nature of this… ▽ More

    Submitted 31 May, 2018; originally announced May 2018.

  40. arXiv:1704.01665  [pdf, other

    cs.LG stat.ML

    Learning Combinatorial Optimization Algorithms over Graphs

    Authors: Hanjun Dai, Elias B. Khalil, Yuyu Zhang, Bistra Dilkina, Le Song

    Abstract: The design of good heuristics or approximation algorithms for NP-hard combinatorial optimization problems often requires significant specialized knowledge and trial-and-error. Can we automate this challenging, tedious process, and learn the algorithms instead? In many real-world applications, it is typically the case that the same optimization problem is solved again and again on a regular basis,… ▽ More

    Submitted 21 February, 2018; v1 submitted 5 April, 2017; originally announced April 2017.

    Comments: NIPS 2017

  41. arXiv:1509.00137  [pdf, other

    cs.LG math.ST stat.ML

    Online Supervised Subspace Tracking

    Authors: Yao Xie, Ruiyang Song, Hanjun Dai, Qingbin Li, Le Song

    Abstract: We present a framework for supervised subspace tracking, when there are two time series $x_t$ and $y_t$, one being the high-dimensional predictors and the other being the response variables and the subspace tracking needs to take into consideration of both sequences. It extends the classic online subspace tracking work which can be viewed as tracking of $x_t$ only. Our online sufficient dimensiona… ▽ More

    Submitted 1 September, 2015; originally announced September 2015.

    Comments: Submitted for journal publication

  42. arXiv:1507.06032  [pdf, ps, other

    stat.ME math.PR stat.ML

    Elastic Net Procedure for Partially Linear Models

    Authors: Chunhong Li, Dengxiang Huang, Hongshuai Dai, Xinxing Wei

    Abstract: Variable selection plays an important role in the high-dimensional data analysis. However the high-dimensional data often induces the strongly correlated variables problem. In this paper, we propose Elastic Net procedure for partially linear models and prove the group effect of its estimate. By a simulation study, we show that the strongly correlated variables problem can be better handled by the… ▽ More

    Submitted 21 July, 2015; originally announced July 2015.

    Comments: arXiv admin note: text overlap with arXiv:0908.1836 by other authors

  43. arXiv:1507.01279  [pdf, other

    cs.LG math.ST stat.ML

    Scan $B$-Statistic for Kernel Change-Point Detection

    Authors: Shuang Li, Yao Xie, Hanjun Dai, Le Song

    Abstract: Detecting the emergence of an abrupt change-point is a classic problem in statistics and machine learning. Kernel-based nonparametric statistics have been used for this task which enjoy fewer assumptions on the distributions than the parametric approach and can handle high-dimensional data. In this paper we focus on the scenario when the amount of background data is large, and propose two related… ▽ More

    Submitted 12 November, 2018; v1 submitted 5 July, 2015; originally announced July 2015.

    Comments: Submitted for journal publication. Partial results appeared in NIPS 2015

  44. arXiv:1506.03101  [pdf, other

    cs.LG stat.CO stat.ML

    Provable Bayesian Inference via Particle Mirror Descent

    Authors: Bo Dai, Niao He, Hanjun Dai, Le Song

    Abstract: Bayesian methods are appealing in their flexibility in modeling complex data and ability in capturing uncertainty in parameters. However, when Bayes' rule does not result in tractable closed-form, most approximate inference algorithms lack either scalability or rigorous guarantees. To tackle this challenge, we propose a simple yet provable algorithm, \emph{Particle Mirror Descent} (PMD), to iterat… ▽ More

    Submitted 5 May, 2016; v1 submitted 9 June, 2015; originally announced June 2015.

    Comments: 38 pages, 26 figures