Skip to main content

Showing 1–50 of 117 results for author: Lee, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.02028  [pdf

    stat.ME

    How should parallel cluster randomized trials with a baseline period be analyzed? A survey of estimands and common estimators

    Authors: Kenneth Menglin Lee, Fan Li

    Abstract: The parallel cluster randomized trial with baseline (PB-CRT) is a common variant of the standard parallel cluster randomized trial (P-CRT) that maintains parallel randomization but additionally allows for both within and between-cluster comparisons. We define two estimands of interest in the context of PB-CRTs, the participant-average treatment effect (pATE) and cluster-average treatment effect (c… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 77 pages, 16 figures

  2. arXiv:2405.15053  [pdf, other

    stat.ME

    A Latent Variable Approach to Learning High-dimensional Multivariate longitudinal Data

    Authors: Sze Ming Lee, Yunxiao Chen, Tony Sit

    Abstract: High-dimensional multivariate longitudinal data, which arise when many outcome variables are measured repeatedly over time, are becoming increasingly common in social, behavioral and health sciences. We propose a latent variable model for drawing statistical inferences on covariate effects and predicting future outcomes based on high-dimensional multivariate longitudinal data. This model introduce… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.10221  [pdf, other

    math.OC cs.LG stat.ML

    Scalarisation-based risk concepts for robust multi-objective optimisation

    Authors: Ben Tu, Nikolas Kantas, Robert M. Lee, Behrang Shafei

    Abstract: Robust optimisation is a well-established framework for optimising functions in the presence of uncertainty. The inherent goal of this problem is to identify a collection of inputs whose outputs are both desirable for the decision maker, whilst also being robust to the underlying uncertainties in the problem. In this work, we study the multi-objective extension of this problem from a computational… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: The code is available at: https://github.com/benmltu/scalarize

  4. arXiv:2405.01404  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    Random Pareto front surfaces

    Authors: Ben Tu, Nikolas Kantas, Robert M. Lee, Behrang Shafei

    Abstract: The goal of multi-objective optimisation is to identify the Pareto front surface which is the set obtained by connecting the best trade-off points. Typically this surface is computed by evaluating the objectives at different points and then interpolating between the subset of the best evaluated trade-off points. In this work, we propose to parameterise the Pareto front surface using polar coordina… ▽ More

    Submitted 21 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: The code is available at: https://github.com/benmltu/scalarize

  5. arXiv:2404.17709  [pdf, other

    stat.ML cs.LG

    Low-rank Matrix Bandits with Heavy-tailed Rewards

    Authors: Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee

    Abstract: In stochastic low-rank matrix bandit, the expected reward of an arm is equal to the inner product between its feature matrix and some unknown $d_1$ by $d_2$ low-rank parameter matrix $Θ^*$ with rank $r \ll d_1\wedge d_2$. While all prior studies assume the payoffs are mixed with sub-Gaussian noises, in this work we loosen this strict assumption and consider the new problem of \underline{low}-rank… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: The 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  6. arXiv:2404.08169  [pdf, other

    stat.ME

    AutoGFI: Streamlined Generalized Fiducial Inference for Modern Inference Problems

    Authors: Wei Du, Jan Hannig, Thomas C. M. Lee, Yi Su, Chunzhe Zhang

    Abstract: The origins of fiducial inference trace back to the 1930s when R. A. Fisher first introduced the concept as a response to what he perceived as a limitation of Bayesian inference - the requirement for a subjective prior distribution on model parameters in cases where no prior information was available. However, Fisher's initial fiducial approach fell out of favor as complications arose, particularl… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  7. arXiv:2402.17834  [pdf, other

    cs.CL stat.ML

    Stable LM 2 1.6B Technical Report

    Authors: Marco Bellagente, Jonathan Tow, Dakota Mahan, Duy Phung, Maksym Zhuravinskyi, Reshinth Adithyan, James Baicoianu, Ben Brooks, Nathan Cooper, Ashish Datta, Meng Lee, Emad Mostaque, Michael Pieler, Nikhil Pinnaparju, Paulo Rocha, Harry Saini, Hannah Teufel, Niccolo Zanichelli, Carlos Riquelme

    Abstract: We introduce StableLM 2 1.6B, the first in a new generation of our language model series. In this technical report, we present in detail the data and training procedure leading to the base and instruction-tuned versions of StableLM 2 1.6B. The weights for both models are available via Hugging Face for anyone to download and use. The report contains thorough evaluations of these models, including z… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 23 pages, 6 figures

  8. arXiv:2401.07298  [pdf, other

    stat.ML cs.LG

    Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems

    Authors: Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee

    Abstract: In the stochastic contextual low-rank matrix bandit problem, the expected reward of an action is given by the inner product between the action's feature matrix and some fixed, but initially unknown $d_1$ by $d_2$ matrix $Θ^*$ with rank $r \ll \{d_1, d_2\}$, and an agent sequentially takes actions based on past experience to maximize the cumulative reward. In this paper, we study the generalized lo… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: Revision of the paper accepted by NeurIPS 2022

  9. arXiv:2312.00622  [pdf, other

    cs.LG math.OC stat.ME

    Practical Path-based Bayesian Optimization

    Authors: Jose Pablo Folch, James Odgers, Shiqiang Zhang, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener

    Abstract: There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itse… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 main pages, 12 with references and appendix. 4 figures, 2 tables. To appear in NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World

    Journal ref: NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World

  10. Personalized Event Prediction for Electronic Health Records

    Authors: Jeong Min Lee, Milos Hauskrecht

    Abstract: Clinical event sequences consist of hundreds of clinical events that represent records of patient care in time. Develo** accurate predictive models of such sequences is of a great importance for supporting a variety of models for interpreting/classifying the current patient condition, or predicting adverse clinical events and outcomes, all aimed to improve patient care. One important challenge o… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2104.01787

    Journal ref: Artificial Intelligence in Medicine, Volume 143, 2023, 102620, ISSN 0933-3657

  11. arXiv:2308.07076  [pdf, ps, other

    stat.ME

    Subsample Least Squares Estimator for Heterogeneous Effects of Multiple Treatments with Any Outcome Variable

    Authors: Myoungjae Lee

    Abstract: For multiple treatments D=0,1,...,J, covariates X and outcome Y, the ordinary least squares estimator (OLS) of Y on (D1,...,DJ,X) is widely applied to a constant-effect linear model, where Dj is the dummy variable for D=j. However, the treatment effects are almost always X-heterogeneous in reality, or Y is noncontinuous, to invalidate such a linear model. The blind hope of practitioners is that th… ▽ More

    Submitted 13 September, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  12. arXiv:2305.18543  [pdf, other

    cs.LG stat.ML

    Robust Lipschitz Bandits to Adversarial Corruptions

    Authors: Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee

    Abstract: Lipschitz bandit is a variant of stochastic bandits that deals with a continuous arm set defined on a metric space, where the reward function is subject to a Lipschitz constraint. In this paper, we introduce a new problem of Lipschitz bandits in the presence of adversarial corruptions where an adaptive adversary corrupts the stochastic rewards up to a total budget $C$. The budget is measured by th… ▽ More

    Submitted 8 October, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  13. arXiv:2305.11774  [pdf, other

    math.OC cs.LG stat.ML

    Multi-objective optimisation via the R2 utilities

    Authors: Ben Tu, Nikolas Kantas, Robert M. Lee, Behrang Shafei

    Abstract: The goal of multi-objective optimisation is to identify a collection of points which describe the best possible trade-offs between the multiple objectives. In order to solve this vector-valued optimisation problem, practitioners often appeal to the use of scalarisation functions in order to transform the multi-objective problem into a collection of single-objective problems. This set of scalarised… ▽ More

    Submitted 1 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: The code is available at: https://github.com/benmltu/scalarize

  14. arXiv:2305.05099  [pdf, ps, other

    stat.ME stat.AP

    Dirichlet process mixture models for the Analysis of Repeated Attempt Designs

    Authors: Michael J. Daniels, Minji Lee, Wei Feng

    Abstract: In longitudinal studies, it is not uncommon to make multiple attempts to collect a measurement after baseline. Recording whether these attempts are successful provides useful information for the purposes of assessing missing data assumptions. This is because measurements from subjects who provide the data after numerous failed attempts may differ from those who provide the measurement after fewer… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 24 pages, additional 16 pages of supplementary material

  15. arXiv:2304.00149  [pdf

    stat.OT

    Using online student focus groups in the development of new educational resources

    Authors: Gian Carlo Diluvi, Sonja Isberg, Bruce Dunham, Nancy Heckman, Melissa Lee

    Abstract: Educational resources, such as web apps and self-directed tutorials, have become popular tools for teaching and active learning. Ideally, students - the intended users of these resources - should be involved in the resource development stage. However, in practice students often only interact with fully developed resources, when it might be too late to incorporate changes. Previous work has address… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  16. arXiv:2303.17705  [pdf

    stat.AP

    Incorporating patient-reported outcomes in dose-finding clinical trials with continuous patient enrollment

    Authors: Anaïs Andrillon, Lucie Biard, Shing M. Lee

    Abstract: Dose-finding clinical trials in oncology aim to estimate the maximum tolerated dose (MTD), based on safety traditionally obtained from the clinician's perspective. While the collection of patient-reported outcomes (PROs) has been advocated to better inform treatment tolerability, there is a lack of guidance and methods on how to use PROs for dose assignments and recommendations. The PRO continual… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 23 pages, 1 figure, 4 tables

  17. arXiv:2302.09440  [pdf, other

    cs.LG stat.ML

    Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits

    Authors: Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee

    Abstract: In stochastic contextual bandits, an agent sequentially makes actions from a time-dependent action set based on past experience to minimize the cumulative regret. Like many other machine learning algorithms, the performance of bandits heavily depends on the values of hyperparameters, and theoretically derived parameter values may lead to unsatisfactory results in practice. Moreover, it is infeasib… ▽ More

    Submitted 8 April, 2024; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  18. arXiv:2301.10419  [pdf, other

    stat.AP

    Deconstructing Pedestrian Crossing Decision-making in Interactions with Continuous Traffic: an Anthropomorphic Model

    Authors: Kai Tian, Gustav Markkula, Chongfeng Wei, Yee Mun Lee, Ruth Madigan, Toshiya Hirose, Natasha Merat, Richard Romano

    Abstract: As safe and comfortable interactions with pedestrians could contribute to automated vehicles' (AVs) social acceptance and scale, increasing attention has been drawn to computational pedestrian behavior models. However, very limited studies characterize pedestrian crossing behavior based on specific behavioral mechanisms, as those mechanisms underpinning pedestrian road behavior are not yet clear.… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  19. arXiv:2301.04578  [pdf, other

    stat.AP

    Precision Dose-finding Cancer Clinical Trials in the Setting of Broadened Eligibility

    Authors: Rebecca B. Silva, Bin Cheng, Richard D. Carvajal, Shing M. Lee

    Abstract: Broadening eligibility criteria in cancer trials has been advocated to represent the true patient population more accurately. While the advantages are clear in terms of generalizability and recruitment, novel dose-finding designs are needed to ensure patient safety. These designs should be able to recommend precise doses for subpopulations if such subpopulations with different toxicity profiles ex… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  20. arXiv:2211.06149  [pdf, other

    cs.LG cs.CE stat.ML

    Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization

    Authors: Jose Pablo Folch, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener

    Abstract: Bayesian Optimization is a useful tool for experiment design. Unfortunately, the classical, sequential setting of Bayesian Optimization does not translate well into laboratory experiments, for instance battery design, where measurements may come from different sources and their evaluations may require significant waiting times. Multi-fidelity Bayesian Optimization addresses the setting with measur… ▽ More

    Submitted 23 February, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 19 pages in main paper / 28 with references and appendix, 7 figures, 2 tables, accepted into Computers and Chemical Engineering

  21. arXiv:2210.13358  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Novelty Detection in Time Series via Weak Innovations Representation: A Deep Learning Approach

    Authors: Xinyi Wang, Mei-jen Lee, Qing Zhao, Lang Tong

    Abstract: We consider novelty detection in time series with unknown and nonparametric probability structures. A deep learning approach is proposed to causally extract an innovations sequence consisting of novelty samples statistically independent of all past samples of the time series. A novelty detection algorithm is developed for the online detection of novel changes in the probability structure in the in… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  22. arXiv:2209.10105  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Distributed Online Non-convex Optimization with Composite Regret

    Authors: Zhanhong Jiang, Aditya Balu, Xian Yeow Lee, Young M. Lee, Chinmay Hegde, Soumik Sarkar

    Abstract: Regret has been widely adopted as the metric of choice for evaluating the performance of online optimization algorithms for distributed, multi-agent systems. However, data/model variations associated with agents can significantly impact decisions and requires consensus among agents. Moreover, most existing works have focused on develo** approaches for (either strongly or non-strongly) convex los… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 41 pages, presented in allerton conference 2022

  23. arXiv:2208.06970  [pdf, other

    stat.ME cs.HC

    Level Set Restricted Voronoi Tessellation for Large scale Spatial Statistical Analysis

    Authors: Tyson Neuroth, Martin Rieth, Konduri Aditya, Myoungkyu Lee, Jacqueline H Chen, Kwan-Liu Ma

    Abstract: Spatial statistical analysis of multivariate volumetric data can be challenging due to scale, complexity, and occlusion. Advances in topological segmentation, feature extraction, and statistical summarization have helped overcome the challenges. This work introduces a new spatial statistical decomposition method based on level sets, connected components, and a novel variation of the restricted cen… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

  24. arXiv:2207.14727  [pdf, other

    stat.ML cs.LG econ.EM math.ST

    Tangential Wasserstein Projections

    Authors: Florian Gunsilius, Meng Hsuan Hsieh, Myung ** Lee

    Abstract: We develop a notion of projections between sets of probability measures using the geometric properties of the 2-Wasserstein space. It is designed for general multivariate probability measures, is computationally efficient to implement, and provides a unique solution in regular settings. The idea is to work on regular tangent cones of the Wasserstein space using generalized geodesics. Its structure… ▽ More

    Submitted 2 August, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

  25. arXiv:2207.00879  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces

    Authors: Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Behrang Shafei, Ruth Misener

    Abstract: Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search, as they achieve good predictive performance with little or no manual tuning, naturally handle discrete feature spaces, and are relatively insensitive to outliers in the training data. Two well-known challenges in using tree ensembles for black-box optimization are (i) effecti… ▽ More

    Submitted 30 December, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: 27 pages, 9 figures, 4 tables

  26. Some performance considerations when using multi-armed bandit algorithms in the presence of missing data

    Authors: Xi** Chen, Kim May Lee, Sofia S. Villar, David S. Robertson

    Abstract: When comparing the performance of multi-armed bandit algorithms, the potential impact of missing data is often overlooked. In practice, it also affects their implementation where the simplest approach to overcome this is to continue to sample according to the original bandit algorithm, ignoring missing outcomes. We investigate the impact on performance of this approach to deal with missing data fo… ▽ More

    Submitted 7 July, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: 30 pages, 6 figures

  27. arXiv:2204.10909  [pdf, other

    cs.LG stat.ML

    Error-in-variables modelling for operator learning

    Authors: Ravi G. Patel, Indu Manickam, Myoungkyu Lee, Mamikon Gulian

    Abstract: Deep operator learning has emerged as a promising tool for reduced-order modelling and PDE model discovery. Leveraging the expressive power of deep neural networks, especially in high dimensions, such methods learn the map** between functional state variables. While proposed methods have assumed noise only in the dependent variables, experimental and numerical data for operator learning typicall… ▽ More

    Submitted 19 July, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 23 pages, 10 figures

  28. An empirical Bayes approach to estimating dynamic models of co-regulated gene expression

    Authors: Sara Venkatraman, Sumanta Basu, Andrew G. Clark, Sofie Delbare, Myung Hee Lee, Martin T. Wells

    Abstract: Time-course gene expression datasets provide insight into the dynamics of complex biological processes, such as immune response and organ development. It is of interest to identify genes with similar temporal expression patterns because such genes are often biologically related. However, this task is challenging due to the high dimensionality of these datasets and the nonlinearity of gene expressi… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

  29. arXiv:2111.03140  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Multi-Objective Constrained Optimization for Energy Applications via Tree Ensembles

    Authors: Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Tom Tranter, Ruth Misener

    Abstract: Energy systems optimization problems are complex due to strongly non-linear system behavior and multiple competing objectives, e.g. economic gain vs. environmental impact. Moreover, a large number of input variables and different variable types, e.g. continuous and categorical, are challenges commonly present in real-world applications. In some cases, proposed optimal solutions need to obey explic… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 36 pages, 8 figures, 5 tables

  30. arXiv:2110.06504  [pdf, ps, other

    stat.ME

    Path-Free Decomposition for Direct, Indirect and Interaction Effects in Mediation Analysis

    Authors: Myoung-jae Lee

    Abstract: Given a binary treatment and a binary mediator, mediation analysis decomposes the total effect of the treatment on an outcome variable into direct and indirect effects. However, the existing decompositions are "path-dependent", and consequently, there appeared different versions of direct and indirect effects. Differently from these, this paper proposes a "path-free" decomposition of the total eff… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  31. arXiv:2106.02979  [pdf, other

    stat.ML cs.LG

    Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms

    Authors: Qin Ding, Yue Kang, Yi-Wei Liu, Thomas C. M. Lee, Cho-Jui Hsieh, James Sharpnack

    Abstract: The stochastic contextual bandit problem, which models the trade-off between exploration and exploitation, has many real applications, including recommender systems, online advertising and clinical trials. As many other machine learning algorithms, contextual bandit algorithms often have one or more hyper-parameters. As an example, in most optimal stochastic contextual bandit algorithms, there is… ▽ More

    Submitted 11 June, 2022; v1 submitted 5 June, 2021; originally announced June 2021.

  32. arXiv:2105.08620  [pdf, other

    stat.ML cs.CV cs.LG

    Adversarial Examples Detection with Bayesian Neural Network

    Authors: Yao Li, Tongyi Tang, Cho-Jui Hsieh, Thomas C. M. Lee

    Abstract: In this paper, we propose a new framework to detect adversarial examples motivated by the observations that random components can improve the smoothness of predictors and make it easier to simulate the output distribution of a deep neural network. With these observations, we propose a novel Bayesian adversarial example detector, short for BATer, to improve the performance of adversarial example de… ▽ More

    Submitted 22 February, 2024; v1 submitted 18 May, 2021; originally announced May 2021.

  33. arXiv:2101.11202  [pdf, other

    astro-ph.IM stat.AP stat.ME

    Change point detection and image segmentation for time series of astrophysical images

    Authors: Cong Xu, Hans Moritz Günther, Vinay L. Kashyap, Thomas C. M. Lee, Andreas Zezas

    Abstract: Many astrophysical phenomena are time-varying, in the sense that their intensity, energy spectrum, and/or the spatial distribution of the emission suddenly change. This paper develops a method for modeling a time series of images. Under the assumption that the arrival times of the photons follow a Poisson process, the data are binned into 4D grids of voxels (time, energy band, and x-y coordinates)… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 22 pages, 10 figures

  34. arXiv:2010.11166  [pdf, other

    cs.LG cs.DC stat.ML

    Decentralized Deep Learning using Momentum-Accelerated Consensus

    Authors: Aditya Balu, Zhanhong Jiang, Sin Yong Tan, Chinmay Hedge, Young M Lee, Soumik Sarkar

    Abstract: We consider the problem of decentralized deep learning where multiple agents collaborate to learn from a distributed dataset. While there exist several decentralized deep learning approaches, the majority consider a central parameter-server topology for aggregating the model parameters from the agents. However, such a topology may be inapplicable in networked systems such as ad-hoc mobile networks… ▽ More

    Submitted 28 November, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

  35. arXiv:2010.06567  [pdf, other

    stat.AP

    Conditional Power and Friends: The Why and How of (Un)planned, Unblinded Sample Size Recalculations in Confirmatory Trials

    Authors: Kevin Kunzmann, Michael J. Grayling, Kim M. Lee, David S. Robertson, Kaspar Rufibach, James M. S. Wason

    Abstract: Adapting the final sample size of a trial to the evidence accruing during the trial is a natural way to address planning uncertainty. Designs with adaptive sample size need to account for their optional stop** to guarantee strict type-I error-rate control. A variety of different methods to maintain type-I error-rate control after unplanned changes of the initial sample size have been proposed in… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  36. arXiv:2009.13697  [pdf, ps, other

    cs.LG cs.GT stat.ML

    A Fast Graph Neural Network-Based Method for Winner Determination in Multi-Unit Combinatorial Auctions

    Authors: Mengyuan Lee, Seyyedali Hosseinalipour, Christopher G. Brinton, Guanding Yu, Huaiyu Dai

    Abstract: The combinatorial auction (CA) is an efficient mechanism for resource allocation in different fields, including cloud computing. It can obtain high economic efficiency and user flexibility by allowing bidders to submit bids for combinations of different items instead of only for individual items. However, the problem of allocating items among the bidders to maximize the auctioneers" revenue, i.e.,… ▽ More

    Submitted 21 December, 2020; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Accepted by Transactions on Cloud Computing

  37. arXiv:2008.03226   

    physics.chem-ph cs.LG stat.ML

    Data-Driven Discovery of Molecular Photoswitches with Multioutput Gaussian Processes

    Authors: Ryan-Rhys Griffiths, Jake L. Greenfield, Aditya R. Thawani, Arian R. Jamasb, Henry B. Moss, Anthony Bourached, Penelope Jones, William McCorkindale, Alexander A. Aldrick, Matthew J. Fuchter Alpha A. Lee

    Abstract: Photoswitchable molecules display two or more isomeric forms that may be accessed using light. Separating the electronic absorption bands of these isomers is key to selectively addressing a specific isomer and achieving high photostationary states whilst overall red-shifting the absorption bands serves to limit material damage due to UV-exposure and increases penetration depth in photopharmacologi… ▽ More

    Submitted 7 August, 2022; v1 submitted 28 June, 2020; originally announced August 2020.

    Comments: Authors still in discussion about authorship ordering

  38. Distributed Associative Memory Network with Memory Refreshing Loss

    Authors: Taewon Park, Inchul Choi, Minho Lee

    Abstract: Despite recent progress in memory augmented neural network (MANN) research, associative memory networks with a single external memory still show limited performance on complex relational reasoning tasks. Especially the content-based addressable memory networks often fail to encode input data into rich enough representation for relational reasoning and this limits the relation modeling performance… ▽ More

    Submitted 27 August, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: Published (https://www.sciencedirect.com/science/article/pii/S0893608021003014?via%3Dihub), Code (https://github.com/taewonpark/DAM)

    Journal ref: Neural Networks 144 (2021) 33-48

  39. arXiv:2007.00334  [pdf, other

    cs.LG stat.ML

    Estimation with Uncertainty via Conditional Generative Adversarial Networks

    Authors: Minhyeok Lee, Junhee Seok

    Abstract: Conventional predictive Artificial Neural Networks (ANNs) commonly employ deterministic weight matrices; therefore, their prediction is a point estimate. Such a deterministic nature in ANNs causes the limitations of using ANNs for medical diagnosis, law problems, and portfolio management, in which discovering not only the prediction but also the uncertainty of the prediction is essentially require… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  40. A review of Bayesian perspectives on sample size derivation for confirmatory trials

    Authors: Kevin Kunzmann, Michael J. Grayling, Kim May Lee, David S. Robertson, Kaspar Rufibach, James M. S. Wason

    Abstract: Sample size derivation is a crucial element of the planning phase of any confirmatory trial. A sample size is typically derived based on constraints on the maximal acceptable type I error rate and a minimal desired power. Here, power depends on the unknown true effect size. In practice, power is typically calculated either for the smallest relevant effect size or a likely point alternative. The fo… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Journal ref: Am. Stat., 2021, 75(4), 424--432

  41. arXiv:2006.12246  [pdf, other

    cs.CV cs.HC cs.LG stat.ML

    Pain Intensity Estimation from Mobile Video Using 2D and 3D Facial Keypoints

    Authors: Matthew Lee, Lyndon Kennedy, Andreas Girgensohn, Lynn Wilcox, John Song En Lee, Chin Wen Tan, Ban Leong Sng

    Abstract: Managing post-surgical pain is critical for successful surgical outcomes. One of the challenges of pain management is accurately assessing the pain level of patients. Self-reported numeric pain ratings are limited because they are subjective, can be affected by mood, and can influence the patient's perception of pain when making comparisons. In this paper, we introduce an approach that analyzes 2D… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  42. arXiv:2005.04788  [pdf

    cs.LG stat.ML

    Distributed Fine-Grained Traffic Speed Prediction for Large-Scale Transportation Networks based on Automatic LSTM Customization and Sharing

    Authors: Ming-Chang Lee, Jia-Chun Lin, Ernst Gunnar Gran

    Abstract: Short-term traffic speed prediction has been an important research topic in the past decade, and many approaches have been introduced. However, providing fine-grained, accurate, and efficient traffic-speed prediction for large-scale transportation networks where numerous traffic detectors are deployed has not been well studied. In this paper, we propose DistPre, which is a distributed fine-grained… ▽ More

    Submitted 3 June, 2020; v1 submitted 10 May, 2020; originally announced May 2020.

    Comments: 14 pages, 7 figures, 2 tables, Euro-par 2020 conference

  43. arXiv:2005.00564  [pdf, other

    stat.ME stat.AP

    Response-adaptive randomization in clinical trials: from myths to practical considerations

    Authors: David S. Robertson, Kim May Lee, Boryana C. Lopez-Kolkovska, Sofia S. Villar

    Abstract: Response-Adaptive Randomization (RAR) is part of a wider class of data-dependent sampling algorithms, for which clinical trials are typically used as a motivating application. In that context, patient allocation to treatments is determined by randomization probabilities that change based on the accrued response data in order to achieve experimental goals. RAR has received abundant theoretical atte… ▽ More

    Submitted 7 June, 2022; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: Update in response to editor comments

    MSC Class: 62-02

  44. arXiv:2004.04258  [pdf, other

    stat.AP q-bio.NC

    Estimating Fiber Orientation Distribution through Blockwise Adaptive Thresholding with Application to HCP Young Adults Data

    Authors: Seungyong Hwang, Thomas C. M. Lee, Debashis Paul, Jie Peng

    Abstract: Due to recent technological advances, large brain imaging data sets can now be collected. Such data are highly complex so extraction of meaningful information from them remains challenging. Thus, there is an urgent need for statistical procedures that are computationally scalable and can provide accurate estimates that capture the neuronal structures and their functionalities. We propose a fast me… ▽ More

    Submitted 28 June, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

  45. arXiv:2004.02401  [pdf, other

    cs.LG cs.CL stat.ML

    Applying Cyclical Learning Rate to Neural Machine Translation

    Authors: Choon Meng Lee, Jianfeng Liu, Wei Peng

    Abstract: In training deep learning networks, the optimizer and related learning rate are often used without much thought or with minimal tuning, even though it is crucial in ensuring a fast convergence to a good quality minimum of the loss function that can also generalize well on the test dataset. Drawing inspiration from the successful application of cyclical learning rate policy for computer vision rela… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  46. ReRe: A Lightweight Real-time Ready-to-Go Anomaly Detection Approach for Time Series

    Authors: Ming-Chang Lee, Jia-Chun Lin, Ernst Gunnar Gran

    Abstract: Anomaly detection is an active research topic in many different fields such as intrusion detection, network monitoring, system health monitoring, IoT healthcare, etc. However, many existing anomaly detection approaches require either human intervention or domain knowledge, and may suffer from high computation complexity, consequently hindering their applicability in real-world scenarios. Therefore… ▽ More

    Submitted 4 December, 2022; v1 submitted 5 April, 2020; originally announced April 2020.

    Comments: 10 pages, 9 figures, COMPSAC 2020

  47. arXiv:2004.02113  [pdf

    cs.SD cs.CV cs.LG stat.ML

    Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System

    Authors: Gwenaelle Cunha Sergio, Minho Lee

    Abstract: Generating music with emotion similar to that of an input video is a very relevant issue nowadays. Video content creators and automatic movie directors benefit from maintaining their viewers engaged, which can be facilitated by producing novel material eliciting stronger emotions in them. Moreover, there's currently a demand for more empathetic computers to aid humans in applications such as augme… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: Published (https://www.hindawi.com/journals/mpe/2020/8478527/)

    Journal ref: Mathematical Problems in Engineering 2020 (2020) 1-15

  48. arXiv:2004.00407  [pdf, other

    cs.LG stat.ML

    Drug-disease Graph: Predicting Adverse Drug Reaction Signals via Graph Neural Network with Clinical Data

    Authors: Heeyoung Kwak, Minwoo Lee, Seunghyun Yoon, Jooyoung Chang, Sangmin Park, Kyomin Jung

    Abstract: Adverse Drug Reaction (ADR) is a significant public health concern world-wide. Numerous graph-based methods have been applied to biomedical graphs for predicting ADRs in pre-marketing phases. ADR detection in post-market surveillance is no less important than pre-marketing assessment, and ADR detection with large-scale clinical data have attracted much attention in recent years. However, there are… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: To appear at PAKDD 2020

  49. arXiv:2003.04774  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    ENTMOOT: A Framework for Optimization over Ensemble Tree Models

    Authors: Alexander Thebelt, Jan Kronqvist, Miten Mistry, Robert M. Lee, Nathan Sudermann-Merx, Ruth Misener

    Abstract: Gradient boosted trees and other regression tree models perform well in a wide range of real-world, industrial applications. These tree models (i) offer insight into important prediction features, (ii) effectively manage sparse data, and (iii) have excellent prediction capabilities. Despite their advantages, they are generally unpopular for decision-making tasks and black-box optimization, which i… ▽ More

    Submitted 18 May, 2021; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: 33 pages, 10 figures, 2 tables

  50. arXiv:2002.03808  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Vocoder-free End-to-End Voice Conversion with Transformer Network

    Authors: June-Woo Kim, Ho-Young Jung, Minho Lee

    Abstract: Mel-frequency filter bank (MFB) based approaches have the advantage of learning speech compared to raw spectrum since MFB has less feature size. However, speech generator with MFB approaches require additional vocoder that needs a huge amount of computation expense for training process. The additional pre/post processing such as MFB and vocoder is not essential to convert real human speech to othe… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: Work in progress

    Journal ref: 2020 International Joint Conference on Neural Networks (IJCNN)