-
Three steps towards dose optimization for oncology dose finding
Authors:
Jason J. Z. Liao,
Ekaterine Asatiani,
Qingyang Liu,
Kevin Hou
Abstract:
Traditional dose selection for oncology registration trials typically employs a one- or two-step single maximum tolerated dose (MTD) approach. However, this approach may not be appropriate for molecularly targeted therapy that tends to have toxicity profiles that are markedly different to cytotoxic agents. The US Food and Drug Administration launched Project Optimus to reform dose optimization in…
▽ More
Traditional dose selection for oncology registration trials typically employs a one- or two-step single maximum tolerated dose (MTD) approach. However, this approach may not be appropriate for molecularly targeted therapy that tends to have toxicity profiles that are markedly different to cytotoxic agents. The US Food and Drug Administration launched Project Optimus to reform dose optimization in oncology drug development and has recently released a related Guidance for Industry. In response to these initiatives, we propose a "three steps towards dose optimization" procedure and discuss the details in dose optimization designs and analyses in this manuscript. The first step is dose-escalation to identify the MTD or maximum administered dose with an efficient hybrid design, which can offer good overdose control and increases the likelihood of the recommended MTD being close to the true MTD. The second step is the selection of appropriate recommended doses for expansion (RDEs), based on all available data including emerging safety, pharmacokinetics, pharmacodynamics, and other biomarker information. The third step is dose optimization, which uses data from a randomized fractional factorial design with multiple RDEs explored in multiple tumor cohorts during the expansion phase to ensure a feasible dose is selected for registration trials, and that the tumor type most sensitive to the investigative treatment is identified. We believe using this three-step approach can increase the likelihood of selecting the optimal dose for registration trial, one that demonstrates a balanced safety profile while retaining much of the efficacy observed at the MTD.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
a q-EW-TOPSIS model of grey correlation for supply capacity evaluation
Authors:
Jia-Ming Liao,
Yu-Jie Huang,
Ke-Ming Shen
Abstract:
The paper describes a new supply capacity evaluation model based on the non-extensive statistical entropy. The traditional EW-TOPSIS model is selected as baseline and the GRA method is used to modify it. The correction results in the non-extensive parameter q which leads to the so-called q-EW-TOPSIS model. This new model has advantages over the traditional EW-TOPSIS model, including the ability to…
▽ More
The paper describes a new supply capacity evaluation model based on the non-extensive statistical entropy. The traditional EW-TOPSIS model is selected as baseline and the GRA method is used to modify it. The correction results in the non-extensive parameter q which leads to the so-called q-EW-TOPSIS model. This new model has advantages over the traditional EW-TOPSIS model, including the ability to accurately evaluate indicator weights with smaller sample sizes and weaker rules, and a more stable and closer-to-complete structure due to the use of entropy evaluation and mutual restriction between indicators. This study provides a more reliable and universal modified EW model. It is proved to be a more compatible model with systems and own greater credibility.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Multifold Cross-Validation Model Averaging for Generalized Additive Partial Linear Models
Authors:
Ze Chen,
Jun Liao,
Wangli Xu,
Yuhong Yang
Abstract:
Generalized additive partial linear models (GAPLMs) are appealing for model interpretation and prediction. However, for GAPLMs, the covariates and the degree of smoothing in the nonparametric parts are often difficult to determine in practice. To address this model selection uncertainty issue, we develop a computationally feasible model averaging (MA) procedure. The model weights are data-driven a…
▽ More
Generalized additive partial linear models (GAPLMs) are appealing for model interpretation and prediction. However, for GAPLMs, the covariates and the degree of smoothing in the nonparametric parts are often difficult to determine in practice. To address this model selection uncertainty issue, we develop a computationally feasible model averaging (MA) procedure. The model weights are data-driven and selected based on multifold cross-validation (CV) (instead of leave-one-out) for computational saving. When all the candidate models are misspecified, we show that the proposed MA estimator for GAPLMs is asymptotically optimal in the sense of achieving the lowest possible Kullback-Leibler loss. In the other scenario where the candidate model set contains at least one correct model, the weights chosen by the multifold CV are asymptotically concentrated on the correct models. As a by-product, we propose a variable importance measure to quantify the importances of the predictors in GAPLMs based on the MA weights. It is shown to be able to asymptotically identify the variables in the true model. Moreover, when the number of candidate models is very large, a model screening method is provided. Numerical experiments show the superiority of the proposed MA method over some existing model averaging and selection methods.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Expert Elicitation and Data Noise Learning for Material Flow Analysis using Bayesian Inference
Authors:
Jiayuan Dong,
Jiankan Liao,
Xun Huan,
Daniel Cooper
Abstract:
Bayesian inference allows the transparent communication of uncertainty in material flow analyses (MFAs), and a systematic update of uncertainty as new data become available. However, the method is undermined by the difficultly of defining proper priors for the MFA parameters and quantifying the noise in the collected data. We start to address these issues by first deriving and implementing an expe…
▽ More
Bayesian inference allows the transparent communication of uncertainty in material flow analyses (MFAs), and a systematic update of uncertainty as new data become available. However, the method is undermined by the difficultly of defining proper priors for the MFA parameters and quantifying the noise in the collected data. We start to address these issues by first deriving and implementing an expert elicitation procedure suitable for generating MFA parameter priors. Second, we propose to learn the data noise concurrent with the parametric uncertainty. These methods are demonstrated using a case study on the 2012 U.S. steel flow. Eight experts are interviewed to elicit distributions on steel flow uncertainty from raw materials to intermediate goods. The experts' distributions are combined and weighted according to the expertise demonstrated in response to seeding questions. These aggregated distributions form our model parameters' prior. A sensible, weakly-informative prior is also adopted for learning the data noise. Bayesian inference is then performed to update the parametric and data noise uncertainty given MFA data collected from the United States Geological Survey (USGS) and the World Steel Association (WSA). The results show a reduction in MFA parametric uncertainty when incorporating the collected data. Only a modest reduction in data noise uncertainty was observed; however, greater reductions were achieved when using data from multiple years in the inference. These methods generate transparent MFA and data noise uncertainties learned from data rather than pre-assumed data noise levels, providing a more robust basis for decision-making that affects the system.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
A gated group sequential design for seamless Phase II/III trial with subpopulation selection
Authors:
Guanhong Miao,
Jason J. Z. Liao,
**g Yang,
Keaven Anderson
Abstract:
Due to the high cost and high failure rate of Phase III trials, seamless Phase II/III designs are more and more popular to trial efficiency. A potential attraction of Phase II/III design is to allow a randomized proof-of-concept stage prior to committing to the full cost of the Phase III trial. Population selection during the trial allows a trial to adapt and focus investment where it is most like…
▽ More
Due to the high cost and high failure rate of Phase III trials, seamless Phase II/III designs are more and more popular to trial efficiency. A potential attraction of Phase II/III design is to allow a randomized proof-of-concept stage prior to committing to the full cost of the Phase III trial. Population selection during the trial allows a trial to adapt and focus investment where it is most likely to provide patient benefit. Motivated by a clinical trial to find the population that potential benefits with dual-primary endpoints progression free survival (PFS) and overall survival (OS), we propose a gated group sequential design for a seamless Phase II/III trial design with population selection. The investigated design controls the familywise error rate and allows multiple interim analyses to enable early stop** for efficacy or futility. Simulations and an illustrative example suggest that the proposed gated group sequential design can have more power than the commonly used classical group sequential design, and reduces the patient's exposure to less effective treatment if the complementary sub-group has less significant treatment effect. The proposed design has the potential to save drug development cost and more quickly fulfill unmet medical needs.
△ Less
Submitted 24 June, 2022;
originally announced June 2022.
-
Cellular Automata Simulation of Grain Growth of Powder Metallurgy Nickel-Based Superalloy
Authors:
Shasha Liua,
Yiling Jianga,
Ronggui Lua,
Xu Cheng,
Jia Lia,
Yang Chen,
Gaofeng Tian
Abstract:
Primary γ' phase instead of carbides and borides plays an important role in suppressing grain growth during solution at 1433K of FGH98 nickel-based polycrystalline alloys. Results illustrate that as-fabricated FGH98 has equiaxed grain structure and after heat treatment, grains remain equiaxed but grow larger. In order to clarify the effects of the size and volume fraction of the primary γ' phase o…
▽ More
Primary γ' phase instead of carbides and borides plays an important role in suppressing grain growth during solution at 1433K of FGH98 nickel-based polycrystalline alloys. Results illustrate that as-fabricated FGH98 has equiaxed grain structure and after heat treatment, grains remain equiaxed but grow larger. In order to clarify the effects of the size and volume fraction of the primary γ' phase on the grain growth during heat treatment, this paper establish a 2D Cellular Automata (CA) model based on the thermal activation and the lowest energy principle. The CA results are compared with the experimental results and show a good fit with an error less than 10%. Grain growth kinetics are depicted and simulations in real time for various sizes and volume fractions of primary γ' particles work out well with the Zener relation. The coefficient n value in Zener relation is theoretically calculated and its minimum value is 0.23 when the radius of primary γ' is 2.8μm.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Three Variants of Differential Privacy: Lossless Conversion and Applications
Authors:
Shahab Asoodeh,
Jiachun Liao,
Flavio P. Calmon,
Oliver Kosut,
Lalitha Sankar
Abstract:
We consider three different variants of differential privacy (DP), namely approximate DP, Rényi DP (RDP), and hypothesis test DP. In the first part, we develop a machinery for optimally relating approximate DP to RDP based on the joint range of two $f$-divergences that underlie the approximate DP and RDP. In particular, this enables us to derive the optimal approximate DP parameters of a mechanism…
▽ More
We consider three different variants of differential privacy (DP), namely approximate DP, Rényi DP (RDP), and hypothesis test DP. In the first part, we develop a machinery for optimally relating approximate DP to RDP based on the joint range of two $f$-divergences that underlie the approximate DP and RDP. In particular, this enables us to derive the optimal approximate DP parameters of a mechanism that satisfies a given level of RDP. As an application, we apply our result to the moments accountant framework for characterizing privacy guarantees of noisy stochastic gradient descent (SGD). When compared to the state-of-the-art, our bounds may lead to about 100 more stochastic gradient descent iterations for training deep learning models for the same privacy budget. In the second part, we establish a relationship between RDP and hypothesis test DP which allows us to translate the RDP constraint into a tradeoff between type I and type II error probabilities of a certain binary hypothesis test. We then demonstrate that for noisy SGD our result leads to tighter privacy guarantees compared to the recently proposed $f$-DP framework for some range of parameters.
△ Less
Submitted 23 January, 2021; v1 submitted 14 August, 2020;
originally announced August 2020.
-
Application of Structural Similarity Analysis of Visually Salient Areas and Hierarchical Clustering in the Screening of Similar Wireless Capsule Endoscopic Images
Authors:
Rui Nie,
Huan Yang,
Hejuan Peng,
Wenbin Luo,
Weiya Fan,
Jie Zhang,
**g Liao,
Fang Huang,
Yufeng Xiao
Abstract:
Small intestinal capsule endoscopy is the mainstream method for inspecting small intestinal lesions,but a single small intestinal capsule endoscopy will produce 60,000 - 120,000 images, the majority of which are similar and have no diagnostic value. It takes 2 - 3 hours for doctors to identify lesions from these images. This is time-consuming and increase the probability of misdiagnosis and missed…
▽ More
Small intestinal capsule endoscopy is the mainstream method for inspecting small intestinal lesions,but a single small intestinal capsule endoscopy will produce 60,000 - 120,000 images, the majority of which are similar and have no diagnostic value. It takes 2 - 3 hours for doctors to identify lesions from these images. This is time-consuming and increase the probability of misdiagnosis and missed diagnosis since doctors are likely to experience visual fatigue while focusing on a large number of similar images for an extended period of time.In order to solve these problems, we proposed a similar wireless capsule endoscope (WCE) image screening method based on structural similarity analysis and the hierarchical clustering of visually salient sub-image blocks. The similarity clustering of images was automatically identified by hierarchical clustering based on the hue,saturation,value (HSV) spatial color characteristics of the images,and the keyframe images were extracted based on the structural similarity of the visually salient sub-image blocks, in order to accurately identify and screen out similar small intestinal capsule endoscopic images. Subsequently, the proposed method was applied to the capsule endoscope imaging workstation. After screening out similar images in the complete data gathered by the Type I OMOM Small Intestinal Capsule Endoscope from 52 cases covering 17 common types of small intestinal lesions, we obtained a lesion recall of 100% and an average similar image reduction ratio of 76%. With similar images screened out, the average play time of the OMOM image workstation was 18 minutes, which greatly reduced the time spent by doctors viewing the images.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Improving Deep Learning For Airbnb Search
Authors:
Malay Haldar,
Mustafa Abdool,
Prashant Ramanathan,
Tyler Sax,
Lanbo Zhang,
Aamir Mansawala,
Shulin Yang,
Bradley Turnbull,
Junshuo Liao
Abstract:
The application of deep learning to search ranking was one of the most impactful product improvements at Airbnb. But what comes next after you launch a deep learning model? In this paper we describe the journey beyond, discussing what we refer to as the ABCs of improving search: A for architecture, B for bias and C for cold start. For architecture, we describe a new ranking neural network, focusin…
▽ More
The application of deep learning to search ranking was one of the most impactful product improvements at Airbnb. But what comes next after you launch a deep learning model? In this paper we describe the journey beyond, discussing what we refer to as the ABCs of improving search: A for architecture, B for bias and C for cold start. For architecture, we describe a new ranking neural network, focusing on the process that evolved our existing DNN beyond a fully connected two layer network. On handling positional bias in ranking, we describe a novel approach that led to one of the most significant improvements in tackling inventory that the DNN historically found challenging. To solve cold start, we describe our perspective on the problem and changes we made to improve the treatment of new listings on the platform. We hope ranking teams transitioning to deep learning will find this a practical case study of how to iterate on DNNs.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.
-
LoCEC: Local Community-based Edge Classification in Large Online Social Networks
Authors:
Chonggang Song,
Qian Lin,
Guohui Ling,
Zongyi Zhang,
Hongzhao Chen,
Jun Liao,
Chuan Chen
Abstract:
Relationships in online social networks often imply social connections in the real world. An accurate understanding of relationship types benefits many applications, e.g. social advertising and recommendation. Some recent attempts have been proposed to classify user relationships into predefined types with the help of pre-labeled relationships or abundant interaction features on relationships. Unf…
▽ More
Relationships in online social networks often imply social connections in the real world. An accurate understanding of relationship types benefits many applications, e.g. social advertising and recommendation. Some recent attempts have been proposed to classify user relationships into predefined types with the help of pre-labeled relationships or abundant interaction features on relationships. Unfortunately, both relationship feature data and label data are very sparse in real social platforms like WeChat, rendering existing methods inapplicable. In this paper, we present an in-depth analysis of WeChat relationships to identify the major challenges for the relationship classification task. To tackle the challenges, we propose a Local Community-based Edge Classification (LoCEC) framework that classifies user relationships in a social network into real-world social connection types. LoCEC enforces a three-phase processing, namely local community detection, community classification and relationship classification, to address the sparsity issue of relationship features and relationship labels. Moreover, LoCEC is designed to handle large-scale networks by allowing parallel and distributed processing. We conduct extensive experiments on the real-world WeChat network with hundreds of billions of edges to validate the effectiveness and efficiency of LoCEC.
△ Less
Submitted 20 March, 2020; v1 submitted 10 February, 2020;
originally announced February 2020.
-
A Better Bound Gives a Hundred Rounds: Enhanced Privacy Guarantees via $f$-Divergences
Authors:
Shahab Asoodeh,
Jiachun Liao,
Flavio P. Calmon,
Oliver Kosut,
Lalitha Sankar
Abstract:
We derive the optimal differential privacy (DP) parameters of a mechanism that satisfies a given level of Rényi differential privacy (RDP). Our result is based on the joint range of two $f$-divergences that underlie the approximate and the Rényi variations of differential privacy. We apply our result to the moments accountant framework for characterizing privacy guarantees of stochastic gradient d…
▽ More
We derive the optimal differential privacy (DP) parameters of a mechanism that satisfies a given level of Rényi differential privacy (RDP). Our result is based on the joint range of two $f$-divergences that underlie the approximate and the Rényi variations of differential privacy. We apply our result to the moments accountant framework for characterizing privacy guarantees of stochastic gradient descent. When compared to the state-of-the-art, our bounds may lead to about 100 more stochastic gradient descent iterations for training deep learning models for the same privacy budget.
△ Less
Submitted 16 January, 2020;
originally announced January 2020.
-
Theoretical Guarantees for Model Auditing with Finite Adversaries
Authors:
Mario Diaz,
Peter Kairouz,
Jiachun Liao,
Lalitha Sankar
Abstract:
Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a)…
▽ More
Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a) access to some finite side information (e.g., a small auditing dataset), and (b) finite capacity (e.g., a fixed neural network architecture). Our work investigates the requirements under which an unsuccessful attempt to identify privacy violations by a finite adversary implies that no stronger adversary can succeed at such a task. We do so via parameters that quantify the capabilities of the finite adversary, including the size of the neural network employed by such an adversary and the amount of side information it has access to as well as the regularity of the (perhaps privacy-guaranteeing) audited model.
△ Less
Submitted 8 November, 2019;
originally announced November 2019.
-
MLPerf Inference Benchmark
Authors:
Vijay Janapa Reddi,
Christine Cheng,
David Kanter,
Peter Mattson,
Guenther Schmuelling,
Carole-Jean Wu,
Brian Anderson,
Maximilien Breughe,
Mark Charlebois,
William Chou,
Ramesh Chukka,
Cody Coleman,
Sam Davis,
Pan Deng,
Greg Diamos,
Jared Duke,
Dave Fick,
J. Scott Gardner,
Itay Hubara,
Sachin Idgunji,
Thomas B. Jablin,
Jeff Jiao,
Tom St. John,
Pankaj Kanwar,
David Lee
, et al. (22 additional authors not shown)
Abstract:
Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic…
▽ More
Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark's flexibility and adaptability.
△ Less
Submitted 9 May, 2020; v1 submitted 6 November, 2019;
originally announced November 2019.
-
MLPerf Training Benchmark
Authors:
Peter Mattson,
Christine Cheng,
Cody Coleman,
Greg Diamos,
Paulius Micikevicius,
David Patterson,
Hanlin Tang,
Gu-Yeon Wei,
Peter Bailis,
Victor Bittorf,
David Brooks,
Dehao Chen,
Debojyoti Dutta,
Udit Gupta,
Kim Hazelwood,
Andrew Hock,
Xinyuan Huang,
Atsushi Ike,
Bill Jia,
Daniel Kang,
David Kanter,
Naveen Kumar,
Jeffery Liao,
Guokai Ma,
Deepak Narayanan
, et al. (12 additional authors not shown)
Abstract:
Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits h…
▽ More
Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits high variance, and software and hardware systems are so diverse that fair benchmarking with the same binary, code, and even hyperparameters is difficult. We therefore present MLPerf, an ML benchmark that overcomes these challenges. Our analysis quantitatively evaluates MLPerf's efficacy at driving performance and scalability improvements across two rounds of results from multiple vendors.
△ Less
Submitted 2 March, 2020; v1 submitted 2 October, 2019;
originally announced October 2019.
-
Generating Fair Universal Representations using Adversarial Models
Authors:
Peter Kairouz,
Jiachun Liao,
Chong Huang,
Maunil Vyas,
Monica Welfert,
Lalitha Sankar
Abstract:
We present a data-driven framework for learning fair universal representations (FUR) that guarantee statistical fairness for any learning task that may not be known a priori. Our framework leverages recent advances in adversarial learning to allow a data holder to learn representations in which a set of sensitive attributes are decoupled from the rest of the dataset. We formulate this as a constra…
▽ More
We present a data-driven framework for learning fair universal representations (FUR) that guarantee statistical fairness for any learning task that may not be known a priori. Our framework leverages recent advances in adversarial learning to allow a data holder to learn representations in which a set of sensitive attributes are decoupled from the rest of the dataset. We formulate this as a constrained minimax game between an encoder and an adversary where the constraint ensures a measure of usefulness (utility) of the representation. The resulting problem is that of censoring, i.e., finding a representation that is least informative about the sensitive attributes given a utility constraint. For appropriately chosen adversarial loss functions, our censoring framework precisely clarifies the optimal adversarial strategy against strong information-theoretic adversaries; it also achieves the fairness measure of demographic parity for the resulting constrained representations. We evaluate the performance of our proposed framework on both synthetic and publicly available datasets. For these datasets, we use two tradeoff measures: censoring vs. representation fidelity and fairness vs. utility for downstream tasks, to amply demonstrate that multiple sensitive features can be effectively censored even as the resulting fair representations ensure accuracy for multiple downstream tasks.
△ Less
Submitted 11 May, 2022; v1 submitted 27 September, 2019;
originally announced October 2019.
-
Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis
Authors:
Ray S. Lin,
Ji Lin,
Satrajit Roychoudhury,
Keaven M. Anderson,
Tianle Hu,
Bo Huang,
Larry F Leon,
Jason JZ Liao,
Rong Liu,
Xiaodong Luo,
Pralay Mukhopadhyay,
Rui Qin,
Kay Tatsuoka,
Xue**g Wang,
Yang Wang,
Jian Zhu,
Tai-Tsang Chen,
Renee Iacona,
Cross-Pharma Non-proportional Hazards Working Group
Abstract:
The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan…
▽ More
The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.
-
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit
Authors:
Yi-Qi Hu,
Yang Yu,
Jun-Da Liao
Abstract:
An automatic machine learning (AutoML) task is to select the best algorithm and its hyper-parameters simultaneously. Previously, the hyper-parameters of all algorithms are joint as a single search space, which is not only huge but also redundant, because many dimensions of hyper-parameters are irrelevant with the selected algorithms. In this paper, we propose a cascaded approach for algorithm sele…
▽ More
An automatic machine learning (AutoML) task is to select the best algorithm and its hyper-parameters simultaneously. Previously, the hyper-parameters of all algorithms are joint as a single search space, which is not only huge but also redundant, because many dimensions of hyper-parameters are irrelevant with the selected algorithms. In this paper, we propose a cascaded approach for algorithm selection and hyper-parameter optimization. While a search procedure is employed at the level of hyper-parameter optimization, a bandit strategy runs at the level of algorithm selection to allocate the budget based on the search feedbacks. Since the bandit is required to select the algorithm with the maximum performance, instead of the average performance, we thus propose the extreme-region upper confidence bound (ER-UCB) strategy, which focuses on the extreme region of the underlying feedback distribution. We show theoretically that the ER-UCB has a regret upper bound $O\left(K \ln n\right)$ with independent feedbacks, which is as efficient as the classical UCB bandit. We also conduct experiments on a synthetic problem as well as a set of AutoML tasks. The results verify the effectiveness of the proposed method.
△ Less
Submitted 31 May, 2019;
originally announced May 2019.
-
swTVM: Towards Optimized Tensor Code Generation for Deep Learning on Sunway Many-Core Processor
Authors:
Mingzhen Li,
Changxi Liu,
Jian** Liao,
Xuegui Zheng,
Hailong Yang,
Rujun Sun,
Jun Xu,
Lin Gan,
Guangwen Yang,
Zhongzhi Luan,
Depei Qian
Abstract:
The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core p…
▽ More
The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core processor renders itself as a competitive candidate for its attractive computational power in both scientific computing and deep learning workloads. This paper combines the trends in these two directions. Specifically, we propose swTVM that extends the original TVM to support ahead-of-time compilation for architecture requiring cross-compilation such as Sunway. In addition, we leverage the architecture features during the compilation such as core group for massive parallelism, DMA for high bandwidth memory transfer and local device memory for data locality, in order to generate efficient codes for deep learning workloads on Sunway. The experiment results show that the codes generated by swTVM achieves 1.79x on average compared to the state-of-the-art deep learning framework on Sunway, across six representative benchmarks. This work is the first attempt from the compiler perspective to bridge the gap of deep learning and Sunway processor particularly with productivity and efficiency in mind. We believe this work will encourage more people to embrace the power of deep learning and Sunway many-core processor.
△ Less
Submitted 11 July, 2022; v1 submitted 15 April, 2019;
originally announced April 2019.
-
Connecting Bayes factor and the Region of Practical Equivalence (ROPE) Procedure for testing interval null hypothesis
Authors:
J. G. Liao,
Vishal Midya,
Arthur Berg
Abstract:
There has been strong recent interest in testing interval null hypothesis for improved scientific inference. For example, Lakens et al (2018) and Lakens and Harms (2017) use this approach to study if there is a pre-specified meaningful treatment effect in gerontology and clinical trials, which is different from the more traditional point null hypothesis that tests for any treatment effect. Two pop…
▽ More
There has been strong recent interest in testing interval null hypothesis for improved scientific inference. For example, Lakens et al (2018) and Lakens and Harms (2017) use this approach to study if there is a pre-specified meaningful treatment effect in gerontology and clinical trials, which is different from the more traditional point null hypothesis that tests for any treatment effect. Two popular Bayesian approaches are available for interval null hypothesis testing. One is the standard Bayes factor and the other is the Region of Practical Equivalence (ROPE) procedure championed by Kruschke and others over many years. This paper establishes a formal connection between these two approaches with two benefits. First, it helps to better understand and improve the ROPE procedure. Second, it leads to a simple and effective algorithm for computing Bayes factor in a wide range of problems using draws from posterior distributions generated by standard Bayesian programs such as BUGS, JAGS and Stan. The tedious and error-prone task of coding custom-made software specific for Bayes factor is then avoided.
△ Less
Submitted 30 April, 2019; v1 submitted 7 March, 2019;
originally announced March 2019.
-
A robustified posterior for Bayesian inference on a large number of parallel effects
Authors:
J G Liao,
Arthur Berg,
Timothy L McMurry
Abstract:
Many modern experiments, such as microarray gene expression and genome-wide association studies, present the problem of estimating a large number of parallel effects. Bayesian inference is a popular approach for analyzing such data by modeling the large number of unknown parameters as random effects from a common prior distribution. However, misspecification of the prior distribution can lead to e…
▽ More
Many modern experiments, such as microarray gene expression and genome-wide association studies, present the problem of estimating a large number of parallel effects. Bayesian inference is a popular approach for analyzing such data by modeling the large number of unknown parameters as random effects from a common prior distribution. However, misspecification of the prior distribution can lead to erroneous estimates of the random effects, especially for the largest and most interesting effects. This paper has two aims. First, we propose a robustified posterior distribution for a parametric Bayesian hierarchical model that can substantially reduce the impact of a misspecified prior. Second, we conduct a systematic comparison of the standard parametric posterior, the proposed robustified parametric posterior, and a nonparametric Bayesian posterior which uses a Dirichlet process mixture prior. The proposed robustifed posterior when combined with a flexible parametric prior can be a superior alternative to nonparametric Bayesian methods.
△ Less
Submitted 25 October, 2018; v1 submitted 27 June, 2018;
originally announced June 2018.