Search | arXiv e-print repository

Three steps towards dose optimization for oncology dose finding

Authors: Jason J. Z. Liao, Ekaterine Asatiani, Qingyang Liu, Kevin Hou

Abstract: Traditional dose selection for oncology registration trials typically employs a one- or two-step single maximum tolerated dose (MTD) approach. However, this approach may not be appropriate for molecularly targeted therapy that tends to have toxicity profiles that are markedly different to cytotoxic agents. The US Food and Drug Administration launched Project Optimus to reform dose optimization in… ▽ More Traditional dose selection for oncology registration trials typically employs a one- or two-step single maximum tolerated dose (MTD) approach. However, this approach may not be appropriate for molecularly targeted therapy that tends to have toxicity profiles that are markedly different to cytotoxic agents. The US Food and Drug Administration launched Project Optimus to reform dose optimization in oncology drug development and has recently released a related Guidance for Industry. In response to these initiatives, we propose a "three steps towards dose optimization" procedure and discuss the details in dose optimization designs and analyses in this manuscript. The first step is dose-escalation to identify the MTD or maximum administered dose with an efficient hybrid design, which can offer good overdose control and increases the likelihood of the recommended MTD being close to the true MTD. The second step is the selection of appropriate recommended doses for expansion (RDEs), based on all available data including emerging safety, pharmacokinetics, pharmacodynamics, and other biomarker information. The third step is dose optimization, which uses data from a randomized fractional factorial design with multiple RDEs explored in multiple tumor cohorts during the expansion phase to ensure a feasible dose is selected for registration trials, and that the tumor type most sensitive to the investigative treatment is identified. We believe using this three-step approach can increase the likelihood of selecting the optimal dose for registration trial, one that demonstrates a balanced safety profile while retaining much of the efficacy observed at the MTD. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 22 pages, 7 figures and 2 tables

arXiv:2303.12190 [pdf, other]

a q-EW-TOPSIS model of grey correlation for supply capacity evaluation

Authors: Jia-Ming Liao, Yu-Jie Huang, Ke-Ming Shen

Abstract: The paper describes a new supply capacity evaluation model based on the non-extensive statistical entropy. The traditional EW-TOPSIS model is selected as baseline and the GRA method is used to modify it. The correction results in the non-extensive parameter q which leads to the so-called q-EW-TOPSIS model. This new model has advantages over the traditional EW-TOPSIS model, including the ability to… ▽ More The paper describes a new supply capacity evaluation model based on the non-extensive statistical entropy. The traditional EW-TOPSIS model is selected as baseline and the GRA method is used to modify it. The correction results in the non-extensive parameter q which leads to the so-called q-EW-TOPSIS model. This new model has advantages over the traditional EW-TOPSIS model, including the ability to accurately evaluate indicator weights with smaller sample sizes and weaker rules, and a more stable and closer-to-complete structure due to the use of entropy evaluation and mutual restriction between indicators. This study provides a more reliable and universal modified EW model. It is proved to be a more compatible model with systems and own greater credibility. △ Less

Submitted 3 March, 2023; originally announced March 2023.

arXiv:2212.02381 [pdf, ps, other]

Multifold Cross-Validation Model Averaging for Generalized Additive Partial Linear Models

Authors: Ze Chen, Jun Liao, Wangli Xu, Yuhong Yang

Abstract: Generalized additive partial linear models (GAPLMs) are appealing for model interpretation and prediction. However, for GAPLMs, the covariates and the degree of smoothing in the nonparametric parts are often difficult to determine in practice. To address this model selection uncertainty issue, we develop a computationally feasible model averaging (MA) procedure. The model weights are data-driven a… ▽ More Generalized additive partial linear models (GAPLMs) are appealing for model interpretation and prediction. However, for GAPLMs, the covariates and the degree of smoothing in the nonparametric parts are often difficult to determine in practice. To address this model selection uncertainty issue, we develop a computationally feasible model averaging (MA) procedure. The model weights are data-driven and selected based on multifold cross-validation (CV) (instead of leave-one-out) for computational saving. When all the candidate models are misspecified, we show that the proposed MA estimator for GAPLMs is asymptotically optimal in the sense of achieving the lowest possible Kullback-Leibler loss. In the other scenario where the candidate model set contains at least one correct model, the weights chosen by the multifold CV are asymptotically concentrated on the correct models. As a by-product, we propose a variable importance measure to quantify the importances of the predictors in GAPLMs based on the MA weights. It is shown to be able to asymptotically identify the variables in the true model. Moreover, when the number of candidate models is very large, a model screening method is provided. Numerical experiments show the superiority of the proposed MA method over some existing model averaging and selection methods. △ Less

Submitted 5 December, 2022; originally announced December 2022.

arXiv:2207.09288 [pdf, other]

doi 10.1111/jiec.13399

Expert Elicitation and Data Noise Learning for Material Flow Analysis using Bayesian Inference

Authors: Jiayuan Dong, Jiankan Liao, Xun Huan, Daniel Cooper

Abstract: Bayesian inference allows the transparent communication of uncertainty in material flow analyses (MFAs), and a systematic update of uncertainty as new data become available. However, the method is undermined by the difficultly of defining proper priors for the MFA parameters and quantifying the noise in the collected data. We start to address these issues by first deriving and implementing an expe… ▽ More Bayesian inference allows the transparent communication of uncertainty in material flow analyses (MFAs), and a systematic update of uncertainty as new data become available. However, the method is undermined by the difficultly of defining proper priors for the MFA parameters and quantifying the noise in the collected data. We start to address these issues by first deriving and implementing an expert elicitation procedure suitable for generating MFA parameter priors. Second, we propose to learn the data noise concurrent with the parametric uncertainty. These methods are demonstrated using a case study on the 2012 U.S. steel flow. Eight experts are interviewed to elicit distributions on steel flow uncertainty from raw materials to intermediate goods. The experts' distributions are combined and weighted according to the expertise demonstrated in response to seeding questions. These aggregated distributions form our model parameters' prior. A sensible, weakly-informative prior is also adopted for learning the data noise. Bayesian inference is then performed to update the parametric and data noise uncertainty given MFA data collected from the United States Geological Survey (USGS) and the World Steel Association (WSA). The results show a reduction in MFA parametric uncertainty when incorporating the collected data. Only a modest reduction in data noise uncertainty was observed; however, greater reductions were achieved when using data from multiple years in the inference. These methods generate transparent MFA and data noise uncertainties learned from data rather than pre-assumed data noise levels, providing a more robust basis for decision-making that affects the system. △ Less

Submitted 12 July, 2022; originally announced July 2022.

Comments: 23 pages of main paper and 10 pages of supporting information

MSC Class: 62F15; 62P12; 62P30

Journal ref: Journal of Industrial Ecology 27(2023) 1105-1122

arXiv:2206.12536 [pdf]

A gated group sequential design for seamless Phase II/III trial with subpopulation selection

Authors: Guanhong Miao, Jason J. Z. Liao, **g Yang, Keaven Anderson

Abstract: Due to the high cost and high failure rate of Phase III trials, seamless Phase II/III designs are more and more popular to trial efficiency. A potential attraction of Phase II/III design is to allow a randomized proof-of-concept stage prior to committing to the full cost of the Phase III trial. Population selection during the trial allows a trial to adapt and focus investment where it is most like… ▽ More Due to the high cost and high failure rate of Phase III trials, seamless Phase II/III designs are more and more popular to trial efficiency. A potential attraction of Phase II/III design is to allow a randomized proof-of-concept stage prior to committing to the full cost of the Phase III trial. Population selection during the trial allows a trial to adapt and focus investment where it is most likely to provide patient benefit. Motivated by a clinical trial to find the population that potential benefits with dual-primary endpoints progression free survival (PFS) and overall survival (OS), we propose a gated group sequential design for a seamless Phase II/III trial design with population selection. The investigated design controls the familywise error rate and allows multiple interim analyses to enable early stop** for efficacy or futility. Simulations and an illustrative example suggest that the proposed gated group sequential design can have more power than the commonly used classical group sequential design, and reduces the patient's exposure to less effective treatment if the complementary sub-group has less significant treatment effect. The proposed design has the potential to save drug development cost and more quickly fulfill unmet medical needs. △ Less

Submitted 24 June, 2022; originally announced June 2022.

Comments: 3 figures, 5 tables

MSC Class: 62P10

arXiv:2106.04888 [pdf]

Cellular Automata Simulation of Grain Growth of Powder Metallurgy Nickel-Based Superalloy

Authors: Shasha Liua, Yiling Jianga, Ronggui Lua, Xu Cheng, Jia Lia, Yang Chen, Gaofeng Tian

Abstract: Primary γ' phase instead of carbides and borides plays an important role in suppressing grain growth during solution at 1433K of FGH98 nickel-based polycrystalline alloys. Results illustrate that as-fabricated FGH98 has equiaxed grain structure and after heat treatment, grains remain equiaxed but grow larger. In order to clarify the effects of the size and volume fraction of the primary γ' phase o… ▽ More Primary γ' phase instead of carbides and borides plays an important role in suppressing grain growth during solution at 1433K of FGH98 nickel-based polycrystalline alloys. Results illustrate that as-fabricated FGH98 has equiaxed grain structure and after heat treatment, grains remain equiaxed but grow larger. In order to clarify the effects of the size and volume fraction of the primary γ' phase on the grain growth during heat treatment, this paper establish a 2D Cellular Automata (CA) model based on the thermal activation and the lowest energy principle. The CA results are compared with the experimental results and show a good fit with an error less than 10%. Grain growth kinetics are depicted and simulations in real time for various sizes and volume fractions of primary γ' particles work out well with the Zener relation. The coefficient n value in Zener relation is theoretically calculated and its minimum value is 0.23 when the radius of primary γ' is 2.8μm. △ Less

Submitted 9 June, 2021; originally announced June 2021.

arXiv:2008.06529 [pdf, other]

Three Variants of Differential Privacy: Lossless Conversion and Applications

Authors: Shahab Asoodeh, Jiachun Liao, Flavio P. Calmon, Oliver Kosut, Lalitha Sankar

Abstract: We consider three different variants of differential privacy (DP), namely approximate DP, Rényi DP (RDP), and hypothesis test DP. In the first part, we develop a machinery for optimally relating approximate DP to RDP based on the joint range of two $f$-divergences that underlie the approximate DP and RDP. In particular, this enables us to derive the optimal approximate DP parameters of a mechanism… ▽ More We consider three different variants of differential privacy (DP), namely approximate DP, Rényi DP (RDP), and hypothesis test DP. In the first part, we develop a machinery for optimally relating approximate DP to RDP based on the joint range of two $f$-divergences that underlie the approximate DP and RDP. In particular, this enables us to derive the optimal approximate DP parameters of a mechanism that satisfies a given level of RDP. As an application, we apply our result to the moments accountant framework for characterizing privacy guarantees of noisy stochastic gradient descent (SGD). When compared to the state-of-the-art, our bounds may lead to about 100 more stochastic gradient descent iterations for training deep learning models for the same privacy budget. In the second part, we establish a relationship between RDP and hypothesis test DP which allows us to translate the RDP constraint into a tradeoff between type I and type II error probabilities of a certain binary hypothesis test. We then demonstrate that for noisy SGD our result leads to tighter privacy guarantees compared to the recently proposed $f$-DP framework for some range of parameters. △ Less

Submitted 23 January, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

Comments: To appear in IEEE Journal on Selected Areas in Information Theory, Special Issue on Privacy and Security of Information Systems. arXiv admin note: text overlap with arXiv:2001.05990

arXiv:2004.02805 [pdf]

Application of Structural Similarity Analysis of Visually Salient Areas and Hierarchical Clustering in the Screening of Similar Wireless Capsule Endoscopic Images

Authors: Rui Nie, Huan Yang, Hejuan Peng, Wenbin Luo, Weiya Fan, Jie Zhang, **g Liao, Fang Huang, Yufeng Xiao

Abstract: Small intestinal capsule endoscopy is the mainstream method for inspecting small intestinal lesions,but a single small intestinal capsule endoscopy will produce 60,000 - 120,000 images, the majority of which are similar and have no diagnostic value. It takes 2 - 3 hours for doctors to identify lesions from these images. This is time-consuming and increase the probability of misdiagnosis and missed… ▽ More Small intestinal capsule endoscopy is the mainstream method for inspecting small intestinal lesions,but a single small intestinal capsule endoscopy will produce 60,000 - 120,000 images, the majority of which are similar and have no diagnostic value. It takes 2 - 3 hours for doctors to identify lesions from these images. This is time-consuming and increase the probability of misdiagnosis and missed diagnosis since doctors are likely to experience visual fatigue while focusing on a large number of similar images for an extended period of time.In order to solve these problems, we proposed a similar wireless capsule endoscope (WCE) image screening method based on structural similarity analysis and the hierarchical clustering of visually salient sub-image blocks. The similarity clustering of images was automatically identified by hierarchical clustering based on the hue,saturation,value (HSV) spatial color characteristics of the images,and the keyframe images were extracted based on the structural similarity of the visually salient sub-image blocks, in order to accurately identify and screen out similar small intestinal capsule endoscopic images. Subsequently, the proposed method was applied to the capsule endoscope imaging workstation. After screening out similar images in the complete data gathered by the Type I OMOM Small Intestinal Capsule Endoscope from 52 cases covering 17 common types of small intestinal lesions, we obtained a lesion recall of 100% and an average similar image reduction ratio of 76%. With similar images screened out, the average play time of the OMOM image workstation was 18 minutes, which greatly reduced the time spent by doctors viewing the images. △ Less

Submitted 1 April, 2020; originally announced April 2020.

arXiv:2002.05515 [pdf, other]

Improving Deep Learning For Airbnb Search

Authors: Malay Haldar, Mustafa Abdool, Prashant Ramanathan, Tyler Sax, Lanbo Zhang, Aamir Mansawala, Shulin Yang, Bradley Turnbull, Junshuo Liao

Abstract: The application of deep learning to search ranking was one of the most impactful product improvements at Airbnb. But what comes next after you launch a deep learning model? In this paper we describe the journey beyond, discussing what we refer to as the ABCs of improving search: A for architecture, B for bias and C for cold start. For architecture, we describe a new ranking neural network, focusin… ▽ More The application of deep learning to search ranking was one of the most impactful product improvements at Airbnb. But what comes next after you launch a deep learning model? In this paper we describe the journey beyond, discussing what we refer to as the ABCs of improving search: A for architecture, B for bias and C for cold start. For architecture, we describe a new ranking neural network, focusing on the process that evolved our existing DNN beyond a fully connected two layer network. On handling positional bias in ranking, we describe a novel approach that led to one of the most significant improvements in tackling inventory that the DNN historically found challenging. To solve cold start, we describe our perspective on the problem and changes we made to improve the treatment of new listings on the platform. We hope ranking teams transitioning to deep learning will find this a practical case study of how to iterate on DNNs. △ Less

Submitted 10 February, 2020; originally announced February 2020.

arXiv:2002.04180 [pdf, other]

LoCEC: Local Community-based Edge Classification in Large Online Social Networks

Authors: Chonggang Song, Qian Lin, Guohui Ling, Zongyi Zhang, Hongzhao Chen, Jun Liao, Chuan Chen

Abstract: Relationships in online social networks often imply social connections in the real world. An accurate understanding of relationship types benefits many applications, e.g. social advertising and recommendation. Some recent attempts have been proposed to classify user relationships into predefined types with the help of pre-labeled relationships or abundant interaction features on relationships. Unf… ▽ More Relationships in online social networks often imply social connections in the real world. An accurate understanding of relationship types benefits many applications, e.g. social advertising and recommendation. Some recent attempts have been proposed to classify user relationships into predefined types with the help of pre-labeled relationships or abundant interaction features on relationships. Unfortunately, both relationship feature data and label data are very sparse in real social platforms like WeChat, rendering existing methods inapplicable. In this paper, we present an in-depth analysis of WeChat relationships to identify the major challenges for the relationship classification task. To tackle the challenges, we propose a Local Community-based Edge Classification (LoCEC) framework that classifies user relationships in a social network into real-world social connection types. LoCEC enforces a three-phase processing, namely local community detection, community classification and relationship classification, to address the sparsity issue of relationship features and relationship labels. Moreover, LoCEC is designed to handle large-scale networks by allowing parallel and distributed processing. We conduct extensive experiments on the real-world WeChat network with hundreds of billions of edges to validate the effectiveness and efficiency of LoCEC. △ Less

Submitted 20 March, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

arXiv:2001.05990 [pdf, other]

A Better Bound Gives a Hundred Rounds: Enhanced Privacy Guarantees via $f$-Divergences

Authors: Shahab Asoodeh, Jiachun Liao, Flavio P. Calmon, Oliver Kosut, Lalitha Sankar

Abstract: We derive the optimal differential privacy (DP) parameters of a mechanism that satisfies a given level of Rényi differential privacy (RDP). Our result is based on the joint range of two $f$-divergences that underlie the approximate and the Rényi variations of differential privacy. We apply our result to the moments accountant framework for characterizing privacy guarantees of stochastic gradient d… ▽ More We derive the optimal differential privacy (DP) parameters of a mechanism that satisfies a given level of Rényi differential privacy (RDP). Our result is based on the joint range of two $f$-divergences that underlie the approximate and the Rényi variations of differential privacy. We apply our result to the moments accountant framework for characterizing privacy guarantees of stochastic gradient descent. When compared to the state-of-the-art, our bounds may lead to about 100 more stochastic gradient descent iterations for training deep learning models for the same privacy budget. △ Less

Submitted 16 January, 2020; originally announced January 2020.

Comments: Submitted for Publication

arXiv:1911.03405 [pdf, other]

Theoretical Guarantees for Model Auditing with Finite Adversaries

Authors: Mario Diaz, Peter Kairouz, Jiachun Liao, Lalitha Sankar

Abstract: Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a)… ▽ More Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a) access to some finite side information (e.g., a small auditing dataset), and (b) finite capacity (e.g., a fixed neural network architecture). Our work investigates the requirements under which an unsuccessful attempt to identify privacy violations by a finite adversary implies that no stronger adversary can succeed at such a task. We do so via parameters that quantify the capabilities of the finite adversary, including the size of the neural network employed by such an adversary and the amount of side information it has access to as well as the regularity of the (perhaps privacy-guaranteeing) audited model. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: 18 pages, 1 figure

arXiv:1911.02549 [pdf, other]

MLPerf Inference Benchmark

Authors: Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee , et al. (22 additional authors not shown)

Abstract: Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic… ▽ More Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark's flexibility and adaptability. △ Less

Submitted 9 May, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: ISCA 2020

arXiv:1910.01500 [pdf, other]

MLPerf Training Benchmark

Authors: Peter Mattson, Christine Cheng, Cody Coleman, Greg Diamos, Paulius Micikevicius, David Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor Bittorf, David Brooks, Dehao Chen, Debojyoti Dutta, Udit Gupta, Kim Hazelwood, Andrew Hock, Xinyuan Huang, Atsushi Ike, Bill Jia, Daniel Kang, David Kanter, Naveen Kumar, Jeffery Liao, Guokai Ma, Deepak Narayanan , et al. (12 additional authors not shown)

Abstract: Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits h… ▽ More Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits high variance, and software and hardware systems are so diverse that fair benchmarking with the same binary, code, and even hyperparameters is difficult. We therefore present MLPerf, an ML benchmark that overcomes these challenges. Our analysis quantitatively evaluates MLPerf's efficacy at driving performance and scalability improvements across two rounds of results from multiple vendors. △ Less

Submitted 2 March, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

Comments: MLSys 2020

arXiv:1910.00411 [pdf, other]

Generating Fair Universal Representations using Adversarial Models

Authors: Peter Kairouz, Jiachun Liao, Chong Huang, Maunil Vyas, Monica Welfert, Lalitha Sankar

Abstract: We present a data-driven framework for learning fair universal representations (FUR) that guarantee statistical fairness for any learning task that may not be known a priori. Our framework leverages recent advances in adversarial learning to allow a data holder to learn representations in which a set of sensitive attributes are decoupled from the rest of the dataset. We formulate this as a constra… ▽ More We present a data-driven framework for learning fair universal representations (FUR) that guarantee statistical fairness for any learning task that may not be known a priori. Our framework leverages recent advances in adversarial learning to allow a data holder to learn representations in which a set of sensitive attributes are decoupled from the rest of the dataset. We formulate this as a constrained minimax game between an encoder and an adversary where the constraint ensures a measure of usefulness (utility) of the representation. The resulting problem is that of censoring, i.e., finding a representation that is least informative about the sensitive attributes given a utility constraint. For appropriately chosen adversarial loss functions, our censoring framework precisely clarifies the optimal adversarial strategy against strong information-theoretic adversaries; it also achieves the fairness measure of demographic parity for the resulting constrained representations. We evaluate the performance of our proposed framework on both synthetic and publicly available datasets. For these datasets, we use two tradeoff measures: censoring vs. representation fidelity and fairness vs. utility for downstream tasks, to amply demonstrate that multiple sensitive features can be effectively censored even as the resulting fair representations ensure accuracy for multiple downstream tasks. △ Less

Submitted 11 May, 2022; v1 submitted 27 September, 2019; originally announced October 2019.

Comments: Extended version of a paper accepted to TIFS

arXiv:1909.09467 [pdf]

doi 10.1080/19466315.2019.1697738

Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis

Authors: Ray S. Lin, Ji Lin, Satrajit Roychoudhury, Keaven M. Anderson, Tianle Hu, Bo Huang, Larry F Leon, Jason JZ Liao, Rong Liu, Xiaodong Luo, Pralay Mukhopadhyay, Rui Qin, Kay Tatsuoka, Xue**g Wang, Yang Wang, Jian Zhu, Tai-Tsang Chen, Renee Iacona, Cross-Pharma Non-proportional Hazards Working Group

Abstract: The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan… ▽ More The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data. △ Less

Submitted 20 September, 2019; originally announced September 2019.

Report number: NPH12

Journal ref: Statistics in Biopharmaceutical Statistics 2020

arXiv:1905.13703 [pdf, other]

Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit

Authors: Yi-Qi Hu, Yang Yu, Jun-Da Liao

Abstract: An automatic machine learning (AutoML) task is to select the best algorithm and its hyper-parameters simultaneously. Previously, the hyper-parameters of all algorithms are joint as a single search space, which is not only huge but also redundant, because many dimensions of hyper-parameters are irrelevant with the selected algorithms. In this paper, we propose a cascaded approach for algorithm sele… ▽ More An automatic machine learning (AutoML) task is to select the best algorithm and its hyper-parameters simultaneously. Previously, the hyper-parameters of all algorithms are joint as a single search space, which is not only huge but also redundant, because many dimensions of hyper-parameters are irrelevant with the selected algorithms. In this paper, we propose a cascaded approach for algorithm selection and hyper-parameter optimization. While a search procedure is employed at the level of hyper-parameter optimization, a bandit strategy runs at the level of algorithm selection to allocate the budget based on the search feedbacks. Since the bandit is required to select the algorithm with the maximum performance, instead of the average performance, we thus propose the extreme-region upper confidence bound (ER-UCB) strategy, which focuses on the extreme region of the underlying feedback distribution. We show theoretically that the ER-UCB has a regret upper bound $O\left(K \ln n\right)$ with independent feedbacks, which is as efficient as the classical UCB bandit. We also conduct experiments on a synthetic problem as well as a set of AutoML tasks. The results verify the effectiveness of the proposed method. △ Less

Submitted 31 May, 2019; originally announced May 2019.

Comments: Appears in IJCAI 2019

arXiv:1904.07404 [pdf, other]

swTVM: Towards Optimized Tensor Code Generation for Deep Learning on Sunway Many-Core Processor

Authors: Mingzhen Li, Changxi Liu, Jian** Liao, Xuegui Zheng, Hailong Yang, Rujun Sun, Jun Xu, Lin Gan, Guangwen Yang, Zhongzhi Luan, Depei Qian

Abstract: The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core p… ▽ More The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core processor renders itself as a competitive candidate for its attractive computational power in both scientific computing and deep learning workloads. This paper combines the trends in these two directions. Specifically, we propose swTVM that extends the original TVM to support ahead-of-time compilation for architecture requiring cross-compilation such as Sunway. In addition, we leverage the architecture features during the compilation such as core group for massive parallelism, DMA for high bandwidth memory transfer and local device memory for data locality, in order to generate efficient codes for deep learning workloads on Sunway. The experiment results show that the codes generated by swTVM achieves 1.79x on average compared to the state-of-the-art deep learning framework on Sunway, across six representative benchmarks. This work is the first attempt from the compiler perspective to bridge the gap of deep learning and Sunway processor particularly with productivity and efficiency in mind. We believe this work will encourage more people to embrace the power of deep learning and Sunway many-core processor. △ Less

Submitted 11 July, 2022; v1 submitted 15 April, 2019; originally announced April 2019.

arXiv:1903.03153 [pdf, other]

Connecting Bayes factor and the Region of Practical Equivalence (ROPE) Procedure for testing interval null hypothesis

Authors: J. G. Liao, Vishal Midya, Arthur Berg

Abstract: There has been strong recent interest in testing interval null hypothesis for improved scientific inference. For example, Lakens et al (2018) and Lakens and Harms (2017) use this approach to study if there is a pre-specified meaningful treatment effect in gerontology and clinical trials, which is different from the more traditional point null hypothesis that tests for any treatment effect. Two pop… ▽ More There has been strong recent interest in testing interval null hypothesis for improved scientific inference. For example, Lakens et al (2018) and Lakens and Harms (2017) use this approach to study if there is a pre-specified meaningful treatment effect in gerontology and clinical trials, which is different from the more traditional point null hypothesis that tests for any treatment effect. Two popular Bayesian approaches are available for interval null hypothesis testing. One is the standard Bayes factor and the other is the Region of Practical Equivalence (ROPE) procedure championed by Kruschke and others over many years. This paper establishes a formal connection between these two approaches with two benefits. First, it helps to better understand and improve the ROPE procedure. Second, it leads to a simple and effective algorithm for computing Bayes factor in a wide range of problems using draws from posterior distributions generated by standard Bayesian programs such as BUGS, JAGS and Stan. The tedious and error-prone task of coding custom-made software specific for Bayes factor is then avoided. △ Less

Submitted 30 April, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

arXiv:1806.10483 [pdf, other]

A robustified posterior for Bayesian inference on a large number of parallel effects

Authors: J G Liao, Arthur Berg, Timothy L McMurry

Abstract: Many modern experiments, such as microarray gene expression and genome-wide association studies, present the problem of estimating a large number of parallel effects. Bayesian inference is a popular approach for analyzing such data by modeling the large number of unknown parameters as random effects from a common prior distribution. However, misspecification of the prior distribution can lead to e… ▽ More Many modern experiments, such as microarray gene expression and genome-wide association studies, present the problem of estimating a large number of parallel effects. Bayesian inference is a popular approach for analyzing such data by modeling the large number of unknown parameters as random effects from a common prior distribution. However, misspecification of the prior distribution can lead to erroneous estimates of the random effects, especially for the largest and most interesting effects. This paper has two aims. First, we propose a robustified posterior distribution for a parametric Bayesian hierarchical model that can substantially reduce the impact of a misspecified prior. Second, we conduct a systematic comparison of the standard parametric posterior, the proposed robustified parametric posterior, and a nonparametric Bayesian posterior which uses a Dirichlet process mixture prior. The proposed robustifed posterior when combined with a flexible parametric prior can be a superior alternative to nonparametric Bayesian methods. △ Less

Submitted 25 October, 2018; v1 submitted 27 June, 2018; originally announced June 2018.

Showing 1–20 of 20 results for author: Liao, J