-
Performance and Interpretation of Classification Models
Authors:
Yuan-chin Ivan Chang
Abstract:
Classification is a common statistical task in many areas. In order to ameliorate the performance of the existing methods, there are always some new classification procedures proposed. These procedures, especially those raised in the machine learning and data-mining literature, are usually complicated, and therefore extra effort is required to understand them and the impacts of individual variable…
▽ More
Classification is a common statistical task in many areas. In order to ameliorate the performance of the existing methods, there are always some new classification procedures proposed. These procedures, especially those raised in the machine learning and data-mining literature, are usually complicated, and therefore extra effort is required to understand them and the impacts of individual variables in these procedures. However, in some applications, for example, pharmaceutical and medical related research, future developments and/or research plans will rely on the interpretation of the classification rule, such as the role of individual variables in a diagnostic rule/model. Hence, in these kinds of research, despite the optimal performance of the complicated models, the model with the balanced ease of interpretability and satisfactory performance is preferred. The complication of a classification rule might diminish its advantage in performance and become an obstacle to be used in those applications. In this paper, we study how to improve the classification performance, in terms of area under the receiver operating characteristic curve of a conventional logistic model, while retaining its ease of interpretation. The proposed method increases the sensitivity at the whole range of specificity and hence is especially useful when the performance in the high-specificity range of a receiver operating characteristic curve is of interest. Theoretical justification is presented, and numerical results using both simulated data and two real data sets are reported.
△ Less
Submitted 17 November, 2015; v1 submitted 15 November, 2015;
originally announced November 2015.
-
Active Learning Via Sequential Design and Uncertainty Sampling
Authors:
**g Wang,
Eunsik Park,
Yuan-chin Ivan Chang
Abstract:
Classification is an important task in many fields including biomedical research and machine learning. Traditionally, a classification rule is constructed based a bunch of labeled data. Recently, due to technological innovation and automatic data collection schemes, we easily encounter with data sets containing large amounts of unlabeled samples. Because to label each of them is usually costly and…
▽ More
Classification is an important task in many fields including biomedical research and machine learning. Traditionally, a classification rule is constructed based a bunch of labeled data. Recently, due to technological innovation and automatic data collection schemes, we easily encounter with data sets containing large amounts of unlabeled samples. Because to label each of them is usually costly and inefficient, how to utilize these unlabeled data in a classifier construction process becomes an important problem. In machine learning literature, active learning or semi-supervised learning are popular concepts discussed under this situation, where classification algorithms recruit new unlabeled subjects sequentially based on the information learned from previous stages of its learning process, and these new subjects are then labeled and included as new training samples. From a statistical aspect, these methods can be recognized as a hybrid of the sequential design and stochastic approximation procedure. In this paper, we study sequential learning procedures for building efficient and effective classifiers, where only the selected subjects are labeled and included in its learning stage. The proposed algorithm combines the ideas of Bayesian sequential optimal design and uncertainty sampling. Computational issues of the algorithm are discussed. Numerical results using both synthesized data and real examples are reported.
△ Less
Submitted 18 June, 2014;
originally announced June 2014.
-
Sequential Estimation in Item Calibration with A Two-Stage Design
Authors:
Yuan-chin Ivan Chang
Abstract:
In this paper we apply a two-stage sequential design to item calibration problems under a three-parameter logistic model assumption. The measurement errors of the estimates of the latent trait levels of examinees are considered in our procedure. Moreover, a sequential procedure is employed to guarantee that the estimates of the parameters reach a prescribed accuracy criterion when the iteration is…
▽ More
In this paper we apply a two-stage sequential design to item calibration problems under a three-parameter logistic model assumption. The measurement errors of the estimates of the latent trait levels of examinees are considered in our procedure. Moreover, a sequential procedure is employed to guarantee that the estimates of the parameters reach a prescribed accuracy criterion when the iteration is stopped, which fully takes the advantage of sequential design. Statistical properties of both the item parameter estimates and the sequential procedure are discussed. We compare the performance of the proposed method with that of the procedures based on some conventional designs using numerical studies.
△ Less
Submitted 21 May, 2013; v1 submitted 19 June, 2012;
originally announced June 2012.
-
A Bayesian measurement error model for two-channel cell-based RNAi data with replicates
Authors:
Chung-Hsing Chen,
Wen-Chi Su,
Chih-Yu Chen,
**g-Ying Huang,
Fang-Yu Tsai,
Wen-Chang Wang,
Chao A. Hsiung,
King-Song Jeng,
I-Shou Chang
Abstract:
RNA interference (RNAi) is an endogenous cellular process in which small double-stranded RNAs lead to the destruction of mRNAs with complementary nucleoside sequence. With the production of RNAi libraries, large-scale RNAi screening in human cells can be conducted to identify unknown genes involved in a biological pathway. One challenge researchers face is how to deal with the multiple testing iss…
▽ More
RNA interference (RNAi) is an endogenous cellular process in which small double-stranded RNAs lead to the destruction of mRNAs with complementary nucleoside sequence. With the production of RNAi libraries, large-scale RNAi screening in human cells can be conducted to identify unknown genes involved in a biological pathway. One challenge researchers face is how to deal with the multiple testing issue and the related false positive rate (FDR) and false negative rate (FNR). This paper proposes a Bayesian hierarchical measurement error model for the analysis of data from a two-channel RNAi high-throughput experiment with replicates, in which both the activity of a particular biological pathway and cell viability are monitored and the goal is to identify short hair-pin RNAs (shRNAs) that affect the pathway activity without affecting cell activity. Simulation studies demonstrate the flexibility and robustness of the Bayesian method and the benefits of having replicates in the experiment. This method is illustrated through analyzing the data from a RNAi high-throughput screening that searches for cellular factors affecting HCV replication without affecting cell viability; comparisons of the results from this HCV study and some of those reported in the literature are included.
△ Less
Submitted 20 March, 2012;
originally announced March 2012.
-
Sequential estimation for covariate-adjusted response-adaptive designs
Authors:
Yuan-chin Ivan Chang,
Eunsik Park
Abstract:
In clinical trials, a covariate-adjusted response-adaptive (CARA) design allows a subject newly entering a trial a better chance of being allocated to a superior treatment regimen based on cumulative information from previous subjects, and adjusts the allocation according to individual covariate information.
Since this design allocates subjects sequentially, it is natural to apply a sequential m…
▽ More
In clinical trials, a covariate-adjusted response-adaptive (CARA) design allows a subject newly entering a trial a better chance of being allocated to a superior treatment regimen based on cumulative information from previous subjects, and adjusts the allocation according to individual covariate information.
Since this design allocates subjects sequentially, it is natural to apply a sequential method for estimating the treatment effect in order to make the data analysis more efficient.
In this paper, we study the sequential estimation of treatment effect for a general CARA design. A stop** criterion is proposed such that the estimates satisfy a prescribed precision when the sampling is stopped. The properties of estimates and stop** time} are obtained under the proposed stop** rule. In addition, we show that the asymptotic properties of the allocation function, under the proposed stop** rule, are the same as those obtained in the non-sequential/fixed sample size counterpart.
We then illustrate the performance of the proposed procedure with some simulation results using logistic models. The properties, such as the coverage probability of treatment effect, correct allocation proportion and average sample size, for diverse combinations of initial sample sizes and tuning parameters in the utility function are discussed.
△ Less
Submitted 20 June, 2011;
originally announced June 2011.
-
Evaluating the diagnostic powers of variables and their linear combinations when the gold standard is continuous
Authors:
Zhanfeng Wang,
Yuan-chin Ivan Chang
Abstract:
The receiver operating characteristic (ROC) curve is a very useful tool for analyzing the diagnostic/classification power of instruments/classification schemes as long as a binary-scale gold standard is available. When the gold standard is continuous and there is no confirmative threshold, ROC curve becomes less useful. Hence, there are several extensions proposed for evaluating the diagnostic pot…
▽ More
The receiver operating characteristic (ROC) curve is a very useful tool for analyzing the diagnostic/classification power of instruments/classification schemes as long as a binary-scale gold standard is available. When the gold standard is continuous and there is no confirmative threshold, ROC curve becomes less useful. Hence, there are several extensions proposed for evaluating the diagnostic potential of variables of interest. However, due to the computational difficulties of these nonparametric based extensions, they are not easy to be used for finding the optimal combination of variables to improve the individual diagnostic power. Therefore, we propose a new measure, which extends the AUC index for identifying variables with good potential to be used in a diagnostic scheme. In addition, we propose a threshold gradient descent based algorithm for finding the best linear combination of variables that maximizes this new measure, which is applicable even when the number of variables is huge. The estimate of the proposed index and its asymptotic property are studied. The performance of the proposed method is illustrated using both synthesized and real data sets.
△ Less
Submitted 8 May, 2011;
originally announced May 2011.
-
Comment: Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies
Authors:
I-Shou Chang,
Chung-Hsing Chen,
Li-Chu Chien,
Chao A. Hsiung
Abstract:
Comment on "Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies" [arXiv:1102.2774]
Comment on "Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies" [arXiv:1102.2774]
△ Less
Submitted 15 February, 2011;
originally announced February 2011.
-
Profiling time course expression of virus genes---an illustration of Bayesian inference under shape restrictions
Authors:
Li-Chu Chien,
I-Shou Chang,
Shih Sheng Jiang,
Pramod K. Gupta,
Chi-Chung Wen,
Yuh-Jenn Wu,
Chao A. Hsiung
Abstract:
There have been several studies of the genome-wide temporal transcriptional program of viruses, based on microarray experiments, which are generally useful in the construction of gene regulation network. It seems that biological interpretations in these studies are directly based on the normalized data and some crude statistics, which provide rough estimates of limited features of the profile and…
▽ More
There have been several studies of the genome-wide temporal transcriptional program of viruses, based on microarray experiments, which are generally useful in the construction of gene regulation network. It seems that biological interpretations in these studies are directly based on the normalized data and some crude statistics, which provide rough estimates of limited features of the profile and may incur biases. This paper introduces a hierarchical Bayesian shape restricted regression method for making inference on the time course expression of virus genes. Estimates of many salient features of the expression profile like onset time, inflection point, maximum value, time to maximum value, area under curve, etc. can be obtained immediately by this method. Applying this method to a baculovirus microarray time course expression data set, we indicate that many biological questions can be formulated quantitatively and we are able to offer insights into the baculovirus biology.
△ Less
Submitted 29 September, 2010;
originally announced September 2010.
-
Shape restricted regression with random Bernstein polynomials
Authors:
I-Shou Chang,
Li-Chu Chien,
Chao A. Hsiung,
Chi-Chung Wen,
Yuh-Jenn Wu
Abstract:
Shape restricted regressions, including isotonic regression and concave regression as special cases, are studied using priors on Bernstein polynomials and Markov chain Monte Carlo methods. These priors have large supports, select only smooth functions, can easily incorporate geometric information into the prior, and can be generated without computational difficulty. Algorithms generating priors…
▽ More
Shape restricted regressions, including isotonic regression and concave regression as special cases, are studied using priors on Bernstein polynomials and Markov chain Monte Carlo methods. These priors have large supports, select only smooth functions, can easily incorporate geometric information into the prior, and can be generated without computational difficulty. Algorithms generating priors and posteriors are proposed, and simulation studies are conducted to illustrate the performance of this approach. Comparisons with the density-regression method of Dette et al. (2006) are included.
△ Less
Submitted 8 August, 2007;
originally announced August 2007.
-
Test of Universality in the Ising Spin Glass Using High Temperature Graph Expansion
Authors:
Daniel Daboul,
Iksoo Chang,
Amnon Aharony
Abstract:
We calculate high-temperature graph expansions for the Ising spin glass model with 4 symmetric random distribution functions for its nearest neighbor interaction constants J_{ij}. Series for the Edwards-Anderson susceptibility χ_EA are obtained to order 13 in the expansion variable (J/(k_B T))^2 for the general d-dimensional hyper-cubic lattice, where the parameter J determines the width of the…
▽ More
We calculate high-temperature graph expansions for the Ising spin glass model with 4 symmetric random distribution functions for its nearest neighbor interaction constants J_{ij}. Series for the Edwards-Anderson susceptibility χ_EA are obtained to order 13 in the expansion variable (J/(k_B T))^2 for the general d-dimensional hyper-cubic lattice, where the parameter J determines the width of the distributions. We explain in detail how the expansions are calculated. The analysis, using the Dlog-Padé approximation and the techniques known as M1 and M2, leads to estimates for the critical threshold (J/(k_B T_c))^2 and for the critical exponent γin dimensions 4, 5, 7 and 8 for all the distribution functions. In each dimension the values for γagree, within their uncertainty margins, with a common value for the different distributions, thus confirming universality.
△ Less
Submitted 7 August, 2004;
originally announced August 2004.
-
What one can learn from experiments about the elusive transition state?
Authors:
Iksoo Chang,
Marek Cieplak,
Jayanth R. Banavar,
Amos Maritan
Abstract:
We present the results of an exact analysis of a model energy landscape of a protein to clarify the notion of the transition state and the physical meaning of the $φ$ values determined in protein engineering experiments. We benchmark our findings to various theoretical approaches proposed in the literature for the identification and characterization of the transition state.
We present the results of an exact analysis of a model energy landscape of a protein to clarify the notion of the transition state and the physical meaning of the $φ$ values determined in protein engineering experiments. We benchmark our findings to various theoretical approaches proposed in the literature for the identification and characterization of the transition state.
△ Less
Submitted 18 May, 2004;
originally announced May 2004.
-
Protein threading by learning
Authors:
Iksoo Chang,
Marek Cieplak,
Ruxandra I. Dima,
Amos Maritan,
Jayanth R. Banavar
Abstract:
Using techniques borrowed from statistical physics and neural networks, we determine the parameters, associated with a scoring function, that are chosen optimally to ensure complete success in threading tests in a training set of proteins. These parameters provide a quantitative measure of the propensities of amino acids to be buried or exposed and to be in a given secondary structure and are a…
▽ More
Using techniques borrowed from statistical physics and neural networks, we determine the parameters, associated with a scoring function, that are chosen optimally to ensure complete success in threading tests in a training set of proteins. These parameters provide a quantitative measure of the propensities of amino acids to be buried or exposed and to be in a given secondary structure and are a good starting point for solving both the threading and design problems.
△ Less
Submitted 10 October, 2001;
originally announced October 2001.
-
Sznajd sociophysics model on a triangular lattice: ferro and antiferromagnetic opinions
Authors:
Iksoo Chang
Abstract:
The Sznajd sociophysics model is generalized on the triangular lattice with pure antiferromagnetic opinion and also with both ferromagnetic and antiferromagnetic opinions. The slogan of the trade union "united we stand, divided we fall" can be realized via the propagation of ferromagnetic opinion of adjacent people in the union, but the propagation of antiferromagnetic opinion can be observed am…
▽ More
The Sznajd sociophysics model is generalized on the triangular lattice with pure antiferromagnetic opinion and also with both ferromagnetic and antiferromagnetic opinions. The slogan of the trade union "united we stand, divided we fall" can be realized via the propagation of ferromagnetic opinion of adjacent people in the union, but the propagation of antiferromagnetic opinion can be observed among the third countries between two big super powers or among the family members of conflicting parents. Fixed points are found in both models. The distributions of relaxation time of the mixed model are disperse and become loser to log-normal as the initial concentration of down spins approaches 0.5, whereas for pure antiferromagnetic spins they are collapsed into one master curve which is roughly lognormal. We do not see the phase transition in the model.
△ Less
Submitted 11 September, 2001;
originally announced September 2001.
-
Asymmetries, Correlations and Fat Tails in Percolation Market Model
Authors:
I. Chang,
D. Stauffer,
R. B. Pandey
Abstract:
Modifications of the Cont-Bouchaud percolation model for price fluctuations give an asymmetry for time-reversal, an asymmetry between high and low prices, volatility clustering, effective multifractality, correlations between volatility and traded volume, and a power law tail with exponent near 3 for the cumulative distribution of price changes. Combining them together still gives the same power…
▽ More
Modifications of the Cont-Bouchaud percolation model for price fluctuations give an asymmetry for time-reversal, an asymmetry between high and low prices, volatility clustering, effective multifractality, correlations between volatility and traded volume, and a power law tail with exponent near 3 for the cumulative distribution of price changes. Combining them together still gives the same power law. Using Ising-correlated percolation does not change these results. Different modifications give log-periodic oscillations before a crash, arising from nonlinear feedback between random fluctuations.
△ Less
Submitted 22 August, 2001;
originally announced August 2001.
-
Time-reversal asymmetry in Cont-Bouchaud stock market model
Authors:
Iksoo Chang,
Dietrich Stauffer
Abstract:
The percolation model of stock market speculation allows an asymmetry (in the return distribution) leading to fast downward crashes and slow upward recovery. We see more small upturns and more intermediate downturns.
The percolation model of stock market speculation allows an asymmetry (in the return distribution) leading to fast downward crashes and slow upward recovery. We see more small upturns and more intermediate downturns.
△ Less
Submitted 29 May, 2001;
originally announced May 2001.
-
Anisotropic Domain Growth of ANNNI Model at Low Temperatures
Authors:
Mookyung Cheon,
Iksoo Chang
Abstract:
We investigate the ordering kinetics for axial next nearest neighbor Ising (ANNNI) model in one and two dimensions by the multi-spin heat bath dynamical simulation. This dynamics enables us to overcome the pinning effect and to observe the dynamical scaling law for domain growth in the ANNNI model at zero temperature. The domain growth exponent is 1/2 isotropically both in the ferromagnetic and…
▽ More
We investigate the ordering kinetics for axial next nearest neighbor Ising (ANNNI) model in one and two dimensions by the multi-spin heat bath dynamical simulation. This dynamics enables us to overcome the pinning effect and to observe the dynamical scaling law for domain growth in the ANNNI model at zero temperature. The domain growth exponent is 1/2 isotropically both in the ferromagnetic and the dry-(commensurate) antiphase. In the wet-(commensurate) antiphase, however, it is approximately 1/3 in the modulated direction, whereas it remains 1/2 in the non-modulated direction. We suggest that these exponent values are dictated by 3 and 4 body diffusion-reaction processes of domain walls.
△ Less
Submitted 2 April, 2001;
originally announced April 2001.
-
Enhancement of superconductive critical temperatures in almost empty or full bands in two dimensions: possible relevance to beta-HfNCl, C60 and MgB2
Authors:
Mahito Kohmoto,
Iksoo Chang,
Jacques Friedel
Abstract:
We examine possibility of enhancement of superconductive critical temperature in two-dimensions. The weak coupling BCS theory is applied, especially when the Fermi level is near the edges of the electronic bands. The attractive interaction depends on ${\bf k}$ due to screening. The density of states(DOS) does not have a peak near the bottom of the band, but $k$-dependent contribution to DOS (ele…
▽ More
We examine possibility of enhancement of superconductive critical temperature in two-dimensions. The weak coupling BCS theory is applied, especially when the Fermi level is near the edges of the electronic bands. The attractive interaction depends on ${\bf k}$ due to screening. The density of states(DOS) does not have a peak near the bottom of the band, but $k$-dependent contribution to DOS (electron density on the Fermi surface) has a diverging peak at the bottom or top. These features lead to significant enhancement of the critical temperatures. The results are qualitatively consistent with the superconductive behaviors of HfNCl ($\Tc \le 25K$) and ZrNCl($\Tc \le 15K$), C$_{60}$ with a field-effect transistor configuration ($\Tc = 52K$), and MgB$_2$ ($\Tc \approx 40K$) which have the unexpectedly high superconductive critical transition temperatures.
△ Less
Submitted 2 April, 2001; v1 submitted 16 March, 2001;
originally announced March 2001.
-
High temperature superconductivity from the two-dimensional semiconductors without magnetism
Authors:
Mahito Kohmoto,
Iksoo Chang,
Jacques Friedel
Abstract:
We examine the possibility of high temperature superconductivity from two-dimensional semiconductor without antiferromagnetic fluctuations. The weak coupling BCS theory is applied, especially where the Fermi level is near the bottom of the conduction band. Due to screening, the attractive interaction is local in k-space. The density of states(DOS) does not have a peak near the bottom of the band…
▽ More
We examine the possibility of high temperature superconductivity from two-dimensional semiconductor without antiferromagnetic fluctuations. The weak coupling BCS theory is applied, especially where the Fermi level is near the bottom of the conduction band. Due to screening, the attractive interaction is local in k-space. The density of states(DOS) does not have a peak near the bottom of the band, but k-dependent contribution to DOS has a diverging peak at the bottom. These features lead to high temperature superconductivity which may explain the possible superconductivity of WO_3.
△ Less
Submitted 29 February, 2000;
originally announced March 2000.
-
Phase transition between d-wave and anisotropic s-wave gaps in high temperature oxides superconductors
Authors:
Iksoo Chang,
Jacques Friedel,
Mahito Kohmoto
Abstract:
We study models for superconductivity with two interactions: $V^>$ due to antiferromagnetic(AF) fluctuations and $V^<$ due to phonons, in a weak coupling approach to the high temperature superconductivity. The nature of the two interactions are considerably different; $V^>$ is positive and sharply peaked at ($\pmπ$,$ \pmπ$) while $V^<$ is negative and peaked at ($0,0$) due to weak phonon screeni…
▽ More
We study models for superconductivity with two interactions: $V^>$ due to antiferromagnetic(AF) fluctuations and $V^<$ due to phonons, in a weak coupling approach to the high temperature superconductivity. The nature of the two interactions are considerably different; $V^>$ is positive and sharply peaked at ($\pmπ$,$ \pmπ$) while $V^<$ is negative and peaked at ($0,0$) due to weak phonon screening. We numerically find (a) weak BCS attraction is enough to have high critical temperature if a van Hove anomaly is at work, (b) $V^>$ (AF) is important to give d-wave superconductivity, (c) the gap order parameter $Δ({\bf k})$ is constant(s-wave) at extremely overdope region and it changes to anisotropic s-wave as do** is reduced, (d) there exists a first order phase transition between d-wave and anisotropic s-wave gaps. These results are qualitatively in agreement with preceding works; they should be modified in the strongly underdope region by the presence of antiferromagnetic fluctuations and ensuing AF pseudogap.
△ Less
Submitted 15 August, 1999;
originally announced August 1999.