-
Regularized Bayesian calibration and scoring of the WD-FAB IRT model improves predictive performance over marginal maximum likelihood
Authors:
Joshua C. Chang,
Julia Porcino,
Elizabeth K. Rasch,
Larry Tang
Abstract:
Item response theory (IRT) is the statistical paradigm underlying a dominant family of generative probabilistic models for test responses, used to quantify traits in individuals relative to target populations. The graded response model (GRM) is a particular IRT model that is used for ordered polytomous test responses. Both the development and the application of the GRM and other IRT models require…
▽ More
Item response theory (IRT) is the statistical paradigm underlying a dominant family of generative probabilistic models for test responses, used to quantify traits in individuals relative to target populations. The graded response model (GRM) is a particular IRT model that is used for ordered polytomous test responses. Both the development and the application of the GRM and other IRT models require statistical decisions. For formulating these models (calibration), one needs to decide on methodologies for item selection, inference, and regularization. For applying these models (test scoring), one needs to make similar decisions, often prioritizing computational tractability and/or interpretability. In many applications, such as in the Work Disability Functional Assessment Battery (WD-FAB), tractability implies approximating an individual's score distribution using estimates of mean and variance, and obtaining that score conditional on only point estimates of the calibrated model. In this manuscript, we evaluate the calibration and scoring of models under this common use-case using Bayesian cross-validation. Applied to the WD-FAB responses collected for the National Institutes of Health, we assess the predictive power of implementations of the GRM based on their ability to yield, on validation sets of respondents, ability estimates that are most predictive of patterns of item responses. Our main finding indicates that regularized Bayesian calibration of the GRM outperforms the regularization-free empirical Bayesian procedure of marginal maximum likelihood. We also motivate the use of compactly supported priors in test scoring.
△ Less
Submitted 1 December, 2021; v1 submitted 3 October, 2020;
originally announced October 2020.
-
Studio2Shop: from studio photo shoots to fashion articles
Authors:
Julia Lasserre,
Katharina Rasch,
Roland Vollgraf
Abstract:
Fashion is an increasingly important topic in computer vision, in particular the so-called street-to-shop task of matching street images with shop images containing similar fashion items. Solving this problem promises new means of making fashion searchable and hel** shoppers find the articles they are looking for. This paper focuses on finding pieces of clothing worn by a person in full-body or…
▽ More
Fashion is an increasingly important topic in computer vision, in particular the so-called street-to-shop task of matching street images with shop images containing similar fashion items. Solving this problem promises new means of making fashion searchable and hel** shoppers find the articles they are looking for. This paper focuses on finding pieces of clothing worn by a person in full-body or half-body images with neutral backgrounds. Such images are ubiquitous on the web and in fashion blogs, and are typically studio photos, we refer to this setting as studio-to-shop. Recent advances in computational fashion include the development of domain-specific numerical representations. Our model Studio2Shop builds on top of such representations and uses a deep convolutional network trained to match a query image to the numerical feature vectors of all the articles annotated in this image. Top-$k$ retrieval evaluation on test query images shows that the correct items are most often found within a range that is sufficiently small for building realistic visual search engines for the studio-to-shop setting.
△ Less
Submitted 2 July, 2018;
originally announced July 2018.
-
The AFLOW Fleet for Materials Discovery
Authors:
Cormac Toher,
Corey Oses,
David Hicks,
Eric Gossett,
Frisco Rose,
Pinku Nath,
Demet Usanmaz,
Denise C. Ford,
Eric Perim,
Camilo E. Calderon,
Jose J. Plata,
Yoav Lederer,
Michal Jahnátek,
Wahyu Setyawan,
Shidong Wang,
Junkai Xue,
Kevin Rasch,
Roman V. Chepulskii,
Richard H. Taylor,
Geena Gomez,
Harvey Shi,
Andrew R. Supka,
Rabih Al Rahal Al Orabi,
Priya Gopal,
Frank T. Cerasoli
, et al. (26 additional authors not shown)
Abstract:
The traditional paradigm for materials discovery has been recently expanded to incorporate substantial data driven research. With the intent to accelerate the development and the deployment of new technologies, the AFLOW Fleet for computational materials design automates high-throughput first principles calculations, and provides tools for data verification and dissemination for a broad community…
▽ More
The traditional paradigm for materials discovery has been recently expanded to incorporate substantial data driven research. With the intent to accelerate the development and the deployment of new technologies, the AFLOW Fleet for computational materials design automates high-throughput first principles calculations, and provides tools for data verification and dissemination for a broad community of users. AFLOW incorporates different computational modules to robustly determine thermodynamic stability, electronic band structures, vibrational dispersions, thermo-mechanical properties and more. The AFLOW data repository is publicly accessible online at aflow.org, with more than 1.7 million materials entries and a panoply of queryable computed properties. Tools to programmatically search and process the data, as well as to perform online machine learning predictions, are also available.
△ Less
Submitted 1 December, 2017;
originally announced December 2017.
-
Fixed-Node Diffusion Monte Carlo of Lithium Systems
Authors:
Kevin Rasch,
Lubos Mitas
Abstract:
We study lithium systems over a range of number of atoms, e.g., atomic anion, dimer, metallic cluster, and body-centered cubic crystal by the diffusion Monte Carlo method. The calculations include both core and valence electrons in order to avoid any possible impact by pseudo potentials. The focus of the study is the fixed-node errors, and for that purpose we test several orbital sets in order to…
▽ More
We study lithium systems over a range of number of atoms, e.g., atomic anion, dimer, metallic cluster, and body-centered cubic crystal by the diffusion Monte Carlo method. The calculations include both core and valence electrons in order to avoid any possible impact by pseudo potentials. The focus of the study is the fixed-node errors, and for that purpose we test several orbital sets in order to provide the most accurate nodal hyper surfaces. We compare our results to other high accuracy calculations wherever available and to experimental results so as to quantify the the fixed-node errors. The results for these Li systems show that fixed-node quantum Monte Carlo achieves remarkably high accuracy total energies and recovers 97-99 % of the correlation energy.
△ Less
Submitted 25 February, 2015;
originally announced February 2015.
-
Materials Cartography: Representing and Mining Material Space Using Structural and Electronic Fingerprints
Authors:
Olexandr Isayev,
Denis Fourches,
Eugene N. Muratov,
Corey Oses,
Kevin Rasch,
Alexander Tropscha,
Stefano Curtarolo
Abstract:
As the proliferation of high-throughput approaches in materials science is increasing the wealth of data in the field, the gap between accumulated-information and derived-knowledge widens. We address the issue of scientific discovery in materials databases by introducing novel analytical approaches based on structural and electronic materials fingerprints. The framework is employed to (i) query la…
▽ More
As the proliferation of high-throughput approaches in materials science is increasing the wealth of data in the field, the gap between accumulated-information and derived-knowledge widens. We address the issue of scientific discovery in materials databases by introducing novel analytical approaches based on structural and electronic materials fingerprints. The framework is employed to (i) query large databases of materials using similarity concepts, (ii) map the connectivity of the materials space (i.e., as a materials cartogram) for rapidly identifying regions with unique organizations/properties, and (iii) develop predictive Quantitative Materials Structure-Property Relation- ships (QMSPR) models for guiding materials design. In this study, we test these fingerprints by seeking target material properties. As a quantitative example, we model the critical temperatures of known superconductors. Our novel materials fingerprinting and materials cartography approaches contribute to the emerging field of materials informatics by enabling effective computational tools to analyze, visualize, model, and design new materials.
△ Less
Submitted 16 December, 2014; v1 submitted 9 December, 2014;
originally announced December 2014.
-
Fixed-node errors in quantum Monte Carlo: interplay of electron density and node nonlinearities
Authors:
Kevin M. Rasch,
Shuming Hu,
Lubos Mitas
Abstract:
We elucidate the origin of large differences (two-fold or more) in the fixed-node errors between the first- vs second-row systems for single-configuration trial wave functions in quantum Monte Carlo calculations. This significant difference in the fixed-node biases is studied across a set of atoms, molecules, and also Si, C solid crystals. The analysis is done over valence isoelectronic systems th…
▽ More
We elucidate the origin of large differences (two-fold or more) in the fixed-node errors between the first- vs second-row systems for single-configuration trial wave functions in quantum Monte Carlo calculations. This significant difference in the fixed-node biases is studied across a set of atoms, molecules, and also Si, C solid crystals. The analysis is done over valence isoelectronic systems that share similar correlation energies, bond patterns, geometries, ground states, and symmetries. We show that the key features which affect the fixed-node errors are the differences in electron density and the degree of node nonlinearity. The findings reveal how the accuracy of the quantum Monte Carlo varies across a variety of systems, provide new perspectives on the origins of the fixed-node biases in electronic structure calculations of molecular and condensed systems, and carry implications for pseudopotential constructions for heavy elements
△ Less
Submitted 29 October, 2013; v1 submitted 8 October, 2013;
originally announced October 2013.
-
Many-body nodal hypersurface and domain averages for correlated wave functions
Authors:
Shuming Hu,
Kevin Rasch,
Lubos Mitas
Abstract:
We outline the basic notions of nodal hypersurface and domain averages for antisymmetric wave functions. We illustrate their properties and analyze the results for a few electron explicitly solvable cases and discuss possible further developments.
We outline the basic notions of nodal hypersurface and domain averages for antisymmetric wave functions. We illustrate their properties and analyze the results for a few electron explicitly solvable cases and discuss possible further developments.
△ Less
Submitted 21 July, 2013;
originally announced July 2013.
-
Impact of the Electron Density on the Fixed-Node Errors in Quantum Monte Carlo
Authors:
Kevin Rasch,
Lubos Mitas
Abstract:
We analyze the effect of increasing charge density on the Fixed Node Errors in Diffusion Monte Carlo by comparing FN-DMC calculations of the total ground state energy on a 4 electron system done with a Hartree-Fock based trial wave function to calculations by the same method on the same system using a Configuration Interaction based trial wave function. We do this for several different values of n…
▽ More
We analyze the effect of increasing charge density on the Fixed Node Errors in Diffusion Monte Carlo by comparing FN-DMC calculations of the total ground state energy on a 4 electron system done with a Hartree-Fock based trial wave function to calculations by the same method on the same system using a Configuration Interaction based trial wave function. We do this for several different values of nuclear charge, Z. The Fixed Node Error of a Hartree-Fock trial wave function for a 4 electron system increases linearly with increasing nuclear charge.
△ Less
Submitted 18 July, 2011;
originally announced July 2011.