Skip to main content

Showing 1–17 of 17 results for author: Apley, D

.
  1. arXiv:2404.15207  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG stat.AP

    Simulation-Free Determination of Microstructure Representative Volume Element Size via Fisher Scores

    Authors: Wei Liu, Satyajit Mojumder, Wing Kam Liu, Wei Chen, Daniel W. Apley

    Abstract: A representative volume element (RVE) is a reasonably small unit of microstructure that can be simulated to obtain the same effective properties as the entire microstructure sample. Finite element (FE) simulation of RVEs, as opposed to much larger samples, saves computational expense, especially in multiscale modeling. Therefore, it is desirable to have a framework that determines RVE size prior t… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Journal ref: APL Mach. Learn. 2(2): 026101 (2024)

  2. arXiv:2303.03393  [pdf, other

    cs.LG cs.HC stat.ME stat.ML

    Interpretable Architecture Neural Networks for Function Visualization

    Authors: Shengtong Zhang, Daniel W. Apley

    Abstract: In many scientific research fields, understanding and visualizing a black-box function in terms of the effects of all the input variables is of great importance. Existing visualization tools do not allow one to visualize the effects of all the input variables simultaneously. Although one can select one or two of the input variables to visualize via a 2D or 3D plot while holding other variables fix… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  3. arXiv:2211.02218  [pdf, other

    stat.ML cs.LG

    Fully Bayesian inference for latent variable Gaussian process models

    Authors: Suraj Yerramilli, Akshay Iyer, Wei Chen, Daniel W. Apley

    Abstract: Real engineering and scientific applications often involve one or more qualitative inputs. Standard Gaussian processes (GPs), however, cannot directly accommodate qualitative inputs. The recently introduced latent variable Gaussian process (LVGP) overcomes this issue by first map** each qualitative factor to underlying latent variables (LVs), and then uses any standard GP covariance function ove… ▽ More

    Submitted 19 March, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  4. arXiv:2207.04994  [pdf, other

    stat.ML cond-mat.mtrl-sci cs.LG

    Uncertainty-Aware Mixed-Variable Machine Learning for Materials Design

    Authors: Hengrui Zhang, Wei Wayne Chen, Akshay Iyer, Daniel W. Apley, Wei Chen

    Abstract: Data-driven design shows the promise of accelerating materials discovery but is challenging due to the prohibitive cost of searching the vast design space of chemistry, structure, and synthesis methods. Bayesian Optimization (BO) employs uncertainty-aware machine learning models to select promising designs to evaluate, hence reducing the cost. However, BO with mixed numerical and categorical varia… ▽ More

    Submitted 4 October, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Journal ref: Scientific Reports 12, 19760 (2022)

  5. Diversity Subsampling: Custom Subsamples from Large Data Sets

    Authors: Boyang Shang, Daniel W. Apley, Sanjay Mehrotra

    Abstract: Subsampling from a large data set is useful in many supervised learning contexts to provide a global view of the data based on only a fraction of the observations. Diverse (or space-filling) subsampling is an appealing subsampling approach when no prior knowledge of the data is available. In this paper, we propose a diversity subsampling approach that selects a subsample from the original data suc… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  6. arXiv:2106.15356  [pdf

    cs.LG stat.ML

    Scalable Gaussian Processes for Data-Driven Design using Big Data with Categorical Factors

    Authors: Liwei Wang, Suraj Yerramilli, Akshay Iyer, Daniel Apley, ** Zhu, Wei Chen

    Abstract: Scientific and engineering problems often require the use of artificial intelligence to aid understanding and the search for promising designs. While Gaussian processes (GP) stand out as easy-to-use and interpretable learners, they have difficulties in accommodating big datasets, categorical inputs, and multiple responses, which has become a common challenge for a growing number of data-driven des… ▽ More

    Submitted 29 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Preprint submitted to Journal of Mechanical Design

  7. arXiv:2012.11135  [pdf, other

    stat.AP

    Nonstationarity Analysis of Materials Microstructures via Fisher Score Vectors

    Authors: Kungang Zhang, Daniel W. Apley, Wei Chen

    Abstract: Microstructures are critical to the physical properties of materials. Stochastic microstructures are commonly observed in many kinds of materials and traditional descriptor-based image analysis of them can be challenging. In this paper, we introduce a powerful and versatile score-based framework for analyzing nonstationarity in stochastic materials microstructures. The framework involves training… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  8. arXiv:2012.06916  [pdf, other

    stat.ML cs.LG

    Concept Drift Monitoring and Diagnostics of Supervised Learning Models via Score Vectors

    Authors: Kungang Zhang, Anh T. Bui, Daniel W. Apley

    Abstract: Supervised learning models are one of the most fundamental classes of models. Viewing supervised learning from a probabilistic perspective, the set of training data to which the model is fitted is usually assumed to follow a stationary distribution. However, this stationarity assumption is often violated in a phenomenon called concept drift, which refers to changes over time in the predictive rela… ▽ More

    Submitted 12 September, 2022; v1 submitted 12 December, 2020; originally announced December 2020.

  9. arXiv:2010.13306  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Database, Features, and Machine Learning Model to Identify Thermally Driven Metal-Insulator Transition Compounds

    Authors: Alexandru B. Georgescu, Peiwen Ren, Aubrey R. Toland, Shengtong Zhang, Kyle D. Miller, Daniel W. Apley, Elsa A. Olivetti, Nicholas Wagner, James M. Rondinelli

    Abstract: Metal-insulator transition (MIT) compounds are materials that may exhibit insulating or metallic behavior, depending on the physical conditions, and are of immense fundamental interest owing to their potential applications in emerging microelectronics. There is a dearth of thermally-driven MIT materials, however, which makes delineating these compounds from those that are exclusively insulating or… ▽ More

    Submitted 21 July, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

    Journal ref: Chem. Mater. 33, 14, 5591-5605 (2021)

  10. arXiv:1910.01688  [pdf

    stat.ML cond-mat.mtrl-sci cs.LG stat.AP

    Bayesian Optimization for Materials Design with Mixed Quantitative and Qualitative Variables

    Authors: Yichi Zhang, Daniel Apley, Wei Chen

    Abstract: Although Bayesian Optimization (BO) has been employed for accelerating materials design in computational materials engineering, existing works are restricted to problems with quantitative variables. However, real designs of materials systems involve both qualitative and quantitative design variables representing material compositions, microstructure morphology, and processing conditions. For mixed… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: 29 pages, 9 figures, 3 tables

  11. arXiv:1812.01786  [pdf, other

    stat.ME

    Density Deconvolution with Additive Measurement Errors using Quadratic Programming

    Authors: Ran Yang, Daniel Apley, Jeremy Staum, David Ruppert

    Abstract: Distribution estimation for noisy data via density deconvolution is a notoriously difficult problem for typical noise distributions like Gaussian. We develop a density deconvolution estimator based on quadratic programming (QP) that can achieve better estimation than kernel density deconvolution methods. The QP approach appears to have a more favorable regularization tradeoff between oversmoothing… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  12. arXiv:1806.07504  [pdf

    stat.ML cs.LG

    A Latent Variable Approach to Gaussian Process Modeling with Qualitative and Quantitative Factors

    Authors: Yichi Zhang, Siyu Tao, Wei Chen, Daniel W. Apley

    Abstract: Computer simulations often involve both qualitative and numerical inputs. Existing Gaussian process (GP) methods for handling this mainly assume a different response surface for each combination of levels of the qualitative factors and relate them via a multiresponse cross-covariance matrix. We introduce a substantially different approach that maps each qualitative factor to an underlying numerica… ▽ More

    Submitted 30 January, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

  13. A monitoring and diagnostic approach for stochastic textured surfaces

    Authors: Anh Tuan Bui, Daniel W. Apley

    Abstract: We develop a supervised-learning-based approach for monitoring and diagnosing texture-related defects in manufactured products characterized by stochastic textured surfaces that satisfy the locality and stationarity properties of Markov random fields. Examples of stochastic textured surface data include images of woven textiles; image or surface metrology data for machined, cast, or formed metal p… ▽ More

    Submitted 21 July, 2017; v1 submitted 9 February, 2017; originally announced February 2017.

  14. arXiv:1701.06655  [pdf, other

    cs.LG stat.ML

    Patchwork Kriging for Large-scale Gaussian Process Regression

    Authors: Chiwoo Park, Daniel Apley

    Abstract: This paper presents a new approach for Gaussian process (GP) regression for large datasets. The approach involves partitioning the regression input domain into multiple local regions with a different local GP model fitted in each region. Unlike existing local partitioned GP approaches, we introduce a technique for patching together the local GP models nearly seamlessly to ensure that the local GP… ▽ More

    Submitted 7 July, 2018; v1 submitted 23 January, 2017; originally announced January 2017.

    MSC Class: 68T01 ACM Class: G.3

  15. arXiv:1612.08468  [pdf, other

    stat.ME

    Visualizing the Effects of Predictor Variables in Black Box Supervised Learning Models

    Authors: Daniel W. Apley, **gyu Zhu

    Abstract: When fitting black box supervised learning models (e.g., complex trees, neural networks, boosted trees, random forests, nearest neighbors, local kernel-weighted methods, etc.), visualizing the main effects of the individual predictor variables and their low-order interaction effects is often important, and partial dependence (PD) plots are the most popular approach for accomplishing this. However,… ▽ More

    Submitted 19 August, 2019; v1 submitted 26 December, 2016; originally announced December 2016.

    Comments: The R package ALEPlot is available on CRAN. The new version contains refined definitions of ALE effects, a new illustrative example, theorems and proofs of asymptotic properties of ALE effects and estimators, and extra implementation details

  16. arXiv:1509.06721  [pdf

    stat.ME stat.AP stat.OT

    Designed Sampling from Large Databases for Controlled Trials

    Authors: Liwen Ouyang, Daniel W. Apley, Sanjay Mehrotra

    Abstract: The increasing prevalence of rich sources of data and the availability of electronic medical record databases and electronic registries opens tremendous opportunities for enhancing medical research. For example, controlled trials are ubiquitously used to investigate the effect of a medical treatment, perhaps dependent on a set of patient covariates, and traditional approaches have relied primarily… ▽ More

    Submitted 22 September, 2015; originally announced September 2015.

  17. arXiv:1303.0383  [pdf, other

    stat.ME stat.CO

    Local Gaussian process approximation for large computer experiments

    Authors: Robert B. Gramacy, Daniel W. Apley

    Abstract: We provide a new approach to approximate emulation of large computer experiments. By focusing expressly on desirable properties of the predictive equations, we derive a family of local sequential design schemes that dynamically define the support of a Gaussian process predictor based on a local subset of the data. We further derive expressions for fast sequential updating of all needed quantities… ▽ More

    Submitted 10 October, 2014; v1 submitted 2 March, 2013; originally announced March 2013.

    Comments: 29 pages, 5 figures, 2 tables