Skip to main content

Showing 1–15 of 15 results for author: Urbanowicz, R J

.
  1. arXiv:2401.11167  [pdf, other

    cs.NE

    Coevolving Artistic Images Using OMNIREP

    Authors: Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz

    Abstract: We have recently developed OMNIREP, a coevolutionary algorithm to discover both a representation and an interpreter that solve a particular problem of interest. Herein, we demonstrate that the OMNIREP framework can be successfully applied within the field of evolutionary art. Specifically, we coevolve representations that encode image position, alongside interpreters that transform these positions… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Journal ref: J. Romero et al. (Eds.), EvoMUSART 2020, LNCS 12103, pp. 165-178, 2020

  2. New Pathways in Coevolutionary Computation

    Authors: Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz

    Abstract: The simultaneous evolution of two or more species with coupled fitness -- coevolution -- has been put to good use in the field of evolutionary computation. Herein, we present two new forms of coevolutionary algorithms, which we have recently designed and applied with success. OMNIREP is a cooperative coevolutionary algorithm that discovers both a representation and an encoding for solving a partic… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2206.13509, arXiv:2206.15409, arXiv:2206.12707

    Journal ref: W. Banzhaf et al. (eds.), Genetic Programming Theory and Practice XVII, Genetic and Evolutionary Computation, 2020

  3. arXiv:2312.05461  [pdf, other

    cs.LG cs.AI

    STREAMLINE: An Automated Machine Learning Pipeline for Biomedicine Applied to Examine the Utility of Photography-Based Phenotypes for OSA Prediction Across International Sleep Centers

    Authors: Ryan J. Urbanowicz, Harsh Bandhey, Brendan T. Keenan, Greg Maislin, Sy Hwang, Danielle L. Mowery, Shannon M. Lynch, Diego R. Mazzotti, Fang Han, Qing Yun Li, Thomas Penzel, Sergio Tufik, Lia Bittencourt, Thorarinn Gislason, Philip de Chazal, Bhajan Singh, Nigel McArdle, Ning-Hung Chen, Allan Pack, Richard J. Schwab, Peter A. Cistulli, Ulysses J. Magalang

    Abstract: While machine learning (ML) includes a valuable array of tools for analyzing biomedical data, significant time and expertise is required to assemble effective, rigorous, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 23 pages, 7 figures, 1 table, 1 supplemental information document (77 pages), and 7 ancillary files

  4. arXiv:2206.15409  [pdf, other

    cs.NE

    Automatically Balancing Model Accuracy and Complexity using Solution and Fitness Evolution (SAFE)

    Authors: Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz

    Abstract: When seeking a predictive model in biomedical data, one often has more than a single objective in mind, e.g., attaining both high accuracy and low complexity (to promote interpretability). We investigate herein whether multiple objectives can be dynamically tuned by our recently proposed coevolutionary algorithm, SAFE (Solution And Fitness Evolution). We find that SAFE is able to automatically tun… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

  5. arXiv:2206.13509  [pdf, other

    cs.NE

    Solution and Fitness Evolution (SAFE): A Study of Multiobjective Problems

    Authors: Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz

    Abstract: We have recently presented SAFE -- Solution And Fitness Evolution -- a commensalistic coevolutionary algorithm that maintains two coevolving populations: a population of candidate solutions and a population of candidate objective functions. We showed that SAFE was successful at evolving solutions within a robotic maze domain. Herein we present an investigation of SAFE's adaptation and application… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2206.12707

    Journal ref: Proceedings of 2019 IEEE Congress on Evolutionary Computation

  6. Solution and Fitness Evolution (SAFE): Coevolving Solutions and Their Objective Functions

    Authors: Moshe Sipper, Jason H. Moore, Ryan J. Urbanowicz

    Abstract: We recently highlighted a fundamental problem recognized to confound algorithmic optimization, namely, \textit{conflating} the objective with the objective function. Even when the former is well defined, the latter may not be obvious, e.g., in learning a strategy to navigate a maze to find a goal (objective), an effective objective function to \textit{evaluate} strategies may not be a simple funct… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Journal ref: EuroGP 2019, LNCS 11451, pages 1-16, 2019

  7. arXiv:2206.12002  [pdf, other

    cs.LG cs.DC cs.DS q-bio.GN

    STREAMLINE: A Simple, Transparent, End-To-End Automated Machine Learning Pipeline Facilitating Data Analysis and Algorithm Comparison

    Authors: Ryan J. Urbanowicz, Robert Zhang, Yuhan Cui, Pranshu Suri

    Abstract: Machine learning (ML) offers powerful methods for detecting and modeling associations often in data with large feature spaces and complex associations. Many useful tools/packages (e.g. scikit-learn) have been developed to make the various elements of data handling, processing, modeling, and interpretation accessible. However, it is not trivial for most investigators to assemble these elements into… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 24 pages, 15 figures, submitted for publication in Genetic Programming Theory and Practice Proceedings

    ACM Class: I.2.0; I.5.0; I.6.5; J.3; K.3.2

  8. arXiv:2104.12844  [pdf, other

    cs.LG

    LCS-DIVE: An Automated Rule-based Machine Learning Visualization Pipeline for Characterizing Complex Associations in Classification

    Authors: Robert Zhang, Rachael Stolzenberg-Solomon, Shannon M. Lynch, Ryan J. Urbanowicz

    Abstract: Machine learning (ML) research has yielded powerful tools for training accurate prediction models despite complex multivariate associations (e.g. interactions and heterogeneity). In fields such as medicine, improved interpretability of ML modeling is required for knowledge discovery, accountability, and fairness. Rule-based ML approaches such as Learning Classifier Systems (LCSs) strike a balance… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: 21 pages, 11 figures, submitted for review on 4/26/21

    ACM Class: I.6.5; I.5.1; I.5.3; I.5.4; I.5.5

  9. arXiv:2008.12829  [pdf, other

    cs.LG stat.ML

    A Rigorous Machine Learning Analysis Pipeline for Biomedical Binary Classification: Application in Pancreatic Cancer Nested Case-control Studies with Implications for Bias Assessments

    Authors: Ryan J. Urbanowicz, Pranshu Suri, Yuhan Cui, Jason H. Moore, Karen Ruth, Rachael Stolzenberg-Solomon, Shannon M. Lynch

    Abstract: Machine learning (ML) offers a collection of powerful approaches for detecting and modeling associations, often applied to data having a large number of features and/or complex associations. Currently, there are many tools to facilitate implementing custom ML analyses (e.g. scikit-learn). Interest is also increasing in automated ML packages, which can make it easier for non-experts to apply ML and… ▽ More

    Submitted 8 September, 2020; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: 22 pages, 12 figures

  10. arXiv:1711.08477  [pdf, other

    cs.LG

    Benchmarking Relief-Based Feature Selection Methods for Bioinformatics Data Mining

    Authors: Ryan J. Urbanowicz, Randal S. Olson, Peter Schmitt, Melissa Meeker, Jason H. Moore

    Abstract: Modern biomedical data mining requires feature selection methods that can (1) be applied to large scale feature spaces (e.g. `omics' data), (2) function in noisy problems, (3) detect complex patterns of association (e.g. gene-gene interactions), (4) be flexibly adapted to various problem domains and data types (e.g. genetic variants, gene expression, and clinical data) and (5) are computationally… ▽ More

    Submitted 2 April, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Revised submission to JBI

  11. arXiv:1711.08421  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Relief-Based Feature Selection: Introduction and Review

    Authors: Ryan J. Urbanowicz, Melissa Meeker, William LaCava, Randal S. Olson, Jason H. Moore

    Abstract: Feature selection plays a critical role in biomedical data mining, driven by increasing feature dimensionality in target problems and growing interest in advanced but computationally expensive methodologies able to model complex associations. Specifically, there is a need for feature selection methods that are computationally efficient, yet sensitive to complex patterns of association, e.g. intera… ▽ More

    Submitted 2 April, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Submitted revisions for publication based on reviews by the Journal of Biomedical Informatics

  12. arXiv:1705.00594  [pdf, other

    cs.AI cs.HC cs.NE

    A System for Accessible Artificial Intelligence

    Authors: Randal S. Olson, Moshe Sipper, William La Cava, Sharon Tartarone, Steven Vitale, Weixuan Fu, Patryk Orzechowski, Ryan J. Urbanowicz, John H. Holmes, Jason H. Moore

    Abstract: While artificial intelligence (AI) has become widespread, many commercial AI systems are not yet accessible to individual researchers nor the general public due to the deep knowledge of the systems required to use them. We believe that AI has matured to the point where it should be an accessible technology for everyone. We present an ongoing project whose ultimate goal is to deliver an open source… ▽ More

    Submitted 10 August, 2017; v1 submitted 1 May, 2017; originally announced May 2017.

    Comments: 14 pages, 5 figures, submitted to Genetic Programming Theory and Practice 2017 workshop

  13. arXiv:1703.00512  [pdf, other

    cs.LG cs.AI

    PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison

    Authors: Randal S. Olson, William La Cava, Patryk Orzechowski, Ryan J. Urbanowicz, Jason H. Moore

    Abstract: The selection, development, or comparison of machine learning methods in data mining can be a difficult task based on the target problem and goals of a particular study. Numerous publicly available real-world and simulated benchmark datasets have emerged from different sources, but their organization and adoption as standards have been inconsistent. As such, selecting and curating specific benchma… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

    Comments: 14 pages, 5 figures, submitted for review to JMLR

  14. arXiv:1603.06212  [pdf, other

    cs.NE cs.AI cs.LG

    Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science

    Authors: Randal S. Olson, Nathan Bartley, Ryan J. Urbanowicz, Jason H. Moore

    Abstract: As the field of data science continues to grow, there will be an ever-increasing demand for tools that make machine learning accessible to non-experts. In this paper, we introduce the concept of tree-based pipeline optimization for automating one of the most tedious parts of machine learning---pipeline design. We implement an open source Tree-based Pipeline Optimization Tool (TPOT) in Python and d… ▽ More

    Submitted 20 March, 2016; originally announced March 2016.

    Comments: 8 pages, 5 figures, preprint to appear in GECCO 2016, edits not yet made from reviewer comments

  15. arXiv:1601.07925  [pdf, other

    cs.LG cs.NE

    Automating biomedical data science through tree-based pipeline optimization

    Authors: Randal S. Olson, Ryan J. Urbanowicz, Peter C. Andrews, Nicole A. Lavender, La Creis Kidd, Jason H. Moore

    Abstract: Over the past decade, data science and machine learning has grown from a mysterious art form to a staple tool across a variety of fields in academia, business, and government. In this paper, we introduce the concept of tree-based pipeline optimization for automating one of the most tedious parts of machine learning---pipeline design. We implement a Tree-based Pipeline Optimization Tool (TPOT) and… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: 16 pages, 5 figures, to appear in EvoBIO 2016 proceedings