Skip to main content

Showing 1–21 of 21 results for author: Duncan, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.00669  [pdf, other

    astro-ph.CO physics.data-an stat.CO

    Euclid preparation. LensMC, weak lensing cosmic shear measurement with forward modelling and Markov Chain Monte Carlo sampling

    Authors: Euclid Collaboration, G. Congedo, L. Miller, A. N. Taylor, N. Cross, C. A. J. Duncan, T. Kitching, N. Martinet, S. Matthew, T. Schrabback, M. Tewes, N. Welikala, N. Aghanim, A. Amara, S. Andreon, N. Auricchio, M. Baldi, S. Bardelli, R. Bender, C. Bodendorf, D. Bonino, E. Branchini, M. Brescia, J. Brinchmann, S. Camera , et al. (217 additional authors not shown)

    Abstract: LensMC is a weak lensing shear measurement method developed for Euclid and Stage-IV surveys. It is based on forward modelling to deal with convolution by a point spread function with comparable size to many galaxies; sampling the posterior distribution of galaxy parameters via Markov Chain Monte Carlo; and marginalisation over nuisance parameters for each of the 1.5 billion galaxies observed by Eu… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 28 pages, 18 figures, 2 tables

  2. arXiv:2403.08971  [pdf, other

    stat.CO

    Designing a Data Science simulation with MERITS: A Primer

    Authors: Corrine F Elliott, James Duncan, Tiffany M Tang, Merle Behr, Karl Kumbier, Bin Yu

    Abstract: Simulations play a crucial role in the modern scientific process. Yet despite (or due to) their ubiquity, the Data Science community shares neither a comprehensive definition for a "high-quality" study nor a consolidated guide to designing one. Inspired by the Predictability-Computability-Stability (PCS) framework for 'veridical' Data Science, we propose six MERITS that a Data Science simulation s… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 26 pages (main text); 1 figure; 2 tables; *Authors contributed equally to this manuscript; **Authors contributed equally to this manuscript

  3. arXiv:2210.09352  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    A Mixing Time Lower Bound for a Simplified Version of BART

    Authors: Omer Ronen, Theo Saarinen, Yan Shuo Tan, James Duncan, Bin Yu

    Abstract: Bayesian Additive Regression Trees (BART) is a popular Bayesian non-parametric regression algorithm. The posterior is a distribution over sums of decision trees, and predictions are made by averaging approximate samples from the posterior. The combination of strong predictive performance and the ability to provide uncertainty measures has led BART to be commonly used in the social sciences, bios… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  4. arXiv:2205.15135  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    Group Probability-Weighted Tree Sums for Interpretable Modeling of Heterogeneous Data

    Authors: Keyan Nasseri, Chandan Singh, James Duncan, Aaron Kornblith, Bin Yu

    Abstract: Machine learning in high-stakes domains, such as healthcare, faces two critical challenges: (1) generalizing to diverse data distributions given limited training data while (2) maintaining interpretability. To address these challenges, we propose an instance-weighted tree-sum method that effectively pools data across diverse groups to output a concise, rule-based model. Given distinct groups of in… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.11931

  5. arXiv:2201.11931  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    Fast Interpretable Greedy-Tree Sums

    Authors: Yan Shuo Tan, Chandan Singh, Keyan Nasseri, Abhineet Agarwal, James Duncan, Omer Ronen, Matthew Epland, Aaron Kornblith, Bin Yu

    Abstract: Modern machine learning has achieved impressive prediction performance, but often sacrifices interpretability, a critical consideration in high-stakes domains such as medicine. In such settings, practitioners often use highly interpretable decision tree models, but these suffer from inductive bias against additive structure. To overcome this bias, we propose Fast Interpretable Greedy-Tree Sums (FI… ▽ More

    Submitted 8 July, 2023; v1 submitted 27 January, 2022; originally announced January 2022.

  6. arXiv:2112.07341  [pdf, other

    astro-ph.CO astro-ph.IM stat.AP

    Euclid: Covariance of weak lensing pseudo-$C_\ell$ estimates. Calculation, comparison to simulations, and dependence on survey geometry

    Authors: R. E. Upham, M. L. Brown, L. Whittaker, A. Amara, N. Auricchio, D. Bonino, E. Branchini, M. Brescia, J. Brinchmann, V. Capobianco, C. Carbone, J. Carretero, M. Castellano, S. Cavuoti, A. Cimatti, R. Cledassou, G. Congedo, L. Conversi, Y. Copin, L. Corcione, M. Cropper, A. Da Silva, H. Degaudenzi, M. Douspis, F. Dubath , et al. (80 additional authors not shown)

    Abstract: An accurate covariance matrix is essential for obtaining reliable cosmological results when using a Gaussian likelihood. In this paper we study the covariance of pseudo-$C_\ell$ estimates of tomographic cosmic shear power spectra. Using two existing publicly available codes in combination, we calculate the full covariance matrix, including mode-coupling contributions arising from both partial sky… ▽ More

    Submitted 17 February, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 15 pages, 8 figures; matches version accepted by A&A; code available at https://github.com/robinupham/shear_pcl_cov

    Journal ref: A&A 660, A114 (2022)

  7. arXiv:2105.02869  [pdf, other

    q-bio.QM cs.LG eess.IV stat.AP

    Estimating Reproducible Functional Networks Associated with Task Dynamics using Unsupervised LSTMs

    Authors: Nicha C. Dvornek, Pamela Ventola, James S. Duncan

    Abstract: We propose a method for estimating more reproducible functional networks that are more strongly associated with dynamic task activity by using recurrent neural networks with long short term memory (LSTMs). The LSTM model is trained in an unsupervised manner to learn to generate the functional magnetic resonance imaging (fMRI) time-series data in regions of interest. The learned functional networks… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: IEEE International Symposium on Biomedical Imaging (ISBI) 2020

    Journal ref: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI), 2020, p. 1395-1398

  8. arXiv:2104.09270  [pdf, ps, other

    physics.soc-ph stat.AP

    Quantifying changes in the British cattle movement network

    Authors: Andrew J Duncan, Aaron Reeves, George J Gunn, Roger W Humphry

    Abstract: The Cattle Tracing System database is an online recording system for cattle births, deaths and between--herd movements in the United Kingdom. Although it has been thoroughly examined, the most recently reported movement analysis is from 2009. This article uses the database to construct weighted directed monthly movement networks for two distinct periods of time, 2004--2006 and 2015--2017, to quant… ▽ More

    Submitted 4 March, 2021; originally announced April 2021.

  9. arXiv:2104.07654  [pdf, other

    cs.LG cs.CV eess.IV q-bio.QM stat.AP

    Demographic-Guided Attention in Recurrent Neural Networks for Modeling Neuropathophysiological Heterogeneity

    Authors: Nicha C. Dvornek, Xiaoxiao Li, Juntang Zhuang, Pamela Ventola, James S. Duncan

    Abstract: Heterogeneous presentation of a neurological disorder suggests potential differences in the underlying pathophysiological changes that occur in the brain. We propose to model heterogeneous patterns of functional network differences using a demographic-guided attention (DGA) mechanism for recurrent neural network models for prediction from functional magnetic resonance imaging (fMRI) time-series da… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: MLMI 2020 (MICCAI Workshop)

  10. arXiv:2010.07468  [pdf, other

    cs.LG cs.CV stat.ML

    AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients

    Authors: Juntang Zhuang, Tommy Tang, Yifan Ding, Sekhar Tatikonda, Nicha Dvornek, Xenophon Papademetris, James S. Duncan

    Abstract: Most popular optimizers for deep learning can be broadly categorized as adaptive methods (e.g. Adam) and accelerated schemes (e.g. stochastic gradient descent (SGD) with momentum). For many models such as convolutional neural networks (CNNs), adaptive methods typically converge faster but generalize worse compared to SGD; for complex settings such as generative adversarial networks (GANs), adaptiv… ▽ More

    Submitted 20 December, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Journal ref: NeurIPS 2020

  11. arXiv:2007.14589  [pdf, other

    cs.CV cs.LG stat.ML

    Pooling Regularized Graph Neural Network for fMRI Biomarker Analysis

    Authors: Xiaoxiao Li, Yuan Zhou, Nicha C. Dvornek, Muhan Zhang, Juntang Zhuang, Pamela Ventola, James S Duncan

    Abstract: Understanding how certain brain regions relate to a specific neurological disorder has been an important area of neuroimaging research. A promising approach to identify the salient regions is using Graph Neural Networks (GNNs), which can be used to analyze graph structured data, e.g. brain networks constructed by functional magnetic resonance imaging (fMRI). We propose an interpretable GNN framewo… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: 11 pages, 4 figures

    Journal ref: MICCAI 2020

  12. arXiv:2006.02493  [pdf

    stat.ML cs.LG

    Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE

    Authors: Juntang Zhuang, Nicha Dvornek, Xiaoxiao Li, Sekhar Tatikonda, Xenophon Papademetris, James Duncan

    Abstract: Neural ordinary differential equations (NODEs) have recently attracted increasing attention; however, their empirical performance on benchmark tasks (e.g. image classification) are significantly inferior to discrete-layer models. We demonstrate an explanation for their poorer performance is the inaccuracy of existing gradient estimation methods: the adjoint method has numerical errors in reverse-m… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Journal ref: https://proceedings.icml.cc/static/paper_files/icml/2020/917-Paper.pdf

  13. Curating a COVID-19 data repository and forecasting county-level death counts in the United States

    Authors: Nick Altieri, Rebecca L. Barter, James Duncan, Raaz Dwivedi, Karl Kumbier, Xiao Li, Robert Netzorg, Briton Park, Chandan Singh, Yan Shuo Tan, Tiffany Tang, Yu Wang, Chao Zhang, Bin Yu

    Abstract: As the COVID-19 outbreak evolves, accurate forecasting continues to play an extremely important role in informing policy decisions. In this paper, we present our continuous curation of a large data repository containing COVID-19 information from a range of sources. We use this data to develop predictions and corresponding prediction intervals for the short-term trajectory of COVID-19 cumulative de… ▽ More

    Submitted 9 August, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: Authors ordered alphabetically. All authors contributed significantly to this work. All collected data, modeling code, forecasts, and visualizations are updated daily and available at \url{https://github.com/Yu-Group/covid19-severity-prediction}

    Journal ref: Published in Harvard Data Science Review, 2020

  14. arXiv:1910.06950  [pdf, other

    eess.IV cs.LG q-bio.NC stat.AP stat.ML

    Jointly Discriminative and Generative Recurrent Neural Networks for Learning from fMRI

    Authors: Nicha C. Dvornek, Xiaoxiao Li, Juntang Zhuang, James S. Duncan

    Abstract: Recurrent neural networks (RNNs) were designed for dealing with time-series data and have recently been used for creating predictive models from functional magnetic resonance imaging (fMRI) data. However, gathering large fMRI datasets for learning is a difficult task. Furthermore, network interpretability is unclear. To address these issues, we utilize multitask learning and design a novel RNN-bas… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: 10th International Workshop on Machine Learning in Medical Imaging (MLMI 2019)

  15. arXiv:1910.00406  [pdf, other

    cs.LG stat.ML

    Decision Explanation and Feature Importance for Invertible Networks

    Authors: Juntang Zhuang, Nicha C. Dvornek, Xiaoxiao Li, Junlin Yang, James S. Duncan

    Abstract: Deep neural networks are vulnerable to adversarial attacks and hard to interpret because of their black-box nature. The recently proposed invertible network is able to accurately reconstruct the inputs to a layer from its outputs, thus has the potential to unravel the black-box model. An invertible network classifier can be viewed as a two-stage model: (1) invertible transformation from input spac… ▽ More

    Submitted 14 October, 2019; v1 submitted 29 September, 2019; originally announced October 2019.

    Comments: Correct notations

    Journal ref: ICCVW 2019

  16. arXiv:1908.04769  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Graph Embedding Using Infomax for ASD Classification and Brain Functional Difference Detection

    Authors: Xiaoxiao Li, Nicha C. Dvornek, Juntang Zhuang, Pamela Ventola, James Duncan

    Abstract: Significant progress has been made using fMRI to characterize the brain changes that occur in ASD, a complex neuro-developmental disorder. However, due to the high dimensionality and low signal-to-noise ratio of fMRI, embedding informative and robust brain regional fMRI representations for both graph-level classification and region-level functional difference detection tasks between ASD and health… ▽ More

    Submitted 13 August, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

  17. arXiv:1907.01661  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Graph Neural Network for Interpreting Task-fMRI Biomarkers

    Authors: Xiaoxiao Li, Nicha C. Dvornek, Yuan Zhou, Juntang Zhuang, Pamela Ventola, James S. Duncan

    Abstract: Finding the biomarkers associated with ASD is helpful for understanding the underlying roots of the disorder and can lead to earlier diagnosis and more targeted treatment. A promising approach to identify biomarkers is using Graph Neural Networks (GNNs), which can be used to analyze graph structured data, i.e. brain networks constructed by fMRI. One way to interpret important features is through l… ▽ More

    Submitted 11 July, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Journal ref: Medical Image Computing and Computer-Assisted Intervention 2019

  18. arXiv:1812.06181  [pdf, other

    cs.CV cs.LG stat.ML

    Efficient Interpretation of Deep Learning Models Using Graph Structure and Cooperative Game Theory: Application to ASD Biomarker Discovery

    Authors: Xiaoxiao Li, Nicha C. Dvornek, Yuan Zhou, Juntang Zhuang, Pamela Ventola, James S. Duncan

    Abstract: Discovering imaging biomarkers for autism spectrum disorder (ASD) is critical to help explain ASD and predict or monitor treatment outcomes. Toward this end, deep learning classifiers have recently been used for identifying ASD from functional magnetic resonance imaging (fMRI) with higher accuracy than traditional learning strategies. However, a key challenge with deep learning models is understan… ▽ More

    Submitted 13 March, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: 12 pages, 7 figures, accpeted as a full paper in IPMI 2019

  19. Prediction of severity and treatment outcome for ASD from fMRI

    Authors: Juntang Zhuang, Nicha C. Dvornek, Xiaoxiao Li, Pamela Ventola, James S. Duncan

    Abstract: Autism spectrum disorder (ASD) is a complex neurodevelopmental syndrome. Early diagnosis and precise treatment are essential for ASD patients. Although researchers have built many analytical models, there has been limited progress in accurate predictive models for early diagnosis. In this project, we aim to build an accurate model to predict treatment outcome and ASD severity from early stage func… ▽ More

    Submitted 28 October, 2018; originally announced October 2018.

    Journal ref: International Workshop on Predictive Intelligence In Medicine, pp 9-17, 2018, Springer

  20. arXiv:1810.07809  [pdf, other

    q-bio.NC stat.ME

    Prediction of treatment outcome for autism from structure of the brain based on sure independence screening

    Authors: Juntang Zhuang, Nicha C. Dvornek, Qingyu Zhao, Xiaoxiao Li, Pamela Ventola, James S. Duncan

    Abstract: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder, and behavioral treatment interventions have shown promise for young children with ASD. However, there is limited progress in understanding the effect of each type of treatment. In this project, we aim to detect structural changes in the brain after treatment and select structural features associated with treatment outcomes. T… ▽ More

    Submitted 25 February, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

    Journal ref: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) 2019 Apr 8 (pp. 404-408). IEEE

  21. arXiv:1805.09799  [pdf, other

    stat.AP cs.CV

    Prediction of Autism Treatment Response from Baseline fMRI using Random Forests and Tree Bagging

    Authors: Nicha C. Dvornek, Daniel Yang, Archana Venkataraman, Pamela Ventola, Lawrence H. Staib, Kevin A. Pelphrey, James S. Duncan

    Abstract: Treating children with autism spectrum disorders (ASD) with behavioral interventions, such as Pivotal Response Treatment (PRT), has shown promise in recent studies. However, deciding which therapy is best for a given patient is largely by trial and error, and choosing an ineffective intervention results in loss of valuable treatment time. We propose predicting patient response to PRT from baseline… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: Multimodal Learning for Clinical Decision Support (ML-CDS) 2016