Skip to main content

Showing 1–27 of 27 results for author: Dalmasso, N

.
  1. arXiv:2403.11428  [pdf, other

    astro-ph.GA

    The rate and contribution of mergers to mass assembly from NIRCam observations of galaxy candidates up to 13.3 billion years ago

    Authors: Nicolò Dalmasso, Antonello Calabrò, Nicha Leethochawalit, Benedetta Vulcani, Kristan Boyett, Michele Trenti, Tommaso Treu, Marco Castellano, Maruša Bradač, Benjamin Metha, Paola Santini

    Abstract: We present an analysis of the galaxy merger rate in the redshift range $4.0<z<9.0$ (i.e. about 1.5 to 0.5 Gyr after the Big Bang) based on visually identified galaxy mergers from morphological parameter analysis. Our dataset is based on high-resolution NIRCam JWST data (F150W and F2000W broad-band filters) in the low-to-moderate magnification ($μ<2$) regions of the Abell 2744 cluster field. From a… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  2. arXiv:2402.18052  [pdf, other

    astro-ph.GA

    Galaxy clustering at cosmic dawn from JWST/NIRCam observations to redshift z$\sim$11

    Authors: Nicolò Dalmasso, Nicha Leethochawalit, Michele Trenti, Kristan Boyett

    Abstract: We report measurements of the galaxy two-point correlation function at cosmic dawn, using photometrically-selected sources from the JWST Advanced Deep Extragalactic Survey (JADES). The JWST/NIRCam dataset comprises approximately $N_g \simeq 7000$ photometrically-selected Lyman Break Galaxies (LBGs), spanning from $z=5.5$ up to $z=10.6$. The primary objective of this study is to extend clustering m… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  3. arXiv:2401.00081  [pdf, other

    cs.LG q-fin.GN

    Synthetic Data Applications in Finance

    Authors: Vamsi K. Potluru, Daniel Borrajo, Andrea Coletta, Niccolò Dalmasso, Yousef El-Laham, Elizabeth Fons, Mohsen Ghassemi, Sriram Gopalakrishnan, Vikesh Gosai, Eleonora Kreačić, Ganapathy Mani, Saheed Obitayo, Deepak Paramanand, Natraj Raman, Mikhail Solonin, Srijan Sood, Svitlana Vyetrenko, Haibei Zhu, Manuela Veloso, Tucker Balch

    Abstract: Synthetic data has made tremendous strides in various commercial settings including finance, healthcare, and virtual reality. We present a broad overview of prototypical applications of synthetic data in the financial sector and in particular provide richer details for a few select ones. These cover a wide variety of data modalities including tabular, time-series, event-series, and unstructured ar… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 50 pages, journal submission; updated 6 privacy levels

  4. arXiv:2312.12329  [pdf, other

    astro-ph.GA

    Galaxy clustering measurements out to redshift z$\sim$8 from Hubble Legacy Fields

    Authors: Nicolò Dalmasso, Michele Trenti, Nicha Leethochawalit

    Abstract: We present a novel approach for measuring the two-point correlation function of galaxies in narrow pencil beam surveys with varying depths. Our methodology is utilized to expand high-redshift galaxy clustering investigations up to $z \sim 8$ by analyzing a comprehensive sample consisting of $N_g = 160$ Lyman break galaxy candidates obtained through optical and near-infrared photometric data within… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  5. arXiv:2311.05436  [pdf, other

    stat.ML cs.CY cs.LG

    Fair Wasserstein Coresets

    Authors: Zikai Xiong, Niccolò Dalmasso, Shubham Sharma, Freddy Lecue, Daniele Magazzeni, Vamsi K. Potluru, Tucker Balch, Manuela Veloso

    Abstract: Data distillation and coresets have emerged as popular approaches to generate a smaller representative set of samples for downstream learning tasks to handle large-scale datasets. At the same time, machine learning is being increasingly applied to decision-making processes at a societal level, making it imperative for modelers to address inherent biases towards subgroups present in the data. While… ▽ More

    Submitted 4 June, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 28 pages, 7 figures, 7 tables

  6. arXiv:2311.00109  [pdf, other

    cs.LG stat.ML

    FairWASP: Fast and Optimal Fair Wasserstein Pre-processing

    Authors: Zikai Xiong, Niccolò Dalmasso, Alan Mishler, Vamsi K. Potluru, Tucker Balch, Manuela Veloso

    Abstract: Recent years have seen a surge of machine learning approaches aimed at reducing disparities in model outputs across different subgroups. In many settings, training data may be used in multiple downstream applications by different users, which means it may be most effective to intervene on the training data itself. In this work, we present FairWASP, a novel pre-processing approach designed to reduc… ▽ More

    Submitted 8 February, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Accepted at AAAI 2024, Main Track. 15 pages, 4 figures, 1 table

  7. arXiv:2306.07235  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Deep Gaussian Mixture Ensembles

    Authors: Yousef El-Laham, Niccolò Dalmasso, Elizabeth Fons, Svitlana Vyetrenko

    Abstract: This work introduces a novel probabilistic deep learning technique called deep Gaussian mixture ensembles (DGMEs), which enables accurate quantification of both epistemic and aleatoric uncertainty. By assuming the data generating process follows that of a Gaussian mixture, DGMEs are capable of approximating complex probability distributions, such as heavy-tailed or multimodal distributions. Our co… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted at Uncertainty in Artificial Intelligence (UAI) 2023 Conference, 7 figures, 11 tables

  8. arXiv:2303.00306  [pdf, other

    astro-ph.GA

    A massive interacting galaxy 510 million years after the Big Bang

    Authors: Kristan Boyett, Michele Trenti, Nicha Leethochawalit, Antonello Calabró, Benjamin Metha, Guido Roberts-Borsani, Nicoló Dalmasso, Lilan Yang, Paola Santini, Tommaso Treu, Tucker Jones, Alaina Henry, Charlotte A. Mason, Takahiro Morishita, Themiya Nanayakkara, Namrata Roy, Xin Wang, Adriano Fontana, Emiliano Merlin, Marco Castellano, Diego Paris, Marusa Bradac, Danilo Marchesini, Sara Mascia, Laura Pentericci , et al. (2 additional authors not shown)

    Abstract: JWST observations confirm the existence of galaxies as early as 300Myr and at a higher number density than expected based on galaxy formation models and HST observations. Yet, sources confirmed spectroscopically in the first 500Myr have estimated stellar masses $<5\times10^8M_\odot$, limiting the signal to noise ratio (SNR) for investigating substructure. We present a high-resolution spectroscopic… ▽ More

    Submitted 26 February, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 52 pages, 10 figures This version of the article has been accepted for publication, after peer review and is subject to Springer Nature's AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1038/s41550-024-02218-7

  9. arXiv:2212.06081  [pdf, other

    cs.LG math.OC

    Fast Learning of Multidimensional Hawkes Processes via Frank-Wolfe

    Authors: Renbo Zhao, Niccolò Dalmasso, Mohsen Ghassemi, Vamsi K. Potluru, Tucker Balch, Manuela Veloso

    Abstract: Hawkes processes have recently risen to the forefront of tools when it comes to modeling and generating sequential events data. Multidimensional Hawkes processes model both the self and cross-excitation between different types of events and have been applied successfully in various domain such as finance, epidemiology and personalized recommendations, among others. In this work we present an adapt… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Presented at the NeurIPS 2022 Workshop on Synthetic Data for Empowering ML Research. 9 pages, 3 figures, 4 tables

  10. arXiv:2208.07961  [pdf, other

    stat.ML cs.LG cs.SI

    Online Learning for Mixture of Multivariate Hawkes Processes

    Authors: Mohsen Ghassemi, Niccolò Dalmasso, Simran Lamba, Vamsi K. Potluru, Sameena Shah, Tucker Balch, Manuela Veloso

    Abstract: Online learning of Hawkes processes has received increasing attention in the last couple of years especially for modeling a network of actors. However, these works typically either model the rich interaction between the events or the latent cluster of the actors or the network structure between the actors. We propose to model the latent structure of the network of actors as well as their rich inte… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 12 pages, 6 figures, 3 tables

    Journal ref: ICAIF 22: 3rd ACM International Conference on AI in Finance, November 2022, Pages 506-513

  11. arXiv:2207.13741  [pdf, other

    stat.ML cs.LG

    Differentially Private Learning of Hawkes Processes

    Authors: Mohsen Ghassemi, Eleonora Kreačić, Niccolò Dalmasso, Vamsi K. Potluru, Tucker Balch, Manuela Veloso

    Abstract: Hawkes processes have recently gained increasing attention from the machine learning community for their versatility in modeling event sequence data. While they have a rich history going back decades, some of their properties, such as sample complexity for learning the parameters and releasing differentially private versions, are yet to be thoroughly analyzed. In this work, we study standard Hawke… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 30 pages, 4 figures

  12. Structural Forecasting for Short-term Tropical Cyclone Intensity Guidance

    Authors: Trey McNeely, Pavel Khokhlov, Niccolo Dalmasso, Kimberly M. Wood, Ann B. Lee

    Abstract: Because geostationary satellite (Geo) imagery provides a high temporal resolution window into tropical cyclone (TC) behavior, we investigate the viability of its application to short-term probabilistic forecasts of TC convective structure to subsequently predict TC intensity. Here, we present a prototype model which is trained solely on two inputs: Geo infrared imagery leading up to the synoptic t… ▽ More

    Submitted 8 April, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  13. arXiv:2203.09167  [pdf, other

    cs.CV

    Unsigned Distance Field as an Accurate 3D Scene Representation for Neural Scene Completion

    Authors: Jean Pierre Richa, Jean-Emmanuel Deschaud, François Goulette, Nicolas Dalmasso

    Abstract: Scene Completion is the task of completing missing geometry from a partial scan of a scene. Most previous methods compute an implicit representation from range data using a Truncated Signed Distance Function (T-SDF) computed on a 3D grid as input to neural networks. The truncation decreases but does not remove the border errors introduced by the sign of SDF for open surfaces. As an alternative, we… ▽ More

    Submitted 2 December, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: 8 pages, 7 figures, 5 tables

  14. arXiv:2203.09155  [pdf, other

    cs.RO

    AdaSplats: Adaptive Splatting of Point Clouds for Accurate 3D Modeling and Real-time High-Fidelity LiDAR Simulation

    Authors: Jean Pierre Richa, Jean-Emmanuel Deschaud, François Goulette, Nicolas Dalmasso

    Abstract: LiDAR sensors provide rich 3D information about their surrounding{s} and are becoming increasingly important for autonomous vehicles tasks such as {localization}, semantic segmentation, object detection, and tracking. {Simulation} accelerates the testing, validation, and deployment of autonomous vehicles while {also} reducing cost and eliminating the risks of testing in real-world scenarios. We ad… ▽ More

    Submitted 26 December, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: 30 pages, 14 figures, 6 tables

  15. arXiv:2202.05049  [pdf, other

    stat.ML cs.LG

    Fair When Trained, Unfair When Deployed: Observable Fairness Measures are Unstable in Performative Prediction Settings

    Authors: Alan Mishler, Niccolò Dalmasso

    Abstract: Many popular algorithmic fairness measures depend on the joint distribution of predictions, outcomes, and a sensitive feature like race or gender. These measures are sensitive to distribution shift: a predictor which is trained to satisfy one of these fairness definitions may become unfair if the distribution changes. In performative prediction settings, however, predictors are precisely intended… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 11 pages, 3 figures. Presented at the workshop on Algorithmic Fairness through the Lens of Causality and Robustness, NeurIPS 2021

  16. arXiv:2107.03920  [pdf, other

    stat.ML cs.LG

    Likelihood-Free Frequentist Inference: Bridging Classical Statistics and Machine Learning for Reliable Simulator-Based Inference

    Authors: Niccolò Dalmasso, Luca Masserano, David Zhao, Rafael Izbicki, Ann B. Lee

    Abstract: Many areas of science make extensive use of computer simulators that implicitly encode intractable likelihood functions of complex systems. Classical statistical methods are poorly suited for these so-called likelihood-free inference (LFI) settings, especially outside asymptotic and low-dimensional regimes. At the same time, traditional LFI methods - such as Approximate Bayesian Computation or mor… ▽ More

    Submitted 19 November, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: 45 pages, 6 figures, code available at https://github.com/lee-group-cmu/lf2i, supplementary material available at https://lucamasserano.github.io/data/LF2I_supplementary_material.pdf

  17. arXiv:2104.01921  [pdf, other

    stat.ME

    When the Oracle Misleads: Modeling the Consequences of Using Observable Rather than Potential Outcomes in Risk Assessment Instruments

    Authors: Alan Mishler, Niccolò Dalmasso

    Abstract: Risk Assessment Instruments (RAIs) are widely used to forecast adverse outcomes in domains such as healthcare and criminal justice. RAIs are commonly trained on observational data and are optimized to predict observable outcomes rather than potential outcomes, which are the outcomes that would occur absent a particular intervention. Examples of relevant potential outcomes include whether a patient… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: 6 pages, 3 figures. Presented at the workshop "'Do the right thing': machine learning and causal inference for improved decision making," NeurIPS 2019

  18. arXiv:2102.10473  [pdf, other

    stat.ME

    Diagnostics for Conditional Density Models and Bayesian Inference Algorithms

    Authors: David Zhao, Niccolò Dalmasso, Rafael Izbicki, Ann B. Lee

    Abstract: There has been growing interest in the AI community for precise uncertainty quantification. Conditional density models f(y|x), where x represents potentially high-dimensional features, are an integral part of uncertainty quantification in prediction and Bayesian inference. However, it is challenging to assess conditional density estimates and gain insight into modes of failure. While existing diag… ▽ More

    Submitted 23 July, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Appearing in 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021), Spotlight Talk; camera-ready version

  19. arXiv:2010.05783  [pdf, other

    cs.LG stat.AP

    Structural Forecasting for Tropical Cyclone Intensity Prediction: Providing Insight with Deep Learning

    Authors: Trey McNeely, Niccolò Dalmasso, Kimberly M. Wood, Ann B. Lee

    Abstract: Tropical cyclone (TC) intensity forecasts are ultimately issued by human forecasters. The human in-the-loop pipeline requires that any forecasting guidance must be easily digestible by TC experts if it is to be adopted at operational centers like the National Hurricane Center. Our proposed framework leverages deep learning to provide forecasters with something neither end-to-end prediction models… ▽ More

    Submitted 7 December, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: To appear in the Tackling Climate Change with Machine Learning workshop at NeurIPS 2020 (Proposals Track) 3 pages, 1 figure

  20. arXiv:2010.04051  [pdf, other

    stat.AP stat.ML

    HECT: High-Dimensional Ensemble Consistency Testing for Climate Models

    Authors: Niccolò Dalmasso, Galen Vincent, Dorit Hammerling, Ann B. Lee

    Abstract: Climate models play a crucial role in understanding the effect of environmental and man-made changes on climate to help mitigate climate risks and inform governmental decisions. Large global climate models such as the Community Earth System Model (CESM), developed by the National Center for Atmospheric Research, are very complex with millions of lines of code describing interactions of the atmosph… ▽ More

    Submitted 30 November, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted at the Tackling Climate Change with Machine Learning workshop at NeurIPS 2020, 6 pages, 1 figure

  21. arXiv:2002.10399  [pdf, other

    stat.ME cs.LG stat.ML

    Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference Setting

    Authors: Niccolò Dalmasso, Rafael Izbicki, Ann B. Lee

    Abstract: Parameter estimation, statistical tests and confidence sets are the cornerstones of classical statistics that allow scientists to make inferences about the underlying process that generated the observed data. A key question is whether one can still construct hypothesis tests and confidence sets with proper coverage and high power in a so-called likelihood-free inference (LFI) setting; that is, a s… ▽ More

    Submitted 13 August, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: 20 pages, 8 figures, 6 tables, 4 algorithm boxes

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, PMLR 119:2323-2334, 2020

  22. arXiv:1912.03896  [pdf, other

    cs.LG eess.SP stat.ML

    Explicit Group Sparse Projection with Applications to Deep Learning and NMF

    Authors: Riyasat Ohib, Nicolas Gillis, Niccolò Dalmasso, Sameena Shah, Vamsi K. Potluru, Sergey Plis

    Abstract: We design a new sparse projection method for a set of vectors that guarantees a desired average sparsity level measured leveraging the popular Hoyer measure (an affine function of the ratio of the $\ell_1$ and $\ell_2$ norms). Existing approaches either project each vector individually or require the use of a regularization parameter which implicitly maps to the average $\ell_0$-measure of sparsit… ▽ More

    Submitted 18 February, 2022; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: 20 pages, 10 figures; major revisions; affiliation corrected, grant added

  23. arXiv:1910.08597  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    Robust Learning Rate Selection for Stochastic Optimization via Splitting Diagnostic

    Authors: Matteo Sordello, Niccolò Dalmasso, Hangfeng He, Weijie Su

    Abstract: This paper proposes SplitSGD, a new dynamic learning rate schedule for stochastic optimization. This method decreases the learning rate for better adaptation to the local geometry of the objective function whenever a stationary phase is detected, that is, the iterates are likely to bounce at around a vicinity of a local minimum. The detection is performed by splitting the single thread into two an… ▽ More

    Submitted 16 February, 2024; v1 submitted 18 October, 2019; originally announced October 2019.

  24. arXiv:1908.11523  [pdf, other

    astro-ph.IM stat.CO stat.ML

    Conditional Density Estimation Tools in Python and R with Applications to Photometric Redshifts and Likelihood-Free Cosmological Inference

    Authors: Niccolò Dalmasso, Taylor Pospisil, Ann B. Lee, Rafael Izbicki, Peter E. Freeman, Alex I. Malz

    Abstract: It is well known in astronomy that propagating non-Gaussian prediction uncertainty in photometric redshift estimates is key to reducing bias in downstream cosmological analyses. Similarly, likelihood-free inference approaches, which are beginning to emerge as a tool for cosmological analysis, require a characterization of the full uncertainty landscape of the parameters of interest given observed… ▽ More

    Submitted 20 December, 2019; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: 27 pages, 7 figures, 4 tables

  25. arXiv:1906.08832  [pdf, other

    stat.AP

    A Flexible Pipeline for Prediction of Tropical Cyclone Paths

    Authors: Niccolò Dalmasso, Robin Dunn, Benjamin LeRoy, Chad Schafer

    Abstract: Hurricanes and, more generally, tropical cyclones (TCs) are rare, complex natural phenomena of both scientific and public interest. The importance of understanding TCs in a changing climate has increased as recent TCs have had devastating impacts on human lives and communities. Moreover, good prediction and understanding about the complex nature of TCs can mitigate some of these human and property… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 4 pages. The first three authors contributed equally. Presented at the ICML 2019 Workshop on "Climate Change: How can AI Help?"

  26. arXiv:1905.11505  [pdf, other

    stat.ME stat.ML

    Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations

    Authors: Niccolò Dalmasso, Ann B. Lee, Rafael Izbicki, Taylor Pospisil, Ilmun Kim, Chieh-An Lin

    Abstract: Complex phenomena in engineering and the sciences are often modeled with computationally intensive feed-forward simulations for which a tractable analytic likelihood does not exist. In these cases, it is sometimes necessary to estimate an approximate likelihood or fit a fast emulator model for efficient statistical inference; such surrogate models include Gaussian synthetic likelihoods and more re… ▽ More

    Submitted 2 December, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: 22 pages, 9 Figures, 2 Tables

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108, 3349-3361, 2020

  27. Clarifying the Hubble constant tension with a Bayesian hierarchical model of the local distance ladder

    Authors: Stephen M. Feeney, Daniel J. Mortlock, Niccolò Dalmasso

    Abstract: Estimates of the Hubble constant, $H_0$, from the distance ladder and the cosmic microwave background (CMB) differ at the $\sim$3-$σ$ level, indicating a potential issue with the standard $Λ$CDM cosmology. Interpreting this tension correctly requires a model comparison calculation depending on not only the traditional `$n$-$σ$' mismatch but also the tails of the likelihoods. Determining the form o… ▽ More

    Submitted 8 November, 2017; v1 submitted 30 June, 2017; originally announced July 2017.

    Comments: 24 pages, 14 figures, matches version submitted to MNRAS. The model code used in this analysis is available for download at https://github.com/sfeeney/hh0