Search | arXiv e-print repository

Magic Insert: Style-Aware Drag-and-Drop

Authors: Nataniel Ruiz, Yuanzhen Li, Neal Wadhwa, Yael Pritch, Michael Rubinstein, David E. Jacobs, Shlomi Fruchter

Abstract: We present Magic Insert, a method for dragging-and-drop** subjects from a user-provided image into a target image of a different style in a physically plausible manner while matching the style of the target image. This work formalizes the problem of style-aware drag-and-drop and presents a method for tackling it by addressing two sub-problems: style-aware personalization and realistic object ins… ▽ More We present Magic Insert, a method for dragging-and-drop** subjects from a user-provided image into a target image of a different style in a physically plausible manner while matching the style of the target image. This work formalizes the problem of style-aware drag-and-drop and presents a method for tackling it by addressing two sub-problems: style-aware personalization and realistic object insertion in stylized images. For style-aware personalization, our method first fine-tunes a pretrained text-to-image diffusion model using LoRA and learned text tokens on the subject image, and then infuses it with a CLIP representation of the target style. For object insertion, we use Bootstrapped Domain Adaption to adapt a domain-specific photorealistic object insertion model to the domain of diverse artistic styles. Overall, the method significantly outperforms traditional approaches such as inpainting. Finally, we present a dataset, SubjectPlop, to facilitate evaluation and future progress in this area. Project page: https://magicinsert.github.io/ △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: Project page: https://magicinsert.github.io/

arXiv:2406.12977 [pdf, other]

On the evolution of low-mass central galaxies in the vicinity of massive structures

Authors: Daniela Palma, Ivan Lacerna, M. Celeste Artale, Antonio D. Montero-Dorta, Andrés N. Ruiz, Sofía A. Cora, Facundo Rodriguez, Diego Pallero, Ana O'Mill, Nelvy Choque-Challapa

Abstract: We investigate low-mass central galaxies with Mstar = $10^{9.5}-10^{10}$ Msun/h, located near massive groups and galaxy clusters using the TNG300 and MDPL2-SAG simulations. We set out to study their evolution, aiming to find hints about the large-scale conformity signal they produce. We also use a control sample of low-mass central galaxies located far away from massive structures. For both sample… ▽ More We investigate low-mass central galaxies with Mstar = $10^{9.5}-10^{10}$ Msun/h, located near massive groups and galaxy clusters using the TNG300 and MDPL2-SAG simulations. We set out to study their evolution, aiming to find hints about the large-scale conformity signal they produce. We also use a control sample of low-mass central galaxies located far away from massive structures. For both samples, we find a sub-population of galaxies that were accreted by another halo in the past but are now considered central galaxies; we refer to these objects as former satellites. The fraction of former satellites is higher for quenched central galaxies near massive systems: 45% in TNG300 and 17% in MDPL2-SAG. Our results in TNG300 show that former satellites were typically hosted by massive dark matter halos (M200 $\geq 10^{13}$ Msun/h) at z$\sim$0.3, followed by a drop in halo mass at lower redshifts. In addition, we find a strong drop in the total gas mass at z$\leq$1 for quenched central galaxies near galaxy groups and clusters produced by these former satellites as well. By removing former satellites, the evolution of quenched central galaxies is fairly similar to those of the quenched control galaxies, showing small differences at low-z. For MDPL2-SAG, former satellites were hosted by less massive halos, with a mean halo mass around $10^{11}$ Msun/h, and the evolution remains equal before and after removing former satellites. We also measure the two-halo conformity, i.e., the correlation in the specific SFR between low-mass central galaxies and their neighbors at Mpc scales, and how former satellites contribute to the signal at z=0, 0.3, and 1. The conformity signal decreases from z=0 to z=1 in MDPL2-SAG but it increases in TNG300. However, after removing former satellites in TNG300, the signal is strongly reduced but almost does not change at z$\leq$0.3, and it disappears at z=1 (abridged). △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 16 pages, 13 figures, submitted to A&A

arXiv:2406.08539 [pdf, other]

Evolution map** II: describing statistics of the non-linear cosmic velocity field

Authors: Matteo Esposito, Ariel G. Sánchez, Julien Bel, Andrés N. Ruiz

Abstract: We extend the evolution map** approach, originally proposed by Sanchez (2022) to describe non-linear matter density fluctuations, to statistics of the cosmic velocity field. This framework classifies cosmological parameters into shape parameters, which determine the shape of the linear matter power spectrum, $P_L(k, z)$, and evolution parameters, which control its amplitude at any given redshift… ▽ More We extend the evolution map** approach, originally proposed by Sanchez (2022) to describe non-linear matter density fluctuations, to statistics of the cosmic velocity field. This framework classifies cosmological parameters into shape parameters, which determine the shape of the linear matter power spectrum, $P_L(k, z)$, and evolution parameters, which control its amplitude at any given redshift. Evolution map** leverages the fact that density fluctuations in cosmologies with identical shape parameters but different evolution parameters exhibit remarkably similar non-linear evolutions when expressed as a function of the clustering amplitude. We use a suite of N-body simulations sharing identical shape parameters but spanning a wide range of evolution parameters. Using an efficient method for estimating the volume-weighted velocity field based on the Voronoi tesselation of the simulation particles, we study the non-linear evolution of the power spectra of the velocity divergence, $P_{θθ}(k)$, and its cross-power spectrum with the density field, $P_{δθ}(k)$. By analysing snapshots at redshifts where the linear matter perturbations have the same amplitude, we demonstrate that evolution map** accurately applies to $P_{θθ}(k)$ and $P_{δθ}(k)$. Deviations at small scales can be modelled in terms of differences in the suppression factor, $g(a) = D(a)/a$, akin to those observed for the density field. Evolution map** simplifies the description of the cosmological dependence of non-linear density and velocity statistics, streamlining the sampling of large cosmological parameter spaces for the analysis of cosmological observables. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 10 pages, 4 figures, including an appendix. Prepared for submission to MNRAS, comments welcome

arXiv:2406.08346 [pdf, other]

LOFAR Deep Fields: Probing the sub-mJy regime of polarized extragalactic sources in ELAIS-N1. I. The catalog

Authors: S. Piras, C. Horellou, J. E. Conway, M. Thomasson, S. del Palacio, T. W. Shimwell, S. P. O'Sullivan, E. Carretti, I. Šnidaric, V. Jelic, B. Adebahr, A. Berger, P. N. Best, M. Brüggen, N. Herrera Ruiz, R. Paladino, I. Prandoni, J. Sabater, V. Vacca

Abstract: The aim of this study is to probe the sub-mJy polarized source population with LOFAR. We present the method used to stack LOFAR polarization datasets, the resulting catalog of polarized sources, and the derived polarized source counts. The ELAIS-N1 field was selected for a polarimetric study at 114.9-177.4 MHz. A total area of 25 deg2 was imaged at 6"- resolution in the Stokes Q and U parameters.… ▽ More The aim of this study is to probe the sub-mJy polarized source population with LOFAR. We present the method used to stack LOFAR polarization datasets, the resulting catalog of polarized sources, and the derived polarized source counts. The ELAIS-N1 field was selected for a polarimetric study at 114.9-177.4 MHz. A total area of 25 deg2 was imaged at 6"- resolution in the Stokes Q and U parameters. Alignment of polarization angles was done both in frequency and in Faraday space before stacking datasets from 19 eight-hour-long epochs. A search for polarized sources was carried out in the final, stacked dataset, and the properties of the detected sources were examined. The depolarization level of sources known to be polarized at 1.4 GHz was quantified. A one-sigma noise level of 19 μJy/beam was reached in the central part of the field after stacking. Twenty-five polarized sources were detected above 8σ, five of which had not been detected in polarization at any other radio frequencies before. Seven additional polarized components were found by lowering the threshold to 6σat positions corresponding to sources known to be polarized at 1.4 GHz. In two radio galaxies, polarization was detected from both radio lobes, so the final number of associated radio continuum sources is 31. The detected sources are weakly polarized, with a median degree of polarization of 1.75% for the sample of sources detected in polarized emission. The sources previously detected in polarization at 1.4 GHz are significantly depolarized at 150 MHz. The catalog is used to derive the polarized source counts at 150 MHz. This is the deepest and highest-resolution polarization study at 150 MHz to date. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 18 pages, 10 figures, accepted for publication in A&A

arXiv:2405.17401 [pdf, other]

RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control

Authors: Litu Rout, Yujia Chen, Nataniel Ruiz, Abhishek Kumar, Constantine Caramanis, Sanjay Shakkottai, Wen-Sheng Chu

Abstract: We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of styl… ▽ More We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of style and content. RB-Modulation is built on a novel stochastic optimal controller where a style descriptor encodes the desired attributes through a terminal cost. The resulting drift not only overcomes the difficulties above, but also ensures high fidelity to the reference style and adheres to the given text prompt. We also introduce a cross-attention-based feature aggregation scheme that allows RB-Modulation to decouple content and style from the reference image. With theoretical justification and empirical evidence, our framework demonstrates precise extraction and control of content and style in a training-free manner. Further, our method allows a seamless composition of content and style, which marks a departure from the dependency on external adapters or ControlNets. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: Preprint. Under review

arXiv:2405.01671 [pdf, other]

Evolution of HOD and galaxy properties in filaments and nodes of the cosmic web

Authors: Noelia R. Perez, Luis A. Pereyra, Georgina Coldwell, Ignacio G. Alfaro Facundo Rodriguez, Andrés N. Ruiz

Abstract: We study the evolution of the Halo Occupation Distribution (HOD) and galaxy properties of nodes and filamentary structures obtained by DisPerSE from the Illustris TNG300-1 hydrodynamical simulation, in the redshift range $0 \leq z \leq 2$. We compute the HOD in filaments and nodes and fit the HOD parameters to study their evolution, taking into account both faint and bright galaxies. In nodes, the… ▽ More We study the evolution of the Halo Occupation Distribution (HOD) and galaxy properties of nodes and filamentary structures obtained by DisPerSE from the Illustris TNG300-1 hydrodynamical simulation, in the redshift range $0 \leq z \leq 2$. We compute the HOD in filaments and nodes and fit the HOD parameters to study their evolution, taking into account both faint and bright galaxies. In nodes, the number of faint galaxies increases with the decreasing redshift in the low-mass halos, while no significant differences are seen in the high-mass halos. Limiting the HOD to bright galaxies shows that the halos increase in mass more than the number of bright galaxies they accrete. For filaments, no large differences in HOD are found for faint galaxies, although for brighter galaxies the behaviour is similar to that for nodes. The HOD parametrization suggests that filaments have no effect on the mass required to host a galaxy (central or satellite), whereas nodes do. The results of the study indicate that filaments do not seem to affect the stellar mass content of galaxies. In contrast, nodes appear to affect halos with masses below approximately $10^{12.5} h^{-1} M_{\odot}$ at local redshift. The analysis of the galaxy colour evolution shows a reddening towards lower redshift, although these processes seem to be more efficient in massive halos, with a strong effect on the bright galaxies. The general evolution suggests that the building of galaxy population within halos is influenced by both the accretion of faint galaxies and the mass growth of the bright ones. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: 13 pages, 5 figures

arXiv:2402.16094 [pdf]

Bistochastically private release of data streams with zero delay

Authors: Nicolas Ruiz

Abstract: Although the bulk of the research in privacy and statistical disclosure control is designed for static data, more and more data are often collected as continuous streams, and extensions of popular privacy tools and models have been proposed for this scenario. However, most of these proposals require buffers, where incoming individuals are momentarily stored, anonymized, and then released following… ▽ More Although the bulk of the research in privacy and statistical disclosure control is designed for static data, more and more data are often collected as continuous streams, and extensions of popular privacy tools and models have been proposed for this scenario. However, most of these proposals require buffers, where incoming individuals are momentarily stored, anonymized, and then released following a delay, thus considering a data stream as a succession of batches while it is by nature continuous. Having a delay unavoidably alters data freshness but also, more critically, inordinately exerts constraints on what can be achieved in terms of protection and information preservation. By considering randomized response, and specifically its recent bistochastic extension, in the context of dynamic data, this paper proposes a protocol for the anonymization of data streams that achieves zero delay while exhibiting formal privacy guarantees. Using a new tool in the privacy literature that introduces the concept of elementary plausible deniability, we show that it is feasible to achieve an atomic processing of individuals entering a stream, in-stead of proceeding by batches. We illustrate the application of the proposed approach by an empirical example. △ Less

Submitted 25 February, 2024; originally announced February 2024.

arXiv:2312.10146 [pdf, other]

Star Formation and Dust in the Cosmic Web

Authors: Massimiliano Parente, Cinthia Ragone-Figueroa, Pablo López, Héctor J. Martínez, Andrés N. Ruiz, Laura Ceccarelli, Valeria Coenda, Facundo Rodriguez, Gian Luigi Granato, Andrea Lapi, Rien van de Weygaert

Abstract: The large-scale environment of the cosmic web is believed to impact galaxy evolution, but there is still no consensus regarding the mechanisms. We use a semi-analytic model (SAM) galaxy catalog to study the star formation and dust content of local galaxies in different cosmic environments of the cosmic web, namely voids, filaments, walls, and nodes. We find a strong impact of the environment only… ▽ More The large-scale environment of the cosmic web is believed to impact galaxy evolution, but there is still no consensus regarding the mechanisms. We use a semi-analytic model (SAM) galaxy catalog to study the star formation and dust content of local galaxies in different cosmic environments of the cosmic web, namely voids, filaments, walls, and nodes. We find a strong impact of the environment only for galaxies with $M_{\rm stars}\lesssim10^{10.8}\, M_\odot$: the less dense the environment, the larger the star formation rate and dust content at fixed stellar mass. This is attributed to the fact that galaxies in less dense environments typically feature younger stellar populations, a slower evolution of their stellar mass and a delayed star formation compared to galaxies in denser environments. As for galaxies with $M_{\rm stars}\gtrsim 10^{10.8}\, M_\odot$ differences among environments are milder due to the disc instability (DI) driven supermassive black hole (SMBH) growth implemented in the SAM, which makes SMBH growth, and thus galaxy quenching, environment insensitive. We qualitatively test our predictions against observations by identifying environments in the SDSS-DR16 using dust masses derived from the GAMA survey. The agreement is encouraging, particularly at ${\rm log} \, M_{\rm stars}/M_\odot\gtrsim 10.5-11$, where sSFRs and dust masses appear quite environment-insensitive. This result confirms the importance of in situ growth channels of SMBHs. △ Less

Submitted 11 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

Comments: 21 pages, 12 figures, 3 tables. Accepted for publication in ApJ

arXiv:2311.13600 [pdf, other]

ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs

Authors: Viraj Shah, Nataniel Ruiz, Forrester Cole, Erika Lu, Svetlana Lazebnik, Yuanzhen Li, Varun Jampani

Abstract: Methods for finetuning generative models for concept-driven personalization generally achieve strong results for subject-driven or style-driven generation. Recently, low-rank adaptations (LoRA) have been proposed as a parameter-efficient way of achieving concept-driven personalization. While recent work explores the combination of separate LoRAs to achieve joint generation of learned styles and su… ▽ More Methods for finetuning generative models for concept-driven personalization generally achieve strong results for subject-driven or style-driven generation. Recently, low-rank adaptations (LoRA) have been proposed as a parameter-efficient way of achieving concept-driven personalization. While recent work explores the combination of separate LoRAs to achieve joint generation of learned styles and subjects, existing techniques do not reliably address the problem; they often compromise either subject fidelity or style fidelity. We propose ZipLoRA, a method to cheaply and effectively merge independently trained style and subject LoRAs in order to achieve generation of any user-provided subject in any user-provided style. Experiments on a wide range of subject and style combinations show that ZipLoRA can generate compelling results with meaningful improvements over baselines in subject and style fidelity while preserving the ability to recontextualize. Project page: https://ziplora.github.io △ Less

Submitted 22 November, 2023; originally announced November 2023.

Comments: Project page: https://ziplora.github.io

arXiv:2311.12928 [pdf, other]

Void Probability Function inside cosmic voids: evidence for hierarchical scaling of high-order correlations in real space

Authors: Federico Dávila-Kurbán, Andrés N. Ruiz, Dante Paz, Diego Garcia Lambas

Abstract: We compare the reduced void probability function (VPF) inside and outside of cosmic voids in the TNG300-1 simulation, both in real and simulated redshift space. The VPF is a special case of the counts-in-cells approach for extracting information of high-order clustering that is crucial for a full understanding of the distribution of galaxies. Previous studies have validated the hierarchical scalin… ▽ More We compare the reduced void probability function (VPF) inside and outside of cosmic voids in the TNG300-1 simulation, both in real and simulated redshift space. The VPF is a special case of the counts-in-cells approach for extracting information of high-order clustering that is crucial for a full understanding of the distribution of galaxies. Previous studies have validated the hierarchical scaling paradigm of galaxy clustering moments, in good agreement with the "negative binomial" model, in redshift surveys, but have also reported that this paradigm is not valid in real space. However, in this work we find that hierarchical scaling can indeed be found in real space inside cosmic voids. This is well fitted by the negative binomial model. We find this result to be robust against changes in void identification, galaxy mass, random dilutions, and redshift. We also obtain that the VPF in real space at high redshift approaches the negative binomial model, and therefore it is similar to the VPF inside voids at the present time. This study points, for the first time, towards evidence of hierarchical scaling of high-order clustering of galaxies in real space inside voids, preserving the pristine structure formation processes of the Universe. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: 12 pages, 8 figures. Accepted for publication by the MNRAS

arXiv:2311.06721 [pdf, other]

Compact groups from semi-analytical models of galaxy formation -- V: their assembly channels as a function of the environment

Authors: A. Taverna, E. Diaz-Gimenez, A. Zandivarez, H. J. Martinez, A. N. Ruiz

Abstract: We delved into the assembly pathways and environments of compact groups (CGs) of galaxies using mock catalogues generated from semi-analytical models (SAMs) on the Millennium simulation. We investigate the ability of SAMs to replicate the observed CG environments and whether CGs with different assembly histories tend to inhabit specific cosmic environments. We also analyse whether the environment… ▽ More We delved into the assembly pathways and environments of compact groups (CGs) of galaxies using mock catalogues generated from semi-analytical models (SAMs) on the Millennium simulation. We investigate the ability of SAMs to replicate the observed CG environments and whether CGs with different assembly histories tend to inhabit specific cosmic environments. We also analyse whether the environment or the assembly history is more important in tailoring CG properties. We find that about half of the CGs in SAMs are non-embedded systems, 40% are inhabiting loose groups or nodes of filaments, while the rest distribute evenly in filaments and voids, in agreement with observations. We observe that early-assembled CGs preferentially inhabit large galaxy systems (~ 60%), while around 30% remain non-embedded. Conversely, lately-formed CGs exhibit the opposite trend. We also obtain that lately-formed CGs have lower velocity dispersions and larger crossing times than early-formed CGs, but mainly because they are preferentially non-embedded. Those lately-formed CGs that inhabit large systems do not show the same features. Therefore, the environment plays a strong role in these properties for lately-formed CGs. Early-formed CGs are more evolved, displaying larger velocity dispersions, shorter crossing times, and more dominant first-ranked galaxies, regardless of the environment. Finally, the difference in brightness between the two brightest members of CGs is dependent only on the assembly history and not on the environment. CGs residing in diverse environments have undergone varied assembly processes, making them suitable for studying their evolution and the interplay of nature and nurture on their traits. △ Less

Submitted 14 November, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

Comments: 13 pages, 8 figures. Accepted for publication in MNRAS

arXiv:2310.19928 [pdf, ps, other]

Characterising HOD in filaments and nodes of the cosmic web

Authors: Noelia R. Perez, Luis A. Pereyra, Georgina Coldwell, Facundo Rodriguez, Ignacio G. Alfaro, Andrés N. Ruiz

Abstract: The standard paradigm for the formation of the Universe suggests that large structures are formed from hierarchical clustering by the continuous accretion of less massive galaxy systems through filaments. In this context, filamentary structures play an important role in the properties and evolution of galaxies by connecting high-density regions, such as nodes, and being surrounded by low-density r… ▽ More The standard paradigm for the formation of the Universe suggests that large structures are formed from hierarchical clustering by the continuous accretion of less massive galaxy systems through filaments. In this context, filamentary structures play an important role in the properties and evolution of galaxies by connecting high-density regions, such as nodes, and being surrounded by low-density regions, such as cosmic voids. The availability of the filament and point critic catalogues extracted by \textsc{DisPerSE} from the \textsc{Illustris} TNG300-1 hydrodynamic simulation allows a detailed analysis of these structures. The halo occupation distribution (HOD) is a powerful tool for linking galaxies and dark matter halos, allowing constrained models of galaxy formation and evolution. In this work we combine the advantage of halo occupancy with information from the filament network to analyse the HOD in filaments and nodes. In our study, we distinguish the inner regions of cosmic filaments and nodes from their surroundings. The results show that the filamentary structures have a similar trend to the total galaxy sample covering a wide range of densities. In the case of the nodes sample, an excess of faint and blue galaxies is found for the low-mass nodes suggesting that these structures are not virialised and that galaxies may be continuously falling through the filaments. Instead, the higher-mass halos could be in a more advanced stage of evolution showing features of virialised structures. △ Less

Submitted 26 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

arXiv:2309.16668 [pdf, other]

doi 10.1145/3658237

RealFill: Reference-Driven Generation for Authentic Image Completion

Authors: Luming Tang, Nataniel Ruiz, Qinghao Chu, Yuanzhen Li, Aleksander Holynski, David E. Jacobs, Bharath Hariharan, Yael Pritch, Neal Wadhwa, Kfir Aberman, Michael Rubinstein

Abstract: Recent advances in generative imagery have brought forth outpainting and inpainting models that can produce high-quality, plausible image content in unknown regions. However, the content these models hallucinate is necessarily inauthentic, since they are unaware of the true scene. In this work, we propose RealFill, a novel generative approach for image completion that fills in missing regions of a… ▽ More Recent advances in generative imagery have brought forth outpainting and inpainting models that can produce high-quality, plausible image content in unknown regions. However, the content these models hallucinate is necessarily inauthentic, since they are unaware of the true scene. In this work, we propose RealFill, a novel generative approach for image completion that fills in missing regions of an image with the content that should have been there. RealFill is a generative inpainting model that is personalized using only a few reference images of a scene. These reference images do not have to be aligned with the target image, and can be taken with drastically varying viewpoints, lighting conditions, camera apertures, or image styles. Once personalized, RealFill is able to complete a target image with visually compelling contents that are faithful to the original scene. We evaluate RealFill on a new image completion benchmark that covers a set of diverse and challenging scenarios, and find that it outperforms existing approaches by a large margin. Project page: https://realfill.github.io △ Less

Submitted 14 May, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

Comments: SIGGRAPH 2024 (Journal Track). Project page: https://realfill.github.io

arXiv:2308.07317 [pdf, other]

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Authors: Ariel N. Lee, Cole J. Hunter, Nataniel Ruiz

Abstract: We present $\textbf{Platypus}$, a family of fine-tuned and merged Large Language Models (LLMs) that achieves the strongest performance and currently stands at first place in HuggingFace's Open LLM Leaderboard as of the release date of this work. In this work we describe (1) our curated dataset $\textbf{Open-Platypus}$, that is a subset of other open datasets and which… ▽ More We present $\textbf{Platypus}$, a family of fine-tuned and merged Large Language Models (LLMs) that achieves the strongest performance and currently stands at first place in HuggingFace's Open LLM Leaderboard as of the release date of this work. In this work we describe (1) our curated dataset $\textbf{Open-Platypus}$, that is a subset of other open datasets and which $\textit{we release to the public}$ (2) our process of fine-tuning and merging LoRA modules in order to conserve the strong prior of pretrained LLMs, while bringing specific domain knowledge to the surface (3) our efforts in checking for test data leaks and contamination in the training data, which can inform future research. Specifically, the Platypus family achieves strong performance in quantitative LLM metrics across model sizes, top** the global Open LLM leaderboard while using just a fraction of the fine-tuning data and overall compute that are required for other state-of-the-art fine-tuned LLMs. In particular, a 13B Platypus model can be trained on $\textit{a single}$ A100 GPU using 25k questions in 5 hours. This is a testament of the quality of our Open-Platypus dataset, and opens opportunities for more improvements in the field. Project page: https://platypus-llm.github.io △ Less

Submitted 14 March, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

Comments: Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023

arXiv:2307.14420 [pdf, other]

doi 10.1093/mnras/stad2300

Environmental effects on associations of dwarf galaxies

Authors: C. Yamila Yaryura, Mario G. Abadi, Stefan Gottlöber, Noam I. Libeskind, Sofía A. Cora, Andrés N. Ruiz, Cristian A. Vega-Martínez, Gustavo Yepes

Abstract: We study the properties of associations of dwarf galaxies and their dependence on the environment. Associations of dwarf galaxies are extended systems composed exclusively of dwarf galaxies, considering as dwarf galaxies those galaxies less massive than $M_{\star, \rm max} = 10^{9.0}$ ${\rm M}_{\odot}\,h^{-1}$. We identify these particular systems using a semi-analytical model of galaxy formation… ▽ More We study the properties of associations of dwarf galaxies and their dependence on the environment. Associations of dwarf galaxies are extended systems composed exclusively of dwarf galaxies, considering as dwarf galaxies those galaxies less massive than $M_{\star, \rm max} = 10^{9.0}$ ${\rm M}_{\odot}\,h^{-1}$. We identify these particular systems using a semi-analytical model of galaxy formation coupled to a dark matter only simulation in the $Λ$ Cold Dark Matter cosmological model. To classify the environment, we estimate eigenvalues from the tidal field of the dark matter particle distribution of the simulation. We find that the majority, two thirds, of associations are located in filaments ($ \sim 67$ per cent), followed by walls ($ \sim 26 $ per cent), while only a small fraction of them are in knots ($ \sim 6 $ per cent) and voids ($ \sim 1 $ per cent). Associations located in more dense environments present significantly higher velocity dispersion than those located in less dense environments, evidencing that the environment plays a fundamental role in their dynamical properties. However, this connection between velocity dispersion and the environment depends exclusively on whether the systems are gravitational bound or unbound, given that it disappears when we consider associations of dwarf galaxies that are gravitationally bound. Although less than a dozen observationally detected associations of dwarf galaxies are currently known, our results are predictions on the eve of forthcoming large surveys of galaxies, which will enable these very particular systems to be identified and studied. △ Less

Submitted 26 July, 2023; originally announced July 2023.

Comments: 13 pages, 9 figures. Accepted for publication in MNRAS

arXiv:2307.13037 [pdf, other]

doi 10.1093/mnras/stad2267

Backsplash galaxies and their impact on galaxy evolution: a three-stage, four-type perspective

Authors: Andrés N. Ruiz, Héctor J. Martínez, Valeria Coenda, Hernán Muriel, Sofía A. Cora, Martín de los Rios, Cristian A. Vega-Martínez

Abstract: We study the population of backsplash galaxies at $z=0$ in the outskirts of massive, isolated clusters of galaxies taken from the MDPL2-SAG semi-analytic catalogue. We consider four types of backsplash galaxies according to whether they are forming stars or passive at three stagesin their lifetimes: before entering the cluster, during their first incursion through the cluster, and after they exit… ▽ More We study the population of backsplash galaxies at $z=0$ in the outskirts of massive, isolated clusters of galaxies taken from the MDPL2-SAG semi-analytic catalogue. We consider four types of backsplash galaxies according to whether they are forming stars or passive at three stagesin their lifetimes: before entering the cluster, during their first incursion through the cluster, and after they exit the cluster. We analyse several geometric, dynamic, and astrophysical aspects of the four types at the three stages. Galaxies that form stars at all stages account for the majority of the backsplash population ($58\%$) and have stellar masses typically below $M_\star\sim 3\times 10^{10} h^{-1}{\rm M}_\odot$ that avoid the innermost cluster's regions and are only mildly affected by it. In a similar mass range, galaxies that become passive after exiting the cluster ($26\%$) follow orbits characterised by small pericentric distance and a strong deflection by the cluster potential well while suffering a strong loss of both dark matter and gas content. Only a small fraction of our sample ($4\%$) become passive while orbiting inside the cluster. These galaxies have experienced heavy pre-processing and the cluster's tidal strip** and ram pressure provide the final blow to their star formation. Finally, galaxies that are passive before entering the cluster for the first time ($12\%$) are typically massive and are not affected significantly by the cluster. Using the bulge/total mass ratio as a proxy for morphology, we find that a single incursion through a cluster do not result in significant morphological changes in all four types. △ Less

Submitted 24 July, 2023; originally announced July 2023.

Comments: Accepted for publication in MNRAS. Comments are welcome

arXiv:2307.06949 [pdf, other]

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Authors: Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Wei Wei, Tingbo Hou, Yael Pritch, Neal Wadhwa, Michael Rubinstein, Kfir Aberman

Abstract: Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and sto… ▽ More Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and storing a personalized model per subject can be demanding in terms of storage capacity. To overcome these challenges, we propose HyperDreamBooth-a hypernetwork capable of efficiently generating a small set of personalized weights from a single image of a person. By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications. Our method achieves personalization on faces in roughly 20 seconds, 25x faster than DreamBooth and 125x faster than Textual Inversion, using as few as one reference image, with the same quality and style diversity as DreamBooth. Also our method yields a model that is 10000x smaller than a normal DreamBooth model. Project page: https://hyperdreambooth.github.io △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: project page: https://hyperdreambooth.github.io

arXiv:2306.17848 [pdf, other]

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

Authors: Ariel N. Lee, Sarah Adel Bargal, Janavi Kasera, Stan Sclaroff, Kate Saenko, Nataniel Ruiz

Abstract: Vision transformers (ViTs) have significantly changed the computer vision landscape and have periodically exhibited superior performance in vision tasks compared to convolutional neural networks (CNNs). Although the jury is still out on which model type is superior, each has unique inductive biases that shape their learning and generalization performance. For example, ViTs have interesting propert… ▽ More Vision transformers (ViTs) have significantly changed the computer vision landscape and have periodically exhibited superior performance in vision tasks compared to convolutional neural networks (CNNs). Although the jury is still out on which model type is superior, each has unique inductive biases that shape their learning and generalization performance. For example, ViTs have interesting properties with respect to early layer non-local feature dependence, as well as self-attention mechanisms which enhance learning flexibility, enabling them to ignore out-of-context image information more effectively. We hypothesize that this power to ignore out-of-context information (which we name $\textit{patch selectivity}$), while integrating in-context information in a non-local manner in early layers, allows ViTs to more easily handle occlusion. In this study, our aim is to see whether we can have CNNs $\textit{simulate}$ this ability of patch selectivity by effectively hardwiring this inductive bias using Patch Mixing data augmentation, which consists of inserting patches from another image onto a training image and interpolating labels between the two image classes. Specifically, we use Patch Mixing to train state-of-the-art ViTs and CNNs, assessing its impact on their ability to ignore out-of-context patches and handle natural occlusions. We find that ViTs do not improve nor degrade when trained using Patch Mixing, but CNNs acquire new capabilities to ignore out-of-context information and improve on occlusion benchmarks, leaving us to conclude that this training method is a way of simulating in CNNs the abilities that ViTs already possess. We will release our Patch Mixing implementation and proposed datasets for public use. Project page: https://arielnlee.github.io/PatchMixing/ △ Less

Submitted 30 June, 2023; originally announced June 2023.

arXiv:2306.00983 [pdf, other]

StyleDrop: Text-to-Image Generation in Any Style

Authors: Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

Abstract: Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts. However, ambiguities inherent in natural language and out-of-distribution effects make it hard to synthesize image styles, that leverage a specific design pattern, texture or material. In this paper, we introduce StyleDrop, a method that enables the synthesis of images that faithfully follo… ▽ More Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts. However, ambiguities inherent in natural language and out-of-distribution effects make it hard to synthesize image styles, that leverage a specific design pattern, texture or material. In this paper, we introduce StyleDrop, a method that enables the synthesis of images that faithfully follow a specific style using a text-to-image model. The proposed method is extremely versatile and captures nuances and details of a user-provided style, such as color schemes, shading, design patterns, and local and global effects. It efficiently learns a new style by fine-tuning very few trainable parameters (less than $1\%$ of total model parameters) and improving the quality via iterative training with either human or automated feedback. Better yet, StyleDrop is able to deliver impressive results even when the user supplies only a single image that specifies the desired style. An extensive study shows that, for the task of style tuning text-to-image models, StyleDrop implemented on Muse convincingly outperforms other methods, including DreamBooth and textual inversion on Imagen or Stable Diffusion. More results are available at our project website: https://styledrop.github.io △ Less

Submitted 1 June, 2023; originally announced June 2023.

Comments: Preprint. Project page at https://styledrop.github.io

arXiv:2304.00186 [pdf, other]

Subject-driven Text-to-Image Generation via Apprenticeship Learning

Authors: Wenhu Chen, Hexiang Hu, Yandong Li, Nataniel Ruiz, Xuhui Jia, Ming-Wei Chang, William W. Cohen

Abstract: Recent text-to-image generation models like DreamBooth have made remarkable progress in generating highly customized images of a target subject, by fine-tuning an ``expert model'' for a given subject from a few examples. However, this process is expensive, since a new expert model must be learned for each subject. In this paper, we present SuTI, a Subject-driven Text-to-Image generator that replac… ▽ More Recent text-to-image generation models like DreamBooth have made remarkable progress in generating highly customized images of a target subject, by fine-tuning an ``expert model'' for a given subject from a few examples. However, this process is expensive, since a new expert model must be learned for each subject. In this paper, we present SuTI, a Subject-driven Text-to-Image generator that replaces subject-specific fine tuning with in-context learning. Given a few demonstrations of a new subject, SuTI can instantly generate novel renditions of the subject in different scenes, without any subject-specific optimization. SuTI is powered by apprenticeship learning, where a single apprentice model is learned from data generated by a massive number of subject-specific expert models. Specifically, we mine millions of image clusters from the Internet, each centered around a specific visual subject. We adopt these clusters to train a massive number of expert models, each specializing in a different subject. The apprentice model SuTI then learns to imitate the behavior of these fine-tuned experts. SuTI can generate high-quality and customized subject-specific images 20x faster than optimization-based SoTA methods. On the challenging DreamBench and DreamBench-v2, our human evaluation shows that SuTI significantly outperforms existing models like InstructPix2Pix, Textual Inversion, Imagic, Prompt2Prompt, Re-Imagen and DreamBooth, especially on the subject and text alignment aspects. △ Less

Submitted 2 October, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

Comments: Accepted at NeurIPS 2023. Model Service to be appear as Google Vertex AI - Instant Tuning (https://cloud.google.com/vertex-ai/docs/generative-ai/image/fine-tune-model). The link to demo video: https://www.youtube.com/watch?v=Q2xQ91D_dhM&t=2071s&ab_channel=GoogleCloud

arXiv:2303.13508 [pdf, other]

DreamBooth3D: Subject-Driven Text-to-3D Generation

Authors: Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan Barron, Yuanzhen Li, Varun Jampani

Abstract: We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with text-to-3D generation (DreamFusion). We find that naively combining these methods fails to yield satisfactory subject-specific 3D assets due to personalized text-to-im… ▽ More We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with text-to-3D generation (DreamFusion). We find that naively combining these methods fails to yield satisfactory subject-specific 3D assets due to personalized text-to-image models overfitting to the input viewpoints of the subject. We overcome this through a 3-stage optimization strategy where we jointly leverage the 3D consistency of neural radiance fields together with the personalization capability of text-to-image models. Our method can produce high-quality, subject-specific 3D assets with text-driven modifications such as novel poses, colors and attributes that are not seen in any of the input images of the subject. △ Less

Submitted 27 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: Project page at https://dreambooth3d.github.io/ Video Summary at https://youtu.be/kKVDrbfvOoA

arXiv:2303.09438 [pdf, other]

Trustera: A Live Conversation Redaction System

Authors: Evandro Gouvêa, Ali Dadgar, Shahab Jalalvand, Rathi Chengalvarayan, Badrinath Jayakumar, Ryan Price, Nicholas Ruiz, Jennifer McGovern, Srinivas Bangalore, Ben Stern

Abstract: Trustera, the first functional system that redacts personally identifiable information (PII) in real-time spoken conversations to remove agents' need to hear sensitive information while preserving the naturalness of live customer-agent conversations. As opposed to post-call redaction, audio masking starts as soon as the customer begins speaking to a PII entity. This significantly reduces the risk… ▽ More Trustera, the first functional system that redacts personally identifiable information (PII) in real-time spoken conversations to remove agents' need to hear sensitive information while preserving the naturalness of live customer-agent conversations. As opposed to post-call redaction, audio masking starts as soon as the customer begins speaking to a PII entity. This significantly reduces the risk of PII being intercepted or stored in insecure data storage. Trustera's architecture consists of a pipeline of automatic speech recognition, natural language understanding, and a live audio redactor module. The system's goal is three-fold: redact entities that are PII, mask the audio that goes to the agent, and at the same time capture the entity, so that the captured PII can be used for a payment transaction or caller identification. Trustera is currently being used by thousands of agents to secure customers' sensitive information. △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: 5

arXiv:2302.01884 [pdf, other]

doi 10.1093/mnras/stad416

Hickson-like compact groups inhabiting different environments

Authors: A. Taverna, J. M. Salerno, I. V. Daza-Perilla, E. Diaz-Gimenez, A. Zandivarez, H. J. Martinez, A. N. Ruiz

Abstract: Although Compact Groups of galaxies (CGs) have been envisioned as isolated extremely dense structures in the Universe, it is accepted today that many of them could be not as isolated as thought. In this work, we study Hickson-like CGs identified in the Sloan Digital Sky Survey Data Release 16 to analyse these systems and their galaxies when embedded in different cosmological structures. To achieve… ▽ More Although Compact Groups of galaxies (CGs) have been envisioned as isolated extremely dense structures in the Universe, it is accepted today that many of them could be not as isolated as thought. In this work, we study Hickson-like CGs identified in the Sloan Digital Sky Survey Data Release 16 to analyse these systems and their galaxies when embedded in different cosmological structures. To achieve this goal, we identify several cosmological structures where CGs can reside: Nodes of filaments, Loose Groups, Filaments and cosmic Voids. Our results indicate that 45 per cent of CGs do not reside in any of these structures, i.e., they can be considered non-embedded or isolated systems. Most of the embedded CGs are found inhabiting Loose Groups and Nodes, while there are almost no CGs residing well inside cosmic Voids. Some physical properties of CGs vary depending on the environment they inhabit. CGs in Nodes show the largest velocity dispersions, the brightest absolute magnitude of the first-ranked galaxy, and the smallest crossing times, while the opposite occurs in Non-Embedded CGs. When comparing galaxies in all the environments and galaxies in CGs, CGs show the highest fractions of red/early-type galaxy members in most of the absolute magnitudes ranges. The variation between galaxies in CGs inhabiting one or another environment is not as significant as the differences caused by belonging or not to a CG. Our results suggest a plausible scenario for galaxy evolution in CGs in which both, large-scale and local environments play essential roles. △ Less

Submitted 3 February, 2023; originally announced February 2023.

Comments: 16 pages, 9 figures, 1 table, accepted for publication in MNRAS

arXiv:2212.10594 [pdf, other]

doi 10.1093/mnras/stad623

Local and large-scale effects on the astrophysics of void-galaxies

Authors: Agustín M. Rodríguez-Medrano, Dante J. Paz, Federico A. Stasyszyn, Facundo Rodríguez, Andrés N. Ruiz, Manuel Merchán

Abstract: Galaxies in cosmic voids have been reported with properties related to a delayed evolution with respect to the Universe in general. These characteristics reflect the interaction of galaxies with the environment. However, it is not clear the degree of influence of the large-scale structure on the properties of void galaxies or, if these are only influenced by the low local density around them typic… ▽ More Galaxies in cosmic voids have been reported with properties related to a delayed evolution with respect to the Universe in general. These characteristics reflect the interaction of galaxies with the environment. However, it is not clear the degree of influence of the large-scale structure on the properties of void galaxies or, if these are only influenced by the low local density around them typical of these regions. In this article we identified cosmic voids in the SDSS-DR16 and studied various properties of galaxies, such as g-r colour, star formation rate, and concentration. To characterise the local environment, we have identified groups of galaxies and studied their properties as a function of their dark matter and stellar masses, analysing separately those found in voids and in the general sample. Our results show that galaxies that inhabit haloes of a given mass (below \sim 10^13.5 M_\dot ), are bluer, have a higher star formation rate and are less concentrated when the host halo is inside voids compared to other regions. For larger halo masses, the trend disappears. We also analyse whether the properties of galaxies are sensitive to the type of voids that inhabit. This is done by separating voids embedded in overdense regions (S-type) from those that asymptotically converge to the average density of the universe (R-type). We found that galaxies in R-type voids are bluer, with higher SFR and less concentration than in S-type voids. Our results indicate some degree of correlation of galaxy properties with the large-scale environment provided by voids, suggesting possible second-order mechanisms in galaxy evolution. △ Less

Submitted 23 February, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

Comments: 10 pages, 9 figures. Acccepted for publication in MNRAS

arXiv:2212.09780 [pdf, other]

doi 10.1093/mnras/stac3746

Reconstructing Orbits of Galaxies in Extreme Regions (ROGER) III: galaxy evolution patterns in projected phase space around massive X-ray clusters

Authors: Hector J. Martinez, Valeria Coenda, Hernan Muriel, Martin de los Rios, Andres N. Ruiz

Abstract: We use the ROGER code by de los Rios et al. to classify galaxies around a sample of X-ray clusters into five classes according to their positions in the projected phase space diagram: cluster galaxies, backsplash galaxies, recent infallers, infalling galaxies, and interlopers. To understand the effects of the cluster environment to the evolution of galaxies, we compare across the five classes: ste… ▽ More We use the ROGER code by de los Rios et al. to classify galaxies around a sample of X-ray clusters into five classes according to their positions in the projected phase space diagram: cluster galaxies, backsplash galaxies, recent infallers, infalling galaxies, and interlopers. To understand the effects of the cluster environment to the evolution of galaxies, we compare across the five classes: stellar mass, specific star formation rate, size, and morphology. Following the guidelines of Coenda et al., a separate analysis is carried out for red and blue galaxies. For red galaxies, cluster galaxies differ from the other classes, having a suppressed specific star formation rate, smaller sizes, and are more likely to be classified as ellipticals. Differences are smaller between the other classes, however backsplash galaxies have significantly lower specific star formation rates than early or recent infalling galaxies. For blue galaxies, we find evidence that recent infallers are smaller than infalling galaxies and interlopers, while the latter two are comparable in size. Our results provide evidence that, after a single passage, the cluster environment can diminish a galaxy's star formation, modify its morphology, and can also reduce in size blue galaxies. We find evidence that quenching occurs faster than morphological transformation from spirals to ellipticals for all classes. While quenching is evidently enhanced as soon as galaxies get into clusters, significant morphological transformations require galaxies to experience the action of the physical mechanisms of the cluster for longer timescales. △ Less

Submitted 19 December, 2022; originally announced December 2022.

Comments: Accepted in MNRAS, 11 pages, 7 figures

arXiv:2212.06849 [pdf, other]

doi 10.1093/mnras/stad1146

Guess the cheese flavour by the size of its holes: A cosmological test using the abundance of Popcorn voids

Authors: Dante J. Paz, Carlos M. Correa, Sebastián R. Gualpa, Andres N. Ruiz, Carlos S. Bederián, R. Dario Graña, Nelson D. Padilla

Abstract: We present a new definition of cosmic void and a publicly available code with the algorithm that implements it. Underdense regions are defined as free-form objects, called popcorn voids, made from the union of spheres of maximum volume with a given joint integrated underdensity contrast.The method is inspired by the excursion-set theory and consequently no rescaling processing is needed, the remov… ▽ More We present a new definition of cosmic void and a publicly available code with the algorithm that implements it. Underdense regions are defined as free-form objects, called popcorn voids, made from the union of spheres of maximum volume with a given joint integrated underdensity contrast.The method is inspired by the excursion-set theory and consequently no rescaling processing is needed, the removal of overlap** voids and objects with sizes below the shot noise threshold is inherent in the algorithm. The abundance of popcorn voids in the matter field can be fitted using the excursion-set theory provided the relationship between the linear density contrast of the barrier and the threshold used in void identification is modified relative to the spherical evolution model. We also analysed the abundance of voids in biased tracer samples in redshift space. We show how the void abundance can be used to measure the geometric distortions due to the assumed fiducial cosmology, in a test similar to an Alcock-Paczyński test. Using the formalism derived from previous works, we show how to correct the abundance of popcorn voids for redshift-space distortion effects. Using this treatment, in combination with the excursion-set theory, we demonstrate the feasibility of void abundance measurements as cosmological probes. We obtain unbiased estimates of the target parameters, albeit with large degeneracies in the parameter space. Therefore, we conclude that the proposed test in combination with other cosmological probes has potential to improve current cosmological parameter constraints. △ Less

Submitted 3 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

Comments: Updated manuscript sent to the MNRAS after referee report: 16 pages, 8 figures. Corrections were made to Fig. 4, some related conclusions were modified. The main conclusions remain unchanged

arXiv:2211.16499 [pdf, other]

Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing

Authors: Nataniel Ruiz, Sarah Adel Bargal, Cihang Xie, Kate Saenko, Stan Sclaroff

Abstract: Modern deep neural networks tend to be evaluated on static test sets. One shortcoming of this is the fact that these deep neural networks cannot be easily evaluated for robustness issues with respect to specific scene variations. For example, it is hard to study the robustness of these networks to variations of object scale, object pose, scene lighting and 3D occlusions. The main reason is that co… ▽ More Modern deep neural networks tend to be evaluated on static test sets. One shortcoming of this is the fact that these deep neural networks cannot be easily evaluated for robustness issues with respect to specific scene variations. For example, it is hard to study the robustness of these networks to variations of object scale, object pose, scene lighting and 3D occlusions. The main reason is that collecting real datasets with fine-grained naturalistic variations of sufficient scale can be extremely time-consuming and expensive. In this work, we present Counterfactual Simulation Testing, a counterfactual framework that allows us to study the robustness of neural networks with respect to some of these naturalistic variations by building realistic synthetic scenes that allow us to ask counterfactual questions to the models, ultimately providing answers to questions such as "Would your classification still be correct if the object were viewed from the top?" or "Would your classification still be correct if the object were partially occluded by another object?". Our method allows for a fair comparison of the robustness of recently released, state-of-the-art Convolutional Neural Networks and Vision Transformers, with respect to these naturalistic variations. We find evidence that ConvNext is more robust to pose and scale variations than Swin, that ConvNext generalizes better to our simulated domain and that Swin handles partial occlusion better than ConvNext. We also find that robustness for all networks improves with network scale and with data scale and variety. We release the Naturalistic Variation Object Dataset (NVD), a large simulated dataset of 272k images of everyday objects with naturalistic variations such as object pose, scale, viewpoint, lighting and occlusions. Project page: https://counterfactualsimulation.github.io △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: Published at the Conference on Neural Information Processing Systems (NeurIPS) 2022

arXiv:2210.09383 [pdf, other]

doi 10.1051/0004-6361/202140704

HALOGAS: Strong Constraints on the Neutral Gas Reservoir and Accretion Rate in Nearby Spiral Galaxies

Authors: P. Kamphuis, E. Jütte, G. H. Heald, N. Herrera Ruiz, G. I. G. Józsa, W. J. G. de Blok, P. Serra, A. Marasco, R. -J. Dettmar, N. M. **el, T. Oosterloo, R. J. Rand, R. A. M. Walterbos, J. M. van der Hulst

Abstract: Galaxies in the local Universe are thought to require ongoing replenishment of their gas reservoir in order to maintain the observed star formation rates. Cosmological simulations predict that such accretion can occur in both a dynamically hot and cold mode. However, until now observational evidence of the accretion required to match the observed star formation histories is lacking. This paper att… ▽ More Galaxies in the local Universe are thought to require ongoing replenishment of their gas reservoir in order to maintain the observed star formation rates. Cosmological simulations predict that such accretion can occur in both a dynamically hot and cold mode. However, until now observational evidence of the accretion required to match the observed star formation histories is lacking. This paper attempts to determine whether galaxies in the local Universe possess a significant reservoir of HI and what would be the accretion rates derived from such reservoirs. We search the vicinity of 22 nearby galaxies for isolated HI clouds or distinct streams in a systematic and automated manner. The HALOGAS observations represent one of the most sensitive and detailed HI surveys to date. These observations typically reach column density sensitivities of 10^19 cm^-2 over a 20 km/s width. We find 14 secure HI cloud candidates without an observed optical counterpart. These cloud candidates appear to be analogues to the most massive clouds detected around the Milky Way and M31. However, on average their numbers seem significantly reduced. We constrain upper limits for HI accretion in the local Universe. The average HI mass currently observed amounts to a rate of 0.05 Msun/yr with a stringent upper limit of 0.22 Msun/yr, confirming previous estimates. This is much lower than the average star formation rate in this sample. Our best estimate, based on GBT detection limits of several galaxies, suggests that another 0.04 Msun/yr could be accreted from undetected clouds and streams. These results show that in nearby galaxies HI is not being accreted at the same rate as stars are currently being formed. Our study can not exclude that other forms of gas accretion are at work. However, these observations also do not reveal extended neutral gas reservoirs around most nearby spiral galaxies. △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: Accepted for publication in Astronomy & Astrophysics section 4. Extragalactic astronomy. Data available at https://www.astron.nl/halogas/data.php

Journal ref: A&A 668, A182 (2022)

arXiv:2210.09300 [pdf, other]

Anisotropic infall in the outskirst of clusters

Authors: Juan Manuel Salerno, Hernán Muriel, Valeria Coenda, Sofía A. Cora, Luis Pereyra, Andrés N. Ruiz, Cristian A. Vega-Martínez

Abstract: We analyse the connection between the star formation quenching of galaxies and their location in theoutskirts of clusters in the redshift range $z=[0,2]$ by estimating the fraction of red galaxies. More specifically, we focus on galaxies that infall isotropically from those that are infalling alongside filaments. We use a sample of galaxies obtained from the semi-analytic model of galaxy formation… ▽ More We analyse the connection between the star formation quenching of galaxies and their location in theoutskirts of clusters in the redshift range $z=[0,2]$ by estimating the fraction of red galaxies. More specifically, we focus on galaxies that infall isotropically from those that are infalling alongside filaments. We use a sample of galaxies obtained from the semi-analytic model of galaxy formation SAG applied to the MultiDark simulation. {\textsc{mdpl2}}. In agreement with observational results, we find that the infall regions show levels of star formation that are intermediate between those of galaxies in clusters and in the field. Moreover, we show that, in the redshift range [0-0.85], the quenching of the star formation is stronger in the filamentary region than in the isotropic infall region. We also study the fraction of red galaxies as a function of the normalised distance to the cluster centre and find that, for radii $R/R_{200}> 3 $, the fraction of red galaxies in the filamentary region is considerably larger than in the isotropic infall region. From the analysis of properties of the main progenitors of galaxies identified at $z = 0$, we find that they have different evolutionary behaviours depending on the stellar mass and environment. Our results confirm the observational findings that suggest that the infall regions of clusters play an important role in the pre-processing of galaxies along most of the evolutionary history of galaxies. △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: 15 pages, 15 figures

arXiv:2210.05667 [pdf, other]

Human Body Measurement Estimation with Adversarial Augmentation

Authors: Nataniel Ruiz, Miriam Bellver, Timo Bolkart, Ambuj Arora, Ming C. Lin, Javier Romero, Raja Bala

Abstract: We present a Body Measurement network (BMnet) for estimating 3D anthropomorphic measurements of the human body shape from silhouette images. Training of BMnet is performed on data from real human subjects, and augmented with a novel adversarial body simulator (ABS) that finds and synthesizes challenging body shapes. ABS is based on the skinned multiperson linear (SMPL) body model, and aims to maxi… ▽ More We present a Body Measurement network (BMnet) for estimating 3D anthropomorphic measurements of the human body shape from silhouette images. Training of BMnet is performed on data from real human subjects, and augmented with a novel adversarial body simulator (ABS) that finds and synthesizes challenging body shapes. ABS is based on the skinned multiperson linear (SMPL) body model, and aims to maximize BMnet measurement prediction error with respect to latent SMPL shape parameters. ABS is fully differentiable with respect to these parameters, and trained end-to-end via backpropagation with BMnet in the loop. Experiments show that ABS effectively discovers adversarial examples, such as bodies with extreme body mass indices (BMI), consistent with the rarity of extreme-BMI bodies in BMnet's training set. Thus ABS is able to reveal gaps in training data and potential failures in predicting under-represented body shapes. Results show that training BMnet with ABS improves measurement prediction accuracy on real bodies by up to 10%, when compared to no augmentation or random body shape sampling. Furthermore, our method significantly outperforms SOTA measurement estimation methods by as much as 3x. Finally, we release BodyM, the first challenging, large-scale dataset of photo silhouettes and body measurements of real human subjects, to further promote research in this area. Project website: https://adversarialbodysim.github.io △ Less

Submitted 11 October, 2022; originally announced October 2022.

Comments: Published at the International Conference on 3D Vision (3DV) 2022

arXiv:2208.12242 [pdf, other]

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Authors: Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman

Abstract: Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a given reference set and synthesize novel renditions of them in different contexts. In this work, we present a new approach for "personalization" of text-to-image… ▽ More Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a given reference set and synthesize novel renditions of them in different contexts. In this work, we present a new approach for "personalization" of text-to-image diffusion models. Given as input just a few images of a subject, we fine-tune a pretrained text-to-image model such that it learns to bind a unique identifier with that specific subject. Once the subject is embedded in the output domain of the model, the unique identifier can be used to synthesize novel photorealistic images of the subject contextualized in different scenes. By leveraging the semantic prior embedded in the model with a new autogenous class-specific prior preservation loss, our technique enables synthesizing the subject in diverse scenes, poses, views and lighting conditions that do not appear in the reference images. We apply our technique to several previously-unassailable tasks, including subject recontextualization, text-guided view synthesis, and artistic rendering, all while preserving the subject's key features. We also provide a new dataset and evaluation protocol for this new task of subject-driven generation. Project page: https://dreambooth.github.io/ △ Less

Submitted 15 March, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

Comments: Published at CVPR 2023. Project page: https://dreambooth.github.io/

arXiv:2207.03940 [pdf]

Bistochastic privacy

Authors: Nicolas Ruiz, Josep Domingo-Ferrer

Abstract: We introduce a new privacy model relying on bistochastic matrices, that is, matrices whose components are nonnegative and sum to 1 both row-wise and column-wise. This class of matrices is used to both define privacy guarantees and a tool to apply protection on a data set. The bistochasticity assumption happens to connect several fields of the privacy literature, including the two most popular mode… ▽ More We introduce a new privacy model relying on bistochastic matrices, that is, matrices whose components are nonnegative and sum to 1 both row-wise and column-wise. This class of matrices is used to both define privacy guarantees and a tool to apply protection on a data set. The bistochasticity assumption happens to connect several fields of the privacy literature, including the two most popular models, k-anonymity and differential privacy. Moreover, it establishes a bridge with information theory, which simplifies the thorny issue of evaluating the utility of a protected data set. Bistochastic privacy also clarifies the trade-off between protection and utility by using bits, which can be viewed as a natural currency to comprehend and operationalize this trade-off, in the same way than bits are used in information theory to capture uncertainty. A discussion on the suitable parameterization of bistochastic matrices to achieve the privacy guarantees of this new model is also provided. △ Less

Submitted 8 July, 2022; originally announced July 2022.

Comments: To be published in Lecture Notes in Artificial Intelligence vol 13408, Modeling Decisions for Artificial Intelligence 19th International Conference MDAI 2022, Sant Cugat, Catalonia, August 30 - 2 September 2022

arXiv:2203.07526 [pdf, other]

doi 10.1051/0004-6361/202243542

How do galaxies populate halos in extreme density environments? An analysis of the Halo Occupation Distribution in SDSS

Authors: Ignacio G. Alfaro, Facundo Rodriguez, Andrés N. Ruiz, Heliana E. Luparello, Diego Garcia Lambas

Abstract: Recent works have shown that the properties of galaxy populations in dark matter halos vary with large-scale environments. These results suggest a variation in the halo occupation distribution (HOD) in extreme density environments. To analyse these effects, we identify cosmic voids and future virialised structures (FVS) in the SDSS-DR12 and estimate the HOD within these superstructures using group… ▽ More Recent works have shown that the properties of galaxy populations in dark matter halos vary with large-scale environments. These results suggest a variation in the halo occupation distribution (HOD) in extreme density environments. To analyse these effects, we identify cosmic voids and future virialised structures (FVS) in the SDSS-DR12 and estimate the HOD within these superstructures using group catalogues as dark matter halo proxies. Our goal is to use observational galaxy data to characterise the HOD within voids and FVS, explore the different properties of these galaxies populations and compare them with the general results outside these superstructures. Using a galaxy group catalogue we compute the HOD within both types of superstructures. We also study the dependence on the results on the main void and FVS properties. We also analysed the mean stellar age of the galaxies inside these regions. In all cases, we compare the results with those derived from the Field sample. Inside voids, we find a strong decrease in HOD concerning the Field results. The mean number of satellites fall to 50%. Inside FVS, the HOD shows a significant increase to the Field, with a 40% excess in the mean number of satellites. In both regions, the differences with respect to the Field increases for the extreme values of the density environments. We obtain no signs of variations related to intrinsic characteristics of voids and FVS. We find that the cumulative distribution of the mean age of stars of the central galaxy also varies in these regions. Finally, we explore the HOD for the 25% youngest (oldest) galaxies. We find that for the low-mass groups the youngest galaxies are only present inside voids. On the other hand, for the high-mass groups the FVS environments show the same increase in the HOD concerning the Field. We find that cosmic voids lack of oldest galaxies. △ Less

Submitted 14 March, 2022; originally announced March 2022.

Comments: 9 pages, 9 figures. First version. Sent to refereed in A&A

Journal ref: A&A 665, A44 (2022)

arXiv:2112.01552 [pdf, other]

doi 10.1093/mnras/stab3551

Reconstructing Orbits of Galaxies in Extreme Regions (ROGER) II: reliability of projected phase-space in our understanding of galaxy populations

Authors: Valeria Coenda, Martín de los Rios, Hernán Muriel, Sofía A. Cora, Héctor J. Martínez, Andrés N. Ruiz, Cristian A. Vega-Martínez

Abstract: We connect galaxy properties with their orbital classification by analysing a sample of galaxies with stellar mass $M_{\star} \geq 10^{8.5}h^{-1}M_\odot$ residing in and around massive and isolated galaxy clusters with mass $M_{200} > 10^{15}h^{-1}M_\odot$ at redshift $z=0$. The galaxy population is generated by applying the semi-analytic model of galaxy formation SAG on the cosmological simulatio… ▽ More We connect galaxy properties with their orbital classification by analysing a sample of galaxies with stellar mass $M_{\star} \geq 10^{8.5}h^{-1}M_\odot$ residing in and around massive and isolated galaxy clusters with mass $M_{200} > 10^{15}h^{-1}M_\odot$ at redshift $z=0$. The galaxy population is generated by applying the semi-analytic model of galaxy formation SAG on the cosmological simulation MultiDark Planck 2. We classify galaxies considering their real orbits (3D) and their projected phase-space position using the ROGER code (2D). We define five categories: cluster galaxies, galaxies that have recently fallen into a cluster, backsplash galaxies, infalling galaxies, and interloper galaxies. For each class, we analyse the $g-r$ colour, the specific star formation rate (sSFR), and the stellar age, as a function of the stellar mass. For the 3D classes, we find that cluster galaxies have the lowest sSFR, and are the reddest and the oldest, as expected from environmental effects. Backsplash galaxies have properties intermediate between the cluster and recent infaller galaxies. For each 2D class, we find an important contamination by other classes. We find it necessary to separate the galaxy populations in red and blue to perform a more realistic analysis of the 2D data. For the red population, the 2D results are in good agreement with the 3D predictions. Nevertheless, when the blue population is considered, the 2D analysis only provides reliable results for recent infallers, infalling galaxies and interloper galaxies. △ Less

Submitted 2 December, 2021; originally announced December 2021.

Comments: 5 figures, 12 pages, accepted for publication in Monthly Notices of the Royal Astronomical Society Main Journal

arXiv:2110.09536 [pdf, other]

doi 10.1093/mnras/stac1020

On the environmental influence of groups and clusters of galaxies beyond the virial radius: Galactic conformity at few Mpc scales

Authors: Ivan Lacerna, Facundo Rodriguez, Antonio D. Montero-Dorta, Ana L. O'Mill, Sofía A. Cora, M. Celeste Artale, Andrés N. Ruiz, Tomás Hough, Cristian A. Vega-Martínez

Abstract: The environment within dark matter haloes can quench the star formation of galaxies. However, environmental effects beyond the virial radius of haloes ($\gtrsim$ 1 Mpc) are less evident. An example is the debated correlation between colour or star formation in central galaxies and neighbour galaxies in adjacent haloes at large separations of several Mpc, referred to as two-halo galactic conformity… ▽ More The environment within dark matter haloes can quench the star formation of galaxies. However, environmental effects beyond the virial radius of haloes ($\gtrsim$ 1 Mpc) are less evident. An example is the debated correlation between colour or star formation in central galaxies and neighbour galaxies in adjacent haloes at large separations of several Mpc, referred to as two-halo galactic conformity. We use two galaxy catalogues generated from different versions of the semi-analytic model SAG applied to the MDPL2 cosmological simulation and the IllustrisTNG300 cosmological hydrodynamical simulation to study the two-halo conformity by measuring the quenched fraction of neighbouring galaxies as a function of the real-space distance from central galaxies. We find that low-mass central galaxies in the vicinity of massive systems ($M_{\rm 200c}$ $\geq$ 10$^{13}$ $h^{-1}~\rm M_{\odot}$) out to 5 $h^{-1}$ Mpc are preferentially quenched compared to other central galaxies at fixed stellar mass $M_{\star}$ or fixed host halo mass $M_{\rm 200c}$ at $z$ ~ 0. In all the galaxies catalogues is consistent that the low-mass ($M_{\star} < 10^{10}$ $h^{-1}~\rm M_{\odot}$ or $M_{\rm 200c} < 10^{11.8}$ $h^{-1}~\rm M_{\odot}$) central galaxies in the vicinity of clusters and, especially, groups of galaxies mostly produce the two-halo galactic conformity. On average, the quenched low-mass central galaxies are much closer to massive haloes than star-forming central galaxies of the same mass (by a factor of ~5). Our results agree with other works regarding the environmental influence of massive haloes that can extend beyond the virial radius and affect nearby low-mass central galaxies. △ Less

Submitted 11 April, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

Comments: 15 pages and 10 figures without appendix. Accepted for publication in MNRAS

arXiv:2109.13378 [pdf, other]

doi 10.1093/mnras/stac127

Imprints of the cosmic void evolution on the baryon processes inside galaxy haloes

Authors: Agustín M. Rodríguez Medrano, Dante J. Paz, Federico A. Stasyszyn, Andrés N. Ruiz

Abstract: Cosmic voids provide a unique environment to study galaxy formation and evolution. In this paper, we analyse a set of hydrodynamic zoom-in simulations of voids, to analyse in detail their inner structures. These voids were identified in a cosmological simulation and classified according to their surrounding dynamics at very large scales: whether they are in expansion or contraction at their outski… ▽ More Cosmic voids provide a unique environment to study galaxy formation and evolution. In this paper, we analyse a set of hydrodynamic zoom-in simulations of voids, to analyse in detail their inner structures. These voids were identified in a cosmological simulation and classified according to their surrounding dynamics at very large scales: whether they are in expansion or contraction at their outskirts. We study how these environments and the dynamics of voids impact the baryonic processes inside haloes and their mechanisms of formation and evolution. We find an under-abundance of processed gas within the voids that can be associated with the lack of massive haloes. By studying the dynamical phase-space diagram of haloes and the halo-particle correlation function, we find that haloes inside of contracting voids are slightly affected by the presence of bigger structures, in comparison to haloes in the inner regions of expanding voids. Consistent signals are obtained both when using dark matter and gas particles. We show that the halo assembly depends on the void dynamical state: haloes in expanding voids assemble slowly than those in contracting voids and in the general universe. This difference in the assembly impacts the baryonic evolution of haloes. Overall the redshift range analysed, haloes in voids have less baryon content than haloes in the general universe and particularly at z = 0 less stellar content. Our results suggest that the large scale void environment modulate the baryonic process occurring inside haloes according to the void dynamical state. △ Less

Submitted 14 January, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

Comments: 14 pages, 11 figures, accepted in MNRAS

arXiv:2108.12710 [pdf, other]

doi 10.1093/mnras/stac1656

Evolution map**: a new approach to describe matter clustering in the non-linear regime

Authors: Ariel G. Sanchez, Andrés N. Ruiz, Jenny Gonzalez Jara, Nelson D. Padilla

Abstract: We present a new approach to describe statistics of the non-linear matter density field that exploits a degeneracy in the impact of different cosmological parameters on the linear dimensionless matter power spectrum, $Δ^2_{\rm L}(k)$. We classify all cosmological parameters into two groups, shape parameters, which determine the shape of $Δ^2_{\rm L}(k)$, and evolution parameters, which only affect… ▽ More We present a new approach to describe statistics of the non-linear matter density field that exploits a degeneracy in the impact of different cosmological parameters on the linear dimensionless matter power spectrum, $Δ^2_{\rm L}(k)$. We classify all cosmological parameters into two groups, shape parameters, which determine the shape of $Δ^2_{\rm L}(k)$, and evolution parameters, which only affect its amplitude at any given redshift. With this definition, the time evolution of $Δ^2_{\rm L}(k)$ in models with identical shape parameters but different evolution parameters can be mapped from one to the other by relabelling the redshifts that correspond to the same clustering amplitude, which we characterize by the linear mass fluctuation in spheres of radius $12\,{\rm Mpc}$, $σ_{12}(z)$. We use N-body simulations to show that the same evolution map** relation gives a good description of the non-linear power spectrum, the halo mass function, or the full density field. The deviations from the exact degeneracy are the result of the different structure formation histories experienced by each model to reach the same clustering amplitude and can be accurately described in terms of differences in the suppression factor $g(a) = D(a)/a$. These relations can be used to drastically reduce the number of parameters required to describe the cosmology dependence of the power spectrum. We show how this can help to speed up the inference of parameter constraints from cosmological observations. We also present a new design of an emulator of the non-linear power spectrum whose predictions can be adapted to an arbitrary choice of evolution parameters and redshift. △ Less

Submitted 15 June, 2022; v1 submitted 28 August, 2021; originally announced August 2021.

Comments: 13 pages, 11 figures, replaced to match version accepted by MNRAS

arXiv:2107.09126 [pdf, other]

Examining the Human Perceptibility of Black-Box Adversarial Attacks on Face Recognition

Authors: Benjamin Spetter-Goldstein, Nataniel Ruiz, Sarah Adel Bargal

Abstract: The modern open internet contains billions of public images of human faces across the web, especially on social media websites used by half the world's population. In this context, Face Recognition (FR) systems have the potential to match faces to specific names and identities, creating glaring privacy concerns. Adversarial attacks are a promising way to grant users privacy from FR systems by disr… ▽ More The modern open internet contains billions of public images of human faces across the web, especially on social media websites used by half the world's population. In this context, Face Recognition (FR) systems have the potential to match faces to specific names and identities, creating glaring privacy concerns. Adversarial attacks are a promising way to grant users privacy from FR systems by disrupting their capability to recognize faces. Yet, such attacks can be perceptible to human observers, especially under the more challenging black-box threat model. In the literature, the justification for the imperceptibility of such attacks hinges on bounding metrics such as $\ell_p$ norms. However, there is not much research on how these norms match up with human perception. Through examining and measuring both the effectiveness of recent black-box attacks in the face recognition setting and their corresponding human perceptibility through survey data, we demonstrate the trade-offs in perceptibility that occur as attacks become more aggressive. We also show how the $\ell_2$ norm and other metrics do not correlate with human perceptibility in a linear fashion, thus making these norms suboptimal at measuring adversarial attack perceptibility. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: 5 pages, 5 figures, submitted to AdvML @ ICML 2021

arXiv:2107.02492 [pdf, other]

doi 10.1051/0004-6361/202040009

Faint polarised sources in the Lockman Hole field at 1.4 GHz

Authors: A. Berger, B. Adebahr, N. Herrera Ruiz, A. H. Wright, I. Prandoni, R. -J. Dettmar

Abstract: We aim to study the nature of the faint, polarised radio source population whose source composition and redshift dependence contain information about the strength, morphology, and evolution of magnetic fields over cosmic timescales. We use a 15 pointing radio continuum L-band mosaic of the Lockman Hole, observed in full polarisation, generated from archival data of the WSRT. The data were analysed… ▽ More We aim to study the nature of the faint, polarised radio source population whose source composition and redshift dependence contain information about the strength, morphology, and evolution of magnetic fields over cosmic timescales. We use a 15 pointing radio continuum L-band mosaic of the Lockman Hole, observed in full polarisation, generated from archival data of the WSRT. The data were analysed using the RM-Synthesis technique. We achieved a noise of 7 μJy/beam in polarised intensity, with a resolution of 15''. Using infrared and optical images and source catalogues, we were able to cross-identify and determine redshifts for one third of our detected polarised sources. We detected 150 polarised sources, most of which are weakly polarised with a mean fractional polarisation of 5.4 %. With a total area of 6.5 deg^2 and a detection threshold of 6.25 σ we find 23 polarised sources per deg^2. Based on our multi wavelength analysis, we find that our sample consists of AGN only. We find a discrepancy between archival number counts and those present in our data, which we attribute to sample variance. Considering the absolute radio luminosty, to distinguish weak and strong sources, we find a general trend of increased probability to detect weak sources at low redshift and strong sources at high redshift. Further, we find an anti-correlation between fractional polarisation and redshift for our strong sources sample at z{\geq}0.6. A decrease in the fractional polarisation of strong sources with increasing redshift cannot be explained by a constant magnetic field and electron density over cosmic scales, however the changing properties of cluster environments over the cosmic timemay play an important role. Disentangling these two effects requires deeper and wider polarisation observations, and better models of the morphology and strength of cosmic magnetic fields. △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: 17 pages, 16 figures, to be published in A&A

Journal ref: A&A 653, A155 (2021)

arXiv:2107.01314 [pdf, other]

doi 10.1093/mnras/stab3070

Redshift-space effects in voids and their impact on cosmological tests. Part II: the void-galaxy cross-correlation function

Authors: Carlos M. Correa, Dante J. Paz, Nelson D. Padilla, Ariel G. Sánchez, Andrés N. Ruiz, Raúl E. Angulo

Abstract: This is the second part of a thorough investigation of the redshift-space effects that affect void properties and the impact they have on cosmological tests. Here, we focus on the void-galaxy cross-correlation function, specifically, on the projected versions that we developed in a previous work. The pillar of the analysis is the one-to-one relationship between real and redshift-space voids above… ▽ More This is the second part of a thorough investigation of the redshift-space effects that affect void properties and the impact they have on cosmological tests. Here, we focus on the void-galaxy cross-correlation function, specifically, on the projected versions that we developed in a previous work. The pillar of the analysis is the one-to-one relationship between real and redshift-space voids above the shot-noise level identified with a spherical void finder. Under this map**, void properties are affected by three effects: (i) a systematic expansion as a consequence of the distortions induced by galaxy dynamics, (ii) the Alcock-Paczynski volume effect, which manifests as an overall expansion or contraction depending on the fiducial cosmology, and (iii) a systematic off-centring along the line of sight as a consequence of the distortions induced by void dynamics. We found that correlations are also affected by an additional source of distortions: the ellipticity of voids. This is the first time that distortions due to the off-centring and ellipticity effects are detected and quantified. With a simplified test, we verified that the Gaussian streaming model is still robust provided all these effects are taken into account, laying the foundations for improvements in current models in order to obtain unbiased cosmological constraints from spectroscopic surveys. Besides this practical importance, this analysis also encodes key information about the structure and dynamics of the Universe at the largest scales. Furthermore, some of the effects constitute cosmological probes by themselves, as is the case of the void ellipticity. △ Less

Submitted 19 January, 2022; v1 submitted 2 July, 2021; originally announced July 2021.

Comments: 14 pages, 9 figures, published in MNRAS, accepted after minor comments

Report number: MNRAS, Volume 509, Issue 2, pp.1871-1884, January 2022

arXiv:2106.08989 [pdf, other]

doi 10.1051/0004-6361/202039838

Galaxy populations in haloes in high-density environments

Authors: Ignacio G. Alfaro, Andres N. Ruiz, Heliana E. Luparello, Facundo Rodriguez, Diego Garcia Lambas

Abstract: There are hints suggesting that properties of galaxy populations in dark matter haloes may depend on their large-scale environment. Recent works point out that very low-density environments influence halo occupation distribution (HOD), however there is not a similar analysis focused on high-density environments. Here we use a simulated set of future virialized superstructures (FVS) to analyse the… ▽ More There are hints suggesting that properties of galaxy populations in dark matter haloes may depend on their large-scale environment. Recent works point out that very low-density environments influence halo occupation distribution (HOD), however there is not a similar analysis focused on high-density environments. Here we use a simulated set of future virialized superstructures (FVS) to analyse the occupation of galaxies in haloes within these high globally dense regions. We use a publicly available simulated galaxy set constructed with a semi-analytical model to identify FVS in the simulation. Then, we computed the HOD within these superstructures for different absolute magnitude thresholds and make several analysis including the comparison to the global HOD results. We study the dependence on the results on properties of the FVS such as density and volume as well as consider the morphology of galaxies. We also analysed the properties of the stellar content of galaxies and the formation time of the haloes inside FVS. We find a significant increase in the HOD inside FVS. This result is present for all absolute magnitude thresholds explored. The effect is larger in the densest regions of FVS, but does not depend on the volume of the superstructure. We also find that the stellar-mass content of galaxies considerably differs inside the superstructures. Low mass haloes have their central and satellite galaxies with a higher stellar mass content (50%), and exhibit mean star ages (20%) older than average. For massive haloes in FVS we find that only the stellar mass of satellite galaxies varies considerably corresponding to a decrease of 50%. We find a significant statistical difference between the formation times of haloes in FVS and the average population. Haloes residing in superstructures formed earlier, a fact that leads to several changes in the HOD and their member galaxy properties. △ Less

Submitted 16 March, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

Comments: 12 pages, 15 figure. Published version (by the A&A)

Journal ref: A&A 654, A62 (2021)

arXiv:2106.06811 [pdf, other]

Case Study on Detecting COVID-19 Health-Related Misinformation in Social Media

Authors: Mir Mehedi A. Pritom, Rosana Montanez Rodriguez, Asad Ali Khan, Sebastian A. Nugroho, Esra'a Alrashydah, Beatrice N. Ruiz, Anthony Rios

Abstract: COVID-19 pandemic has generated what public health officials called an infodemic of misinformation. As social distancing and stay-at-home orders came into effect, many turned to social media for socializing. This increase in social media usage has made it a prime vehicle for the spreading of misinformation. This paper presents a mechanism to detect COVID-19 health-related misinformation in social… ▽ More COVID-19 pandemic has generated what public health officials called an infodemic of misinformation. As social distancing and stay-at-home orders came into effect, many turned to social media for socializing. This increase in social media usage has made it a prime vehicle for the spreading of misinformation. This paper presents a mechanism to detect COVID-19 health-related misinformation in social media following an interdisciplinary approach. Leveraging social psychology as a foundation and existing misinformation frameworks, we defined misinformation themes and associated keywords incorporated into the misinformation detection mechanism using applied machine learning techniques. Next, using the Twitter dataset, we explored the performance of the proposed methodology using multiple state-of-the-art machine learning classifiers. Our method shows promising results with at most 78% accuracy in classifying health-related misinformation versus true information using uni-gram-based NLP feature generations from tweets and the Decision Tree classifier. We also provide suggestions on alternatives for countering misinformation and ethical consideration for the study. △ Less

Submitted 12 June, 2021; originally announced June 2021.

Comments: 10 pages

arXiv:2106.04569 [pdf, other]

Simulated Adversarial Testing of Face Recognition Models

Authors: Nataniel Ruiz, Adam Kortylewski, Weichao Qiu, Cihang Xie, Sarah Adel Bargal, Alan Yuille, Stan Sclaroff

Abstract: Most machine learning models are validated and tested on fixed datasets. This can give an incomplete picture of the capabilities and weaknesses of the model. Such weaknesses can be revealed at test time in the real world. The risks involved in such failures can be loss of profits, loss of time or even loss of life in certain critical applications. In order to alleviate this issue, simulators can b… ▽ More Most machine learning models are validated and tested on fixed datasets. This can give an incomplete picture of the capabilities and weaknesses of the model. Such weaknesses can be revealed at test time in the real world. The risks involved in such failures can be loss of profits, loss of time or even loss of life in certain critical applications. In order to alleviate this issue, simulators can be controlled in a fine-grained manner using interpretable parameters to explore the semantic image manifold. In this work, we propose a framework for learning how to test machine learning algorithms using simulators in an adversarial manner in order to find weaknesses in the model before deploying it in critical scenarios. We apply this method in a face recognition setup. We show that certain weaknesses of models trained on real data can be discovered using simulated samples. Using our proposed method, we can find adversarial synthetic faces that fool contemporary face recognition models. This demonstrates the fact that these models have weaknesses that are not measured by commonly used validation datasets. We hypothesize that this type of adversarial examples are not isolated, but usually lie in connected spaces in the latent space of the simulator. We present a method to find these adversarial regions as opposed to the typical adversarial points found in the adversarial example literature. △ Less

Submitted 31 May, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

Comments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

arXiv:2103.01865 [pdf, other]

doi 10.1051/0004-6361/201937432

Overdensity of VVV galaxies behind the Galactic bulge

Authors: Daniela Galdeano, Luis Pereyra, Fernanda Duplancic, Georgina Coldwell, Sol Alonso, Andrés N. Ruiz, Sofía A. Cora, Noelia Perez, Cristian Vega-Martínez, Dante Minniti

Abstract: We studied a region of 1.636 square degrees corresponding to the VVV tile $b204$. Using SExtractor, we analysed photometric data generating a catalogue of extended sources in this area. In order to confirm these sources as galaxy candidates we visually inspected RGB images looking for typical galaxy features. Using 2MASX and GCMW catalogued sources we tested completeness and contamination of our c… ▽ More We studied a region of 1.636 square degrees corresponding to the VVV tile $b204$. Using SExtractor, we analysed photometric data generating a catalogue of extended sources in this area. In order to confirm these sources as galaxy candidates we visually inspected RGB images looking for typical galaxy features. Using 2MASX and GCMW catalogued sources we tested completeness and contamination of our catalogue and define suitable colour cuts to select galaxies. We also compared the observational results with those obtained from two semi-analytical models on Dark Matter simulations. One galaxy catalogue was constructed with the SAG semi-analytic model of galaxy formation, and the other one was constructed with the L-Galaxies semi-analytic model.By adopting CLASS-STAR$< 0.5$, $r_{1/2} > 0.7$ arcsec and specific colour cuts (J-Ks$>$0.97, J-H$>$0 and H-Ks$>$0) we generated an automatic catalogue of extended sources. After visual inspection we identified 624 sources with 10$<$Ks$<$17 as galaxy candidates. The contamination of the automatic catalogue is 28% when considering visually confirmed galaxies as reliable objects. The estimated completeness is 87% up to magnitude Ks=13.5. We analysed the spatial distribution of galaxy candidates, finding a high concentration of galaxies in a small region of 15 arcmin radius. This region has three times higher density than similar areas in the tile. We compared the number of galaxies in this small area with the mean density values obtained from a suitable sample of galaxies from semi-analytic models finding that our results are consistent with an overdensity region. Using VVV near-infrared data and mock catalogues we detect new extragalactic sources that have not been identified by other catalogues. We demonstrate the potentiality of the VVV survey studying a large number of galaxy candidates and extragalactic structures obscured by the Milky Way. △ Less

Submitted 2 March, 2021; originally announced March 2021.

arXiv:2012.05225 [pdf, other]

MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias

Authors: Nataniel Ruiz, Barry-John Theobald, Anurag Ranjan, Ahmed Hussein Abdelaziz, Nicholas Apostoloff

Abstract: To detect bias in face recognition networks, it can be useful to probe a network under test using samples in which only specific attributes vary in some controlled way. However, capturing a sufficiently large dataset with specific control over the attributes of interest is difficult. In this work, we describe a simulator that applies specific head pose and facial expression adjustments to images o… ▽ More To detect bias in face recognition networks, it can be useful to probe a network under test using samples in which only specific attributes vary in some controlled way. However, capturing a sufficiently large dataset with specific control over the attributes of interest is difficult. In this work, we describe a simulator that applies specific head pose and facial expression adjustments to images of previously unseen people. The simulator first fits a 3D morphable model to a provided image, applies the desired head pose and facial expression controls, then renders the model into an image. Next, a conditional Generative Adversarial Network (GAN) conditioned on the original image and the rendered morphable model is used to produce the image of the original person with the new facial expression and head pose. We call this conditional GAN -- MorphGAN. Images generated using MorphGAN conserve the identity of the person in the original image, and the provided control over head pose and facial expression allows test sets to be created to identify robustness issues of a facial recognition deep network with respect to pose and expression. Images generated by MorphGAN can also serve as data augmentation when training data are scarce. We show that by augmenting small datasets of faces with new poses and expressions improves the recognition performance by up to 9% depending on the augmentation and data scarcity. △ Less

Submitted 10 December, 2020; v1 submitted 9 December, 2020; originally announced December 2020.

arXiv:2011.08292 [pdf, other]

doi 10.1051/0004-6361/202038896

LOFAR Deep Fields: Probing a broader population of polarized radio galaxies in ELAIS-N1

Authors: N. Herrera Ruiz, S. P. O'Sullivan, V. Vacca, V. Jelić, B. Nikiel-Wroczyński, S. Bourke, J. Sabater, R. -J. Dettmar, G. Heald, C. Horellou, S. Piras, C. Sobey, T. W. Shimwell, C. Tasse, M. J. Hardcastle, R. Kondapally, K. T. Chyży, M. Iacobelli, P. N. Best, M. Brüggen, E. Carretti, I. Prandoni

Abstract: We present deep polarimetric observations of the European Large Area ISO Survey-North 1 (ELAIS-N1) field using the Low Frequency Array (LOFAR) at 114.9-177.4 MHz. The ELAIS-N1 field is part of the LOFAR Two-metre Sky Survey deep fields data release I. For six eight-hour observing epochs, we align the polarization angles and stack the 20"-resolution Stokes $Q$, $U$-parameter data cubes. This produc… ▽ More We present deep polarimetric observations of the European Large Area ISO Survey-North 1 (ELAIS-N1) field using the Low Frequency Array (LOFAR) at 114.9-177.4 MHz. The ELAIS-N1 field is part of the LOFAR Two-metre Sky Survey deep fields data release I. For six eight-hour observing epochs, we align the polarization angles and stack the 20"-resolution Stokes $Q$, $U$-parameter data cubes. This produces a 16 deg$^2$ image with 1$σ_{\rm QU}$ sensitivity of 26 $μ$Jy/beam in the central area. In this paper, we demonstrate the feasibility of the stacking technique, and we generate a catalog of polarized sources in ELAIS-N1 and their associated Faraday rotation measures (RMs). While in a single-epoch observation we detect three polarized sources, this number increases by a factor of about three when we consider the stacked data, with a total of ten sources. This yields a surface density of polarized sources of one per 1.6 deg$^2$. The Stokes $I$ images of three of the ten detected polarized sources have morphologies resembling those of FR I radio galaxies. This represents a greater fraction of this type of source than previously found, which suggests that more sensitive observations may help with their detection. △ Less

Submitted 16 November, 2020; originally announced November 2020.

Comments: This paper is part of the 1st data release of the LoTSS Deep Fields. Accepted for publication in A&A. 14 pages, 9 figures, 7 tables

Journal ref: A&A 648, A12 (2021)

arXiv:2011.05747 [pdf, other]

doi 10.1093/mnras/stab1622

Improved two-point correlation function estimates using glass-like distributions as a reference sample

Authors: Federico Dávila-Kurbán, Ariel G. Sanchez, Marcelo Lares, Andrés N. Ruiz

Abstract: All estimators of the two-point correlation function are based on a random catalogue, a set of points with no intrinsic clustering following the selection function of a survey. High-accuracy estimates require the use of large random catalogues, which imply a high computational cost. We propose to replace the standard random catalogues by glass-like point distributions or glass catalogues, which ar… ▽ More All estimators of the two-point correlation function are based on a random catalogue, a set of points with no intrinsic clustering following the selection function of a survey. High-accuracy estimates require the use of large random catalogues, which imply a high computational cost. We propose to replace the standard random catalogues by glass-like point distributions or glass catalogues, which are characterized by a power spectrum $P(k)\propto k^4$ and exhibit significantly less power than a Poisson distribution with the same number of points on scales larger than the mean inter-particle separation. We show that these distributions can be obtained by iteratively applying the technique of Zeldovich reconstruction commonly used in studies of baryon acoustic oscillations (BAO). We provide a modified version of the widely used Landy-Szalay estimator of the correlation function adapted to the use of glass catalogues and compare its performance with the results obtained using random samples. Our results show that glass-like samples do not add any bias with respect to the results obtained using Poisson distributions. On scales larger than the mean inter-particle separation of the glass catalogues, the modified estimator leads to a significant reduction of the variance of the Legendre multipoles $ξ_\ell(s)$ with respect to the standard Landy-Szalay results with the same number of points. The size of the glass catalogue required to achieve a given accuracy in the correlation function is significantly smaller than when using random samples. Even considering the small additional cost of constructing the glass catalogues, their use could help to drastically reduce the computational cost of configuration-space clustering analysis of future surveys while maintaining high-accuracy requirements. △ Less

Submitted 8 June, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: 9 pages, 7 figures, minor changes to match version accepted by MNRAS

arXiv:2010.11959 [pdf, other]

doi 10.1093/mnras/staa3339

ROGER: Reconstructing Orbits of Galaxies in Extreme Regions using machine learning techniques

Authors: Martín de los Rios, Héctor Julián Martínez, Valeria Coenda, Hernán Muriel, Andrés Nicolás Ruiz, Cristian Antonio Vega-Martínez, Sofia Alejandra Cora

Abstract: We present the ROGER (Reconstructing Orbits of Galaxies in Extreme Regions) code, which uses three different machine learning techniques to classify galaxies in, and around, clusters, according to their projected phase-space position. We use a sample of 34 massive, $M_{200}>10^{15} h^{-1} M_{\odot}$, galaxy clusters in the MultiDark Planck 2 (MDLP2) simulation at redshift zero. We select all galax… ▽ More We present the ROGER (Reconstructing Orbits of Galaxies in Extreme Regions) code, which uses three different machine learning techniques to classify galaxies in, and around, clusters, according to their projected phase-space position. We use a sample of 34 massive, $M_{200}>10^{15} h^{-1} M_{\odot}$, galaxy clusters in the MultiDark Planck 2 (MDLP2) simulation at redshift zero. We select all galaxies with stellar mass $M_{\star} \ge 10^{8.5} h^{-1}M_{\odot}$, as computed by the semi-analytic model of galaxy formation SAG, that are located in, and in the vicinity of, the clusters and classify them according to their orbits. We train ROGER to retrieve the original classification of the galaxies out of their projected phase-space positions. For each galaxy, ROGER gives as output the probability of being a cluster galaxy, a galaxy that has recently fallen into a cluster, a backsplash galaxy, an infalling galaxy, or an interloper. We discuss the performance of the machine learning methods and potential uses of our code. Among the different methods explored, we find the K-Nearest Neighbours algorithm achieves the best performance. △ Less

Submitted 22 October, 2020; originally announced October 2020.

Comments: Acceptep for its publication in the MNRAS Journal. Code available at github repository

arXiv:2010.05922 [pdf, other]

doi 10.1093/mnras/staa3197

Associations of dwarf galaxies in a $Λ$CDM Universe

Authors: C. Y. Yaryura, M. G. Abadi, S. Gottlober, N. I. Libeskind, S. A. Cora, A. N. Ruiz, C. A. Vega-Martínez, Gustavo Yepes, Peter Behroozi

Abstract: Associations of dwarf galaxies are loose systems composed exclusively of dwarf galaxies. These systems were identified in the Local Volume for the first time more than thirty years ago. We study these systems in the cosmological framework of the $Λ$ Cold Dark Matter ($Λ$CDM) model. We consider the Small MultiDark Planck simulation and populate its dark matter haloes by applying the semi-analytic m… ▽ More Associations of dwarf galaxies are loose systems composed exclusively of dwarf galaxies. These systems were identified in the Local Volume for the first time more than thirty years ago. We study these systems in the cosmological framework of the $Λ$ Cold Dark Matter ($Λ$CDM) model. We consider the Small MultiDark Planck simulation and populate its dark matter haloes by applying the semi-analytic model of galaxy formation SAG. We identify galaxy systems using a friends of friends algorithm with a linking length equal to $b=0.4 \,{\rm Mpc}\,h^{-1}$, to reproduce the size of dwarf galaxy associations detected in the Local Volume. Our samples of dwarf systems are built up removing those systems that have one (or more) galaxies with stellar mass larger than a maximum threshold $M_{\rm max}$. We analyse three different samples defined by ${\rm log}_{10}(M_{\rm max}[{\rm M}_{\odot}\,h^{-1}]) = 8.5, 9.0$ and $9.5$. On average, our systems have typical sizes of $\sim 0.2\,{\rm Mpc}\,h^{-1}$, velocity dispersion of $\sim 30 {\rm km\,s^{-1}} $ and estimated total mass of $\sim 10^{11} {\rm M}_{\odot}\,h^{-1}$. Such large typical sizes suggest that individual members of a given dwarf association reside in different dark matter haloes and are generally not substructures of any other halo. Indeed, in more than 90 per cent of our dwarf systems their individual members inhabit different dark matter haloes, while only in the remaining 10 per cent members do reside in the same halo. Our results indicate that the $Λ$CDM model can naturally reproduce the existence and properties of dwarf galaxies associations without much difficulty. △ Less

Submitted 12 October, 2020; originally announced October 2020.

Comments: 9 pages, 7 figures. Accepted for publication in MNRAS

arXiv:2008.02898 [pdf, other]

doi 10.1126/sciadv.abe3902

Stereo Darkfield Interferometry : a versatile localization method for subnanometer force spectroscopy of single molecules and 3D-tracking of single cells

Authors: Martin Rieu, Thibault Vieille, Gaël Radou, Raphaël Jeanneret, Nadia Ruiz, Bertrand Ducos, Jean-François Allemand, Vincent Croquette

Abstract: Super-resolutive 3D tracking, such as PSF engineering or evanescent field imaging has long been used to track microparticles and to enhance the throughput of single molecules force spectroscopic measurements. However, current methods present two drawbacks. First, they lack precision compared with optical tweezers or AFM. Second, the dependence of their signal upon the position is complex creating… ▽ More Super-resolutive 3D tracking, such as PSF engineering or evanescent field imaging has long been used to track microparticles and to enhance the throughput of single molecules force spectroscopic measurements. However, current methods present two drawbacks. First, they lack precision compared with optical tweezers or AFM. Second, the dependence of their signal upon the position is complex creating the need for a time-consuming calibration step. Here, we introduce a new optical technique that circumvents both issues and allows for a simple, versatile and efficient 3D tracking of diluted particles while offering a sub-nanometer frame-to-frame precision in all three spatial directions. The principle is to combine stereoscopy and interferometry, such that the z (axial) position is measured through the distance between two interferometric fringe patterns. The linearity of this stereoscopy technique alleviates the need for lookup tables while the structured interferometric pattern enhances precision. On the other hand, the extended spatial footprint of this PSF maximizes the number of photons detected per frame without the need of fancy cameras, nor the need for complex hardware. Hence, thanks to its simplicity and versatility, we believe that SDI (Stereo Darkfield Interferometry) technology has the potential to significantly enhance the spreading of 3D tracking. We demonstrate the efficiency of this technique on various single-molecule measurements thanks to magnetic tweezers. In particular we demonstrate the precise quantification of two-state dynamics involving axial steps as short as 1 nm. We then show that SDI can be directly embedded in a commercial objective providing a means to track multiple single cells in 3D . △ Less

Submitted 23 August, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

Comments: 15 pages main text, 3 main figures, 28 pages supplementary, 35 supplementary figures 2020/08/13 : corrected typos, merged parts 3.1 and 3.3 for more clarity

Showing 1–50 of 108 results for author: Ruiz, N