-
From clonal interference to Poissonian interacting trajectories
Authors:
Felix Hermann,
Adrián Gonzalez Casanova,
Renato Soares dos Santos,
András Tóbiás,
Anton Wakolbinger
Abstract:
We consider a population whose size $N$ is fixed over the generations, and in which random beneficial mutations arrive at a rate of order $1/\log N$ per generation. In this so-called Gerrish-Lenski regime, typically a finite number of contending mutations is present together with one resident type. These mutations compete for fixation, a phenomenon addressed as clonal interference. We study a syst…
▽ More
We consider a population whose size $N$ is fixed over the generations, and in which random beneficial mutations arrive at a rate of order $1/\log N$ per generation. In this so-called Gerrish-Lenski regime, typically a finite number of contending mutations is present together with one resident type. These mutations compete for fixation, a phenomenon addressed as clonal interference. We study a system of Poissonian interacting trajectories (PIT) which arise as a large population scaling limit of the logarithmic sizes of the contending clonal subpopulations. We prove that this system exhibits an a.s.\ positive asymptotic rate of fitness increase (speed of adaptation), which turns out to be finite if and only if fitness increments have a finite expectation. We relate this speed to heuristic predictions from the literature. Furthermore, we derive a functional central limit theorem for the fitness of the resident population in the PIT. A main result of this work is that the Poissonian interacting trajectories arise as a large-population limit of a continuous time Moran model with strong selection.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Technical Design Report of the Spin Physics Detector at NICA
Authors:
The SPD Collaboration,
V. Abazov,
V. Abramov,
L. Afanasyev,
R. Akhunzyanov,
A. Akindinov,
I. Alekseev,
A. Aleshko,
V. Alexakhin,
G. Alexeev,
L. Alimov,
A. Allakhverdieva,
A. Amoroso,
V. Andreev,
V. Andreev,
E. Andronov,
Yu. Anikin,
S. Anischenko,
A. Anisenkov,
V. Anosov,
E. Antokhin,
A. Antonov,
S. Antsupov,
A. Anufriev,
K. Asadova
, et al. (392 additional authors not shown)
Abstract:
The Spin Physics Detector collaboration proposes to install a universal detector in the second interaction point of the NICA collider under construction (JINR, Dubna) to study the spin structure of the proton and deuteron and other spin-related phenomena using a unique possibility to operate with polarized proton and deuteron beams at a collision energy up to 27 GeV and a luminosity up to…
▽ More
The Spin Physics Detector collaboration proposes to install a universal detector in the second interaction point of the NICA collider under construction (JINR, Dubna) to study the spin structure of the proton and deuteron and other spin-related phenomena using a unique possibility to operate with polarized proton and deuteron beams at a collision energy up to 27 GeV and a luminosity up to $10^{32}$ cm$^{-2}$ s$^{-1}$. As the main goal, the experiment aims to provide access to the gluon TMD PDFs in the proton and deuteron, as well as the gluon transversity distribution and tensor PDFs in the deuteron, via the measurement of specific single and double spin asymmetries using different complementary probes such as charmonia, open charm, and prompt photon production processes. Other polarized and unpolarized physics is possible, especially at the first stage of NICA operation with reduced luminosity and collision energy of the proton and ion beams. This document is dedicated exclusively to technical issues of the SPD setup construction.
△ Less
Submitted 28 May, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Performance of a modular ton-scale pixel-readout liquid argon time projection chamber
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1340 additional authors not shown)
Abstract:
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi…
▽ More
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Do** Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
H. Amar Es-sghir,
P. Amedo,
J. Anderson,
D. A. Andrade,
C. Andreopoulos
, et al. (1300 additional authors not shown)
Abstract:
Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN…
▽ More
Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon do** can substantially recover light losses due to contamination of the liquid argon by nitrogen.
△ Less
Submitted 9 February, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
The DUNE Far Detector Vertical Drift Technology, Technical Design Report
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade,
C. Andreopoulos
, et al. (1304 additional authors not shown)
Abstract:
DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi…
▽ More
DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model.
The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise.
In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered.
This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Authors:
Animesh Sinha,
Bo Sun,
Anmol Kalia,
Arantxa Casanova,
Elliot Blanchard,
David Yan,
Winnie Zhang,
Tony Nelli,
Jiahui Chen,
Hardik Shah,
Licheng Yu,
Mitesh Kumar Singh,
Ankit Ramchandani,
Maziar Sanjabi,
Sonal Gupta,
Amy Bearman,
Dhruv Mahajan
Abstract:
We introduce Style Tailoring, a recipe to finetune Latent Diffusion Models (LDMs) in a distinct domain with high visual quality, prompt alignment and scene diversity. We choose sticker image generation as the target domain, as the images significantly differ from photorealistic samples typically generated by large-scale LDMs. We start with a competent text-to-image model, like Emu, and show that r…
▽ More
We introduce Style Tailoring, a recipe to finetune Latent Diffusion Models (LDMs) in a distinct domain with high visual quality, prompt alignment and scene diversity. We choose sticker image generation as the target domain, as the images significantly differ from photorealistic samples typically generated by large-scale LDMs. We start with a competent text-to-image model, like Emu, and show that relying on prompt engineering with a photorealistic model to generate stickers leads to poor prompt alignment and scene diversity. To overcome these drawbacks, we first finetune Emu on millions of sticker-like images collected using weak supervision to elicit diversity. Next, we curate human-in-the-loop (HITL) Alignment and Style datasets from model generations, and finetune to improve prompt alignment and style alignment respectively. Sequential finetuning on these datasets poses a tradeoff between better style alignment and prompt alignment gains. To address this tradeoff, we propose a novel fine-tuning method called Style Tailoring, which jointly fits the content and style distribution and achieves best tradeoff. Evaluation results show our method improves visual quality by 14%, prompt alignment by 16.2% and scene diversity by 15.3%, compared to prompt engineering the base Emu model for stickers generation.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Branching fractional Brownian motion: discrete approximations and maximal displacement
Authors:
Adrián González Casanova,
Jan Lukas Igelbrink
Abstract:
We construct and study branching fractional Brownian motion with Hurst parameter $H\in(1/2,1)$. The construction relies on a generalization of the discrete approximation of fractional Brownian motion (Hammond and Sheffield, Probability Theory and Related Fields, 2013) to power law Pólya urns indexed by trees. We show that the first order of the speed of branching fractional Brownian motion with Hu…
▽ More
We construct and study branching fractional Brownian motion with Hurst parameter $H\in(1/2,1)$. The construction relies on a generalization of the discrete approximation of fractional Brownian motion (Hammond and Sheffield, Probability Theory and Related Fields, 2013) to power law Pólya urns indexed by trees. We show that the first order of the speed of branching fractional Brownian motion with Hurst parameter $H$ is $ct^{H+1/2}$ where $c$ is explicit and only depends on the Hurst parameter. A notion of "branching property" for processes with memory emerges naturally from our construction.
△ Less
Submitted 22 April, 2024; v1 submitted 6 October, 2023;
originally announced October 2023.
-
Absorption and stationary times for the $Λ$-Wright-Fisher process
Authors:
Airam Blancas,
Adrián González Casanova,
Sebastian Hummel,
Sandra Palau
Abstract:
We derive stationary and fixation times for the multi-type $Λ$-Wright-Fisher process with and without the classic linear drift that models mutations. Our method relies on a grand coupling of the process realized through the so-called lookdown-construction. A well-known process embedded in this construction is the fixation line. We generalise the process to our setup and make use of the associated…
▽ More
We derive stationary and fixation times for the multi-type $Λ$-Wright-Fisher process with and without the classic linear drift that models mutations. Our method relies on a grand coupling of the process realized through the so-called lookdown-construction. A well-known process embedded in this construction is the fixation line. We generalise the process to our setup and make use of the associated explosion times to obtain a representation of the fixation and stationary times in terms of the waiting time in a coupon collector problem.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Some Preliminary Steps Towards Metaverse Logic
Authors:
Antonio L. Furtado,
Marco A. Casanova,
Edirlei Soares de Lima
Abstract:
Assuming that the term 'metaverse' could be understood as a computer-based implementation of multiverse applications, we started to look in the present work for a logic that would be powerful enough to handle the situations arising both in the real and in the fictional underlying application domains. Realizing that first-order logic fails to account for the unstable behavior of even the most simpl…
▽ More
Assuming that the term 'metaverse' could be understood as a computer-based implementation of multiverse applications, we started to look in the present work for a logic that would be powerful enough to handle the situations arising both in the real and in the fictional underlying application domains. Realizing that first-order logic fails to account for the unstable behavior of even the most simpleminded information system domains, we resorted to non-conventional extensions, in an attempt to sketch a minimal composite logic strategy. The discussion was kept at a rather informal level, always trying to convey the intuition behind the theoretical notions in natural language terms, and appealing to an AI agent, namely ChatGPT, in the hope that algorithmic and common-sense approaches can be usefully combined.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Non-equilibrium steady state of the symmetric exclusion process with reservoirs
Authors:
Simone Floreani,
Adrián González Casanova
Abstract:
Consider the open symmetric exclusion process on a connected graph with vertexes in $[N-1]:=\{1,\ldots, N-1\}$ where points $1$ and $N-1$ are connected, respectively, to a left reservoir and a right reservoir with densities $ρ_L,ρ_R\in(0,1)$. We prove that the non-equilibrium steady state of such system is…
▽ More
Consider the open symmetric exclusion process on a connected graph with vertexes in $[N-1]:=\{1,\ldots, N-1\}$ where points $1$ and $N-1$ are connected, respectively, to a left reservoir and a right reservoir with densities $ρ_L,ρ_R\in(0,1)$. We prove that the non-equilibrium steady state of such system is $$μ_{\text{stat}} = \sum_{I\subset \mathcal P([N-1]) }F(I)\bigg(\otimes_{x\in I}\rm{Bernoulli}(ρ_R)\otimes_{y\in [N-1]\setminus I}\rm{Bernoulli}(ρ_L) \bigg).$$ In the formula above $ \mathcal P([N-1])$ denotes the power set of $[N-1]$ while the numbers $F(I)> 0$ are such that $\sum_{I\subset \mathcal P([N-1]) }F(I)=1$ and given in terms of absorption probabilities of the absorbing stochastic dual process. Via probabilistic arguments we compute explicitly the factors $F(I)$ when the graph is a homogeneous segment.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Muller's ratchet in a near-critical regime: tournament versus fitness proportional selection
Authors:
Jan Lukas Igelbrink,
Adrián González Casanova,
Charline Smadi,
Anton Wakolbinger
Abstract:
Muller's ratchet, in its prototype version, models a haploid, asexual population whose size~$N$ is constant over the generations. Slightly deleterious mutations are acquired along the lineages at a constant rate, and individuals carrying less mutations have a selective advantage. The classical variant considers {\it fitness proportional} selection, but other fitness schemes are conceivable as well…
▽ More
Muller's ratchet, in its prototype version, models a haploid, asexual population whose size~$N$ is constant over the generations. Slightly deleterious mutations are acquired along the lineages at a constant rate, and individuals carrying less mutations have a selective advantage. The classical variant considers {\it fitness proportional} selection, but other fitness schemes are conceivable as well. Inspired by the work of Etheridge et al. ([EPW09]) we propose a parameter scaling which fits well to the ``near-critical'' regime that was in the focus of [EPW09] (and in which the mutation-selection ratio diverges logarithmically as $N\to \infty$). Using a Moran model, we investigate the``rule of thumb'' given in [EPW09] for the click rate of the ``classical ratchet'' by putting it into the context of new results on the long-time evolution of the size of the best class of the ratchet with (binary) tournament selection, which (other than that of the classical ratchet) follows an autonomous dynamics up to the time of its extinction.
In [GSW23] it was discovered that the tournament ratchet has a hierarchy of dual processes which can be constructed on top of an Ancestral Selection graph with a Poisson decoration. For a regime in which the mutation/selection-ratio remains bounded away from 1, this was used in [GSW23] to reveal the asymptotics of the click rates as well as that of the type frequency profile between clicks. We will describe how these ideas can be extended to the near-critical regime in which the mutation-selection ratio of the tournament ratchet converges to 1 as $N\to \infty$.
△ Less
Submitted 18 June, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
The ancestral selection graph for a $Λ$-asymmetric Moran model
Authors:
Adrián González Casanova,
Noemi Kurt,
José Luis Pérez
Abstract:
Motivated by the question of the impact of selective advantage in populations with skewed reproduction mechanims, we study a Moran model with selection. We assume that there are two types of individuals, where the reproductive success of one type is larger than the other. The higher reproductive success may stem from either more frequent reproduction, or from larger numbers of offspring, and is en…
▽ More
Motivated by the question of the impact of selective advantage in populations with skewed reproduction mechanims, we study a Moran model with selection. We assume that there are two types of individuals, where the reproductive success of one type is larger than the other. The higher reproductive success may stem from either more frequent reproduction, or from larger numbers of offspring, and is encoded in a measure $Λ$ for each of the two types. Our approach consists of constructing a $Λ$-asymmetric Moran model in which individuals of the two populations compete, rather than considering a Moran model for each population. Under certain conditions, that we call the "partial order of adaptation", we can couple these measures. This allows us to construct the central object of this paper, the $Λ-$asymmetric ancestral selection graph, leading to a pathwise duality of the forward in time $Λ$-asymmetric Moran model with its ancestral process. Interestingly, the construction also provides a connection to the theory of optimal transport. We apply the ancestral selection graph in order to obtain scaling limits of the forward and backward processes, and note that the frequency process converges to the solution of an SDE with discontinous paths. Finally, we derive a Griffiths representation for the generator of the SDE and use it to find a semi-explicit formula for the probability of fixation of the less beneficial of the two types.
△ Less
Submitted 5 January, 2024; v1 submitted 31 May, 2023;
originally announced June 2023.
-
Lookdown construction for a Moran seed-bank model
Authors:
Maria Clara Fittipaldi,
Adrián González Casanova,
Julio Ernesto Nava
Abstract:
We present a lookdown construction for a Moran seed-bank model with variable active and inactive population sizes and we show that the empirical measure of our model coincides with that of the Seed-Bank-Moran Model with latency of Greven, den Hollander and Oomen, 2022. Furthermore, we prove that the time to the most recent common ancestor, starting from $N$ individuals with stationary distribution…
▽ More
We present a lookdown construction for a Moran seed-bank model with variable active and inactive population sizes and we show that the empirical measure of our model coincides with that of the Seed-Bank-Moran Model with latency of Greven, den Hollander and Oomen, 2022. Furthermore, we prove that the time to the most recent common ancestor, starting from $N$ individuals with stationary distribution over its state (active or inactive), has the same asymptotic order as the largest inactivity period. We then obtain an asymptotic distribution of the TMRCA, and use this result to find the first order of the asymptotic distribution of the fixation time of a single beneficial mutant conditioned to invade the whole population, which surprisingly is of order $\ln(N)$.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
Controllable Image Generation via Collage Representations
Authors:
Arantxa Casanova,
Marlène Careil,
Adriana Romero-Soriano,
Christopher J. Pal,
Jakob Verbeek,
Michal Drozdzal
Abstract:
Recent advances in conditional generative image models have enabled impressive results. On the one hand, text-based conditional models have achieved remarkable generation quality, by leveraging large-scale datasets of image-text pairs. To enable fine-grained controllability, however, text-based models require long prompts, whose details may be ignored by the model. On the other hand, layout-based…
▽ More
Recent advances in conditional generative image models have enabled impressive results. On the one hand, text-based conditional models have achieved remarkable generation quality, by leveraging large-scale datasets of image-text pairs. To enable fine-grained controllability, however, text-based models require long prompts, whose details may be ignored by the model. On the other hand, layout-based conditional models have also witnessed significant advances. These models rely on bounding boxes or segmentation maps for precise spatial conditioning in combination with coarse semantic labels. The semantic labels, however, cannot be used to express detailed appearance characteristics. In this paper, we approach fine-grained scene controllability through image collages which allow a rich visual description of the desired scene as well as the appearance and location of the objects therein, without the need of class nor attribute labels. We introduce "mixing and matching scenes" (M&Ms), an approach that consists of an adversarially trained generative image model which is conditioned on appearance features and spatial positions of the different elements in a collage, and integrates these into a coherent image. We train our model on the OpenImages (OI) dataset and evaluate it on collages derived from OI and MS-COCO datasets. Our experiments on the OI dataset show that M&Ms outperforms baselines in terms of fine-grained scene controllability while being very competitive in terms of image quality and sample diversity. On the MS-COCO dataset, we highlight the generalization ability of our model by outperforming DALL-E in terms of the zero-shot FID metric, despite using two magnitudes fewer parameters and data. Collage based generative models have the potential to advance content creation in an efficient and effective way as they are intuitive to use and yield high quality generations.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Instance-Conditioned GAN Data Augmentation for Representation Learning
Authors:
Pietro Astolfi,
Arantxa Casanova,
Jakob Verbeek,
Pascal Vincent,
Adriana Romero-Soriano,
Michal Drozdzal
Abstract:
Data augmentation has become a crucial component to train state-of-the-art visual representation models. However, handcrafting combinations of transformations that lead to improved performances is a laborious task, which can result in visually unrealistic samples. To overcome these limitations, recent works have explored the use of generative models as learnable data augmentation tools, showing pr…
▽ More
Data augmentation has become a crucial component to train state-of-the-art visual representation models. However, handcrafting combinations of transformations that lead to improved performances is a laborious task, which can result in visually unrealistic samples. To overcome these limitations, recent works have explored the use of generative models as learnable data augmentation tools, showing promising results in narrow application domains, e.g., few-shot learning and low-data medical imaging. In this paper, we introduce a data augmentation module, called DA_IC-GAN, which leverages instance-conditioned GAN generations and can be used off-the-shelf in conjunction with most state-of-the-art training recipes. We showcase the benefits of DA_IC-GAN by plugging it out-of-the-box into the supervised training of ResNets and DeiT models on the ImageNet dataset, and achieving accuracy boosts up to between 1%p and 2%p with the highest capacity models. Moreover, the learnt representations are shown to be more robust than the baselines when transferred to a handful of out-of-distribution datasets, and exhibit increased invariance to variations of instance and viewpoints. We additionally couple DA_IC-GAN with a self-supervised training recipe and show that we can also achieve an improvement of 1%p in accuracy in some settings. With this work, we strengthen the evidence on the potential of learnable data augmentations to improve visual representation learning, paving the road towards non-handcrafted augmentations in model training.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Alpha-stable branching and beta-Frequency processes, beyond the IID assumption
Authors:
Adrián González Casanova,
Imanol Nuñez,
J. -L. Pérez
Abstract:
Birkner et al. obtained necessary and sufficient conditions for the frequency between two independent and identically distributed continuous-state branching processes time-changed by a functional of the total mass process to be a Markov process. Foucart et al. extended this result to continuous-state branching processes with immigration. We generalize these results by drop** the independent and…
▽ More
Birkner et al. obtained necessary and sufficient conditions for the frequency between two independent and identically distributed continuous-state branching processes time-changed by a functional of the total mass process to be a Markov process. Foucart et al. extended this result to continuous-state branching processes with immigration. We generalize these results by drop** the independent and identically distributed assumption. Our result clarifies under which conditions a multi-type $Λ$-coalescent can be constructed from a multi-type branching process by a time change using the total mass. Finally, we address a problem formulated by Griffiths, by clarifying the relation between 2-type $α$-stable continuous-state branching processes and 2-type $β$-Fleming--Viot processes with mutation and selection.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
Quasi-equilibria and click times for a variant of Muller's ratchet
Authors:
Adrian Gonzalez Casanova,
Charline Smadi,
Anton Wakolbinger
Abstract:
Consider a population of $N$ individuals, each of them carrying a type in $\mathbb N_0$. The population evolves according to a Moran dynamics with selection and mutation, where an individual of type $k$ has the same selective advantage over all individuals with type $k' > k$, and type $k$ mutates to type $k+1$ at a constant rate. This model is thus a variation of the classical Muller's ratchet: th…
▽ More
Consider a population of $N$ individuals, each of them carrying a type in $\mathbb N_0$. The population evolves according to a Moran dynamics with selection and mutation, where an individual of type $k$ has the same selective advantage over all individuals with type $k' > k$, and type $k$ mutates to type $k+1$ at a constant rate. This model is thus a variation of the classical Muller's ratchet: there the selective advantage is proportional to $k'-k$. For a regime of selection strength and mutation rates which is between the regimes of weak and strong selection/mutation, we obtain the asymptotic rate of the {\em click times} of the ratchet (i.e. the times at which the hitherto minimal (`best') type in the population is lost), and reveal the quasi-stationary type frequency profile between clicks. The large population limit of this profile is characterized as the normalized attractor of a ``dual'' hierarchical multitype logistic system, and also via the distribution of the final minimal displacement in a branching random walk with one-sided steps. An important role in the proofs is played by a graphical representation of the model, both forward and backward in time, and a central tool is the ancestral selection graph decorated by mutations.
△ Less
Submitted 5 December, 2023; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Seed bank Cannings Graphs: How dormancy smoothes random genetic drift
Authors:
Adrián González Casanova,
Lizbeth Peñaloza,
Arno Siri-Jégousse
Abstract:
In this article, we introduce a random (directed) graph model for the simultaneous forwards and backwards description of a rather broad class of Cannings models with a seed bank mechanism. This provides a simple tool to establish a sampling duality in the finite population size, and obtain a path-wise embedding of the forward frequency process and the backward ancestral process. Further, it allows…
▽ More
In this article, we introduce a random (directed) graph model for the simultaneous forwards and backwards description of a rather broad class of Cannings models with a seed bank mechanism. This provides a simple tool to establish a sampling duality in the finite population size, and obtain a path-wise embedding of the forward frequency process and the backward ancestral process. Further, it allows the derivation of limit theorems that generalize celebrated results by Möhle to models with seed banks, and where it can be seen how the effect of seed banks affects the genealogies. The explicit graphical construction is a new tool to understand the subtle interplay of seed banks, reproduction and genetic drift in population genetics.
△ Less
Submitted 19 May, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Asymptotics of the frequency spectrum for general Dirichlet Xi-coalescents
Authors:
Adrian Gonzalez Casanova,
Veronica Miro Pina,
Emmanuel Schertzer,
Arno Siri-Jegousse
Abstract:
In this work, we study general Dirichlet coalescents, which are a family of Xi-coalecents constructed from i.i.d mass partitions, and are an extension of the symmetric coalescent. This class of models is motivated by population models with recurrent demographic bottlenecks. We study the short time behavior of the multidimensional block counting process whose i-th component counts the number of blo…
▽ More
In this work, we study general Dirichlet coalescents, which are a family of Xi-coalecents constructed from i.i.d mass partitions, and are an extension of the symmetric coalescent. This class of models is motivated by population models with recurrent demographic bottlenecks. We study the short time behavior of the multidimensional block counting process whose i-th component counts the number of blocks of size i. Compared to standard coalescent models (such as the class of Lambda-coalescents coming down from infinity), our process has no deterministic speed of coming down from infinity. In particular, we prove that, under appropriate re-scaling, it converges to a stochastic process which is the unique solution of a martingale problem. We show that the multivariate Lamperti transform of this limiting process is a Markov Additive Process (MAP). This allows us to provide some asymptotics for the n-Site Frequency Spectrum, which is a statistic widely used in population genetics. In particular, the rescaled number of mutations converges to the exponential functional of a subordinator.
△ Less
Submitted 20 September, 2023; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Revisiting Hotels-50K and Hotel-ID
Authors:
Aarash Feizi,
Arantxa Casanova,
Adriana Romero-Soriano,
Reihaneh Rabbany
Abstract:
In this paper, we propose revisited versions for two recent hotel recognition datasets: Hotels50K and Hotel-ID. The revisited versions provide evaluation setups with different levels of difficulty to better align with the intended real-world application, i.e. countering human trafficking. Real-world scenarios involve hotels and locations that are not captured in the current data sets, therefore it…
▽ More
In this paper, we propose revisited versions for two recent hotel recognition datasets: Hotels50K and Hotel-ID. The revisited versions provide evaluation setups with different levels of difficulty to better align with the intended real-world application, i.e. countering human trafficking. Real-world scenarios involve hotels and locations that are not captured in the current data sets, therefore it is important to consider evaluation settings where classes are truly unseen. We test this setup using multiple state-of-the-art image retrieval models and show that as expected, the models' performances decrease as the evaluation gets closer to the real-world unseen settings. The rankings of the best performing models also change across the different evaluation settings, which further motivates using the proposed revisited datasets.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Scaling limit of an adaptive contact process
Authors:
Adrián González Casanova,
András Tóbiás,
Daniel Valesin
Abstract:
We introduce and study an interacting particle system evolving on the $d$-dimensional torus $(\mathbb Z/N\mathbb Z)^d$. Each vertex of the torus can be either empty or occupied by an individual of type $λ\in (0,\infty)$. An individual of type $λ$ dies with rate one and gives birth at each neighboring empty position with rate $λ$; moreover, when the birth takes place, the newborn individual is like…
▽ More
We introduce and study an interacting particle system evolving on the $d$-dimensional torus $(\mathbb Z/N\mathbb Z)^d$. Each vertex of the torus can be either empty or occupied by an individual of type $λ\in (0,\infty)$. An individual of type $λ$ dies with rate one and gives birth at each neighboring empty position with rate $λ$; moreover, when the birth takes place, the newborn individual is likely to have the same type as the parent, but has a small probability of being a mutant. A mutant child of an individual of type $λ$ has type chosen according to a probability kernel. We consider the asymptotic behavior of this process when $N\to \infty$ and the parameter $δ_N$ tends to zero fast enough that mutations are sufficiently separated in time, so that the amount of time spent on configurations with more than one type becomes negligible. We show that, after a suitable time scaling and deletion of the periods of time spent on configurations with more than one type, the process converges to a Markov jump process on $(0,\infty)$, whose rates we characterize.
△ Less
Submitted 20 June, 2023; v1 submitted 7 July, 2022;
originally announced July 2022.
-
A Note on Process Modelling: Combining Situation Calculus and Petri Nets
Authors:
Edirlei Soares de Lima,
Antonio L. Furtado,
Bruno Feijó,
Marco A. Casanova
Abstract:
The situation calculus logic model is convenient for modelling the actions that can occur in an information system application. The interplay of pre-conditions and post-conditions determines a semantically justified partial order of the defined actions and serves to enforce integrity constraints. This form of specification allows the use of plan-generation algorithms to investigate, before the sys…
▽ More
The situation calculus logic model is convenient for modelling the actions that can occur in an information system application. The interplay of pre-conditions and post-conditions determines a semantically justified partial order of the defined actions and serves to enforce integrity constraints. This form of specification allows the use of plan-generation algorithms to investigate, before the system is adopted, whether the proposed specification allows all desirable use cases, and effectively disallows undesirable ones. Especially for legacy applications, implemented without a prior specification, Process Mining techniques were employed to derive an implicit Petri net model from the analysis of a large number of traces registered in an execution log. However, if the system just begins to be used, and has a still empty execution log, this sort of process mining discovery would not be feasible. This paper explains how the Petri net model can be directly derived from the situation calculus specification rules. The main gist is to provide evidence that the two models are complementary, not only because the Petri net model is derivable from the situation calculus model, but also in view of the distinct advantages of the two models. While the situation calculus model leads to planning and simulated execution prior to implementation, the Petri net model can be designed to run in a restrictive mode, allowing an intuitive visualization of the workable sequences. As proof of concept, the paper describes a prototype to demonstrate the methods and applies it to two examples: a published request processing application used to introduce process mining notions; and an analogously structured trial by combat application taken from a popular movie. The prototype includes an interactive dramatization component, which enacts the second application.
△ Less
Submitted 1 July, 2022;
originally announced July 2022.
-
A Gaseous Argon-Based Near Detector to Enhance the Physics Capabilities of DUNE
Authors:
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo
, et al. (1220 additional authors not shown)
Abstract:
This document presents the concept and physics case for a magnetized gaseous argon-based detector system (ND-GAr) for the Deep Underground Neutrino Experiment (DUNE) Near Detector. This detector system is required in order for DUNE to reach its full physics potential in the measurement of CP violation and in delivering precision measurements of oscillation parameters. In addition to its critical r…
▽ More
This document presents the concept and physics case for a magnetized gaseous argon-based detector system (ND-GAr) for the Deep Underground Neutrino Experiment (DUNE) Near Detector. This detector system is required in order for DUNE to reach its full physics potential in the measurement of CP violation and in delivering precision measurements of oscillation parameters. In addition to its critical role in the long-baseline oscillation program, ND-GAr will extend the overall physics program of DUNE. The LBNF high-intensity proton beam will provide a large flux of neutrinos that is sampled by ND-GAr, enabling DUNE to discover new particles and search for new interactions and symmetries beyond those predicted in the Standard Model.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Snowmass Neutrino Frontier: DUNE Physics Summary
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez
, et al. (1221 additional authors not shown)
Abstract:
The Deep Underground Neutrino Experiment (DUNE) is a next-generation long-baseline neutrino oscillation experiment with a primary physics goal of observing neutrino and antineutrino oscillation patterns to precisely measure the parameters governing long-baseline neutrino oscillation in a single experiment, and to test the three-flavor paradigm. DUNE's design has been developed by a large, internat…
▽ More
The Deep Underground Neutrino Experiment (DUNE) is a next-generation long-baseline neutrino oscillation experiment with a primary physics goal of observing neutrino and antineutrino oscillation patterns to precisely measure the parameters governing long-baseline neutrino oscillation in a single experiment, and to test the three-flavor paradigm. DUNE's design has been developed by a large, international collaboration of scientists and engineers to have unique capability to measure neutrino oscillation as a function of energy in a broadband beam, to resolve degeneracy among oscillation parameters, and to control systematic uncertainty using the exquisite imaging capability of massive LArTPC far detector modules and an argon-based near detector. DUNE's neutrino oscillation measurements will unambiguously resolve the neutrino mass ordering and provide the sensitivity to discover CP violation in neutrinos for a wide range of possible values of $δ_{CP}$. DUNE is also uniquely sensitive to electron neutrinos from a galactic supernova burst, and to a broad range of physics beyond the Standard Model (BSM), including nucleon decays. DUNE is anticipated to begin collecting physics data with Phase I, an initial experiment configuration consisting of two far detector modules and a minimal suite of near detector components, with a 1.2 MW proton beam. To realize its extensive, world-leading physics potential requires the full scope of DUNE be completed in Phase II. The three Phase II upgrades are all necessary to achieve DUNE's physics goals: (1) addition of far detector modules three and four for a total FD fiducial mass of at least 40 kt, (2) upgrade of the proton beam power from 1.2 MW to 2.4 MW, and (3) replacement of the near detector's temporary muon spectrometer with a magnetized, high-pressure gaseous argon TPC and calorimeter.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Two-type branching processes with immigration, and the structured coalescents
Authors:
María Emilia Caballero,
Adrián González Casanova,
José Luis Pérez
Abstract:
We consider a population constituted by two types of individuals; each of them can produce offspring in two different islands (as a particular case the islands can be interpreted as active or dormant individuals). We model the evolution of the population of each type using a two-type Feller diffusion with immigration, and we study the frequency of one of the types, in each island, when the total p…
▽ More
We consider a population constituted by two types of individuals; each of them can produce offspring in two different islands (as a particular case the islands can be interpreted as active or dormant individuals). We model the evolution of the population of each type using a two-type Feller diffusion with immigration, and we study the frequency of one of the types, in each island, when the total population size in each island is forced to be constant at a dense set of times. This leads to the solution of a SDE which we call the asymmetric two-island frequency process. We derive properties of this process and obtain a large population limit when the total size of each island tends to infinity. Additionally, we compute the fluctuations of the process around its deterministic limit. We establish conditions under which the asymmetric two-island frequency process has a moment dual. The dual is a continuous-time two-dimensional Markov chain that can be interpreted in terms of mutation, branching, pairwise branching, coalescence, and a novel mixed selection-migration term. Also, we conduct a stability analysis of the limiting deterministic dynamical system and present some numerical results to study fixation and a new form of balancing selection. When restricting to the seedbank model, we observe that some combinations of the parameters lead to balancing selection. Besides finding yet another way in which genetic reservoirs increase the genetic variability, we find that if a population that sustains a seedbank competes with one that does not, the seed producers will have a selective advantage if they reproduce faster, but will not have a selective disadvantage if they reproduce slower: their worst case scenario is balancing selection.
△ Less
Submitted 1 May, 2024; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Instance-Conditioned GAN
Authors:
Arantxa Casanova,
Marlène Careil,
Jakob Verbeek,
Michal Drozdzal,
Adriana Romero-Soriano
Abstract:
Generative Adversarial Networks (GANs) can generate near photo realistic images in narrow domains such as human faces. Yet, modeling complex distributions of datasets such as ImageNet and COCO-Stuff remains challenging in unconditional settings. In this paper, we take inspiration from kernel density estimation techniques and introduce a non-parametric approach to modeling distributions of complex…
▽ More
Generative Adversarial Networks (GANs) can generate near photo realistic images in narrow domains such as human faces. Yet, modeling complex distributions of datasets such as ImageNet and COCO-Stuff remains challenging in unconditional settings. In this paper, we take inspiration from kernel density estimation techniques and introduce a non-parametric approach to modeling distributions of complex datasets. We partition the data manifold into a mixture of overlap** neighborhoods described by a datapoint and its nearest neighbors, and introduce a model, called instance-conditioned GAN (IC-GAN), which learns the distribution around each datapoint. Experimental results on ImageNet and COCO-Stuff show that IC-GAN significantly improves over unconditional models and unsupervised data partitioning baselines. Moreover, we show that IC-GAN can effortlessly transfer to datasets not seen during training by simply changing the conditioning instances, and still generate realistic images. Finally, we extend IC-GAN to the class-conditional case and show semantically controllable generation and competitive quantitative results on ImageNet; while improving over BigGAN on ImageNet-LT. Code and trained models to reproduce the reported results are available at https://github.com/facebookresearch/ic_gan.
△ Less
Submitted 4 November, 2021; v1 submitted 10 September, 2021;
originally announced September 2021.
-
Algebras of Sets and Coherent Sets of Gambles
Authors:
Juerg Kohlas,
Arianna Casanova,
Marco Zaffalon
Abstract:
In a recent work we have shown how to construct an information algebra of coherent sets of gambles defined on general possibility spaces. Here we analyze the connection of such an algebra with the set algebra of subsets of the possibility space on which gambles are defined and the set algebra of sets of its atoms. Set algebras are particularly important information algebras since they are their pr…
▽ More
In a recent work we have shown how to construct an information algebra of coherent sets of gambles defined on general possibility spaces. Here we analyze the connection of such an algebra with the set algebra of subsets of the possibility space on which gambles are defined and the set algebra of sets of its atoms. Set algebras are particularly important information algebras since they are their prototypical structures. Furthermore, they are the algebraic counterparts of classical propositional logic. As a consequence, this paper also details how propositional logic is naturally embedded into the theory of imprecise probabilities.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
Information algebras of coherent sets of gambles in general possibility spaces
Authors:
Juerg Kohlas,
Arianna Casanova,
Marco Zaffalon
Abstract:
In this paper, we show that coherent sets of gambles can be embedded into the algebraic structure of information algebra. This leads firstly, to a new perspective of the algebraic and logical structure of desirability and secondly, it connects desirability, hence imprecise probabilities, to other formalism in computer science sharing the same underlying structure. Both the domain-free and the labe…
▽ More
In this paper, we show that coherent sets of gambles can be embedded into the algebraic structure of information algebra. This leads firstly, to a new perspective of the algebraic and logical structure of desirability and secondly, it connects desirability, hence imprecise probabilities, to other formalism in computer science sharing the same underlying structure. Both the domain-free and the labeled view of the information algebra of coherent sets of gambles are presented, considering general possibility spaces.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Information algebras in the theory of imprecise probabilities
Authors:
Arianna Casanova,
Juerg Kohlas,
Marco Zaffalon
Abstract:
In this paper, we show that coherent sets of gambles and coherent lower and upper previsions can be embedded into the algebraic structure of information algebra. This leads firstly, to a new perspective of the algebraic and logical structure of desirability and imprecise probabilities and secondly, it connects imprecise probabilities to other formalism in computer science sharing the same underlyi…
▽ More
In this paper, we show that coherent sets of gambles and coherent lower and upper previsions can be embedded into the algebraic structure of information algebra. This leads firstly, to a new perspective of the algebraic and logical structure of desirability and imprecise probabilities and secondly, it connects imprecise probabilities to other formalism in computer science sharing the same underlying structure. Both the domain free and the labeled view of the resulting information algebras are presented, considering product possibility spaces. Moreover, it is shown that both are atomistic and therefore they can be embedded in set algebras.
△ Less
Submitted 27 April, 2021; v1 submitted 26 February, 2021;
originally announced February 2021.
-
Generating unseen complex scenes: are we there yet?
Authors:
Arantxa Casanova,
Michal Drozdzal,
Adriana Romero-Soriano
Abstract:
Although recent complex scene conditional generation models generate increasingly appealing scenes, it is very hard to assess which models perform better and why. This is often due to models being trained to fit different data splits, and defining their own experimental setups. In this paper, we propose a methodology to compare complex scene conditional generation models, and provide an in-depth a…
▽ More
Although recent complex scene conditional generation models generate increasingly appealing scenes, it is very hard to assess which models perform better and why. This is often due to models being trained to fit different data splits, and defining their own experimental setups. In this paper, we propose a methodology to compare complex scene conditional generation models, and provide an in-depth analysis that assesses the ability of each model to (1) fit the training distribution and hence perform well on seen conditionings, (2) to generalize to unseen conditionings composed of seen object combinations, and (3) generalize to unseen conditionings composed of unseen object combinations. As a result, we observe that recent methods are able to generate recognizable scenes given seen conditionings, and exploit compositionality to generalize to unseen conditionings with seen object combinations. However, all methods suffer from noticeable image quality degradation when asked to generate images from conditionings composed of unseen object combinations. Moreover, through our analysis, we identify the advantages of different pipeline components, and find that (1) encouraging compositionality through instance-wise spatial conditioning normalizations increases robustness to both types of unseen conditionings, (2) using semantically aware losses such as the scene-graph perceptual similarity helps improve some dimensions of the generation process, and (3) enhancing the quality of generated masks and the quality of the individual objects are crucial steps to improve robustness to both types of unseen conditionings.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.
-
The relative frequency between two continuous-state branching processes with immigration and their genealogy
Authors:
María Emilia Caballero,
Adrián González Casanova,
José-Luis Pérez
Abstract:
When two (possibly different in distribution) continuous-state branching processes with immigration are present, we study the relative frequency of one of them when the total mass is forced to be constant at a dense set of times. This leads to a SDE whose unique strong solution will be the definition of a $Λ$-asymmetric frequency process ($Λ$-AFP). We prove that it is a Feller process and we calcu…
▽ More
When two (possibly different in distribution) continuous-state branching processes with immigration are present, we study the relative frequency of one of them when the total mass is forced to be constant at a dense set of times. This leads to a SDE whose unique strong solution will be the definition of a $Λ$-asymmetric frequency process ($Λ$-AFP). We prove that it is a Feller process and we calculate a large population limit when the total mass tends to infinity. This allows us to study the fluctuations of the process around its deterministic limit. Furthermore, we find conditions for the $Λ$-AFP to have a moment dual. The dual can be interpreted in terms of selection, (coordinated) mutation, pairwise branching (efficiency), coalescence, and a novel component that comes from the asymmetry between the reproduction mechanisms. In the particular case of a pair of equally distributed continuous-state branching processes, the associated $Λ$-AFP will be the dual of a $Λ$-coalescent. The map that sends each continuous-state branching process to its associated $Λ$-coalescent (according to the former procedure) is a homeomorphism between metric spaces.
△ Less
Submitted 11 March, 2023; v1 submitted 1 October, 2020;
originally announced October 2020.
-
$Λ$-coalescents arising in populations with dormancy
Authors:
Fernando Cordero,
Adrián González Casanova,
Jason Schweinsberg,
Maite Wilke-Berenguer
Abstract:
Consider a population evolving from year to year through three seasons: spring, summer and winter. Every spring starts with $N$ dormant individuals waking up independently of each other according to a given distribution. Once an individual is awake, it starts reproducing at a constant rate. By the end of spring, all individuals are awake and continue reproducing independently as Yule processes dur…
▽ More
Consider a population evolving from year to year through three seasons: spring, summer and winter. Every spring starts with $N$ dormant individuals waking up independently of each other according to a given distribution. Once an individual is awake, it starts reproducing at a constant rate. By the end of spring, all individuals are awake and continue reproducing independently as Yule processes during the whole summer. In the winter, $N$ individuals chosen uniformly at random go to sleep until the next spring, and the other individuals die. We show that because an individual that wakes up unusually early can have a large number of surviving descendants, for some choices of model parameters the genealogy of the population will be described by a $Λ$-coalescent. In particular, the beta coalescent can describe the genealogy when the rate at which individuals wake up increases exponentially over time. We also characterize the set of all $Λ$-coalescents that can arise in this framework.
△ Less
Submitted 20 September, 2020;
originally announced September 2020.
-
A Brief Survey on Replica Consistency in Cloud Environments
Authors:
Robson A. Campêlo,
Marco A. Casanova,
Dorgival O. Guedes,
Alberto H. F. Laender
Abstract:
Cloud computing is a general term that involves delivering hosted services over the Internet. With the accelerated growth of the volume of data used by applications, many organizations have moved their data into cloud servers to provide scalable, reliable and highly available services. A particularly challenging issue that arises in the context of cloud storage systems with geographically-distribu…
▽ More
Cloud computing is a general term that involves delivering hosted services over the Internet. With the accelerated growth of the volume of data used by applications, many organizations have moved their data into cloud servers to provide scalable, reliable and highly available services. A particularly challenging issue that arises in the context of cloud storage systems with geographically-distributed data replication is how to reach a consistent state for all replicas. This survey reviews major aspects related to consistency issues in cloud data storage systems, categorizing recently proposed methods into three categories: (1) fixed consistency methods, (2) configurable consistency methods and (3) consistency monitoring methods.
△ Less
Submitted 1 September, 2020; v1 submitted 26 August, 2020;
originally announced August 2020.
-
Haldane's formula in Cannings models: The case of moderately strong selection
Authors:
Florin Boenkost,
Adrián González Casanova,
Cornelia Pokalyuk,
Anton Wakolbinger
Abstract:
For a class of Cannings models we prove Haldane's formula, $π(s_N) \sim \frac{2s_N}{ρ^2}$, for the fixation probability of a single beneficial mutant in the limit of large population size $N$ and in the regime of moderately strong selection, i.e. for $s_N \sim N^{-b}$ and $0< b<1/2$. Here, $s_N$ is the selective advantage of an individual carrying the beneficial type, and $ρ^2$ is the (asymptotic)…
▽ More
For a class of Cannings models we prove Haldane's formula, $π(s_N) \sim \frac{2s_N}{ρ^2}$, for the fixation probability of a single beneficial mutant in the limit of large population size $N$ and in the regime of moderately strong selection, i.e. for $s_N \sim N^{-b}$ and $0< b<1/2$. Here, $s_N$ is the selective advantage of an individual carrying the beneficial type, and $ρ^2$ is the (asymptotic) offspring variance. Our assumptions on the reproduction mechanism allow for a coupling of the beneficial allele's frequency process with slightly supercritical Galton-Watson processes in the early phase of fixation.
△ Less
Submitted 21 January, 2022; v1 submitted 5 August, 2020;
originally announced August 2020.
-
Reinforced active learning for image segmentation
Authors:
Arantxa Casanova,
Pedro O. Pinheiro,
Negar Rostamzadeh,
Christopher J. Pal
Abstract:
Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and time-consuming. Second, realistic segmentation datasets are highly unbalanced: some categories are much more abundant than others, biasing the performance to the most represented ones. In this paper, we are interested in focusing human labelling effort on a small su…
▽ More
Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and time-consuming. Second, realistic segmentation datasets are highly unbalanced: some categories are much more abundant than others, biasing the performance to the most represented ones. In this paper, we are interested in focusing human labelling effort on a small subset of a larger pool of data, minimizing this effort while maximizing performance of a segmentation model on a hold-out set. We present a new active learning strategy for semantic segmentation based on deep reinforcement learning (RL). An agent learns a policy to select a subset of small informative image regions -- opposed to entire images -- to be labeled, from a pool of unlabeled data. The region selection decision is made based on predictions and uncertainties of the segmentation model being trained. Our method proposes a new modification of the deep Q-network (DQN) formulation for active learning, adapting it to the large-scale nature of semantic segmentation problems. We test the proof of concept in CamVid and provide results in the large-scale dataset Cityscapes. On Cityscapes, our deep RL region-based DQN approach requires roughly 30% less additional labeled data than our most competitive baseline to reach the same performance. Moreover, we find that our method asks for more labels of under-represented categories compared to the baselines, improving their performance and hel** to mitigate class imbalance.
△ Less
Submitted 16 February, 2020;
originally announced February 2020.
-
Particle systems with coordination
Authors:
Adrián González Casanova,
Noemi Kurt,
András Tóbiás
Abstract:
We consider a generalization of spatial branching coalescing processes in which the behaviour of individuals is not (necessarily) independent, on the contrary, individuals tend to take simultaneous actions. We show that these processes have moment duals, which happen to be multidimensional diffusions with jumps. Moment duality provides a general framework to study structural properties of the proc…
▽ More
We consider a generalization of spatial branching coalescing processes in which the behaviour of individuals is not (necessarily) independent, on the contrary, individuals tend to take simultaneous actions. We show that these processes have moment duals, which happen to be multidimensional diffusions with jumps. Moment duality provides a general framework to study structural properties of the processes in this class. We present some conditions under which the expectation of the process is not affected by coordination and comment on the effect of coordination on the variance. We analyse several examples in more detail, including the nested coalescent, the peripatric coalescent with selection and coordinated migration, and the Parabolic Anderson Model.
△ Less
Submitted 4 May, 2021; v1 submitted 16 January, 2020;
originally announced January 2020.
-
The shape of a seed bank tree
Authors:
Adrián González Casanova,
Lizbeth Peñaloza,
Arno Siri-Jégousse
Abstract:
We derive the asymptotic behavior of the total, active and inactive branch lengths of the seed bank coalescent, when the size of the initial sample grows to infinity. Those random variables have important applications for populations evolving under some seed bank effects, such as plants and bacteria, and for some cases of structured populations like metapopulations. The proof relies on the study o…
▽ More
We derive the asymptotic behavior of the total, active and inactive branch lengths of the seed bank coalescent, when the size of the initial sample grows to infinity. Those random variables have important applications for populations evolving under some seed bank effects, such as plants and bacteria, and for some cases of structured populations like metapopulations. The proof relies on the study of the tree at a stop** time corresponding to the first time that a deactivated lineage reactivates. We also give conditional sampling formulas for the random partition and we study the system at the time of the first deactivation of a lineage. All these results provide a good picture of the different regimes and behaviors of the block-counting process of the seed bank coalescent.
△ Less
Submitted 24 September, 2020; v1 submitted 13 January, 2020;
originally announced January 2020.
-
Haldane's formula in Cannings models: The case of moderately weak selection
Authors:
Florin Boenkost,
Adrián González Casanova,
Cornelia Pokalyuk,
Anton Wakolbinger
Abstract:
We introduce a Cannings model with directional selection via a paintbox construction and establish a strong duality with the line counting process of a new \emph{Cannings ancestral selection graph} in discrete time. This duality also yields a formula for the fixation probability of the beneficial type. Haldane's formula states that for a single selectively advantageous individual in a population o…
▽ More
We introduce a Cannings model with directional selection via a paintbox construction and establish a strong duality with the line counting process of a new \emph{Cannings ancestral selection graph} in discrete time. This duality also yields a formula for the fixation probability of the beneficial type. Haldane's formula states that for a single selectively advantageous individual in a population of haploid individuals of size $N$ the prob\-ability of fixation is asymptotically (as $N\to \infty$) equal to the selective advantage of haploids $s_N$ divided by half of the offspring variance. For a class of offspring distributions within Kingman attraction we prove this asymptotics for sequences $s_N$ obeying $N^{-1} \ll s_N \ll N^{-1/2} $, which is a regime of "moderately weak selection". It turns out that for $ s_N \ll N^{-2/3} $ the Cannings ancestral selection graph is so close to the ancestral selection graph of a Moran model that a suitable coupling argument allows to play the problem back asymptotically to the fixation probability in the Moran model, which can be computed explicitly.
△ Less
Submitted 17 December, 2020; v1 submitted 23 July, 2019;
originally announced July 2019.
-
The effective strength of selection in random environment
Authors:
Adrián González Casanova,
Dario Spanò,
Maite Wilke-Berenguer
Abstract:
We analyse a family of two-types Wright-Fisher models with selection in a random environment and skewed offspring distribution. We provide a calculable criterion to quantify the impact of different shapes of selection on the fate of the weakest allele, and thus compare them. The main mathematical tool is duality, which we prove to hold, also in presence of random environment (quenched and in some…
▽ More
We analyse a family of two-types Wright-Fisher models with selection in a random environment and skewed offspring distribution. We provide a calculable criterion to quantify the impact of different shapes of selection on the fate of the weakest allele, and thus compare them. The main mathematical tool is duality, which we prove to hold, also in presence of random environment (quenched and in some cases annealed), between the population's allele frequencies and genealogy, both in the case of finite population size and in the scaling limit for large size. Duality also yields new insight on properties of branching-coalescing processes in random environment, such as their long term behaviour.
△ Less
Submitted 8 February, 2023; v1 submitted 28 March, 2019;
originally announced March 2019.
-
Separation of time-scales for the seed bank diffusion and its jump-diffusion limit
Authors:
Jochen Blath,
Eugenio Buzzoni,
Adrián González Casanova,
Maite Wilke-Berenguer
Abstract:
We investigate the scaling limit of the seed bank diffusion when reproduction and migration (to and from the seed bank) happen on different time-scales. More precisely, we consider the case when migration is `slow' and reproduction is `standard' (in the original time-scale) and then switch to a new, accelerated time-scale, where migration is `standard' and reproduction is `fast'. This is motivated…
▽ More
We investigate the scaling limit of the seed bank diffusion when reproduction and migration (to and from the seed bank) happen on different time-scales. More precisely, we consider the case when migration is `slow' and reproduction is `standard' (in the original time-scale) and then switch to a new, accelerated time-scale, where migration is `standard' and reproduction is `fast'. This is motivated by models for bacterial dormancy, where periods of quiescence can be orders of magnitude larger than reproductive times, and where it is expected to find non-trivial degenerate genealogies on the evolutionary time-scale.
However, the above scaling regime is not only interesting from a biological perspective, but also from a mathematical point of view, since it provides a prototypical example where the expected scaling limit of a continuous diffusion should (and will be) a jump-diffusion. For this situation, standard convergence results often seem to fail in multiple ways. For example, since the set of continuous paths from a closed subset of the càdlàg paths in each of the classical Skorohod topologies $J_1, J_2, M_1$ and $M_2$, none of them can be employed for tightness on path-space. Further, a naïve direct rescaling of the Markov generator corresponding to the continuous diffusion immediately leads to a blow-up of the diffusion coefficient. Still, one can identify a well-defined limit via duality in a surprisingly non-technical way. Indeed, we show that a certain duality relation is in some sense stable under passage to the limit and allows an identification of the limit, avoiding all technicalities related to the blow-up in the classical generator. The result then boils down to a convergence criterion for time-continuous Markov chains in a separation of time-scales regime, which is of independent interest.
△ Less
Submitted 28 March, 2019;
originally announced March 2019.
-
Multidimensional $Λ$-Wright-Fisher processes with general frequency-dependent selection
Authors:
Adrian Gonzalez Casanova,
Charline Smadi
Abstract:
We construct a constant size population model allowing for general selective interactions and extreme reproductive events. It generalizes the idea of (Krone and Neuhauser 1997) who represented the selection by allowing individuals to sample potential parents in the previous generation before choosing the 'strongest' one, by allowing individuals to use any rule to choose their real parent. Via a la…
▽ More
We construct a constant size population model allowing for general selective interactions and extreme reproductive events. It generalizes the idea of (Krone and Neuhauser 1997) who represented the selection by allowing individuals to sample potential parents in the previous generation before choosing the 'strongest' one, by allowing individuals to use any rule to choose their real parent. Via a large population limit, we obtain a generalisation of $Λ$-Fleming Viot processes allowing for non transitive interactions between types. We provide fixation properties, and give conditions for these processes to be realised as solutions of stochastic differential equations.
△ Less
Submitted 16 April, 2020; v1 submitted 15 March, 2019;
originally announced March 2019.
-
The Symmetric Coalescent and Wright-Fisher models with bottlenecks
Authors:
Adrián González Casanova,
Verónica Miró Pina,
Arno Siri-Jégousse
Abstract:
We define a new class of $Ξ$-coalescents characterized by a possibly infinite measure over the non negative integers. We call them symmetric coalescents since they are the unique family of exchangeable coalescents satisfying a symmetry property on their coagulation rates: they are invariant under any transformation that consists in moving one element from one block to another without changing the…
▽ More
We define a new class of $Ξ$-coalescents characterized by a possibly infinite measure over the non negative integers. We call them symmetric coalescents since they are the unique family of exchangeable coalescents satisfying a symmetry property on their coagulation rates: they are invariant under any transformation that consists in moving one element from one block to another without changing the total number of blocks. We illustrate the diversity of behaviors of this family of processes by introducing and studying a one parameter subclass, the $(β,S)$-coalescents. We also embed this family in a larger class of $Ξ$-coalescents arising as the limit genealogies of Wright-Fisher models with bottlenecks. Some convergence results rely on a new Skorokhod type metric, that induces the Meyer-Zheng topology, which allows to study the scaling limit of non-markovian processes using standard techniques.
△ Less
Submitted 1 March, 2022; v1 submitted 13 March, 2019;
originally announced March 2019.
-
The Wright-Fisher model with efficiency
Authors:
Adrian Gonzalez Casanova,
Veronica Miro Pina,
Juan Carlos Pardo
Abstract:
In populations competing for resources, it is natural to ask whether consuming fewer resources provides any selective advantage. To answer this question, we propose a Wright- Fisher model with two types of individuals: the inefficient individuals, those who need more resources to reproduce and can have a higher growth rate, and the efficient individuals. In this model, the total amount of resource…
▽ More
In populations competing for resources, it is natural to ask whether consuming fewer resources provides any selective advantage. To answer this question, we propose a Wright- Fisher model with two types of individuals: the inefficient individuals, those who need more resources to reproduce and can have a higher growth rate, and the efficient individuals. In this model, the total amount of resource N, is fixed, and the population size varies randomly depending on the number of efficient individuals. We show that, as N increases, the frequency process of efficient individuals converges to a diffusion which is a generalisation of the Wright- Fisher diffusion with selection. The genealogy of this model is given by a branching-coalescing process that we call the Ancestral Selection/Efficiency Graph, and that is an extension of the Ancestral Selection Graph (Krone and Neuhauser (1997a), Krone and Neuhauser (1997b)). The main contribution of this paper is that, in evolving populations, inefficiency can arise as a promoter of selective advantage and not necessarily as a trade-off.
△ Less
Submitted 9 September, 2020; v1 submitted 19 February, 2019;
originally announced February 2019.
-
The seed bank coalescent with simultaneous switching
Authors:
Jochen Blath,
Adrián González Casanova,
Noemi Kurt,
Maite Wilke-Berenguer
Abstract:
We introduce a new Wright-Fisher type model for seed banks incorporating "simultaneous switching", which is motivated by recent work on microbial dormancy. We show that the simultaneous switching mechanism leads to a new jump-diffusion limit for the scaled frequency processes, extending the classical Wright-Fisher and seed bank diffusion limits. We further establish a new dual coalescent structure…
▽ More
We introduce a new Wright-Fisher type model for seed banks incorporating "simultaneous switching", which is motivated by recent work on microbial dormancy. We show that the simultaneous switching mechanism leads to a new jump-diffusion limit for the scaled frequency processes, extending the classical Wright-Fisher and seed bank diffusion limits. We further establish a new dual coalescent structure with multiple activation and deactivation events of lineages. While this seems reminiscent of multiple merger events in general exchangeable coalescents, it actually leads to an entirely new class of coalescent processes with unique qualitative and quantitative behaviour. To illustrate this, we provide a novel kind of condition for coming down from infinity for these coalescents using recent results of Griffiths.
△ Less
Submitted 21 December, 2018; v1 submitted 10 December, 2018;
originally announced December 2018.
-
Computing Entity Semantic Similarity by Features Ranking
Authors:
Livia Ruback,
Claudio Lucchese,
Alexander Arturo Mera Caraballo,
Grettel Monteagudo García,
Marco Antonio Casanova,
Chiara Renso
Abstract:
This article presents a novel approach to estimate semantic entity similarity using entity features available as Linked Data. The key idea is to exploit ranked lists of features, extracted from Linked Data sources, as a representation of the entities to be compared. The similarity between two entities is then estimated by comparing their ranked lists of features. The article describes experiments…
▽ More
This article presents a novel approach to estimate semantic entity similarity using entity features available as Linked Data. The key idea is to exploit ranked lists of features, extracted from Linked Data sources, as a representation of the entities to be compared. The similarity between two entities is then estimated by comparing their ranked lists of features. The article describes experiments with museum data from DBpedia, with datasets from a LOD catalog, and with computer science conferences from the DBLP repository. The experiments demonstrate that entity similarity, computed using ranked lists of features, achieves better accuracy than state-of-the-art measures.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
Ranking RDF Instances in Degree-decoupled RDF Graphs
Authors:
Elisa S. Menendez,
Marco A. Casanova,
Mohand Boughanem,
Luiz André P. Paes Leme
Abstract:
In the last decade, RDF emerged as a new kind of standardized data model, and a sizable body of knowledge from fields such as Information Retrieval was adapted to RDF graphs. One common task in graph databases is to define an importance score for nodes based on centrality measures, such as PageRank and HITS. The majority of the strategies highly depend on the degree of the node. However, in some R…
▽ More
In the last decade, RDF emerged as a new kind of standardized data model, and a sizable body of knowledge from fields such as Information Retrieval was adapted to RDF graphs. One common task in graph databases is to define an importance score for nodes based on centrality measures, such as PageRank and HITS. The majority of the strategies highly depend on the degree of the node. However, in some RDF graphs, called degree-decoupled RDF graphs, the notion of importance is not directly related to the node degree. Therefore, this work first proposes three novel node importance measures, named InfoRank I, II and III, for degree-decoupled RDF graphs. It then compares the proposed measures with traditional PageRank and other familiar centrality measures, using with an IMDb dataset.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
An Algebra of Lightweight Ontologies
Authors:
Marco A. Casanova,
Rômulo Magalhães
Abstract:
This paper argues that certain ontology design problems are profitably addressed by treating ontologies as theories and by defining a set of operations that create new ontologies, including their constraints, out of other ontologies. The paper first shows how to use the operations in the context of ontology reuse, how to take advantage of the operations to compare different ontologies, or differen…
▽ More
This paper argues that certain ontology design problems are profitably addressed by treating ontologies as theories and by defining a set of operations that create new ontologies, including their constraints, out of other ontologies. The paper first shows how to use the operations in the context of ontology reuse, how to take advantage of the operations to compare different ontologies, or different versions of an ontology, and how the operations may help design mediated schemas in a bottom up fashion. The core of the paper discusses how to compute the operations for lightweight ontologies and addresses the question of minimizing the set of constraints of a lightweight ontology. Finally, the paper describes an implementation of the operations, as a Protégé plug-in.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
High-power all-fiber ultra-low noise laser
Authors:
Jian Zhao,
Germain Guiraud,
Christophe Pierre,
Florian Floissat,
Alexis Casanova,
Ali Hreibi,
Walid Chaibi,
Nicholas Traynor,
Johan Boullet,
Giorgio Santarelli
Abstract:
High-power ultra-low noise single-mode single-frequency lasers are in great demand for interferometric metrology. Robust, compact all-fiber lasers represent one of the most promising technologies to replace the current laser sources in use based on injection-locked ring resonators or multi-stage solid-state amplifiers. Here, a linearly-polarized high-power ultra-low noise all-fiber laser is demons…
▽ More
High-power ultra-low noise single-mode single-frequency lasers are in great demand for interferometric metrology. Robust, compact all-fiber lasers represent one of the most promising technologies to replace the current laser sources in use based on injection-locked ring resonators or multi-stage solid-state amplifiers. Here, a linearly-polarized high-power ultra-low noise all-fiber laser is demonstrated at a power level of 100 W. Special care has been taken in the study of relative intensity noise (RIN) and its reduction. Using an optimized servo actuator to directly control the driving current of the pump laser diode (LD), we obtain a large feedback bandwidth of up to 1.3 MHz. The RIN reaches-160 dBc/Hz between 3 kHz and 20 kHz.
△ Less
Submitted 23 May, 2018;
originally announced May 2018.
-
On the iterative refinement of densely connected representation levels for semantic segmentation
Authors:
Arantxa Casanova,
Guillem Cucurull,
Michal Drozdzal,
Adriana Romero,
Yoshua Bengio
Abstract:
State-of-the-art semantic segmentation approaches increase the receptive field of their models by using either a downsampling path composed of poolings/strided convolutions or successive dilated convolutions. However, it is not clear which operation leads to best results. In this paper, we systematically study the differences introduced by distinct receptive field enlargement methods and their imp…
▽ More
State-of-the-art semantic segmentation approaches increase the receptive field of their models by using either a downsampling path composed of poolings/strided convolutions or successive dilated convolutions. However, it is not clear which operation leads to best results. In this paper, we systematically study the differences introduced by distinct receptive field enlargement methods and their impact on the performance of a novel architecture, called Fully Convolutional DenseResNet (FC-DRN). FC-DRN has a densely connected backbone composed of residual networks. Following standard image segmentation architectures, receptive field enlargement operations that change the representation level are interleaved among residual networks. This allows the model to exploit the benefits of both residual and dense connectivity patterns, namely: gradient flow, iterative refinement of representations, multi-scale feature combination and deep supervision. In order to highlight the potential of our model, we test it on the challenging CamVid urban scene understanding benchmark and make the following observations: 1) downsampling operations outperform dilations when the model is trained from scratch, 2) dilations are useful during the finetuning step of the model, 3) coarser representations require less refinement steps, and 4) ResNets (by model construction) are good regularizers, since they can reduce the model capacity when needed. Finally, we compare our architecture to alternative methods and report state-of-the-art result on the Camvid dataset, with at least twice fewer parameters.
△ Less
Submitted 30 April, 2018;
originally announced April 2018.
-
Modelling and simulating Lenski's long-term evolution experiment
Authors:
Ellen Baake,
Adrián González Casanova,
Sebastian Probst,
Anton Wakolbinger
Abstract:
We revisit the model by Wiser, Ribeck, and Lenski (Science \textbf{342} (2013), 1364--1367), which describes how the mean fitness increases over time due to beneficial mutations in Lenski's long-term evolution experiment. We develop the model further both conceptually and mathematically. Conceptually, we describe the experiment with the help of a Cannings model with mutation and selection, where t…
▽ More
We revisit the model by Wiser, Ribeck, and Lenski (Science \textbf{342} (2013), 1364--1367), which describes how the mean fitness increases over time due to beneficial mutations in Lenski's long-term evolution experiment. We develop the model further both conceptually and mathematically. Conceptually, we describe the experiment with the help of a Cannings model with mutation and selection, where the latter includes diminishing returns epistasis. The analysis sheds light on the growth dynamics within every single day and reveals a runtime effect, that is, the shortening of the daily growth period with increasing fitness; and it allows to clarify the contribution of epistasis to the mean fitness curve. Mathematically, we explain rigorous results in terms of a law of large numbers (in the limit of infinite population size and for a certain asymptotic parameter regime), and present approximations based on heuristics and supported by simulations for finite populations.
△ Less
Submitted 5 April, 2019; v1 submitted 27 March, 2018;
originally announced March 2018.