-
Posterior Sampling with Denoising Oracles via Tilted Transport
Authors:
Joan Bruna,
Jiequn Han
Abstract:
Score-based diffusion models have significantly advanced high-dimensional data generation across various domains, by learning a denoising oracle (or score) from datasets. From a Bayesian perspective, they offer a realistic modeling of data priors and facilitate solving inverse problems through posterior sampling. Although many heuristic methods have been developed recently for this purpose, they l…
▽ More
Score-based diffusion models have significantly advanced high-dimensional data generation across various domains, by learning a denoising oracle (or score) from datasets. From a Bayesian perspective, they offer a realistic modeling of data priors and facilitate solving inverse problems through posterior sampling. Although many heuristic methods have been developed recently for this purpose, they lack the quantitative guarantees needed in many scientific applications.
In this work, we introduce the \textit{tilted transport} technique, which leverages the quadratic structure of the log-likelihood in linear inverse problems in combination with the prior denoising oracle to transform the original posterior sampling problem into a new `boosted' posterior that is provably easier to sample from. We quantify the conditions under which this boosted posterior is strongly log-concave, highlighting the dependencies on the condition number of the measurement matrix and the signal-to-noise ratio. The resulting posterior sampling scheme is shown to reach the computational threshold predicted for sampling Ising models [Kunisky'23] with a direct analysis, and is further validated on high-dimensional Gaussian mixture models and scalar field $\varphi^4$ models.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
UruBots UAV -- Air Emergency Service Indoor Team Description Paper for FIRA 2024
Authors:
Hiago Sodre,
Sebastian Barcelona,
Anthony Scirgalea,
Brandon Macedo,
Gabriel Sampson,
Pablo Moraes,
William Moraes,
Victoria Saravia,
Juan Deniz,
Bruna Guterres,
Andre Kelbouscas,
Ricardo Grando
Abstract:
This document addresses the description of the corresponding "Urubots" Team for the 2024 Fira Air League, "Air Emergency Service (Indoor)." We introduce our team and an autonomous Unmanned Aerial Vehicle (UAV) that relies on computer vision for its flight control. This UAV has the capability to perform a wide variety of navigation tasks in indoor environments, without requiring the intervention of…
▽ More
This document addresses the description of the corresponding "Urubots" Team for the 2024 Fira Air League, "Air Emergency Service (Indoor)." We introduce our team and an autonomous Unmanned Aerial Vehicle (UAV) that relies on computer vision for its flight control. This UAV has the capability to perform a wide variety of navigation tasks in indoor environments, without requiring the intervention of an external operator or any form of external processing, resulting in a significant decrease in workload and manual dependence. Additionally, our software has been designed to be compatible with the vehicle's structure and for its application to the competition circuit. In this paper, we detail additional aspects about the mechanical structure, software, and application to the FIRA competition.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
UruBots Autonomous Cars Team One Description Paper for FIRA 2024
Authors:
Pablo Moraes,
Christopher Peters,
Any Da Rosa,
Vinicio Melgar,
Franco Nuñez,
Maximo Retamar,
William Moraes,
Victoria Saravia,
Hiago Sodre,
Sebastian Barcelona,
Anthony Scirgalea,
Juan Deniz,
Bruna Guterres,
André Kelbouscas,
Ricardo Grando
Abstract:
This document presents the design of an autonomous car developed by the UruBots team for the 2024 FIRA Autonomous Cars Race Challenge. The project involves creating an RC-car sized electric vehicle capable of navigating race tracks with in an autonomous manner. It integrates mechanical and electronic systems alongside artificial intelligence based algorithms for the navigation and real-time decisi…
▽ More
This document presents the design of an autonomous car developed by the UruBots team for the 2024 FIRA Autonomous Cars Race Challenge. The project involves creating an RC-car sized electric vehicle capable of navigating race tracks with in an autonomous manner. It integrates mechanical and electronic systems alongside artificial intelligence based algorithms for the navigation and real-time decision-making. The core of our project include the utilization of an AI-based algorithm to learn information from a camera and act in the robot to perform the navigation. We show that by creating a dataset with more than five thousand samples and a five-layered CNN we managed to achieve promissing performance we our proposed hardware setup. Overall, this paper aims to demonstrate the autonomous capabilities of our car, highlighting its readiness for the 2024 FIRA challenge, hel** to contribute to the field of autonomous vehicle research.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
UruBots Autonomous Car Team Two: Team Description Paper for FIRA 2024
Authors:
William Moraes,
Juan Deniz,
Pablo Moraes,
Christopher Peters,
Vincent Sandin,
Gabriel da Silva,
Franco Nunez,
Maximo Retamar,
Victoria Saravia,
Hiago Sodre,
Sebastian Barcelona,
Anthony Scirgalea,
Bruna Guterres,
Andre Kelbouscas,
Ricardo Grando
Abstract:
This paper proposes a mini autonomous car to be used by the team UruBots for the 2024 FIRA Autonomous Cars Race Challenge. The vehicle is proposed focusing on a low cost and light weight setup. Powered by a Raspberry PI4 and with a total weight of 1.15 Kilograms, we show that our vehicle manages to race a track of approximately 13 meters in 11 seconds at the best evaluation that was carried out, w…
▽ More
This paper proposes a mini autonomous car to be used by the team UruBots for the 2024 FIRA Autonomous Cars Race Challenge. The vehicle is proposed focusing on a low cost and light weight setup. Powered by a Raspberry PI4 and with a total weight of 1.15 Kilograms, we show that our vehicle manages to race a track of approximately 13 meters in 11 seconds at the best evaluation that was carried out, with an average speed of 1.2m/s in average. That performance was achieved after training a convolutional neural network with 1500 samples for a total amount of 60 epochs. Overall, we believe that our vehicle are suited to perform at the FIRA Autonomous Cars Race Challenge 2024, hel** the development of the field of study and the category in the competition.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
How Truncating Weights Improves Reasoning in Language Models
Authors:
Lei Chen,
Joan Bruna,
Alberto Bietti
Abstract:
In addition to the ability to generate fluent text in various languages, large language models have been successful at tasks that involve basic forms of logical "reasoning" over their context. Recent work found that selectively removing certain components from weight matrices in pre-trained models can improve such reasoning capabilities. We investigate this phenomenon further by carefully studying…
▽ More
In addition to the ability to generate fluent text in various languages, large language models have been successful at tasks that involve basic forms of logical "reasoning" over their context. Recent work found that selectively removing certain components from weight matrices in pre-trained models can improve such reasoning capabilities. We investigate this phenomenon further by carefully studying how certain global associations tend to be stored in specific weight components or Transformer blocks, in particular feed-forward layers. Such associations may hurt predictions in reasoning tasks, and removing the corresponding components may then improve performance. We analyze how this arises during training, both empirically and theoretically, on a two-layer Transformer trained on a basic reasoning task with noise, a toy associative memory model, and on the Pythia family of pre-trained models tested on simple reasoning tasks.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning
Authors:
Ricardo B. Grando,
Raul Steinmetz,
Victor A. Kich,
Alisson H. Kolling,
Pablo M. Furik,
Junior C. de Jesus,
Bruna V. Guterres,
Daniel T. Gamarra,
Rodrigo S. Guerra,
Paulo L. J. Drews-Jr
Abstract:
Deep Reinforcement Learning (DRL) has emerged as a promising approach to enhancing motion control and decision-making through a wide range of robotic applications. While prior research has demonstrated the efficacy of DRL algorithms in facilitating autonomous mapless navigation for aerial and terrestrial mobile robots, these methods often grapple with poor generalization when faced with unknown ta…
▽ More
Deep Reinforcement Learning (DRL) has emerged as a promising approach to enhancing motion control and decision-making through a wide range of robotic applications. While prior research has demonstrated the efficacy of DRL algorithms in facilitating autonomous mapless navigation for aerial and terrestrial mobile robots, these methods often grapple with poor generalization when faced with unknown tasks and environments. This paper explores the impact of the Delayed Policy Updates (DPU) technique on fostering generalization to new situations, and bolstering the overall performance of agents. Our analysis of DPU in aerial and terrestrial mobile robots reveals that this technique significantly curtails the lack of generalization and accelerates the learning process for agents, enhancing their efficiency across diverse tasks and unknown scenarios.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Tunable Surface Plasmon-Polaritons Interaction in All-Metal Pyramidal Metasurfaces: Unveiling Principles and Significance for Biosensing Applications
Authors:
Talles E. M. Marques,
Yuri H. Isayama,
Felipe M. F. Teixeira,
Fabiano C. Santana,
Rafael S. Gonçalves,
Aline Rocha,
Bruna P. Dias,
Lidia M. Andrade,
Estefânia M. N. Martins,
Ronaldo A. P. Nagem,
Clascidia A. Furtado,
Miguel A. G. Balanta,
Jorge Ricardo Mejía-Salazar,
Paulo S. S. Guimarães,
Wagner N. Rodrigues,
Jhonattan C. Ramirez
Abstract:
The strong coupling of plasmonic resonance modes in conductive pyramidal nanoparticles leads to an increase in the density of free charges on the surface. By ensuring plasmonic coupling in the pyramidal nanoparticle lattice, the achieved field intensity is potentiated. At the same time, a strong coupling between resonant modes is guaranteed, which results in the formation of new hybrid modes. In t…
▽ More
The strong coupling of plasmonic resonance modes in conductive pyramidal nanoparticles leads to an increase in the density of free charges on the surface. By ensuring plasmonic coupling in the pyramidal nanoparticle lattice, the achieved field intensity is potentiated. At the same time, a strong coupling between resonant modes is guaranteed, which results in the formation of new hybrid modes. In this manuscript, we demonstrated a tunable double anticrossing interaction that results from the interaction between two Localized Surface Plasmon Resonance (LSPR) modes and a Surface Plasmon Polariton (SPP) wave. The tuning is done as a function of the variation of the angle of incidence of the input electric field. From the double anticrossing, an increase in field intensity in a blue-shifted LSPR mode located in the red wavelength region is observed. This demonstrates that at certain angles of incidence, the intensity field obtained is strongly favored, which would be beneficial for applications such as Surface Enhancement Raman Spectroscopy (SERS). Nanoparticle-based lattices have been widely used for biosensor applications. However, one of the major limitations of this type of device is the low tolerance to high concentrations of biomolecules, which significantly affects their performance. According to the studies carried out for this manuscript, it was demonstrated that the implemented geometry allows for the observation of an LSPR mode, which is responsible for the control and synchronization of other perceived resonances. This mode remains almost invariant when subjected to structural variations or changes in the angle of incidence of the electric field. These characteristics eliminate the limitation mentioned above, allowing for sensitivities 10^3 times higher than those achieved in conventional systems based on LSPR used to detect P. brasiliensis antigen.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
The largest EEG-based BCI reproducibility study for open science: the MOABB benchmark
Authors:
Sylvain Chevallier,
Igor Carrara,
Bruno Aristimunha,
Pierre Guetschel,
Sara Sedlar,
Bruna Lopes,
Sebastien Velut,
Salim Khazem,
Thomas Moreau
Abstract:
Objective. This study conduct an extensive Brain-computer interfaces (BCI) reproducibility analysis on open electroencephalography datasets, aiming to assess existing solutions and establish open and reproducible benchmarks for effective comparison within the field. The need for such benchmark lies in the rapid industrial progress that has given rise to undisclosed proprietary solutions. Furthermo…
▽ More
Objective. This study conduct an extensive Brain-computer interfaces (BCI) reproducibility analysis on open electroencephalography datasets, aiming to assess existing solutions and establish open and reproducible benchmarks for effective comparison within the field. The need for such benchmark lies in the rapid industrial progress that has given rise to undisclosed proprietary solutions. Furthermore, the scientific literature is dense, often featuring challenging-to-reproduce evaluations, making comparisons between existing approaches arduous.
Approach. Within an open framework, 30 machine learning pipelines (separated into raw signal: 11, Riemannian: 13, deep learning: 6) are meticulously re-implemented and evaluated across 36 publicly available datasets, including motor imagery (14), P300 (15), and SSVEP (7). The analysis incorporates statistical meta-analysis techniques for results assessment, encompassing execution time and environmental impact considerations.
Main results. The study yields principled and robust results applicable to various BCI paradigms, emphasizing motor imagery, P300, and SSVEP. Notably, Riemannian approaches utilizing spatial covariance matrices exhibit superior performance, underscoring the necessity for significant data volumes to achieve competitive outcomes with deep learning techniques. The comprehensive results are openly accessible, paving the way for future research to further enhance reproducibility in the BCI domain.
Significance. The significance of this study lies in its contribution to establishing a rigorous and transparent benchmark for BCI research, offering insights into optimal methodologies and highlighting the importance of reproducibility in driving advancements within the field.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
On some relations between the Perimeter, the Area and the Visual Angle of a Convex Set
Authors:
Joaquim Bruna,
Julià Cufí,
Agustí Reventós
Abstract:
We establish some relations between the perimeter, the area and the visual angle of a planar compact convex set. Our first result states that Crofton's formula is the unique universal formula relating the visual angle, length and area. After that we give a characterization of convex sets of constant width by means of the behaviour of its isotopic sets at infinity. Also for this class of convex set…
▽ More
We establish some relations between the perimeter, the area and the visual angle of a planar compact convex set. Our first result states that Crofton's formula is the unique universal formula relating the visual angle, length and area. After that we give a characterization of convex sets of constant width by means of the behaviour of its isotopic sets at infinity. Also for this class of convex sets we prove that the existence of an isotopic circle is enough to ensure that the considered set is a disc.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Navigating Eukaryotic Genome Annotation Pipelines: A Route Map to BRAKER, Galba, and TSEBRA
Authors:
Tomáš Brůna,
Lars Gabriel,
Katharina J. Hoff
Abstract:
Annotating the structure of protein-coding genes represents a major challenge in the analysis of eukaryotic genomes. This task sets the groundwork for subsequent genomic studies aimed at understanding the functions of individual genes. BRAKER and Galba are two fully automated and containerized pipelines designed to perform accurate genome annotation. BRAKER integrates the GeneMark-ETP and AUGUSTUS…
▽ More
Annotating the structure of protein-coding genes represents a major challenge in the analysis of eukaryotic genomes. This task sets the groundwork for subsequent genomic studies aimed at understanding the functions of individual genes. BRAKER and Galba are two fully automated and containerized pipelines designed to perform accurate genome annotation. BRAKER integrates the GeneMark-ETP and AUGUSTUS gene finders, employing the TSEBRA combiner to attain high sensitivity and precision. BRAKER is adept at handling genomes of any size, provided that it has access to both transcript expression sequencing data and an extensive protein database from the target clade. In particular, BRAKER demonstrates high accuracy even with only one type of these extrinsic evidence sources, although it should be noted that accuracy diminishes for larger genomes under such conditions. In contrast, Galba adopts a distinct methodology utilizing the outcomes of direct protein-to-genome spliced alignments using miniprot to generate training genes and evidence for gene prediction in AUGUSTUS. Galba has superior accuracy in large genomes if protein sequences are the only source of evidence. This chapter provides practical guidelines for employing both pipelines in the annotation of eukaryotic genomes, with a focus on insect genomes.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Ontologia para monitorar a deficiência mental em seus déficts no processamento da informação por declínio cognitivo e evitar agressões psicológicas e físicas em ambientes educacionais com ajuda da I.A*
Authors:
Bruna Araújo de Castro Oliveira
Abstract:
The intention of this article is to propose the use of artificial intelligence to detect through analysis by UFO ontology the emergence of verbal and physical aggression related to psychosocial deficiencies and their provoking agents, in an attempt to prevent catastrophic consequences within school environments.
The intention of this article is to propose the use of artificial intelligence to detect through analysis by UFO ontology the emergence of verbal and physical aggression related to psychosocial deficiencies and their provoking agents, in an attempt to prevent catastrophic consequences within school environments.
△ Less
Submitted 31 January, 2024;
originally announced March 2024.
-
Computational-Statistical Gaps in Gaussian Single-Index Models
Authors:
Alex Damian,
Loucas Pillaud-Vivien,
Jason D. Lee,
Joan Bruna
Abstract:
Single-Index Models are high-dimensional regression problems with planted structure, whereby labels depend on an unknown one-dimensional projection of the input via a generic, non-linear, and potentially non-deterministic transformation. As such, they encompass a broad class of statistical inference tasks, and provide a rich template to study statistical and computational trade-offs in the high-di…
▽ More
Single-Index Models are high-dimensional regression problems with planted structure, whereby labels depend on an unknown one-dimensional projection of the input via a generic, non-linear, and potentially non-deterministic transformation. As such, they encompass a broad class of statistical inference tasks, and provide a rich template to study statistical and computational trade-offs in the high-dimensional regime.
While the information-theoretic sample complexity to recover the hidden direction is linear in the dimension $d$, we show that computationally efficient algorithms, both within the Statistical Query (SQ) and the Low-Degree Polynomial (LDP) framework, necessarily require $Ω(d^{k^\star/2})$ samples, where $k^\star$ is a "generative" exponent associated with the model that we explicitly characterize. Moreover, we show that this sample complexity is also sufficient, by establishing matching upper bounds using a partial-trace algorithm. Therefore, our results provide evidence of a sharp computational-to-statistical gap (under both the SQ and LDP class) whenever $k^\star>2$. To complete the study, we provide examples of smooth and Lipschitz deterministic target functions with arbitrarily large generative exponents $k^\star$.
△ Less
Submitted 12 March, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems
Authors:
Ivan Sekulić,
Silvia Terragni,
Victor Guimarães,
Nghia Khau,
Bruna Guedes,
Modestas Filipavicius,
André Ferreira Manso,
Roland Mathis
Abstract:
In the realm of dialogue systems, user simulation techniques have emerged as a game-changer, redefining the evaluation and enhancement of task-oriented dialogue (TOD) systems. These methods are crucial for replicating real user interactions, enabling applications like synthetic data augmentation, error detection, and robust evaluation. However, existing approaches often rely on rigid rule-based me…
▽ More
In the realm of dialogue systems, user simulation techniques have emerged as a game-changer, redefining the evaluation and enhancement of task-oriented dialogue (TOD) systems. These methods are crucial for replicating real user interactions, enabling applications like synthetic data augmentation, error detection, and robust evaluation. However, existing approaches often rely on rigid rule-based methods or on annotated data. This paper introduces DAUS, a Domain-Aware User Simulator. Leveraging large language models, we fine-tune DAUS on real examples of task-oriented dialogues. Results on two relevant benchmarks showcase significant improvements in terms of user goal fulfillment. Notably, we have observed that fine-tuning enhances the simulator's coherence with user goals, effectively mitigating hallucinations -- a major source of inconsistencies in simulator responses.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Lane formation and aggregation spots in a model of ants
Authors:
Maria Bruna,
Martin Burger,
Oscar de Wit
Abstract:
We investigate an interacting particle model to simulate a foraging colony of ants, where each ant is represented as an active Brownian particle. The interactions among ants are mediated through chemotaxis, aligning their orientations with the upward gradient of the pheromone field. Unlike conventional models, our study introduces a parameter that enables the reproduction of two distinctive behavi…
▽ More
We investigate an interacting particle model to simulate a foraging colony of ants, where each ant is represented as an active Brownian particle. The interactions among ants are mediated through chemotaxis, aligning their orientations with the upward gradient of the pheromone field. Unlike conventional models, our study introduces a parameter that enables the reproduction of two distinctive behaviors: the well-known Keller--Segel collapse and the formation of traveling clusters, without relying on external constraints such as food sources or nests. We consider the associated mean-field limit partial differential equation (PDE) of this system and establish the analytical and numerical foundations for understanding these particle behaviors. Remarkably, the mean-field PDE not only supports Keller--Segel collapse and lane formation but also unveils a bistable region where these two behaviors compete. The patterns associated with these phenomena are elucidated by the shape of the growing eigenfunctions derived from linear stability analysis. This study not only contributes to our understanding of complex ant colony dynamics but also introduces a novel parameter-dependent perspective on pattern formation in collective systems.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
A Systematic Evaluation of Euclidean Alignment with Deep Learning for EEG Decoding
Authors:
Bruna Junqueira,
Bruno Aristimunha,
Sylvain Chevallier,
Raphael Y. de Camargo
Abstract:
Electroencephalography (EEG) signals are frequently used for various Brain-Computer Interface (BCI) tasks. While Deep Learning (DL) techniques have shown promising results, they are hindered by the substantial data requirements. By leveraging data from multiple subjects, transfer learning enables more effective training of DL models. A technique that is gaining popularity is Euclidean Alignment (E…
▽ More
Electroencephalography (EEG) signals are frequently used for various Brain-Computer Interface (BCI) tasks. While Deep Learning (DL) techniques have shown promising results, they are hindered by the substantial data requirements. By leveraging data from multiple subjects, transfer learning enables more effective training of DL models. A technique that is gaining popularity is Euclidean Alignment (EA) due to its ease of use, low computational complexity, and compatibility with Deep Learning models. However, few studies evaluate its impact on the training performance of shared and individual DL models. In this work, we systematically evaluate the effect of EA combined with DL for decoding BCI signals. We used EA to train shared models with data from multiple subjects and evaluated its transferability to new subjects. Our experimental results show that it improves decoding in the target subject by 4.33% and decreases convergence time by more than 70%. We also trained individual models for each subject to use as a majority-voting ensemble classifier. In this scenario, using EA improved the 3-model ensemble accuracy by 3.7%. However, when compared to the shared model with EA, the ensemble accuracy was 3.62% lower.
△ Less
Submitted 22 May, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Concept Alignment
Authors:
Sunayana Rane,
Polyphony J. Bruna,
Ilia Sucholutsky,
Christopher Kello,
Thomas L. Griffiths
Abstract:
Discussion of AI alignment (alignment between humans and AI systems) has focused on value alignment, broadly referring to creating AI systems that share human values. We argue that before we can even attempt to align values, it is imperative that AI systems and humans align the concepts they use to understand the world. We integrate ideas from philosophy, cognitive science, and deep learning to ex…
▽ More
Discussion of AI alignment (alignment between humans and AI systems) has focused on value alignment, broadly referring to creating AI systems that share human values. We argue that before we can even attempt to align values, it is imperative that AI systems and humans align the concepts they use to understand the world. We integrate ideas from philosophy, cognitive science, and deep learning to explain the need for concept alignment, not just value alignment, between humans and machines. We summarize existing accounts of how humans and machines currently learn concepts, and we outline opportunities and challenges in the path towards shared concepts. Finally, we explain how we can leverage the tools already being developed in cognitive science and AI research to accelerate progress towards concept alignment.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Double points and image of reflection maps
Authors:
Jose Rafael Borges Zampiva,
Guillermo Penafort-Sanchis,
Bruna Orefice-Okamoto,
Joao Nivaldo Tomazella
Abstract:
A reflection map** is a singular holomorphic map** obtained by restricting the quotient map** of a complex reflection group. We study the analytic structure of double point spaces of reflection map**s. In the case where the image is a hypersurface, we obtain explicit equations for the double point space and for the image as well. In the case of surfaces in $\C^3$, this gives a very efficie…
▽ More
A reflection map** is a singular holomorphic map** obtained by restricting the quotient map** of a complex reflection group. We study the analytic structure of double point spaces of reflection map**s. In the case where the image is a hypersurface, we obtain explicit equations for the double point space and for the image as well. In the case of surfaces in $\C^3$, this gives a very efficient method to compute the Milnor number and delta invariant of the double point curve.
△ Less
Submitted 7 July, 2024; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Stochastic Optimal Control Matching
Authors:
Carles Domingo-Enrich,
Jiequn Han,
Brandon Amos,
Joan Bruna,
Ricky T. Q. Chen
Abstract:
Stochastic optimal control, which has the goal of driving the behavior of noisy systems, is broadly applicable in science, engineering and artificial intelligence. Our work introduces Stochastic Optimal Control Matching (SOCM), a novel Iterative Diffusion Optimization (IDO) technique for stochastic optimal control that stems from the same philosophy as the conditional score matching loss for diffu…
▽ More
Stochastic optimal control, which has the goal of driving the behavior of noisy systems, is broadly applicable in science, engineering and artificial intelligence. Our work introduces Stochastic Optimal Control Matching (SOCM), a novel Iterative Diffusion Optimization (IDO) technique for stochastic optimal control that stems from the same philosophy as the conditional score matching loss for diffusion models. That is, the control is learned via a least squares problem by trying to fit a matching vector field. The training loss, which is closely connected to the cross-entropy loss, is optimized with respect to both the control function and a family of reparameterization matrices which appear in the matching vector field. The optimization with respect to the reparameterization matrices aims at minimizing the variance of the matching vector field. Experimentally, our algorithm achieves lower error than all the existing IDO techniques for stochastic optimal control for three out of four control problems, in some cases by an order of magnitude. The key idea underlying SOCM is the path-wise reparameterization trick, a novel technique that may be of independent interest. Code at https://github.com/facebookresearch/SOC-matching
△ Less
Submitted 28 June, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
On Learning Gaussian Multi-index Models with Gradient Flow
Authors:
Alberto Bietti,
Joan Bruna,
Loucas Pillaud-Vivien
Abstract:
We study gradient flow on the multi-index regression problem for high-dimensional Gaussian data. Multi-index functions consist of a composition of an unknown low-rank linear projection and an arbitrary unknown, low-dimensional link function. As such, they constitute a natural template for feature learning in neural networks.
We consider a two-timescale algorithm, whereby the low-dimensional link…
▽ More
We study gradient flow on the multi-index regression problem for high-dimensional Gaussian data. Multi-index functions consist of a composition of an unknown low-rank linear projection and an arbitrary unknown, low-dimensional link function. As such, they constitute a natural template for feature learning in neural networks.
We consider a two-timescale algorithm, whereby the low-dimensional link function is learnt with a non-parametric model infinitely faster than the subspace parametrizing the low-rank projection. By appropriately exploiting the matrix semigroup structure arising over the subspace correlation matrices, we establish global convergence of the resulting Grassmannian population gradient flow dynamics, and provide a quantitative description of its associated `saddle-to-saddle' dynamics. Notably, the timescales associated with each saddle can be explicitly characterized in terms of an appropriate Hermite decomposition of the target link function. In contrast with these positive results, we also show that the related \emph{planted} problem, where the link function is known and fixed, in fact has a rough optimization landscape, in which gradient flow dynamics might get trapped with high probability.
△ Less
Submitted 2 November, 2023; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Aplicacion de Robots Humanoides como Guias Interactivos en Museos: Una Simulacion con el Robot NAO
Authors:
Hiago Sodre,
Pablo Moraes,
Monica Rodriguez,
Victor Castelli,
Pamela Barboza,
Martin Mattos,
Guillermo Vivas,
Bruna de Vargas,
Tobias Dörnbach,
Ricardo Grando
Abstract:
This article presents an application that evaluates the feasibility of humanoid robots as interactive guides in art museums. The application entailes programming a NAO robot and a chatbot to provide information about art pieces in a simulated museum environment. In this controlled scenario, the learning employees interact with the robot and the chatbot. The result is a skilled participation in the…
▽ More
This article presents an application that evaluates the feasibility of humanoid robots as interactive guides in art museums. The application entailes programming a NAO robot and a chatbot to provide information about art pieces in a simulated museum environment. In this controlled scenario, the learning employees interact with the robot and the chatbot. The result is a skilled participation in the interactions, along with the effectiveness of the robot and chatbot that communicates the basic details of the art objects. You see natural and fluid interactions between the students and the robot. This suggests that the addition of humanoid robots to museums may provide a better experience for visitors, but also the need to continue to do more to optimize the quality of interaction. This study contributes to understanding the possibilities and requirements of applying humanoid technologies in a cultural context.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Predictive Maintenance Model Based on Anomaly Detection in Induction Motors: A Machine Learning Approach Using Real-Time IoT Data
Authors:
Sergio F. Chevtchenko,
Monalisa C. M. dos Santos,
Diego M. Vieira,
Ricardo L. Mota,
Elisson Rocha,
Bruna V. Cruz,
Danilo Araújo,
Ermeson Andrade
Abstract:
With the support of Internet of Things (IoT) devices, it is possible to acquire data from degradation phenomena and design data-driven models to perform anomaly detection in industrial equipment. This approach not only identifies potential anomalies but can also serve as a first step toward building predictive maintenance policies. In this work, we demonstrate a novel anomaly detection system on i…
▽ More
With the support of Internet of Things (IoT) devices, it is possible to acquire data from degradation phenomena and design data-driven models to perform anomaly detection in industrial equipment. This approach not only identifies potential anomalies but can also serve as a first step toward building predictive maintenance policies. In this work, we demonstrate a novel anomaly detection system on induction motors used in pumps, compressors, fans, and other industrial machines. This work evaluates a combination of pre-processing techniques and machine learning (ML) models with a low computational cost. We use a combination of pre-processing techniques such as Fast Fourier Transform (FFT), Wavelet Transform (WT), and binning, which are well-known approaches for extracting features from raw data. We also aim to guarantee an optimal balance between multiple conflicting parameters, such as anomaly detection rate, false positive rate, and inference speed of the solution. To this end, multiobjective optimization and analysis are performed on the evaluated models. Pareto-optimal solutions are presented to select which models have the best results regarding classification metrics and computational effort. Differently from most works in this field that use publicly available datasets to validate their models, we propose an end-to-end solution combining low-cost and readily available IoT sensors. The approach is validated by acquiring a custom dataset from induction motors. Also, we fuse vibration, temperature, and noise data from these sensors as the input to the proposed ML model. Therefore, we aim to propose a methodology general enough to be applied in different industrial contexts in the future.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Indefinite order in the interface of quantum mechanics and gravity
Authors:
Bruna Sahdo
Abstract:
Researchers have long been aiming to understand how the characteristics of Quantum Theory and General Relativity combine to account for regimes in their interface. One reason why this is a hard task is how differently the theories approach time and causality. For instance, causal structure in relativity is determined by the distribution of mass in spacetime while, in the quantum formalism, it is s…
▽ More
Researchers have long been aiming to understand how the characteristics of Quantum Theory and General Relativity combine to account for regimes in their interface. One reason why this is a hard task is how differently the theories approach time and causality. For instance, causal structure in relativity is determined by the distribution of mass in spacetime while, in the quantum formalism, it is supposed to be fixed and given in advance. In this master's thesis, we discuss the notion of indefinite order, which first appears in an abstract generalization of Quantum Theory [...] where the demand for global causal structure is removed, in principle allowing cases for which the order of operations in protocols is not necessarily well defined. One epitomical example of indefinite order is the quantum switch process, which realizes a quantum superposition of orders of two operations on a target system. The quantum switch probabilities have been reproduced in experimental optical setups that are fully described in principle by quantum mechanics. Since these experiments are compatible with spacetime causal structure, this generated uncertainty about the conclusions that can be drawn from obtaining these results depending on the context. Here, we return to the initial motivations and also present how scenarios involving gravity in low energies could lead to indefinite order. This includes the formulation of a quantum switch in a quantum gravity scenario and of a quantum switch in a classical Schwarzschild metric. The switch then provides a common ground to discuss different kinds of setups. The latter proposal of a quantum switch in a classical metric is an original work that, aside from being an example of indefinite order, proposes the realization of the protocol in Earth's gravity as a test of quantum mechanics on curved spacetimes, a regime which has not yet been explored experimentally.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Symmetric Single Index Learning
Authors:
Aaron Zweig,
Joan Bruna
Abstract:
Few neural architectures lend themselves to provable learning with gradient based methods. One popular model is the single-index model, in which labels are produced by composing an unknown linear projection with a possibly unknown scalar link function. Learning this model with SGD is relatively well-understood, whereby the so-called information exponent of the link function governs a polynomial sa…
▽ More
Few neural architectures lend themselves to provable learning with gradient based methods. One popular model is the single-index model, in which labels are produced by composing an unknown linear projection with a possibly unknown scalar link function. Learning this model with SGD is relatively well-understood, whereby the so-called information exponent of the link function governs a polynomial sample complexity rate. However, extending this analysis to deeper or more complicated architectures remains challenging.
In this work, we consider single index learning in the setting of symmetric neural networks. Under analytic assumptions on the activation and maximum degree assumptions on the link function, we prove that gradient flow recovers the hidden planted direction, represented as a finitely supported vector in the feature space of power sum polynomials. We characterize a notion of information exponent adapted to our setting that controls the efficiency of learning.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
AI driven B-cell Immunotherapy Design
Authors:
Bruna Moreira da Silva,
David B. Ascher,
Nicholas Geard,
Douglas E. V. Pires
Abstract:
Antibodies, a prominent class of approved biologics, play a crucial role in detecting foreign antigens. The effectiveness of antigen neutralisation and elimination hinges upon the strength, sensitivity, and specificity of the paratope-epitope interaction, which demands resource-intensive experimental techniques for characterisation. In recent years, artificial intelligence and machine learning met…
▽ More
Antibodies, a prominent class of approved biologics, play a crucial role in detecting foreign antigens. The effectiveness of antigen neutralisation and elimination hinges upon the strength, sensitivity, and specificity of the paratope-epitope interaction, which demands resource-intensive experimental techniques for characterisation. In recent years, artificial intelligence and machine learning methods have made significant strides, revolutionising the prediction of protein structures and their complexes. The past decade has also witnessed the evolution of computational approaches aiming to support immunotherapy design. This review focuses on the progress of machine learning-based tools and their frameworks in the domain of B-cell immunotherapy design, encompassing linear and conformational epitope prediction, paratope prediction, and antibody design. We mapped the most commonly used data sources, evaluation metrics, and method availability and thoroughly assessed their significance and limitations, discussing the main challenges ahead.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
On Single Index Models beyond Gaussian Data
Authors:
Joan Bruna,
Loucas Pillaud-Vivien,
Aaron Zweig
Abstract:
Sparse high-dimensional functions have arisen as a rich framework to study the behavior of gradient-descent methods using shallow neural networks, showcasing their ability to perform feature learning beyond linear models. Amongst those functions, the simplest are single-index models $f(x) = φ( x \cdot θ^*)$, where the labels are generated by an arbitrary non-linear scalar link function $φ$ applied…
▽ More
Sparse high-dimensional functions have arisen as a rich framework to study the behavior of gradient-descent methods using shallow neural networks, showcasing their ability to perform feature learning beyond linear models. Amongst those functions, the simplest are single-index models $f(x) = φ( x \cdot θ^*)$, where the labels are generated by an arbitrary non-linear scalar link function $φ$ applied to an unknown one-dimensional projection $θ^*$ of the input data. By focusing on Gaussian data, several recent works have built a remarkable picture, where the so-called information exponent (related to the regularity of the link function) controls the required sample complexity. In essence, these tools exploit the stability and spherical symmetry of Gaussian distributions. In this work, building from the framework of \cite{arous2020online}, we explore extensions of this picture beyond the Gaussian setting, where both stability or symmetry might be violated. Focusing on the planted setting where $φ$ is known, our main results establish that Stochastic Gradient Descent can efficiently recover the unknown direction $θ^*$ in the high-dimensional regime, under assumptions that extend previous works \cite{yehudai2020learning,wu2022learning}.
△ Less
Submitted 25 October, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Broadband parametric amplification for multiplexed SiMOS quantum dot signals
Authors:
Victor Elhomsy,
Luca Planat,
David J. Niegemann,
Bruna Cardoso-Paz,
Ali Badreldin,
Bernhard Klemt,
Vivien Thiney,
Renan Lethiecq,
Eric Eyraud,
Matthieu C. Dartiailh,
Benoit Bertrand,
Heimanu Niebojewski,
Christopher Bäuerle,
Maud Vinet,
Tristan Meunier,
Nicolas Roch,
Matias Urdampilleta
Abstract:
Spins in semiconductor quantum dots hold great promise as building blocks of quantum processors. Trap** them in SiMOS transistor-like devices eases future industrial scale fabrication. Among the potentially scalable readout solutions, gate-based dispersive radiofrequency reflectometry only requires the already existing transistor gates to readout a quantum dot state, relieving the need for addit…
▽ More
Spins in semiconductor quantum dots hold great promise as building blocks of quantum processors. Trap** them in SiMOS transistor-like devices eases future industrial scale fabrication. Among the potentially scalable readout solutions, gate-based dispersive radiofrequency reflectometry only requires the already existing transistor gates to readout a quantum dot state, relieving the need for additional elements. In this effort towards scalability, traveling-wave superconducting parametric amplifiers significantly enhance the readout signal-to-noise ratio (SNR) by reducing the noise below typical cryogenic low-noise amplifiers, while offering a broad amplification band, essential to multiplex the readout of multiple resonators. In this work, we demonstrate a 3GHz gate-based reflectometry readout of electron charge states trapped in quantum dots formed in SiMOS multi-gate devices, with SNR enhanced thanks to a Josephson traveling-wave parametric amplifier (JTWPA). The broad, tunable 2GHz amplification bandwidth combined with more than 10dB ON/OFF SNR improvement of the JTWPA enables frequency and time division multiplexed readout of interdot transitions, and noise performance near the quantum limit. In addition, owing to a design without superconducting loops and with a metallic ground plane, the JTWPA is flux insensitive and shows stable performances up to a magnetic field of 1.2T at the quantum dot device, compatible with standard SiMOS spin qubit experiments.
△ Less
Submitted 2 August, 2023; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Reliable coarse-grained turbulent simulations through combined offline learning and neural emulation
Authors:
Christian Pedersen,
Laure Zanna,
Joan Bruna,
Pavel Perezhogin
Abstract:
Integration of machine learning (ML) models of unresolved dynamics into numerical simulations of fluid dynamics has been demonstrated to improve the accuracy of coarse resolution simulations. However, when trained in a purely offline mode, integrating ML models into the numerical scheme can lead to instabilities. In the context of a 2D, quasi-geostrophic turbulent system, we demonstrate that inclu…
▽ More
Integration of machine learning (ML) models of unresolved dynamics into numerical simulations of fluid dynamics has been demonstrated to improve the accuracy of coarse resolution simulations. However, when trained in a purely offline mode, integrating ML models into the numerical scheme can lead to instabilities. In the context of a 2D, quasi-geostrophic turbulent system, we demonstrate that including an additional network in the loss function, which emulates the state of the system into the future, produces offline-trained ML models that capture important subgrid processes, with improved stability properties.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Exact hydrodynamics and onset of phase separation for an active exclusion process
Authors:
James Mason,
Clement Erignoux,
Robert Jack,
Maria Bruna
Abstract:
We consider a lattice model of active matter with exclusion and derive its hydrodynamic description exactly. The hydrodynamic limit leads to an integro-differential equation for the density of particles with a given orientation. Volume exclusion results in nonlinear mobility dependent on spatial density. Such models of active matter can support motility-induced phase separation, which occurs despi…
▽ More
We consider a lattice model of active matter with exclusion and derive its hydrodynamic description exactly. The hydrodynamic limit leads to an integro-differential equation for the density of particles with a given orientation. Volume exclusion results in nonlinear mobility dependent on spatial density. Such models of active matter can support motility-induced phase separation, which occurs despite the absence of attractive interactions. We study the onset of phase separation with linear stability analysis and numerical simulations.
△ Less
Submitted 30 October, 2023; v1 submitted 21 July, 2023;
originally announced July 2023.
-
A Map** Study of Machine Learning Methods for Remaining Useful Life Estimation of Lead-Acid Batteries
Authors:
Sérgio F Chevtchenko,
Elisson da Silva Rocha,
Bruna Cruz,
Ermeson Carneiro de Andrade,
Danilo Ricardo Barbosa de Araújo
Abstract:
Energy storage solutions play an increasingly important role in modern infrastructure and lead-acid batteries are among the most commonly used in the rechargeable category. Due to normal degradation over time, correctly determining the battery's State of Health (SoH) and Remaining Useful Life (RUL) contributes to enhancing predictive maintenance, reliability, and longevity of battery systems. Besi…
▽ More
Energy storage solutions play an increasingly important role in modern infrastructure and lead-acid batteries are among the most commonly used in the rechargeable category. Due to normal degradation over time, correctly determining the battery's State of Health (SoH) and Remaining Useful Life (RUL) contributes to enhancing predictive maintenance, reliability, and longevity of battery systems. Besides improving the cost savings, correct estimation of the SoH can lead to reduced pollution though reuse of retired batteries. This paper presents a map** study of the state-of-the-art in machine learning methods for estimating the SoH and RUL of lead-acid batteries. These two indicators are critical in the battery management systems of electric vehicles, renewable energy systems, and other applications that rely heavily on this battery technology. In this study, we analyzed the types of machine learning algorithms employed for estimating SoH and RUL, and evaluated their performance in terms of accuracy and inference time. Additionally, this map** identifies and analyzes the most commonly used combinations of sensors in specific applications, such as vehicular batteries. The map** concludes by highlighting potential gaps and opportunities for future research, which lays the foundation for further advancements in the field.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
A Neural Collapse Perspective on Feature Evolution in Graph Neural Networks
Authors:
Vignesh Kothapalli,
Tom Tirer,
Joan Bruna
Abstract:
Graph neural networks (GNNs) have become increasingly popular for classification tasks on graph-structured data. Yet, the interplay between graph topology and feature evolution in GNNs is not well understood. In this paper, we focus on node-wise classification, illustrated with community detection on stochastic block model graphs, and explore the feature evolution through the lens of the "Neural C…
▽ More
Graph neural networks (GNNs) have become increasingly popular for classification tasks on graph-structured data. Yet, the interplay between graph topology and feature evolution in GNNs is not well understood. In this paper, we focus on node-wise classification, illustrated with community detection on stochastic block model graphs, and explore the feature evolution through the lens of the "Neural Collapse" (NC) phenomenon. When training instance-wise deep classifiers (e.g. for image classification) beyond the zero training error point, NC demonstrates a reduction in the deepest features' within-class variability and an increased alignment of their class means to certain symmetric structures. We start with an empirical study that shows that a decrease in within-class variability is also prevalent in the node-wise classification setting, however, not to the extent observed in the instance-wise case. Then, we theoretically study this distinction. Specifically, we show that even an "optimistic" mathematical model requires that the graphs obey a strict structural condition in order to possess a minimizer with exact collapse. Interestingly, this condition is viable also for heterophilic graphs and relates to recent empirical studies on settings with improved GNNs' generalization. Furthermore, by studying the gradient dynamics of the theoretical model, we provide reasoning for the partial collapse observed empirically. Finally, we present a study on the evolution of within- and between-class feature variability across layers of a well-trained GNN and contrast the behavior with spectral methods.
△ Less
Submitted 26 October, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Gravitational quantum switch on a superposition of spherical shells
Authors:
Natália S. Móller,
Bruna Sahdo,
Nelson Yokomizo
Abstract:
In the absence of a complete theory of quantum gravity, phenomenological models built upon minimal assumptions have been explored for the analysis of possible quantum effects in gravitational systems. Implications of a superposition of geometries have been considered in such models, including the occurrence of processes with indefinite order. In a gravitational quantum switch, in particular, the o…
▽ More
In the absence of a complete theory of quantum gravity, phenomenological models built upon minimal assumptions have been explored for the analysis of possible quantum effects in gravitational systems. Implications of a superposition of geometries have been considered in such models, including the occurrence of processes with indefinite order. In a gravitational quantum switch, in particular, the order of operations applied by two agents on a target system is entangled with the state of the geometry. We consider a model describing the superposition of geometries produced by distinct arrangements of spherical mass shells, and show that a protocol for the implementation of a gravitational quantum switch can be formulated in such a system. The geometries in superposition are identical in an exterior region outside a given radius, and differ within such a radius. The exterior region provides a classical frame from which the superposition of geometries in the interior region can be probed. One of the agents crosses the interior region and becomes entangled with the geometry, which is explored as a resource for the implementation of the quantum switch. Novel features of the protocol include the superposition of nonisometric geometries, the existence of a region with a definite geometry, and the fact that the agent that experiences the superposition of geometries is in free fall, preventing information on the global geometry to be obtained from within its laboratory.
△ Less
Submitted 5 February, 2024; v1 submitted 19 June, 2023;
originally announced June 2023.
-
Constraining the LyC escape fraction from LEGUS star clusters with SIGNALS HII region observations: A pilot study of NGC 628
Authors:
J. W. Teh,
K. Grasha,
M. R. Krumholz,
A. Battisti,
D. Calzetti,
L. Rousseau-Nepton,
C. Rhea,
A. Adamo,
R. C. Kennicutt,
E. K. Grebel,
D. O. Cook,
F. Combes,
M. Messa,
S. Linden,
R. S. Klessen,
J. M. Vilchez,
M. Fumagalli,
A. F. McLeod,
L. J. Smith,
L. Chemin,
J. Wang,
E. Sabbi,
E. Sacchi,
A. Petric,
L. Della Bruna
, et al. (1 additional authors not shown)
Abstract:
The ionising radiation of young and massive stars is a crucial form of stellar feedback. Most ionising (Lyman-continuum; LyC, $λ< 912A$) photons are absorbed close to the stars that produce them, forming compact HII regions, but some escape into the wider galaxy. Quantifying the fraction of LyC photons that escape is an open problem. In this work, we present a semi-novel method to estimate the esc…
▽ More
The ionising radiation of young and massive stars is a crucial form of stellar feedback. Most ionising (Lyman-continuum; LyC, $λ< 912A$) photons are absorbed close to the stars that produce them, forming compact HII regions, but some escape into the wider galaxy. Quantifying the fraction of LyC photons that escape is an open problem. In this work, we present a semi-novel method to estimate the escape fraction by combining broadband photometry of star clusters from the Legacy ExtraGalactic UV Survey (LEGUS) with HII regions observed by the Star formation, Ionized gas, and Nebular Abundances Legacy Survey (SIGNALS) in the nearby spiral galaxy NGC 628. We first assess the completeness of the combined catalogue, and find that 49\% of HII regions lack corresponding star clusters as a result of a difference in the sensitivities of the LEGUS and SIGNALS surveys. For HII regions that do have matching clusters, we infer the escape fraction from the difference between the ionising power required to produce the observed HII luminosity and the predicted ionising photon output of their host star clusters; the latter is computed using a combination of LEGUS photometric observations and a stochastic stellar population synthesis code SLUG (Stochastically Lighting Up Galaxies). Overall, we find an escape fraction of $f_{esc} = 0.09^{+0.06}_{-0.06}$ across our sample of 42 HII regions; in particular, we find HII regions with high $f_{esc}$ are predominantly regions with low H$α$-luminosity. We also report possible correlation between $f_{esc}$ and the emission lines [O ii]/[N ii] and [O ii]/H$β$.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
In-Context Learning User Simulators for Task-Oriented Dialog Systems
Authors:
Silvia Terragni,
Modestas Filipavicius,
Nghia Khau,
Bruna Guedes,
André Manso,
Roland Mathis
Abstract:
This paper presents a novel application of large language models in user simulation for task-oriented dialog systems, specifically focusing on an in-context learning approach. By harnessing the power of these models, the proposed approach generates diverse utterances based on user goals and limited dialog examples. Unlike traditional simulators, this method eliminates the need for labor-intensive…
▽ More
This paper presents a novel application of large language models in user simulation for task-oriented dialog systems, specifically focusing on an in-context learning approach. By harnessing the power of these models, the proposed approach generates diverse utterances based on user goals and limited dialog examples. Unlike traditional simulators, this method eliminates the need for labor-intensive rule definition or extensive annotated data, making it more efficient and accessible. Additionally, an error analysis of the interaction between the user simulator and dialog system uncovers common mistakes, providing valuable insights into areas that require improvement. Our implementation is available at https://github.com/telepathylabsai/prompt-based-user-simulator.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Conditionally Strongly Log-Concave Generative Models
Authors:
Florentin Guth,
Etienne Lempereur,
Joan Bruna,
Stéphane Mallat
Abstract:
There is a growing gap between the impressive results of deep image generative models and classical algorithms that offer theoretical guarantees. The former suffer from mode collapse or memorization issues, limiting their application to scientific data. The latter require restrictive assumptions such as log-concavity to escape the curse of dimensionality. We partially bridge this gap by introducin…
▽ More
There is a growing gap between the impressive results of deep image generative models and classical algorithms that offer theoretical guarantees. The former suffer from mode collapse or memorization issues, limiting their application to scientific data. The latter require restrictive assumptions such as log-concavity to escape the curse of dimensionality. We partially bridge this gap by introducing conditionally strongly log-concave (CSLC) models, which factorize the data distribution into a product of conditional probability distributions that are strongly log-concave. This factorization is obtained with orthogonal projectors adapted to the data distribution. It leads to efficient parameter estimation and sampling algorithms, with theoretical guarantees, although the data distribution is not globally log-concave. We show that several challenging multiscale processes are conditionally log-concave using wavelet packet orthogonal projectors. Numerical results are shown for physical fields such as the $\varphi^4$ model and weak lensing convergence maps with higher resolution than in previous works.
△ Less
Submitted 31 May, 2023;
originally announced June 2023.
-
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
Authors:
David Brandfonbrener,
Ofir Nachum,
Joan Bruna
Abstract:
In recent years, domains such as natural language processing and image recognition have popularized the paradigm of using large datasets to pretrain representations that can be effectively transferred to downstream tasks. In this work we evaluate how such a paradigm should be done in imitation learning, where both pretraining and finetuning data are trajectories collected by experts interacting wi…
▽ More
In recent years, domains such as natural language processing and image recognition have popularized the paradigm of using large datasets to pretrain representations that can be effectively transferred to downstream tasks. In this work we evaluate how such a paradigm should be done in imitation learning, where both pretraining and finetuning data are trajectories collected by experts interacting with an unknown environment. Namely, we consider a setting where the pretraining corpus consists of multitask demonstrations and the task for each demonstration is set by an unobserved latent context variable. The goal is to use the pretraining corpus to learn a low dimensional representation of the high dimensional (e.g., visual) observation space which can be transferred to a novel context for finetuning on a limited dataset of demonstrations. Among a variety of possible pretraining objectives, we argue that inverse dynamics modeling -- i.e., predicting an action given the observations appearing before and after it in the demonstration -- is well-suited to this setting. We provide empirical evidence of this claim through evaluations on a variety of simulated visuomotor manipulation problems. While previous work has attempted various theoretical explanations regarding the benefit of inverse dynamics modeling, we find that these arguments are insufficient to explain the empirical advantages often observed in our settings, and so we derive a novel analysis using a simple but general environment model.
△ Less
Submitted 25 October, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
A Systematic Map** Study and Practitioner Insights on the Use of Software Engineering Practices to Develop MVPs
Authors:
Silvio Alonso,
Marcos Kalinowski,
Bruna Ferreira,
Simone D. J. Barbosa,
Helio Lopes
Abstract:
[Background] The MVP concept has influenced the way in which development teams apply Software Engineering practices. However, the overall understanding of this influence of MVPs on SE practices is still poor. [Objective] Our goal is to characterize the publication landscape on practices that have been used in the context of software MVPs and to gather practitioner insights on the identified practi…
▽ More
[Background] The MVP concept has influenced the way in which development teams apply Software Engineering practices. However, the overall understanding of this influence of MVPs on SE practices is still poor. [Objective] Our goal is to characterize the publication landscape on practices that have been used in the context of software MVPs and to gather practitioner insights on the identified practices. [Method] We conducted a systematic map** study and discussed its results in two focus groups sessions involving twelve industry practitioners that extensively use MVPs in their projects to capture their perceptions on the findings of the map** study. [Results] We identified 33 papers published between 2013 and 2020 and observed some trends related to MVP ideation and evaluation practices. For instance, regarding ideation, we found six different approaches and mainly informal end-user involvement practices. Regarding evaluation, there is an emphasis on end-user validations based on practices such as usability tests, A/B testing, and usage data analysis. However, there is still limited research related to MVP technical feasibility assessment and effort estimation. Practitioners of the focus group sessions reinforced the confidence in our results regarding ideation and evaluation practices, being aware of most of the identified practices. They also reported how they deal with the technical feasibility assessments and effort estimation in practice. [Conclusion] Our analysis suggests that there are opportunities for solution proposals and evaluation studies to address literature gaps concerning technical feasibility assessment and effort estimation. Overall, more effort needs to be invested into empirically evaluating the existing MVP-related practices.
△ Less
Submitted 14 May, 2023;
originally announced May 2023.
-
A High-Speed Waveguide Integrated InSe Photodetector on SiN Photonics for NIR Applications
Authors:
Srinvasa Reddy Tamalampudi,
Juan Esteban Villegas,
Ghada Dushaq,
Raman Sankar,
Bruna Paredes,
Mahmoud Rasras
Abstract:
On-chip integration of two-dimensional (2D) materials offers great potential for the realization of novel optoelectronic devices in different photonic platforms. In particular, indium selenide (InSe) is a very promising 2D material due to its ultra-high carrier mobility and outstanding photo-responsivity. Here, we report a high-speed photodetector based on a multilayer 90 nm thick InSe integrated…
▽ More
On-chip integration of two-dimensional (2D) materials offers great potential for the realization of novel optoelectronic devices in different photonic platforms. In particular, indium selenide (InSe) is a very promising 2D material due to its ultra-high carrier mobility and outstanding photo-responsivity. Here, we report a high-speed photodetector based on a multilayer 90 nm thick InSe integrated on a silicon nitride (SiN) waveguide. The device exhibits a low dark current of 10 nA at 1V bias, a remarkable photoresponsivity of 0.38 AW-1, and high external quantum efficiency of 48.4% measured at 5 V bias. This performance is tested at near-infrared (NIR) 976 nm wavelength under ambient conditions. Furthermore, using numerical and experimental investigations, the estimated absorption coefficient per unit length is 0.11dB/um. To determine the dynamic response of the photodetector, its small and large signal frequency response are also evaluated. A 3-dB radiofrequency (RF) bandwidth of 85 MHz is measured with an open-eye diagram observed at 1 Gbit/s data transmission. Given these outstanding optoelectronic merits, active photonic devices based on integrated multilayer InSe can be realized for a variety of applications including short-reach optical interconnects, LiDAR imaging, and biosensing.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
An elementary multidimensional fundamental theorem of calculus
Authors:
Joaquim Bruna
Abstract:
We discuss a version of the fundamental theorem of calculus in several variables and some applications, of potential interest as a teaching material in undergraduate courses.
We discuss a version of the fundamental theorem of calculus in several variables and some applications, of potential interest as a teaching material in undergraduate courses.
△ Less
Submitted 15 August, 2023; v1 submitted 28 April, 2023;
originally announced April 2023.
-
Nuclear activity in $z<0.3$ QSO 2's mainly triggered by galaxy mergers
Authors:
Bruna L. C. Araujo,
Thaisa Storchi-Bergmann,
Sandro B. Rembold,
André L. P. Kaipper,
Bruno Dall'Agnol de Oliveira
Abstract:
We investigate the role of the close environment on the nuclear activity of a sample of 436 nearby ($z<0.3$) QSO 2's -- selected from SDSS-III spectra, via comparison of their environment and interaction parameters with those of a control sample of 1308 galaxies. We have used the corresponding SDSS images to obtain the number of neighbour galaxies $N$, tidal strength parameter $Q$ and asymmetry pa…
▽ More
We investigate the role of the close environment on the nuclear activity of a sample of 436 nearby ($z<0.3$) QSO 2's -- selected from SDSS-III spectra, via comparison of their environment and interaction parameters with those of a control sample of 1308 galaxies. We have used the corresponding SDSS images to obtain the number of neighbour galaxies $N$, tidal strength parameter $Q$ and asymmetry parameters. We find a small excess of $N$ in the QSOs compared to its three controls, and no difference in $Q$. The main difference is an excess of asymmetry in the QSOs hosts, which is almost twice that of the control galaxies. This difference is not due to the hosts' morphology, since there is no difference in their Galaxy Zoo classifications. HST images of two highly asymmetric QSO 2 hosts of our sample show that both sources have a close companion (at projected separations $\sim$ 5 kpc), which we thus conclude is the cause of the observed asymmetry in the lower resolution SDSS images. The mean projected radius of the controls is $ \langle r \rangle = 8.53\pm$0.06 kpc, while that of the QSO hosts is $ \langle r \rangle = 9.39\pm$0.12 kpc, supporting the presence of interaction signatures in the outer regions of the QSO hosts. Our results favour a scenario in which nuclear activity in QSO 2's is triggered by close galaxy interactions -- when the distance between the host and companion is of the order of the galaxy radius, implying that they are already in the process of merger.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Data-driven multiscale modeling of subgrid parameterizations in climate models
Authors:
Karl Otness,
Laure Zanna,
Joan Bruna
Abstract:
Subgrid parameterizations, which represent physical processes occurring below the resolution of current climate models, are an important component in producing accurate, long-term predictions for the climate. A variety of approaches have been tested to design these components, including deep learning methods. In this work, we evaluate a proof of concept illustrating a multiscale approach to this p…
▽ More
Subgrid parameterizations, which represent physical processes occurring below the resolution of current climate models, are an important component in producing accurate, long-term predictions for the climate. A variety of approaches have been tested to design these components, including deep learning methods. In this work, we evaluate a proof of concept illustrating a multiscale approach to this prediction problem. We train neural networks to predict subgrid forcing values on a testbed model and examine improvements in prediction accuracy that can be obtained by using additional information in both fine-to-coarse and coarse-to-fine directions.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Electrical manipulation of a single electron spin in CMOS with micromagnet and spin-valley coupling
Authors:
Bernhard Klemt,
Victor El-Homsy,
Martin Nurizzo,
Pierre Hamonic,
Biel Martinez,
Bruna Cardoso Paz,
Cameron spence,
Matthieu Dartiailh,
Baptiste Jadot,
Emmanuel Chanrion,
Vivien Thiney,
Renan Lethiecq,
Benoit Bertrand,
Heimanu Niebojewski,
Christopher Bäuerle,
Maud Vinet,
Yann-Michel Niquet,
Tristan Meunier,
Matias Urdampilleta
Abstract:
For semiconductor spin qubits, complementary-metal-oxide-semiconductor (CMOS) technology is the ideal candidate for reliable and scalable fabrication. Making the direct leap from academic fabrication to qubits fabricated fully by industrial CMOS standards is difficult without intermediate solutions. With a flexible back-end-of-line (BEOL) new functionalities such as micromagnets or superconducting…
▽ More
For semiconductor spin qubits, complementary-metal-oxide-semiconductor (CMOS) technology is the ideal candidate for reliable and scalable fabrication. Making the direct leap from academic fabrication to qubits fabricated fully by industrial CMOS standards is difficult without intermediate solutions. With a flexible back-end-of-line (BEOL) new functionalities such as micromagnets or superconducting circuits can be added in a post-CMOS process to study the physics of these devices or achieve proof of concepts. Once the process is established it can be incorporated in the foundry-compatible process flow. Here, we study a single electron spin qubit in a CMOS device with a micromagnet integrated in the flexible BEOL. We exploit the synthetic spin orbit coupling (SOC) to control the qubit via electric field and we investigate the spin-valley physics in the presence of SOC where we show an enhancement of the Rabi frequency at the spin-valley hotspot. Finally, we probe the high frequency noise in the system using dynamical decoupling pulse sequences and demonstrate that charge noise dominates the qubit decoherence in this range.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
On the solid angle of a convex set
Authors:
J. Bruna,
J. Cufí E. Gallego,
A. Reventós
Abstract:
Here we analyze three dimensional analogues of the classical Crofton's formula for planar compact convex sets. In this formula a fundamental role is played by the visual angle of the convex set from an exterior point. A generalization of the visual angle to convex sets in euclidian space is the visual solid angle. This solid angle, being an spherically convex set in the unit sphere, has length, ar…
▽ More
Here we analyze three dimensional analogues of the classical Crofton's formula for planar compact convex sets. In this formula a fundamental role is played by the visual angle of the convex set from an exterior point. A generalization of the visual angle to convex sets in euclidian space is the visual solid angle. This solid angle, being an spherically convex set in the unit sphere, has length, area and other geometric quantities to be considered. The main goal of this note is to express invariant quantities of the original convex set depending on volume, surface area and mean curvature integral by means of integrals of functions related to the solid angle.
△ Less
Submitted 8 March, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
A Multi-layered GaGeTe Electro-Optic Device Integrated in Silicon Photonics
Authors:
Srinivasa Reddy Tamalampudi,
Ghada Dushaq,
Juan Esteban Villegas,
Bruna Paredes,
Mahmoud S. Rasras
Abstract:
Electrically tunable devices contribute significantly to key functions of photonics integrated circuits. Here, we demonstrate the tuning of the optical index of refraction based on hybrid integration of multi-layered anisotropic GaGeTe on a silicon micro-ring resonator (Si-MRR). Under static applied (DC) bias and transverse-electric (TE) polarization, the device exhibits a linear resonance shift w…
▽ More
Electrically tunable devices contribute significantly to key functions of photonics integrated circuits. Here, we demonstrate the tuning of the optical index of refraction based on hybrid integration of multi-layered anisotropic GaGeTe on a silicon micro-ring resonator (Si-MRR). Under static applied (DC) bias and transverse-electric (TE) polarization, the device exhibits a linear resonance shift without any amplitude modulation. However, for the transverse-magnetic (TM) polarization, both amplitude and phase modulation are observed. The corresponding wavelength shift and half-wave voltage length product (V_π.l) for the TE polarization are 1.78 pm/V and 0.9 V.cm, respectively. These values are enhanced for the TM polarizations and correspond to 6.65 pm/V and 0.28 V.cm, respectively. The dynamic radio frequency (RF) response of the devices was also tested at different bias conditions. Remarkably, the device exhibits a 1.6 MHz and 2.1 MHz response at 0 V and 7 V bias, respectively. Based on these findings, the integration of 2D GaGeTe on the silicon photonics platform has great potential for the next generation of integrated photonic applications such as switches and phase shifters.
△ Less
Submitted 3 December, 2022;
originally announced December 2022.
-
Anisotropic Van der Waals 2D GeAs Integrated on Silicon Four-Waveguide Crossing
Authors:
Ghada Dushaq,
Juan Esteban Villegas,
Bruna Paredes,
Srinivasa Reddy Tamalampudi,
Mahmoud S. Rasras
Abstract:
In-plane optical anisotropy plays a critical role in manipulating light in a wide range of planner photonic devices. In this study, the strong anisotropy of multilayer 2D GeAs is leveraged and utilized to validate the technical feasibility of on-chip light management. A 2D GeAs is stamped into an ultra-compact silicon waveguide four-way crossing optimized for operation in the O-optical band. The m…
▽ More
In-plane optical anisotropy plays a critical role in manipulating light in a wide range of planner photonic devices. In this study, the strong anisotropy of multilayer 2D GeAs is leveraged and utilized to validate the technical feasibility of on-chip light management. A 2D GeAs is stamped into an ultra-compact silicon waveguide four-way crossing optimized for operation in the O-optical band. The measured optical transmission spectra indicated a remarkable discrepancy between the in-plane crystal optical axes with an attenuation ratio of ~ 3.5 (at 1330 nm). Additionally, the effect of GeAs crystal orientation on the electro-optic transmission performance is demonstrated on a straight waveguide. A notable 50 % reduction in responsivity was recorded for devices constructed with cross direction compared to devices with a crystal a-direction parallel to the light polarization. This extraordinary optical anisotropy, combined with a high refractive index ~ 4 of 2D GeAs, opens possibilities for efficient on-chip light manipulation in photonic devices.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
Lessons Learned to Improve the UX Practices in Agile Projects Involving Data Science and Process Automation
Authors:
Bruna Ferreira,
Silvio Marques,
Marcos Kalinowski,
Helio Lopes,
Simone D. J. Barbosa
Abstract:
Context: User-Centered Design and Agile methodologies focus on human issues. Nevertheless, agile methodologies focus on contact with contracting customers and generating value for them. Usually, the communication between end users and the agile team is mediated by customers. However, they do not know the problems end users face in their routines. Hence, UX issues are typically identified only afte…
▽ More
Context: User-Centered Design and Agile methodologies focus on human issues. Nevertheless, agile methodologies focus on contact with contracting customers and generating value for them. Usually, the communication between end users and the agile team is mediated by customers. However, they do not know the problems end users face in their routines. Hence, UX issues are typically identified only after the implementation, during user testing and validation. Objective: Aiming to improve the understanding and definition of the problem in agile projects, this research investigates the practices and difficulties experienced by agile teams during the development of data science and process automation projects. Also, we analyze the benefits and the teams' perceptions regarding user participation in these projects. Method: We collected data from four agile teams in an academia-industry collaboration focusing on delivering data science and process automation solutions. Therefore, we applied a carefully designed questionnaire answered by developers, scrum masters, and UX designers. In total, 18 subjects answered the questionnaire. Results: From the results, we identify practices used by the teams to define and understand the problem and to represent the solution. The practices most often used are prototypes and meetings with stakeholders. Another practice that helped the team to understand the problem was using Lean Inceptions. Also, our results present some specific issues regarding data science projects. Conclusion: We observed that end-user participation can be critical to understanding and defining the problem. They help to define elements of the domain and barriers in the implementation. We identified a need for approaches that facilitate user-team communication in data science projects and the need for more detailed requirements representations to support data science solutions.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
A Functional-Space Mean-Field Theory of Partially-Trained Three-Layer Neural Networks
Authors:
Zhengdao Chen,
Eric Vanden-Eijnden,
Joan Bruna
Abstract:
To understand the training dynamics of neural networks (NNs), prior studies have considered the infinite-width mean-field (MF) limit of two-layer NN, establishing theoretical guarantees of its convergence under gradient flow training as well as its approximation and generalization capabilities. In this work, we study the infinite-width limit of a type of three-layer NN model whose first layer is r…
▽ More
To understand the training dynamics of neural networks (NNs), prior studies have considered the infinite-width mean-field (MF) limit of two-layer NN, establishing theoretical guarantees of its convergence under gradient flow training as well as its approximation and generalization capabilities. In this work, we study the infinite-width limit of a type of three-layer NN model whose first layer is random and fixed. To define the limiting model rigorously, we generalize the MF theory of two-layer NNs by treating the neurons as belonging to functional spaces. Then, by writing the MF training dynamics as a kernel gradient flow with a time-varying kernel that remains positive-definite, we prove that its training loss in $L_2$ regression decays to zero at a linear rate. Furthermore, we define function spaces that include the solutions obtainable through the MF training dynamics and prove Rademacher complexity bounds for these spaces. Our theory accommodates different scaling choices of the model, resulting in two regimes of the MF limit that demonstrate distinctive behaviors while both exhibiting feature learning.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Learning Single-Index Models with Shallow Neural Networks
Authors:
Alberto Bietti,
Joan Bruna,
Clayton Sanford,
Min Jae Song
Abstract:
Single-index models are a class of functions given by an unknown univariate ``link'' function applied to an unknown one-dimensional projection of the input. These models are particularly relevant in high dimension, when the data might present low-dimensional structure that learning algorithms should adapt to. While several statistical aspects of this model, such as the sample complexity of recover…
▽ More
Single-index models are a class of functions given by an unknown univariate ``link'' function applied to an unknown one-dimensional projection of the input. These models are particularly relevant in high dimension, when the data might present low-dimensional structure that learning algorithms should adapt to. While several statistical aspects of this model, such as the sample complexity of recovering the relevant (one-dimensional) subspace, are well-understood, they rely on tailored algorithms that exploit the specific structure of the target function. In this work, we introduce a natural class of shallow neural networks and study its ability to learn single-index models via gradient flow. More precisely, we consider shallow networks in which biases of the neurons are frozen at random initialization. We show that the corresponding optimization landscape is benign, which in turn leads to generalization guarantees that match the near-optimal sample complexity of dedicated semi-parametric methods.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
New insights from cross-correlation studies between Solar activity and Cosmic-ray fluxes
Authors:
Nicola Tomassetti,
Bruna Bertucci,
Emanuele Fiandrini
Abstract:
The observed variability of the cosmic-ray intensity in the interplanetary space is driven by the evolution of the Sun's magnetic activity over its 11-year quasiperiodical cycle. Investigating the relationship between solar activity indices and cosmic-ray intensity measurements is then essential for understanding the fundamental processes of particle transport in the heliosphere. Here we have perf…
▽ More
The observed variability of the cosmic-ray intensity in the interplanetary space is driven by the evolution of the Sun's magnetic activity over its 11-year quasiperiodical cycle. Investigating the relationship between solar activity indices and cosmic-ray intensity measurements is then essential for understanding the fundamental processes of particle transport in the heliosphere. Here we have performed a global characterization the solar modulation of cosmic rays over the solar activity cycle and for different energies of the cosmic particles. We present our cross-correlation studies using data from space experiments, neutron monitors and solar observatories collected over several solar cycles.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Data driven analysis of Galactic cosmic rays in the heliosphere: diffusion of cosmic protons and nuclei
Authors:
Nicola Tomassetti,
Bruna Bertucci,
Federico Donnini,
Emanuele Fiandrini,
Maura Graziani,
Behrouz Khiali,
Alejandro Reina Conde
Abstract:
Galactic cosmic rays (GCRs) inside the heliosphere are affected by magnetic turbulence and Solar wind disturbances which result in the so-called solar modulation effect. To investigate this phenomenon, we have performed a data-driven analysis of the temporal dependence of the GCR flux over the solar cycle. With a global statistical inference of GCR data collected in space by AMS-02, PAMELA, and CR…
▽ More
Galactic cosmic rays (GCRs) inside the heliosphere are affected by magnetic turbulence and Solar wind disturbances which result in the so-called solar modulation effect. To investigate this phenomenon, we have performed a data-driven analysis of the temporal dependence of the GCR flux over the solar cycle. With a global statistical inference of GCR data collected in space by AMS-02, PAMELA, and CRIS on monthly basis, we have determined the dependence of the GCR diffusion parameters upon time and rigidity. In this conference, we present our results for GCR protons and nuclei, we discuss their interpretation in terms of basic processes of particle transport and their relations with the dynamics of the heliospheric plasma.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Temporal evolution and rigidity dependence of the solar modulation lag of Galactic cosmic rays
Authors:
Nicola Tomassetti,
Bruna Bertucci,
Emanuele Fiandrini
Abstract:
When traveling in the heliosphere, Galactic cosmic rays (GCRs) are subjected to the solar modulation effect, a quasiperiodical change of their intensity caused by the 11-year cycle of solar activity. Here we investigate the association of solar activity and cosmic radiation over five solar cycles, from 1965 to 2020, using a collection of multichannel data from neutron monitors, space missions, and…
▽ More
When traveling in the heliosphere, Galactic cosmic rays (GCRs) are subjected to the solar modulation effect, a quasiperiodical change of their intensity caused by the 11-year cycle of solar activity. Here we investigate the association of solar activity and cosmic radiation over five solar cycles, from 1965 to 2020, using a collection of multichannel data from neutron monitors, space missions, and solar observatories. In particular, we focus on the time lag between the monthly sunspot number and the GCR flux variations. We show that the modulation lag is subjected to a 22-year periodical variation, ranging from about 2 to 14 months and following the polarity cycle of the Sun's magnetic field. We also show that the lag is remarkably decreasing with increasing energy of the GCR particles. These results reflect the interplay of basic physics phenomena that cause the GCR modulation effect: the drift motion of charged particles in the interplanetary magnetic field, the latitudinal dependence of the solar wind, the energy dependence of their residence time in the heliosphere. Based on this interpretation, we end up with a global effective formula for the modulation lag and testable predictions for the flux evolution of cosmic particles and antiparticles over the solar cycle.
△ Less
Submitted 19 November, 2022; v1 submitted 11 October, 2022;
originally announced October 2022.