-
Evidence of directional structural superlubricity and Lévy flights in a van der Waals heterostructure
Authors:
Maxime Le Ster,
Paweł Krukowski,
Maciej Rogala,
Paweł Dabrowski,
Iaroslav Lutsyk,
Klaudia Toczek,
Krzysztof Podlaski,
Tefvik O. Mendeş,
Francesca Genuzio,
Andrea Locatelli,
Guan Bian,
Tai-Chang Chiang,
Simon A. Brown,
Paweł J. Kowalczyk
Abstract:
Structural superlubricity is a special frictionless contact in which two crystals are in incommensurate arrangement such that relative in-plane translation is associated with vanishing energy barrier crossing. So far, it has been realized in multilayer graphene and other van der Waals two-dimensional crystals with hexagonal or triangular crystalline symmetries, leading to isotropic frictionless co…
▽ More
Structural superlubricity is a special frictionless contact in which two crystals are in incommensurate arrangement such that relative in-plane translation is associated with vanishing energy barrier crossing. So far, it has been realized in multilayer graphene and other van der Waals two-dimensional crystals with hexagonal or triangular crystalline symmetries, leading to isotropic frictionless contacts. Directional structural superlubricity, to date unrealized in two-dimensional systems, is possible when the reciprocal lattices of the two crystals coincide in one direction only. Here, we evidence directional structural superlubricity a $α$-bismuthene/graphite van der Waals system, manifested by spontaneous hop** of the islands over hundreds of nanometres at room temperature, resolved by low-energy electron microscopy and supported by registry simulations. Statistical analysis of individual and collective $α$-bismuthene islands populations reveal a heavy-tailed distribution of the hop** lengths and sticking times indicative of Lévy flight dynamics, largely unobserved in massive condensed-matter systems.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter
Authors:
M. Aamir,
B. Acar,
G. Adamov,
T. Adams,
C. Adloff,
S. Afanasiev,
C. Agrawal,
C. Agrawal,
A. Ahmad,
H. A. Ahmed,
S. Akbar,
N. Akchurin,
B. Akgul,
B. Akgun,
R. O. Akpinar,
E. Aktas,
A. AlKadhim,
V. Alexakhin,
J. Alimena,
J. Alison,
A. Alpana,
W. Alshehri,
P. Alvarez Dominguez,
M. Alyari,
C. Amendola
, et al. (550 additional authors not shown)
Abstract:
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr…
▽ More
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadronic section. The shower reconstruction method is based on graph neural networks and it makes use of a dynamic reduction network architecture. It is shown that the algorithm is able to capture and mitigate the main effects that normally hinder the reconstruction of hadronic showers using classical reconstruction methods, by compensating for fluctuations in the multiplicity, energy, and spatial distributions of the shower's constituents. The performance of the algorithm is evaluated using test beam data collected in 2018 prototype of the CMS HGCAL accompanied by a section of the CALICE AHCAL prototype. The capability of the method to mitigate the impact of energy leakage from the calorimeter is also demonstrated.
△ Less
Submitted 30 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Authors:
Marah Abdin,
Sam Ade Jacobs,
Ammar Ahmad Awan,
Jyoti Aneja,
Ahmed Awadallah,
Hany Awadalla,
Nguyen Bach,
Amit Bahree,
Arash Bakhtiari,
Jianmin Bao,
Harkirat Behl,
Alon Benhaim,
Misha Bilenko,
Johan Bjorck,
Sébastien Bubeck,
Qin Cai,
Martin Cai,
Caio César Teodoro Mendes,
Weizhu Chen,
Vishrav Chaudhary,
Dong Chen,
Dongdong Chen,
Yen-Chun Chen,
Yi-Ling Chen,
Parul Chopra
, et al. (90 additional authors not shown)
Abstract:
We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset…
▽ More
We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset for training, a scaled-up version of the one used for phi-2, composed of heavily filtered publicly available web data and synthetic data. The model is also further aligned for robustness, safety, and chat format. We also provide some initial parameter-scaling results with a 7B and 14B models trained for 4.8T tokens, called phi-3-small and phi-3-medium, both significantly more capable than phi-3-mini (e.g., respectively 75% and 78% on MMLU, and 8.7 and 8.9 on MT-bench). Moreover, we also introduce phi-3-vision, a 4.2 billion parameter model based on phi-3-mini with strong reasoning capabilities for image and text prompts.
△ Less
Submitted 23 May, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
A biosensor based on magnetoelastic waves for detection of antibodies in human plasma for COVID-19 serodiagnosis
Authors:
Wenderson R. F. Silva,
Larissa C. P. Monteiro,
Renato L. Senra,
Eduardo N. D. de Araujo,
Rafael O. R. R. Cunha,
Tiago A. O. Mendes,
Joaquim B. S. Mendes
Abstract:
The study proposes a new efficient wireless biosensor based on magnetoelastic waves for the detection of antibodies in human plasma, aiming at the serological diagnosis of COVID-19. The biosensor was functionalized with the N antigen - nucleocapsid phosphoprotein of the SARS-CoV-2 virus. Validation analyses, by sodium dodecyl-sulfate polyacrylamide gel electrophoresis (SDS-PAGE), Western blotting,…
▽ More
The study proposes a new efficient wireless biosensor based on magnetoelastic waves for the detection of antibodies in human plasma, aiming at the serological diagnosis of COVID-19. The biosensor was functionalized with the N antigen - nucleocapsid phosphoprotein of the SARS-CoV-2 virus. Validation analyses, by sodium dodecyl-sulfate polyacrylamide gel electrophoresis (SDS-PAGE), Western blotting, atomic force microscopy (AFM), scanning electron microscopy (SEM), micro-Raman spectroscopy, confirmed the selectivity and effective surface functionalization of the biosensor. The research successfully obtained, expressed and purified the recombinant antigen, while plasma samples from COVID-19 positive and negative patients were used to test the performance of the biosensor. A comparison of performance with the ELISA method revealed equivalent diagnostic power. These results indicate the robustness of the biosensor in reliably differentiating between positive and negative samples, highlighting its potential as an efficient and low-cost tool for the serological diagnosis of COVID-19. In addition to being fast to execute and having the potential for automation in large-scale diagnostic studies, the biosensor fills a significant gap in existing SARS-CoV-2 detection approaches.
△ Less
Submitted 9 June, 2024; v1 submitted 17 March, 2024;
originally announced March 2024.
-
Evidence and quantification of memory effects in competitive first passage events
Authors:
M. Dolgushev,
T. V. Mendes,
B. Gorin,
K. Xie,
N. Levernier,
O. Bénichou,
H. Kellay,
R. Voituriez,
T. Guérin
Abstract:
Splitting probabilities quantify the likelihood of a given outcome out of competitive events for general random processes. This key observable of random walk theory, historically introduced as the Gambler's ruin problem for a player in a casino, has a broad range of applications beyond mathematical finance in evolution genetics, physics and chemistry, such as allele fixation, polymer translocation…
▽ More
Splitting probabilities quantify the likelihood of a given outcome out of competitive events for general random processes. This key observable of random walk theory, historically introduced as the Gambler's ruin problem for a player in a casino, has a broad range of applications beyond mathematical finance in evolution genetics, physics and chemistry, such as allele fixation, polymer translocation, protein folding and more generally competitive reactions. The statistics of competitive events is well understood for memoryless (Markovian) processes. However, in complex systems such as polymer fluids, the motion of a particle should typically be described as a process with memory. Appart from scaling theories and perturbative approaches in one-dimension, the outcome of competitive events is much less characterized analytically for processes with memory. Here, we introduce an analytical approach that provides the splitting probabilities for general $d$-dimensional non-Markovian Gaussian processes. This analysis shows that splitting probabilities are critically controlled by the out of equilibrium statistics of reactive trajectories, observed after the first passage. This hallmark of non-Markovian dynamics and its quantitative impact on splitting probabilities are directly evidenced in a prototypical experimental reaction scheme in viscoelastic fluids. Altogether, these results reveal both experimentally and theoretically the importance of memory effects on competitive reactions.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
LDIC Survey 2023: Feeling Welcome in the Community
Authors:
Christopher Aubin,
Bipasha Chakraborty,
Will Detmold,
Sofie Martins,
Nilmani Mathur,
Tereza Mendes,
Finn M. Stokes
Abstract:
We review the level of welcomeness that members of the lattice field theory community feel based on the results of a survey performed in May and June 2023. While respondents reported generally high levels of feeling welcome at the lattice conference, women and people with diverse gender identities, sexual orientations, ethnic backgrounds and religious affiliations feel less included and have more…
▽ More
We review the level of welcomeness that members of the lattice field theory community feel based on the results of a survey performed in May and June 2023. While respondents reported generally high levels of feeling welcome at the lattice conference, women and people with diverse gender identities, sexual orientations, ethnic backgrounds and religious affiliations feel less included and have more negative experiences at the lattice conference than their peers. Respondents report that they are actively informing themselves about inequities in the community, however a large fraction of survey participants underestimate the severity of the problem, as was found in previous surveys. The survey data indicate that this situation can be most effectively improved by organizing talks and events about issues of diversity and inclusion within the lattice community. Respondents also reported that individual readings of scientific papers on equity and inclusion are effective in giving people agency in making a change and hence it may be helpful to collate a collection of important articles on these topics.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Textbooks Are All You Need
Authors:
Suriya Gunasekar,
Yi Zhang,
Jyoti Aneja,
Caio César Teodoro Mendes,
Allie Del Giorno,
Sivakanth Gopi,
Mojan Javaheripi,
Piero Kauffmann,
Gustavo de Rosa,
Olli Saarikivi,
Adil Salim,
Shital Shah,
Harkirat Singh Behl,
Xin Wang,
Sébastien Bubeck,
Ronen Eldan,
Adam Tauman Kalai,
Yin Tat Lee,
Yuanzhi Li
Abstract:
We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is a Transformer-based model with 1.3B parameters, trained for 4 days on 8 A100s, using a selection of ``textbook quality" data from the web (6B tokens) and synthetically generated textbooks and exercises with GPT-3.5 (1B tokens). Despite this small scale, phi-1 attains pass@1 accu…
▽ More
We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is a Transformer-based model with 1.3B parameters, trained for 4 days on 8 A100s, using a selection of ``textbook quality" data from the web (6B tokens) and synthetically generated textbooks and exercises with GPT-3.5 (1B tokens). Despite this small scale, phi-1 attains pass@1 accuracy 50.6% on HumanEval and 55.5% on MBPP. It also displays surprising emergent properties compared to phi-1-base, our model before our finetuning stage on a dataset of coding exercises, and phi-1-small, a smaller model with 350M parameters trained with the same pipeline as phi-1 that still achieves 45% on HumanEval.
△ Less
Submitted 2 October, 2023; v1 submitted 20 June, 2023;
originally announced June 2023.
-
The quark propagator and quark-gluon vertex from lattice QCD at finite temperature
Authors:
Jesuel Marques,
Gerhard Kalusche,
Tereza Mendes,
Paulo Silva,
Jon-Ivar Skullerud,
Orlando Oliveira
Abstract:
The quark-gluon vertex is an important object of QCD. Studies have shown that this quantity is relevant for the dynamical chiral symmetry breaking pattern in the vacuum. The goal of our project is to obtain the quark-gluon vertex at finite temperature around the deconfinement/chiral transition using the tools provided by lattice QCD. It will be the first time that the quark-gluon vertex at finite…
▽ More
The quark-gluon vertex is an important object of QCD. Studies have shown that this quantity is relevant for the dynamical chiral symmetry breaking pattern in the vacuum. The goal of our project is to obtain the quark-gluon vertex at finite temperature around the deconfinement/chiral transition using the tools provided by lattice QCD. It will be the first time that the quark-gluon vertex at finite temperature is determined using lattice QCD. The propagators, which are a by-product of this project, are also of interest in themselves. The configurations used were generated by the FASTSUM collaboration. In this contribution, we describe our motivations and goals, some technical details of the determination and report on the status of the calculation.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Rossby numbers of fully and partially convective stars
Authors:
N. R. Landin,
L. T. S. Mendes,
L. P. R. Vaz,
S. H. P. Alencar
Abstract:
We investigate stellar magnetic activity from the theoretical point of view, by using stellar evolution models to calculate theoretical convective turnover times ($τ_{\rm c}$) and Rossby numbers (${\rm Ro}$) for pre-main-sequence and main-sequence stars. The problem is that the canonical place where $τ_{\rm c}$ is usually determined (half a mixing length above the base of the convective zone) fail…
▽ More
We investigate stellar magnetic activity from the theoretical point of view, by using stellar evolution models to calculate theoretical convective turnover times ($τ_{\rm c}$) and Rossby numbers (${\rm Ro}$) for pre-main-sequence and main-sequence stars. The problem is that the canonical place where $τ_{\rm c}$ is usually determined (half a mixing length above the base of the convective zone) fails for fully convective stars and there is no agreement on this in the literature. Our calculations were performed with the ATON stellar evolution code. We concentrated our analysis on fully and partially convective stars motivated by recent observations of slowly rotating fully convective stars, whose X-ray emissions correlate with their Rossby numbers in the same way as in solar-like stars, suggesting that the presence of a tachocline is not required for magnetic field generation. We investigate the behaviour of $τ_{\rm c}$ over the stellar radius for stars of different masses and ages. As ${\rm Ro}$ depends on $τ_{\rm c}$, which varies strongly with the stellar radius, we use our theoretical results to determine a better radial position at which to calculate it for fully convective stars. Using our alternative locations, we fit a sample of 847 stars in the rotation-activity diagram ($L_{\rm X}/L_{\rm bol}$ versus ${\rm Ro}$) with a two-part power-law function. Our fit parameters are consistent with previous work, showing that stars with ${\rm Ro}$$\leq$${\rm Ro_{sat}}$ are distributed around a saturation level in $L_{\rm X}/L_{\rm bol}$ and, for stars with ${\rm Ro}$$>$${\rm Ro_{sat}}$, $L_{\rm X}/L_{\rm bol}$ clearly decays with ${\rm Ro}$ with an exponent of $-2.4\!\pm\!0.1$.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
Probing singularities of Landau-gauge propagators with Padé approximants
Authors:
Cristiane Y. London,
Diogo Boito,
Attilio Cucchieri,
Tereza Mendes
Abstract:
Padé approximants are employed in order to study the analytic structure of the four-dimensional SU(2) Landau-gauge gluon and ghost propagators in the infrared regime. The approximants, which are model independent, are used as fitting functions to lattice data for the propagators, carefully propagating uncertainties due to the fit procedure and taking into account all possible correlations. Applyin…
▽ More
Padé approximants are employed in order to study the analytic structure of the four-dimensional SU(2) Landau-gauge gluon and ghost propagators in the infrared regime. The approximants, which are model independent, are used as fitting functions to lattice data for the propagators, carefully propagating uncertainties due to the fit procedure and taking into account all possible correlations. Applying this procedure systematically to the gluon-propagator data, we observe the presence of a pair of complex poles at $p^2_{\mathrm{pole}} = (-0.37 \pm 0.05_{\mathrm{stat}} \pm 0.08_{\mathrm{sys}}) \pm \, i\, (0.66 \pm 0.03_{\mathrm{stat}} \pm 0.02_{\mathrm{sys}}) \, \mathrm{GeV}^2$, where ``stat'' represents the statistical error and ``sys'' the systematic one. We also find a zero on the negative real axis of $p^2$, at $p^2_{\mathrm{zero}} = (-2.9 \pm 0.4_{\mathrm{stat}} \pm 0.9_{\mathrm{sys}}) \, \mathrm{GeV}^2$. We thus note that our procedure -- which is based on a model-independent approach and includes careful error propagation -- confirms the presence of a pair of complex poles in the gluon propagator, in agreement with previous works. For the ghost propagator, the Padés indicate the existence of the single pole at $p^2 = 0$, as expected. We also find evidence of a branch cut on the negative real axis. Through the use of the so-called D-Log Padé method, which is designed to approximate functions with cuts, we corroborate the existence of this cut for the ghost propagator.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Probing the singularities of the Landau-gauge gluon and ghost propagators with rational approximants
Authors:
Diogo Boito,
Attilio Cucchieri,
Cristiane Y. London,
Tereza Mendes
Abstract:
We employ Padé approximants in the study of the analytic structure of the four-dimensional $SU(2)$ Landau-gauge gluon and ghost propagators in the infrared regime. The approximants, which are model independent, serve as fitting functions for the lattice data. We carefully propagate the uncertainties due to the fitting procedure, taking into account all possible correlations. For the gluon-propagat…
▽ More
We employ Padé approximants in the study of the analytic structure of the four-dimensional $SU(2)$ Landau-gauge gluon and ghost propagators in the infrared regime. The approximants, which are model independent, serve as fitting functions for the lattice data. We carefully propagate the uncertainties due to the fitting procedure, taking into account all possible correlations. For the gluon-propagator data, we confirm the presence of a pair of complex poles at $p_{\rm pole}^2 = \left[(-0.37 \,\pm\, 0.05_{\rm stat}\,\pm\, 0.08_{\rm sys}) \pm i\,(0.66\, \pm\, 0.03_{\rm stat}\, \pm\, 0.02_{\rm sys})\right]\, \mathrm{GeV}^2$, where the first error is statistical and the second systematic. The existence of this pair of complex poles, already hinted upon in previous works, is thus put onto a firmer basis, thanks to the model independence and to the careful error propagation of our analysis. For the ghost propagator, the Padés indicate the existence of a single pole at $p^2 = 0$, as expected. In this case, our results also show evidence of a branch cut along the negative real axis of $p^2$. This is corroborated with another type of approximant, the D-Log Padés, which are better suited to studying functions with a branch cut and are applied here for the first time in this context. Due to particular features and limited statistics of the gluon-propagator data, our analysis is inconclusive regarding the presence of a branch cut in the gluon case.
△ Less
Submitted 25 January, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Authors:
Ganesh Jawahar,
Subhabrata Mukherjee,
Debadeepta Dey,
Muhammad Abdul-Mageed,
Laks V. S. Lakshmanan,
Caio Cesar Teodoro Mendes,
Gustavo Henrique de Rosa,
Shital Shah
Abstract:
Autocomplete is a task where the user inputs a piece of text, termed prompt, which is conditioned by the model to generate semantically coherent continuation. Existing works for this task have primarily focused on datasets (e.g., email, chat) with high frequency user prompt patterns (or focused prompts) where word-based language models have been quite effective. In this work, we study the more cha…
▽ More
Autocomplete is a task where the user inputs a piece of text, termed prompt, which is conditioned by the model to generate semantically coherent continuation. Existing works for this task have primarily focused on datasets (e.g., email, chat) with high frequency user prompt patterns (or focused prompts) where word-based language models have been quite effective. In this work, we study the more challenging open-domain setting consisting of low frequency user prompt patterns (or broad prompts, e.g., prompt about 93rd academy awards) and demonstrate the effectiveness of character-based language models. We study this problem under memory-constrained settings (e.g., edge devices and smartphones), where character-based representation is effective in reducing the overall model size (in terms of parameters). We use WikiText-103 benchmark to simulate broad prompts and demonstrate that character models rival word models in exact match accuracy for the autocomplete task, when controlled for the model size. For instance, we show that a 20M parameter character model performs similar to an 80M parameter word model in the vanilla setting. We further propose novel methods to improve character models by incorporating inductive bias in the form of compositional information and representation transfer from large word models. Datasets and code used in this work are available at https://github.com/UBC-NLP/char_autocomplete.
△ Less
Submitted 7 June, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
A model of membrane deformations driven by a surface pH gradient
Authors:
Toni V. Mendes,
Jonas Ranft,
Hélène Berthoumieux
Abstract:
Many cellular organelles are membrane-bound structures with complex membrane composition and shape. Their shapes have been observed to depend on the metabolic state of the organelle, and the mechanisms that couple biochemical pathways and membrane shape are still actively investigated. Here, we study a model coupling inhomogeneities in the lipid composition and membrane geometry via a generalized…
▽ More
Many cellular organelles are membrane-bound structures with complex membrane composition and shape. Their shapes have been observed to depend on the metabolic state of the organelle, and the mechanisms that couple biochemical pathways and membrane shape are still actively investigated. Here, we study a model coupling inhomogeneities in the lipid composition and membrane geometry via a generalized Helfrich free energy. We derive the resulting stress tensor, the Green's function for a tubular membrane and compute the phase diagram of the induced deformations. We then apply this model to study the deformation of mitochondria cristae described as membrane tubes supporting a pH gradient at its surface. This gradient in turn controls the lipid composition of the membrane via the protonation/deprotonation of cardiolipins, which are acid-based lipids known to be crucial for mitochondria shape and functioning. Our model predicts the appearance of tube deformations resembling the observed shape changes of cristea when submitted to a proton gradient.
△ Less
Submitted 23 January, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Authors:
Sharath Girish,
Debadeepta Dey,
Neel Joshi,
Vibhav Vineet,
Shital Shah,
Caio Cesar Teodoro Mendes,
Abhinav Shrivastava,
Yale Song
Abstract:
The current literature on self-supervised learning (SSL) focuses on develo** learning objectives to train neural networks more effectively on unlabeled data. The typical development process involves taking well-established architectures, e.g., ResNet demonstrated on ImageNet, and using them to evaluate newly developed objectives on downstream scenarios. While convenient, this does not take into…
▽ More
The current literature on self-supervised learning (SSL) focuses on develo** learning objectives to train neural networks more effectively on unlabeled data. The typical development process involves taking well-established architectures, e.g., ResNet demonstrated on ImageNet, and using them to evaluate newly developed objectives on downstream scenarios. While convenient, this does not take into account the role of architectures which has been shown to be crucial in the supervised learning literature. In this work, we establish extensive empirical evidence showing that a network architecture plays a significant role in SSL. We conduct a large-scale study with over 100 variants of ResNet and MobileNet architectures and evaluate them across 11 downstream scenarios in the SSL setting. We show that there is no one network that performs consistently well across the scenarios. Based on this, we propose to learn not only network weights but also architecture topologies in the SSL regime. We show that "self-supervised architectures" outperform popular handcrafted architectures (ResNet18 and MobileNetV2) while performing competitively with the larger and computationally heavy ResNet50 on major image classification benchmarks (ImageNet-1K, iNat2021, and more). Our results suggest that it is time to consider moving beyond handcrafted architectures in SSL and start thinking about incorporating architecture search into self-supervised learning objectives.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models
Authors:
Mojan Javaheripi,
Gustavo H. de Rosa,
Subhabrata Mukherjee,
Shital Shah,
Tomasz L. Religa,
Caio C. T. Mendes,
Sebastien Bubeck,
Farinaz Koushanfar,
Debadeepta Dey
Abstract:
The Transformer architecture is ubiquitously used as the building block of large-scale autoregressive language models. However, finding architectures with the optimal trade-off between task performance (perplexity) and hardware constraints like peak memory utilization and latency is non-trivial. This is exacerbated by the proliferation of various hardware. We leverage the somewhat surprising empir…
▽ More
The Transformer architecture is ubiquitously used as the building block of large-scale autoregressive language models. However, finding architectures with the optimal trade-off between task performance (perplexity) and hardware constraints like peak memory utilization and latency is non-trivial. This is exacerbated by the proliferation of various hardware. We leverage the somewhat surprising empirical observation that the number of decoder parameters in autoregressive Transformers has a high rank correlation with task performance, irrespective of the architecture topology. This observation organically induces a simple Neural Architecture Search (NAS) algorithm that uses decoder parameters as a proxy for perplexity without need for any model training. The search phase of our training-free algorithm, dubbed Lightweight Transformer Search (LTS), can be run directly on target devices since it does not require GPUs. Using on-target-device measurements, LTS extracts the Pareto-frontier of perplexity versus any hardware performance cost. We evaluate LTS on diverse devices from ARM CPUs to NVIDIA GPUs and two popular autoregressive Transformer backbones: GPT-2 and Transformer-XL. Results show that the perplexity of 16-layer GPT-2 and Transformer-XL can be achieved with up to 1.5x, 2.5x faster runtime and 1.2x, 2.0x lower peak memory utilization. When evaluated in zero and one-shot settings, LTS Pareto-frontier models achieve higher average accuracy compared to the 350M parameter OPT across 14 tasks, with up to 1.6x lower latency. LTS extracts the Pareto-frontier in under 3 hours while running on a commodity laptop. We effectively remove the carbon footprint of hundreds of GPU hours of training during search, offering a strong simple baseline for future NAS methods in autoregressive language modeling.
△ Less
Submitted 17 October, 2022; v1 submitted 3 March, 2022;
originally announced March 2022.
-
Everlasting impact of initial perturbations on first-passage times of non-Markovian random walks
Authors:
N. Levernier,
T. V. Mendes,
O. Bénichou,
R. Voituriez,
T. Guérin
Abstract:
Persistence, defined as the probability that a fluctuating signal has not reached a threshold up to a given observation time, plays a crucial role in the theory of random processes. It quantifies the kinetics of processes as varied as phase ordering, reaction diffusion or interface relaxation dynamics. The fact that persistence can decay algebraically with time with non trivial exponents has trigg…
▽ More
Persistence, defined as the probability that a fluctuating signal has not reached a threshold up to a given observation time, plays a crucial role in the theory of random processes. It quantifies the kinetics of processes as varied as phase ordering, reaction diffusion or interface relaxation dynamics. The fact that persistence can decay algebraically with time with non trivial exponents has triggered a number of experimental and theoretical studies. However, general analytical methods to calculate persistence exponents cannot be applied to the ubiquitous case of non-Markovian systems relaxing transiently after an imposed initial perturbation. Here, we introduce a theoretical framework that enables the non perturbative determination of persistence exponents of $d$-dimensional Gaussian non-Markovian processes with general non stationary dynamics relaxing to a steady state after an initial perturbation. Two prototypical classes of situations are analyzed: either the system is subjected to a temperature quench at initial time, or its past trajectory is assumed to have been observed and thus known. Altogether, our results reveal and quantify, on the basis of Gaussian processes, the deep impact of initial perturbations on first-passage statistics of non-Markovian processes. Our theory covers the case of spatial dimension higher than one, opening the way to characterize non-trivial reaction kinetics for complex systems with non-equilibrium initial conditions.
△ Less
Submitted 10 January, 2022;
originally announced January 2022.
-
Technical debt and agile software development practices and processes: An industry practitioner survey
Authors:
Johannes Holvitie,
Sherlock A. Licorish,
Rodrigo O. Spínola,
Sami Hyrynsalmi,
Stephen G. MacDonell,
Thiago S. Mendes,
Jim Buchan,
Ville Leppänen
Abstract:
Context: Contemporary software development is typically conducted in dynamic, resource-scarce environments that are prone to the accumulation of technical debt. While this general phenomenon is acknowledged, what remains unknown is how technical debt specifically manifests in and affects software processes, and how the software development techniques employed accommodate or mitigate the presence o…
▽ More
Context: Contemporary software development is typically conducted in dynamic, resource-scarce environments that are prone to the accumulation of technical debt. While this general phenomenon is acknowledged, what remains unknown is how technical debt specifically manifests in and affects software processes, and how the software development techniques employed accommodate or mitigate the presence of this debt. Objectives: We sought to draw on practitioner insights and experiences in order to classify the effects of agile method use on technical debt management. We explore the breadth of practitioners' knowledge about technical debt; how technical debt is manifested across the software process; and the perceived effects of common agile software development practices and processes on technical debt. In doing so, we address a research gap in technical debt knowledge and provide novel and actionable managerial recommendations. Method: We designed, tested and executed a multi-national survey questionnaire to address our objectives, receiving 184 responses from practitioners in Brazil, Finland, and New Zealand. Results: Our findings indicate that: 1) Practitioners are aware of technical debt, although, there was under utilization of the concept, 2) Technical debt commonly resides in legacy systems, however, concrete instances of technical debt are hard to conceptualize which makes it problematic to manage, 3) Queried agile practices and processes help to reduce technical debt; particularly, techniques that verify and maintain the structure and clarity of implemented artifacts. Conclusions: The fact that technical debt instances tend to have characteristics in common means that a systematic approach to its management is feasible. However, notwithstanding the positive effects of some agile practices on technical debt management, competing stakeholders' interests remain a concern.(Abridged)
△ Less
Submitted 30 April, 2021;
originally announced April 2021.
-
Adoption and Suitability of Software Development Methods and Practices
Authors:
Sherlock A. Licorish,
Johannes Holvitie,
Sami Hyrynsalmi,
Ville Leppänen,
Rodrigo O. Spínola,
Thiago S. Mendes,
Stephen G. MacDonell,
Jim Buchan
Abstract:
In seeking to complement consultants' and tool vendors' reports, there has been an increasing academic focus on understanding the adoption and use of software development methods and practices. We surveyed practitioners working in Brazil, Finland, and New Zealand in a transnational study to contribute to these efforts. Among our findings we observed that most of the 184 practitioners in our sample…
▽ More
In seeking to complement consultants' and tool vendors' reports, there has been an increasing academic focus on understanding the adoption and use of software development methods and practices. We surveyed practitioners working in Brazil, Finland, and New Zealand in a transnational study to contribute to these efforts. Among our findings we observed that most of the 184 practitioners in our sample focused on a small portfolio of projects that were of short duration. In addition, Scrum and Kanban were used most; however, some practitioners also used conventional methods. Coding Standards, Simple Design and Refactoring were used most by practitioners, and these practices were held to be largely suitable for project and process management. Our evidence points to the need to properly understand and support a wide range of software methods.
△ Less
Submitted 19 March, 2021;
originally announced March 2021.
-
ADAM30 Downregulates APP-Linked Defects Through Cathepsin D Activation in Alzheimer's Disease
Authors:
Florent Letronne,
Geoffroy Laumet,
Anne-Marie Ayral,
Julien Chapuis,
Florie Demiautte,
Mathias Laga,
Michel Vandenberghe,
Nicolas Malmanche,
Florence Leroux,
Fanny Eysert,
Yoann Sottejeau,
Linda Chami,
Amandine Flaig,
Charlotte Bauer,
Pierre Dourlen,
Marie Lesaffre,
Charlotte Delay,
Ludovic Huot,
Julie Dumont,
Elisabeth Werkmeister,
Franck Lafont,
Tiago Mendes,
Franck Hansmannel,
Bart Dermaut,
Benoit Deprez
, et al. (16 additional authors not shown)
Abstract:
Although several ADAMs (A disintegrin-like and metalloproteases) have been shown to contribute to the amy-loid precursor protein (APP) metabolism, the full spectrum of metalloproteases involved in this metabolism remains to be established. Transcriptomic analyses centred on metalloprotease genes unraveled a 50% decrease in ADAM30 expression that inversely correlates with amyloid load in Alzheimer'…
▽ More
Although several ADAMs (A disintegrin-like and metalloproteases) have been shown to contribute to the amy-loid precursor protein (APP) metabolism, the full spectrum of metalloproteases involved in this metabolism remains to be established. Transcriptomic analyses centred on metalloprotease genes unraveled a 50% decrease in ADAM30 expression that inversely correlates with amyloid load in Alzheimer's disease brains. Accordingly, in vitro down-or up-regulation of ADAM30 expression triggered an increase/decrease in A$β$ peptides levels whereas expression of a biologically inactive ADAM30 (ADAM30 mut) did not affect A$β$ secretion. Proteomics/cell-based experiments showed that ADAM30-dependent regulation of APP metabolism required both cathepsin D (CTSD) activation and APP sorting to lysosomes. Accordingly, in Alzheimer-like transgenic mice, neuronal ADAM30 over-expression lowered A$β$42 secretion in neuron primary cultures, soluble A$β$42 and amyloid plaque load levels in the brain and concomitantly enhanced CTSD activity and finally rescued long term potentiation.
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
Ghost Sector in Minimal Linear Covariant Gauge
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Orlando Oliveira,
Martin Roelfs,
Paulo J. Silva
Abstract:
We discuss possible definitions of the Faddeev-Popov matrix for the minimal linear covariant gauge on the lattice and present preliminary results for the ghost propagator.
We discuss possible definitions of the Faddeev-Popov matrix for the minimal linear covariant gauge on the lattice and present preliminary results for the ghost propagator.
△ Less
Submitted 2 December, 2018;
originally announced December 2018.
-
Lattice Computation of the Ghost Propagator in Linear Covariant Gauges
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Orlando Oliveira,
Martin Roelfs,
Paulo J. Silva
Abstract:
We discuss the subtleties concerning the lattice computation of the ghost propagator in linear covariant gauges, and present preliminary numerical results.
We discuss the subtleties concerning the lattice computation of the ghost propagator in linear covariant gauges, and present preliminary numerical results.
△ Less
Submitted 28 November, 2018;
originally announced November 2018.
-
Faddeev-Popov Matrix in Linear Covariant Gauge: First Results
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Orlando Oliveira,
Martin Roelfs,
Paulo J. Silva
Abstract:
We discuss a possible definition of the Faddeev-Popov matrix for the minimal linear covariant gauge on the lattice and present first results for the ghost propagator. We consider Yang-Mills theory in four space-time dimensions, for SU(2) and SU(3) gauge groups.
We discuss a possible definition of the Faddeev-Popov matrix for the minimal linear covariant gauge on the lattice and present first results for the ghost propagator. We consider Yang-Mills theory in four space-time dimensions, for SU(2) and SU(3) gauge groups.
△ Less
Submitted 21 September, 2018;
originally announced September 2018.
-
Probing the tensor structure of lattice three-gluon vertex in Landau gauge
Authors:
Milan Vu**ovic,
Tereza Mendes
Abstract:
In this paper we test an approximate method that is often used in lattice studies of the Landau gauge three-gluon vertex. The approximation consists in describing the lattice correlator with tensor bases from the continuum theory. With the help of vertex reconstruction, we show that this "continuum" approach may lead, for general kinematics, to significant errors in vertex tensor representations.…
▽ More
In this paper we test an approximate method that is often used in lattice studies of the Landau gauge three-gluon vertex. The approximation consists in describing the lattice correlator with tensor bases from the continuum theory. With the help of vertex reconstruction, we show that this "continuum" approach may lead, for general kinematics, to significant errors in vertex tensor representations. Such errors are highly unwelcome, as they can lead to wrong quantitative estimates for vertex form factors and related quantities of interest, like the three-gluon running coupling. As a possible solution, we demonstrate numerically and analytically that there exist special kinematic configurations for which the vertex tensor structures can be described exactly on the lattice. For these kinematics, the dimensionless tensor elements are equal to the continuum ones, regardless of the details of the lattice implementation. We ran our simulations for an $SU(2)$ gauge theory in two and three spacetime dimensions, with Wilson and $\mathcal{O}(a^2)$ tree-level improved gauge actions. Our results and conclusions can be straightforwardly generalised to higher dimensions and, with some precautions, to other lattice correlators, like the ghost-gluon, quark-gluon and four-gluon vertices.
△ Less
Submitted 10 January, 2019; v1 submitted 10 July, 2018;
originally announced July 2018.
-
Region-Based Classification of PolSAR Data Using Radial Basis Kernel Functions With Stochastic Distances
Authors:
R. G. Negri,
A. C. Frery,
W. B. Silva,
T. S. G. Mendes,
L. V. Dutra
Abstract:
Region-based classification of PolSAR data can be effectively performed by seeking for the assignment that minimizes a distance between prototypes and segments. Silva et al (2013) used stochastic distances between complex multivariate Wishart models which, differently from other measures, are computationally tractable. In this work we assess the robustness of such approach with respect to errors i…
▽ More
Region-based classification of PolSAR data can be effectively performed by seeking for the assignment that minimizes a distance between prototypes and segments. Silva et al (2013) used stochastic distances between complex multivariate Wishart models which, differently from other measures, are computationally tractable. In this work we assess the robustness of such approach with respect to errors in the training stage, and propose an extension that alleviates such problems. We introduce robustness in the process by incorporating a combination of radial basis kernel functions and stochastic distances with Support Vector Machines (SVM). We consider several stochastic distances between Wishart: Bhatacharyya, Kullback-Leibler, Chi-Square, Rényi, and Hellinger. We perform two case studies with PolSAR images, both simulated and from actual sensors, and different classification scenarios to compare the performance of Minimum Distance and SVM classification frameworks. With this, we model the situation of imperfect training samples. We show that SVM with the proposed kernel functions achieves better performance with respect to Minimum Distance, at the expense of more computational resources and the need of parameter tuning. Code and data are provided for reproducibility.
△ Less
Submitted 7 May, 2018;
originally announced May 2018.
-
Lattice Gluon Propagator and One-Gluon-Exchange Potential
Authors:
Attilio Cucchieri,
Tereza Mendes,
Willian M. Serenone
Abstract:
We consider the interquark potential in the one-gluon-exchange (OGE) approximation, using a fully nonperturbative gluon propagator from large-volume lattice simulations. The resulting VLGP potential is non-confining, showing that the OGE approximation is not sufficient to describe the infrared sector of QCD. Nevertheless, it represents an improvement over the perturbative (Coulomb-like) potential,…
▽ More
We consider the interquark potential in the one-gluon-exchange (OGE) approximation, using a fully nonperturbative gluon propagator from large-volume lattice simulations. The resulting VLGP potential is non-confining, showing that the OGE approximation is not sufficient to describe the infrared sector of QCD. Nevertheless, it represents an improvement over the perturbative (Coulomb-like) potential, since it allows the description of a few low-lying bound states of charmonium and bottomonium. In order to achieve a better description of these spectra, we add to VLGP a linearly growing term. The obtained results are comparable to the corresponding ones in the Cornell-potential case. As a byproduct of our study, we estimate the interquark distance for the considered charmonium and bottomonium states.
△ Less
Submitted 26 April, 2017;
originally announced April 2017.
-
Bloch Waves in Minimal Landau Gauge and the Infinite-Volume Limit of Lattice Gauge Theory
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
By exploiting the similarity between Bloch's theorem for electrons in crystalline solids and the problem of Landau gauge-fixing in Yang-Mills theory on a "replicated" lattice, one is able to obtain essentially infinite-volume results from numerical simulations performed on a relatively small lattice. This approach, proposed by D. Zwanziger in \cite{Zwanziger:1993dh}, corresponds to taking the infi…
▽ More
By exploiting the similarity between Bloch's theorem for electrons in crystalline solids and the problem of Landau gauge-fixing in Yang-Mills theory on a "replicated" lattice, one is able to obtain essentially infinite-volume results from numerical simulations performed on a relatively small lattice. This approach, proposed by D. Zwanziger in \cite{Zwanziger:1993dh}, corresponds to taking the infinite-volume limit for Landau-gauge field configurations in two steps: firstly for the gauge transformation alone, while kee** the lattice volume finite, and secondly for the gauge-field configuration itself. The solutions to the gauge-fixing condition are then given in terms of Bloch waves. Applying the method to data from Monte Carlo simulations of pure SU(2) gauge theory in two and three space-time dimensions, we are able to evaluate the Landau-gauge gluon propagator for lattices of linear extent up to sixteen times larger than that of the simulated lattice. The approach is reminiscent of Fisher and Ruelle's construction of the thermodynamic limit in classical statistical mechanics.
△ Less
Submitted 5 December, 2016;
originally announced December 2016.
-
Further Study of BRST-Symmetry Breaking on the Lattice
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We evaluate the so-called Bose-ghost propagator Q(p^2) for SU(2) gauge theory in minimal Landau gauge, considering lattice volumes up to 120^4 and physical lattice extents up to 13.5 f. In particular, we investigate discretization effects, as well as the infinite-volume and continuum limits. We recall that a nonzero value for this quantity provides direct evidence of BRST-symmetry breaking, relate…
▽ More
We evaluate the so-called Bose-ghost propagator Q(p^2) for SU(2) gauge theory in minimal Landau gauge, considering lattice volumes up to 120^4 and physical lattice extents up to 13.5 f. In particular, we investigate discretization effects, as well as the infinite-volume and continuum limits. We recall that a nonzero value for this quantity provides direct evidence of BRST-symmetry breaking, related to the restriction of the functional measure to the first Gribov region. Our results show that the prediction (from cluster decomposition) for Q(p^2) in terms of gluon and ghost propagators is better satisfied as the continuum limit is approached.
△ Less
Submitted 15 November, 2016;
originally announced November 2016.
-
Impurities near an Antiferromagnetic-Singlet Quantum Critical Point
Authors:
T. Mendes,
N. Costa,
G. Batrouni,
N. Curro,
R. R. dos Santos,
T. Paiva,
R. T. Scalettar
Abstract:
Heavy fermion systems, and other strongly correlated electron materials, often exhibit a competition between antiferromagnetic (AF) and singlet ground states. Using exact Quantum Monte Carlo (QMC) simulations, we examine the effect of impurities in the vicinity of such AF- singlet quantum critical points, through an appropriately defined impurity susceptibility, $χ_{imp}$. Our key finding is a con…
▽ More
Heavy fermion systems, and other strongly correlated electron materials, often exhibit a competition between antiferromagnetic (AF) and singlet ground states. Using exact Quantum Monte Carlo (QMC) simulations, we examine the effect of impurities in the vicinity of such AF- singlet quantum critical points, through an appropriately defined impurity susceptibility, $χ_{imp}$. Our key finding is a connection, within a single calculational framework, between AF domains induced on the singlet side of the transition, and the behavior of the nuclear magnetic resonance (NMR) relaxation rate $1/T_1$. We show that local NMR measurements provide a diagnostic for the location of the QCP which agrees remarkably well with the vanishing of the AF order parameter and large values of $χ_{imp}$. We connect our results with experiments on Cd-doped CeCoIn$_5$.
△ Less
Submitted 20 July, 2016;
originally announced July 2016.
-
Numerical Evaluation of the Bose-Ghost Propagator in Minimal Landau Gauge on the Lattice
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We present numerical details of the evaluation of the so-called Bose-ghost propagator in lattice minimal Landau gauge, for the SU(2) case in four Euclidean dimensions. This quantity has been proposed as a carrier of the confining force in the Gribov-Zwanziger approach and, as such, its infrared behavior could be relevant for the understanding of color confinement in Yang-Mills theories. Also, its…
▽ More
We present numerical details of the evaluation of the so-called Bose-ghost propagator in lattice minimal Landau gauge, for the SU(2) case in four Euclidean dimensions. This quantity has been proposed as a carrier of the confining force in the Gribov-Zwanziger approach and, as such, its infrared behavior could be relevant for the understanding of color confinement in Yang-Mills theories. Also, its nonzero value can be interpreted as direct evidence of BRST-symmetry breaking, which is induced when restricting the functional measure to the first Gribov region Omega. Our simulations are done for lattice volumes up to 120^4 and for physical lattice extents up to 13.5 fm. We investigate the infinite-volume and continuum limits.
△ Less
Submitted 24 April, 2016;
originally announced April 2016.
-
Modeling the Landau-Gauge Ghost Propagator in 2, 3 and 4 Space-Time Dimensions
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Nele Vandersickel
Abstract:
We present an analytic description of numerical results for the ghost propagator G(p^2) in minimal Landau gauge on the lattice. The data were produced in the SU(2) case using the largest lattice volumes to date, for d = 2, 3 and 4 space-time dimensions. Our proposed form for G(p^2) is derived from the one-loop relation between ghost and gluon propagators, considering a tree-level ghost-gluon verte…
▽ More
We present an analytic description of numerical results for the ghost propagator G(p^2) in minimal Landau gauge on the lattice. The data were produced in the SU(2) case using the largest lattice volumes to date, for d = 2, 3 and 4 space-time dimensions. Our proposed form for G(p^2) is derived from the one-loop relation between ghost and gluon propagators, considering a tree-level ghost-gluon vertex and our previously obtained gluon-propagator results \cite{Cucchieri:2011ig}. Although this one-loop expression is not a good description of the data, it leads to a one-parameter fit of our ghost-propagator data with a generally good value of χ^2/dof, comparable to other fitting forms used in the literature. At the same time, we present a simple parametrization of the difference between the lattice data and the one-loop predictions.
△ Less
Submitted 4 February, 2016;
originally announced February 2016.
-
Stellar models simulating the disk-locking mechanism and the evolutionary history of the Orion Nebula cluster and NGC2264
Authors:
N. R. Landin,
L. T. S. Mendes,
L. P. R. Vaz,
S. H. P. Alencar
Abstract:
Rotational evolution in young stars is described by pMS evolutionary tracks including rotation, conservation of angular momentum (AM), and simulations of disk-locking (DL). By assuming that DL is the regulation mechanism for the stellar angular velocity during the early stages of pMS, we use our models and observational data to constrain disk lifetimes (Tdisk) of a sample of low-mass stars in the…
▽ More
Rotational evolution in young stars is described by pMS evolutionary tracks including rotation, conservation of angular momentum (AM), and simulations of disk-locking (DL). By assuming that DL is the regulation mechanism for the stellar angular velocity during the early stages of pMS, we use our models and observational data to constrain disk lifetimes (Tdisk) of a sample of low-mass stars in the ONC and NGC2264. The period distributions of the ONC and NGC2264 are bimodal and depend on the stellar mass. To follow the rotational evolution of these two clusters' stars, we generated some sets of evolutionary tracks. We assumed that the evolution of fast rotators can be modeled by considering conservation of AM during all stages and of moderate rotators by considering conservation of angular velocity during the first stages of evolution. With these models we estimate a mass and an age for all stars. For the ONC, we assume that the secondary peak in the period distribution is due to high-mass objects locked in their disks, with a locking period (Plock) of ~8 days. For NGC2264 we make two hypotheses: (1) the stars in the secondary peak are locked with Plock=5 days, and (2) NGC2264 is in a later stage in the rotational evolution (this implies in a DL scenario with Plock=8 days, a Tdisk of 1 Myr and, after that, constant AM evolution). We simulated the period distribution of NGC2264 when its mean age was 1 Myr. Dichotomy and bimodality appear in the simulated distribution, presenting one peak at 2 days and another one at 5-7 days, indicating that the assumption of Plock=8 days is plausible. Our hypotheses are compared with observational disk diagnoses available in the literature. DL models with Plock=8 days and 0.2 Myr<=Tdisk<=3 Myr are consistent with observed periods of moderate rotators of the ONC. For NGC2264, hyphotesis 2 is the more promising explanation for its period distribution.
△ Less
Submitted 7 December, 2015;
originally announced December 2015.
-
Vision-Based Road Detection using Contextual Blocks
Authors:
Caio César Teodoro Mendes,
Vincent Frémont,
Denis Fernando Wolf
Abstract:
Road detection is a fundamental task in autonomous navigation systems. In this paper, we consider the case of monocular road detection, where images are segmented into road and non-road regions. Our starting point is the well-known machine learning approach, in which a classifier is trained to distinguish road and non-road regions based on hand-labeled images. We proceed by introducing the use of…
▽ More
Road detection is a fundamental task in autonomous navigation systems. In this paper, we consider the case of monocular road detection, where images are segmented into road and non-road regions. Our starting point is the well-known machine learning approach, in which a classifier is trained to distinguish road and non-road regions based on hand-labeled images. We proceed by introducing the use of "contextual blocks" as an efficient way of providing contextual information to the classifier. Overall, the proposed methodology, including its image feature selection and classifier, was conceived with computational cost in mind, leaving room for optimized implementations. Regarding experiments, we perform a sensible evaluation of each phase and feature subset that composes our system. The results show a great benefit from using contextual blocks and demonstrate their computational efficiency. Finally, we submit our results to the KITTI road detection benchmark achieving scores comparable with state of the art methods.
△ Less
Submitted 3 September, 2015;
originally announced September 2015.
-
Heavy-Quarkonium Potential from the Lattice Gluon Propagator
Authors:
Willian M. Serenone,
Attilio Cucchieri,
Tereza Mendes
Abstract:
We consider the potential-model approach for obtaining the spectrum of charmonium and bottomonium, replacing the usual gluon propagator by one obtained from lattice simulations. The resulting spectra are compared to the corresponding ones in the Cornell-potential case. We also estimate the interquark distance in both cases.
We consider the potential-model approach for obtaining the spectrum of charmonium and bottomonium, replacing the usual gluon propagator by one obtained from lattice simulations. The resulting spectra are compared to the corresponding ones in the Cornell-potential case. We also estimate the interquark distance in both cases.
△ Less
Submitted 25 May, 2015;
originally announced May 2015.
-
Evidence of BRST-Symmetry Breaking in Lattice Minimal Landau Gauge
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Nele Vandersickel
Abstract:
By evaluating the so-called Bose-ghost propagator, we present the first numerical evidence of BRST-symmetry breaking for Yang-Mills theory in minimal Landau gauge, i.e. due to the restriction of the functional integration to the first Gribov region in the Gribov-Zwanziger approach. Our data are well described by a simple fitting function, which can be related to a massive gluon propagator in combi…
▽ More
By evaluating the so-called Bose-ghost propagator, we present the first numerical evidence of BRST-symmetry breaking for Yang-Mills theory in minimal Landau gauge, i.e. due to the restriction of the functional integration to the first Gribov region in the Gribov-Zwanziger approach. Our data are well described by a simple fitting function, which can be related to a massive gluon propagator in combination with an infrared-free (Faddeev-Popov) ghost propagator. As a consequence, the Bose-ghost propagator, which has been proposed as a carrier of the confining force in minimal Landau gauge, displays a 1/p^4 singularity in the infrared limit.
△ Less
Submitted 30 October, 2014;
originally announced October 2014.
-
BRST-Symmetry Breaking and Bose-Ghost Propagator in Lattice Minimal Landau Gauge
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Nele Vandersickel
Abstract:
The Bose-ghost propagator has been proposed as a carrier of the confining force in Yang-Mills theories in minimal Landau gauge. We present the first numerical evaluation of this propagator, using lattice simulations for the SU(2) gauge group in the scaling region. Our data are well described by a simple fitting function, which is compatible with an infrared-enhanced Bose-ghost propagator. This fun…
▽ More
The Bose-ghost propagator has been proposed as a carrier of the confining force in Yang-Mills theories in minimal Landau gauge. We present the first numerical evaluation of this propagator, using lattice simulations for the SU(2) gauge group in the scaling region. Our data are well described by a simple fitting function, which is compatible with an infrared-enhanced Bose-ghost propagator. This function can also be related to a massive gluon propagator in combination with an infrared-free (Faddeev-Popov) ghost propagator. Since the Bose-ghost propagator can be written as the vacuum expectation value of a BRST-exact quantity and should therefore vanish in a BRST-invariant theory, our results provide the first numerical manifestation of BRST-symmetry breaking due to restriction of gauge-configuration space to the Gribov region.
△ Less
Submitted 7 May, 2014;
originally announced May 2014.
-
SU(2) Lattice Gluon Propagator and Potential Models
Authors:
Willian M. Serenone,
Attilio Cucchieri,
Tereza Mendes
Abstract:
We study the bottomonium spectrum using a potential model. Our potential incorporates lattice results for the gluon propagator, obtained from simulations of pure SU(2) gauge theory in Landau gauge. The mass of the bottom quark is left as a free parameter. The resulting spectrum is compared to the case of the Coulomb plus Linear (or Cornell) Potential.
We study the bottomonium spectrum using a potential model. Our potential incorporates lattice results for the gluon propagator, obtained from simulations of pure SU(2) gauge theory in Landau gauge. The mass of the bottom quark is left as a free parameter. The resulting spectrum is compared to the case of the Coulomb plus Linear (or Cornell) Potential.
△ Less
Submitted 29 April, 2014;
originally announced April 2014.
-
Systematic Effects at Criticality for the SU(2)-Landau-Gauge Gluon Propagator
Authors:
Tereza Mendes,
Attilio Cucchieri
Abstract:
We analyze data from finite-temperature simulations of the gluon propagator in SU(2) Landau gauge on large lattices. We argue that the singular behavior of this quantity around the deconfinement transition, seen in several previous studies, is a lattice artifact.
We analyze data from finite-temperature simulations of the gluon propagator in SU(2) Landau gauge on large lattices. We argue that the singular behavior of this quantity around the deconfinement transition, seen in several previous studies, is a lattice artifact.
△ Less
Submitted 27 January, 2014;
originally announced January 2014.
-
Crossing the Gribov horizon: an unconventional study of geometric properties of gauge-configuration space in Landau gauge
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We prove a lower bound for the smallest nonzero eigenvalue of the Landau-gauge Faddeev-Popov matrix in Yang-Mills theories. The bound is written in terms of the smallest nonzero momentum on the lattice and of a parameter characterizing the geometry of the first Gribov region. This allows a simple and intuitive description of the infinite-volume limit in the ghost sector. In particular, we show how…
▽ More
We prove a lower bound for the smallest nonzero eigenvalue of the Landau-gauge Faddeev-Popov matrix in Yang-Mills theories. The bound is written in terms of the smallest nonzero momentum on the lattice and of a parameter characterizing the geometry of the first Gribov region. This allows a simple and intuitive description of the infinite-volume limit in the ghost sector. In particular, we show how nonperturbative effects may be quantified by the rate at which typical thermalized and gauge-fixed configurations approach the Gribov horizon. Our analytic results are verified numerically in the SU(2) case through an informal, free and easy, approach. This analysis provides the first concrete explanation of why the so-called scaling solution of the Dyson-Schwinger equations is not observed in lattice studies.
△ Less
Submitted 19 November, 2013;
originally announced November 2013.
-
Using river locks to teach hydrodynamic concepts
Authors:
Vagson L. Carvalho-Santos,
Thales C. Mendes,
Enisvaldo C. Silva,
Márcio L. Rios,
Anderson A P Silva
Abstract:
In this work, the use of a river lock as a non-formal setting for teaching Q2 hydrodynamical concepts is proposed. In particular, we describe the operation of a river lock situated at the Sobradinho dam, on the São Francisco River (Brazil). A model to represent and to analyse the dynamics of river lock operation is presented and we derive the dynamical equations for the rising of the water column…
▽ More
In this work, the use of a river lock as a non-formal setting for teaching Q2 hydrodynamical concepts is proposed. In particular, we describe the operation of a river lock situated at the Sobradinho dam, on the São Francisco River (Brazil). A model to represent and to analyse the dynamics of river lock operation is presented and we derive the dynamical equations for the rising of the water column as an example to understand the Euler equation. Furthermore, with this activity, we enable the integration of content initially introduced in the classroom with practical applications, thereby allowing the association of physical themes to content relevant in disciplines such as history and geography. In addition, experiences of this kind enable teachers to talk about the environmental and social impacts caused by the construction of a dam and, consequently, a crossover of concepts has been made possible, leading to more meaningful learning for the students.
△ Less
Submitted 26 September, 2013;
originally announced September 2013.
-
Ghost sector and geometry in minimal Landau gauge: further constraining the infinite-volume limit
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We present improved upper and lower bounds for the momentum-space ghost propagator of Yang-Mills theories in terms of the two smallest nonzero eigenvalues (and their corresponding eigenvectors) of the Faddeev-Popov matrix. These results are verified using data from four-dimensional numerical simulations of SU(2) lattice gauge theory in minimal Landau gauge at beta = 2.2, for lattice sides N = 16,…
▽ More
We present improved upper and lower bounds for the momentum-space ghost propagator of Yang-Mills theories in terms of the two smallest nonzero eigenvalues (and their corresponding eigenvectors) of the Faddeev-Popov matrix. These results are verified using data from four-dimensional numerical simulations of SU(2) lattice gauge theory in minimal Landau gauge at beta = 2.2, for lattice sides N = 16, 32, 48 and 64. Gribov-copy effects are discussed by considering four different sets of numerical minima. We then present a lower bound for the smallest nonzero eigenvalue of the Faddeev-Popov matrix in terms of the smallest nonzero momentum on the lattice and of a parameter characterizing the geometry of the first Gribov region $Ω$. This allows a simple and intuitive description of the infinite-volume limit in the ghost sector. In particular, we show how nonperturbative effects may be quantified by the rate at which typical thermalized and gauge-fixed configurations approach the boundary of Omega, known as the first Gribov horizon. As a result, a simple and concrete explanation emerges for why lattice studies do not observe an enhanced ghost propagator in the deep infrared limit. Most of the simulations have been performed on the Blue Gene/P--IBM supercomputer shared by Rice University and São Paulo University.
△ Less
Submitted 19 November, 2013; v1 submitted 6 August, 2013;
originally announced August 2013.
-
The Minimal Landau Background Gauge on the Lattice
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We present the first numerical implementation of the minimal Landau background gauge for Yang-Mills theory on the lattice. Our approach is a simple generalization of the usual minimal Landau gauge and is formulated for general SU(N) gauge group. We also report on preliminary tests of the method in the four-dimensional SU(2) case, using different background fields. Our tests show that the convergen…
▽ More
We present the first numerical implementation of the minimal Landau background gauge for Yang-Mills theory on the lattice. Our approach is a simple generalization of the usual minimal Landau gauge and is formulated for general SU(N) gauge group. We also report on preliminary tests of the method in the four-dimensional SU(2) case, using different background fields. Our tests show that the convergence of the numerical minimization process is comparable to the case of a null background. The uniqueness of the minimizing functional employed is briefly discussed.
△ Less
Submitted 1 April, 2012;
originally announced April 2012.
-
Ghost dissection
Authors:
David Dudal,
Nele Vandersickel,
Attilio Cucchieri,
Tereza Mendes
Abstract:
We show that a necessary condition to have a positive Landau-gauge ghost propagator in d=2 Yang-Mills theories is a vanishing zero-momentum gluon propagator. Our proof is based on a careful scrutinizing of the ghost Dyson-Schwinger equation. Said otherwise, the Gribov no-pole condition forbids the occurrence of the "decoupling/massive" gluon propagator solution in d=2, in sharp contrast with d=3 a…
▽ More
We show that a necessary condition to have a positive Landau-gauge ghost propagator in d=2 Yang-Mills theories is a vanishing zero-momentum gluon propagator. Our proof is based on a careful scrutinizing of the ghost Dyson-Schwinger equation. Said otherwise, the Gribov no-pole condition forbids the occurrence of the "decoupling/massive" gluon propagator solution in d=2, in sharp contrast with d=3 and 4, but consistent with state-of-the-art lattice data.
△ Less
Submitted 10 February, 2012;
originally announced February 2012.
-
Massive gluon propagator at zero and finite temperature
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Nele Vandersickel
Abstract:
We report on our study of the infrared gluon propagator for SU(2) lattice gauge theory using large lattice volumes. The observed massive behavior is discussed from the point of view of analytic predictions for the zero-temperature case. Such a behavior is still present as the temperature is switched on, but manifests itself differently in the electric and magnetic channels.
We report on our study of the infrared gluon propagator for SU(2) lattice gauge theory using large lattice volumes. The observed massive behavior is discussed from the point of view of analytic predictions for the zero-temperature case. Such a behavior is still present as the temperature is switched on, but manifests itself differently in the electric and magnetic channels.
△ Less
Submitted 3 February, 2012;
originally announced February 2012.
-
Electric and Magnetic Screening Masses around the Deconfinement Transition
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We report on the status of our study of gluon propagators and screening masses around the de- confining transition for pure SU(2) gauge theory in Landau gauge.
We report on the status of our study of gluon propagators and screening masses around the de- confining transition for pure SU(2) gauge theory in Landau gauge.
△ Less
Submitted 29 January, 2012;
originally announced January 2012.
-
Modeling the Gluon Propagator in Landau Gauge: Lattice Estimates of Pole Masses and Dimension-Two Condensates
Authors:
Attilio Cucchieri,
David Dudal,
Tereza Mendes,
Nele Vandersickel
Abstract:
We present an analytic description of numerical results for the Landau-gauge SU(2) gluon propagator D(p^2), obtained from lattice simulations (in the scaling region) for the largest lattice sizes to date, in d = 2, 3 and 4 space-time dimensions. Fits to the gluon data in 3d and in 4d show very good agreement with the tree-level prediction of the Refined Gribov-Zwanziger (RGZ) framework, supporting…
▽ More
We present an analytic description of numerical results for the Landau-gauge SU(2) gluon propagator D(p^2), obtained from lattice simulations (in the scaling region) for the largest lattice sizes to date, in d = 2, 3 and 4 space-time dimensions. Fits to the gluon data in 3d and in 4d show very good agreement with the tree-level prediction of the Refined Gribov-Zwanziger (RGZ) framework, supporting a massive behavior for D(p^2) in the infrared limit. In particular, we investigate the propagator's pole structure and provide estimates of the dynamical mass scales that can be associated with dimension-two condensates in the theory. In the 2d case, fitting the data requires a non-integer power of the momentum p in the numerator of the expression for D(p^2). In this case, an infinite-volume-limit extrapolation gives D(0) = 0. Our analysis suggests that this result is related to a particular symmetry in the complex-pole structure of the propagator and not to purely imaginary poles, as would be expected in the original Gribov-Zwanziger scenario.
△ Less
Submitted 9 November, 2011;
originally announced November 2011.
-
Yang-Mills Theory in lambda-Gauges
Authors:
Axel Maas,
Tereza Mendes,
Stefan Olejnik
Abstract:
The gauge-independent phenomenon of color confinement in Yang-Mills theory manifests itself differently in different gauges. Therefore, the gauge dependence of quantities related to the infrared structure of the theory becomes important for understanding the confinement mechanism. Particularly useful are classes of gauges that are controlled by a single gauge parameter. We present results on propa…
▽ More
The gauge-independent phenomenon of color confinement in Yang-Mills theory manifests itself differently in different gauges. Therefore, the gauge dependence of quantities related to the infrared structure of the theory becomes important for understanding the confinement mechanism. Particularly useful are classes of gauges that are controlled by a single gauge parameter. We present results on propagators and the color-Coulomb potential for the so-called lambda-gauges, which interpolate between the (minimal) Landau gauge and the (minimal complete) Coulomb gauge. Results are reported for the SU(2) lattice gauge theory in three and four space-time dimensions. We investigate especially intermediate and low momenta. We find a continuous evolution of all quantities with the gauge parameter, except at zero four-momentum.
△ Less
Submitted 12 August, 2011;
originally announced August 2011.
-
Electric and magnetic Landau-gauge gluon propagators in finite-temperature SU(2) gauge theory
Authors:
Attilio Cucchieri,
Tereza Mendes
Abstract:
We perform lattice simulations in pure-SU(2) Yang-Mills theory to investigate how the infrared behavior of electric and magnetic gluon propagators in Landau gauge is affected by temperature. We consider the largest lattices to date, in an attempt to keep systematic errors under control. Electric and magnetic screening masses are calculated through an Ansatz from the zero-temperature case, based on…
▽ More
We perform lattice simulations in pure-SU(2) Yang-Mills theory to investigate how the infrared behavior of electric and magnetic gluon propagators in Landau gauge is affected by temperature. We consider the largest lattices to date, in an attempt to keep systematic errors under control. Electric and magnetic screening masses are calculated through an Ansatz from the zero-temperature case, based on complex-conjugate poles for the momentum-space propagators. As recently reported in [1], we find good fits to the proposed form at all temperatures considered, with different ratios of real to imaginary part of the pole masses for the longitudinal (electric) and transverse (magnetic) propagators. The behavior of the magnetic propagator D_T(p) is in agreement with the dimensional-reduction picture, showing infrared suppression (with a turnover in momentum) and violation of spectral positivity at all nonzero temperatures considered. The longitudinal propagator D_L(p) appears to reach a plateau at small momenta and is subject to severe finite-Nt effects around the critical temperature Tc. As a consequence, only lattices with temporal extent Nt > 8 seem to be free from systematic errors. After these errors are removed, the infrared-plateau value is considerably reduced around the transition and the sharp peak observed previously for this quantity at Tc is no longer present. The resulting infrared behavior for D_L(p) at Tc is essentially the same as for 0.5Tc . An investigation of the temperature range between 0.5Tc and Tc reveals that a less pronounced (finite) peak may occur at smaller temperatures, e.g. T ~ 0.9Tc.
△ Less
Submitted 1 May, 2011;
originally announced May 2011.
-
Gluon Propagators in Linear Covariant Gauge
Authors:
Attilio Cucchieri,
Tereza Mendes,
Gilberto M. Nakamura,
Elton M. S. Santos
Abstract:
The implementation of the linear covariant gauge on the lattice faces a conceptual problem: using the standard compact discretization, the gluon field is bounded, while the four-divergence of the gluon field satisfies a Gaussian distribution, i.e. it is unbounded. This can give rise to convergence problems when a numerical implementation is attempted. In order to overcome this problem, one can use…
▽ More
The implementation of the linear covariant gauge on the lattice faces a conceptual problem: using the standard compact discretization, the gluon field is bounded, while the four-divergence of the gluon field satisfies a Gaussian distribution, i.e. it is unbounded. This can give rise to convergence problems when a numerical implementation is attempted. In order to overcome this problem, one can use different discretizations for the gluon field or consider an SU(N_c) group with sufficiently large N_c. One can also consider small values of the gauge parameter xi and study numerically the limiting case of xi \to 0, i.e. the Landau gauge. These different approaches will be discussed here.
△ Less
Submitted 25 February, 2011;
originally announced February 2011.
-
Nonperturbative HQET at Order $1/m$
Authors:
Benoit Blossier,
Georg von Hippel,
Nicolas Garron,
Tereza Mendes
Abstract:
We summarize first results for masses and decay constants of bottom-strange (pseudo-scalar and vector) mesons from nonperturbatively renormalized heavy-quark effective theory (HQET), using lattice-QCD simulations in the quenched approximation.
We summarize first results for masses and decay constants of bottom-strange (pseudo-scalar and vector) mesons from nonperturbatively renormalized heavy-quark effective theory (HQET), using lattice-QCD simulations in the quenched approximation.
△ Less
Submitted 30 January, 2011;
originally announced January 2011.
-
Handling Excited States on the Lattice: The GEVP Method
Authors:
Tereza Mendes
Abstract:
High-precision calculations of hadron spectroscopy are a crucial task for Lattice QCD. State-of-the-art techniques are needed to disentangle the contributions from different energy states, such as solving the generalized eigenvalue problem (GEVP) for zero-momentum hadron correlators in an efficient way. We review the method and discuss its application in the determination of the $B_s$-meson spectr…
▽ More
High-precision calculations of hadron spectroscopy are a crucial task for Lattice QCD. State-of-the-art techniques are needed to disentangle the contributions from different energy states, such as solving the generalized eigenvalue problem (GEVP) for zero-momentum hadron correlators in an efficient way. We review the method and discuss its application in the determination of the $B_s$-meson spectrum using (quenched) nonperturbative HQET at order $1/m_b$.
△ Less
Submitted 30 January, 2011;
originally announced January 2011.