Search | arXiv e-print repository

arXiv:2402.19088 [pdf, other]

Survey in Characterization of Semantic Change

Authors: Jader Martins Camboim de Sá, Marcos Da Silveira, Cédric Pruski

Abstract: Live languages continuously evolve to integrate the cultural change of human societies. This evolution manifests through neologisms (new words) or \textbf{semantic changes} of words (new meaning to existing words). Understanding the meaning of words is vital for interpreting texts coming from different cultures (regionalism or slang), domains (e.g., technical terms), or periods. In computer scienc… ▽ More Live languages continuously evolve to integrate the cultural change of human societies. This evolution manifests through neologisms (new words) or \textbf{semantic changes} of words (new meaning to existing words). Understanding the meaning of words is vital for interpreting texts coming from different cultures (regionalism or slang), domains (e.g., technical terms), or periods. In computer science, these words are relevant to computational linguistics algorithms such as translation, information retrieval, question answering, etc. Semantic changes can potentially impact the quality of the outcomes of these algorithms. Therefore, it is important to understand and characterize these changes formally. The study of this impact is a recent problem that has attracted the attention of the computational linguistics community. Several approaches propose methods to detect semantic changes with good precision, but more effort is needed to characterize how the meaning of words changes and to reason about how to reduce the impact of semantic change. This survey provides an understandable overview of existing approaches to the \textit{characterization of semantic changes} and also formally defines three classes of characterizations: if the meaning of a word becomes more general or narrow (change in dimension) if the word is used in a more pejorative or positive/ameliorated sense (change in orientation), and if there is a trend to use the word in a, for instance, metaphoric or metonymic context (change in relation). We summarized the main aspects of the selected publications in a table and discussed the needs and trends in the research activities on semantic change characterization. △ Less

Submitted 11 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

arXiv:2312.13909 [pdf, ps, other]

Zig-zag deformations of toric quiver gauge theories. Part I: reflexive polytopes

Authors: Stefano Cremonesi, José Sá

Abstract: We study one-parameter families of $U(1)^2$ preserving deformations relating pairs of toric quiver gauge theories on D-branes probing local toric (pseudo) del Pezzo surfaces. The superpotential deformations are defined by zig-zag paths in the brane tiling and are non-trivial in the chiral ring if the geometry has a non-isolated singularity. In the dual $(p,q)$ web, the deformation is realized as a… ▽ More We study one-parameter families of $U(1)^2$ preserving deformations relating pairs of toric quiver gauge theories on D-branes probing local toric (pseudo) del Pezzo surfaces. The superpotential deformations are defined by zig-zag paths in the brane tiling and are non-trivial in the chiral ring if the geometry has a non-isolated singularity. In the dual $(p,q)$ web, the deformation is realized as a Hanany-Witten move that reverses a semi-infinite fivebrane. We use these deformations to find RG flows between 4d $\mathcal{N}=1$ SCFTs on D3-branes probing local toric (pseudo) del Pezzo surfaces of the same degree, and briefly comment on the interpretation for BPS quivers of rank one 5d SCFTs on $S^1$. △ Less

Submitted 14 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: 47 pages + 2 appendices, 23 figures; v2: typos corrected, references added, sections 1 and 3 reworded for clarity and reference to prior work; v3: minor changes, version accepted by JHEP

arXiv:2308.01849 [pdf, other]

Curricular Transfer Learning for Sentence Encoded Tasks

Authors: Jader Martins Camboim de Sá, Matheus Ferraroni Sanches, Rafael Roque de Souza, Júlio Cesar dos Reis, Leandro Aparecido Villas

Abstract: Fine-tuning language models in a downstream task is the standard approach for many state-of-the-art methodologies in the field of NLP. However, when the distribution between the source task and target task drifts, \textit{e.g.}, conversational environments, these gains tend to be diminished. This article proposes a sequence of pre-training steps (a curriculum) guided by "data hacking" and grammar… ▽ More Fine-tuning language models in a downstream task is the standard approach for many state-of-the-art methodologies in the field of NLP. However, when the distribution between the source task and target task drifts, \textit{e.g.}, conversational environments, these gains tend to be diminished. This article proposes a sequence of pre-training steps (a curriculum) guided by "data hacking" and grammar analysis that allows further gradual adaptation between pre-training distributions. In our experiments, we acquire a considerable improvement from our method compared to other known pre-training approaches for the MultiWoZ task. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2202.08970 [pdf, other]

doi 10.1140/epja/s10050-022-00750-6

Status and initial physics performance studies of the MPD experiment at NICA

Authors: MPD Collaboration, V. Abgaryan, R. Acevedo Kado, S. V. Afanasyev, G. N. Agakishiev, E. Alpatov, G. Altsybeev, M. Alvarado Hernández, S. V. Andreeva, T. V. Andreeva, E. V. Andronov, N. V. Anfimov, A. A. Aparin, V. I. Astakhov, E. Atkin, T. Aushev, G. S. Averichev, A. V. Averyanov, A. Ayala, V. A. Babkin, T. Babutsidze, I. A. Balashov, A. Bancer, M. Yu. Barabanov, D. A. Baranov , et al. (454 additional authors not shown)

Abstract: The Nuclotron-base Ion Collider fAcility (NICA) is under construction at the Joint Institute for Nuclear Research (JINR), with commissioning of the facility expected in late 2022. The Multi-Purpose Detector (MPD) has been designed to operate at NICA and its components are currently in production. The detector is expected to be ready for data taking with the first beams from NICA. This document pro… ▽ More The Nuclotron-base Ion Collider fAcility (NICA) is under construction at the Joint Institute for Nuclear Research (JINR), with commissioning of the facility expected in late 2022. The Multi-Purpose Detector (MPD) has been designed to operate at NICA and its components are currently in production. The detector is expected to be ready for data taking with the first beams from NICA. This document provides an overview of the landscape of the investigation of the QCD phase diagram in the region of maximum baryonic density, where NICA and MPD will be able to provide significant and unique input. It also provides a detailed description of the MPD set-up, including its various subsystems as well as its support and computing infrastructures. Selected performance studies for particular physics measurements at MPD are presented and discussed in the context of existing data and theoretical expectations. △ Less

Submitted 16 February, 2022; originally announced February 2022.

Comments: 53 pages, 68 figures, submitted as a Review article to EPJA

Journal ref: Eur. Phys. J. A 58, 140 (2022)

arXiv:2101.07158 [pdf]

DECT-2020 New Radio: The Next Step Towards 5G Massive Machine-Type Communications

Authors: Roman Kovalchukov, Dmitri Moltchanov, Juho Pirskanen, Joonas Sae, Jussi Numminen, Yevgeni Koucheryavy, Mikko Valkama

Abstract: Massive machine type communications (mMTC) is one of the cornerstone services that have to be supported by 5G systems. 3GPP has already introduced LTE-M and NB-IoT, often referred to as cellular IoT, in 3GPP Releases 13, 14, and 15 and submitted these technologies as part of 3GPP IMT-2020 (i.e., 5G) technology submission to ITU-R. Even though NB-IoT and LTE-M have shown to satisfy 5G mMTC requirem… ▽ More Massive machine type communications (mMTC) is one of the cornerstone services that have to be supported by 5G systems. 3GPP has already introduced LTE-M and NB-IoT, often referred to as cellular IoT, in 3GPP Releases 13, 14, and 15 and submitted these technologies as part of 3GPP IMT-2020 (i.e., 5G) technology submission to ITU-R. Even though NB-IoT and LTE-M have shown to satisfy 5G mMTC requirements defined by ITU-R, it is expected that these cellular IoT solutions will not address all aspects of IoT and ongoing digitalization, including the support for direct communication between "things" with flexible deployments, different business models, as well as support for even higher node densities and enhanced coverage. In this paper, we introduce the DECT-2020 standard recently published by ETSI for mMTC communications. We evaluate its performance and compare it to the existing LPWAN solutions showing that it outperforms those in terms of supported density of nodes while still kee** delay and loss guarantees at the required level. △ Less

Submitted 13 May, 2022; v1 submitted 18 January, 2021; originally announced January 2021.

Comments: Author-Submitted manuscript accepted for publication in the IEEE Communications Magazine, 7 pages, 5 figures, 1 table

arXiv:2011.04749 [pdf, other]

Longitudinal modeling of MS patient trajectories improves predictions of disability progression

Authors: Edward De Brouwer, Thijs Becker, Yves Moreau, Eva Kubala Havrdova, Maria Trojano, Sara Eichau, Serkan Ozakbas, Marco Onofrj, Pierre Grammond, Jens Kuhle, Ludwig Kappos, Patrizia Sola, Elisabetta Cartechini, Jeannette Lechner-Scott, Raed Alroughani, Oliver Gerlach, Tomas Kalincik, Franco Granella, Francois GrandMaison, Roberto Bergamaschi, Maria Jose Sa, Bart Van Wijmeersch, Aysun Soysal, Jose Luis Sanchez-Menoyo, Claudio Solaro , et al. (16 additional authors not shown)

Abstract: Research in Multiple Sclerosis (MS) has recently focused on extracting knowledge from real-world clinical data sources. This type of data is more abundant than data produced during clinical trials and potentially more informative about real-world clinical practice. However, this comes at the cost of less curated and controlled data sets. In this work, we address the task of optimally extracting in… ▽ More Research in Multiple Sclerosis (MS) has recently focused on extracting knowledge from real-world clinical data sources. This type of data is more abundant than data produced during clinical trials and potentially more informative about real-world clinical practice. However, this comes at the cost of less curated and controlled data sets. In this work, we address the task of optimally extracting information from longitudinal patient data in the real-world setting with a special focus on the sporadic sampling problem. Using the MSBase registry, we show that with machine learning methods suited for patient trajectories modeling, such as recurrent neural networks and tensor factorization, we can predict disability progression of patients in a two-year horizon with an ROC-AUC of 0.86, which represents a 33% decrease in the ranking pair error (1-AUC) compared to reference methods using static clinical features. Compared to the models available in the literature, this work uses the most complete patient history for MS disease progression prediction. △ Less

Submitted 9 November, 2020; originally announced November 2020.

arXiv:2002.08500 [pdf, other]

Processing topical queries on images of historical newspaper pages

Authors: José E. B. Maia, Gildácio J. de A. Sá

Abstract: Historical newspapers are a source of research for the human and social sciences. However, these image collections are difficult to read by machine due to the low quality of the print, the lack of standardization of the pages in addition to the low quality photograph of some files. This paper presents the processing model of a topic navigation system in historical newspaper page images. The genera… ▽ More Historical newspapers are a source of research for the human and social sciences. However, these image collections are difficult to read by machine due to the low quality of the print, the lack of standardization of the pages in addition to the low quality photograph of some files. This paper presents the processing model of a topic navigation system in historical newspaper page images. The general procedure consists of four modules which are: segmentation of text sub-images and text extraction, preprocessing and representation, induced topic extraction and representation, and document viewing and retrieval interface. The algorithmic and technological approaches of each module are described and the initial test results about a collection covering a range of 28 years are presented. △ Less

Submitted 19 February, 2020; originally announced February 2020.

arXiv:1810.04238 [pdf]

doi 10.1038/s41563-020-0737-1

Ultrafast Studies of Hot-Hole Dynamics in Au/p-GaN Heterostructures

Authors: Giulia Tagliabue, Joseph S. DuChene, Mohamed Abdellah, Adela Habib, Yocefu Hattori, Kaibo Zheng, Sophie E. Canton, David J. Gosztola, Wen-Hui Cheng, Ravishankar Sundararaman, Jacinto Sa, Harry A. Atwater

Abstract: Harvesting non-equilibrium hot carriers from photo-excited metal nanoparticles has enabled plasmon-driven photochemical transformations and tunable photodetection with resonant nanoantennas. Despite numerous studies on the ultrafast dynamics of hot electrons, to date, the temporal evolution of hot holes in metal-semiconductor heterostructures remains unknown. An improved understanding of the carri… ▽ More Harvesting non-equilibrium hot carriers from photo-excited metal nanoparticles has enabled plasmon-driven photochemical transformations and tunable photodetection with resonant nanoantennas. Despite numerous studies on the ultrafast dynamics of hot electrons, to date, the temporal evolution of hot holes in metal-semiconductor heterostructures remains unknown. An improved understanding of the carrier dynamics in hot-hole-driven systems is needed to help expand the scope of hot-carrier optoelectronics beyond hot-electron-based devices. Here, using ultrafast transient absorption spectroscopy, we show that plasmon-induced hot-hole injection from gold (Au) nanoparticles into the valence band of p-type gallium nitride (p-GaN) occurs within 200 fs, placing hot-hole transfer on a similar timescale as hot-electron transfer. We further observed that the removal of hot holes from below the Au Fermi level exerts a discernible influence on the thermalization of hot electrons above it, reducing the peak electronic temperature and decreasing the electron-phonon coupling time relative to Au samples without a pathway for hot-hole collection. First principles calculations corroborate these experimental observations, suggesting that hot-hole injection modifies the relaxation dynamics of hot electrons in Au nanoparticles through ultrafast modulation of the d-band electronic structure. Taken together, these ultrafast studies substantially advance our understanding of the temporal evolution of hot holes in metal-semiconductor heterostructures and suggest new strategies for manipulating and controlling the energy distributions of hot carriers on ultrafast timescales. △ Less

Submitted 9 October, 2018; originally announced October 2018.

Comments: 12 pages, 4 figures

arXiv:1712.02824 [pdf, ps, other]

Stacked Denoising Autoencoders and Transfer Learning for Immunogold Particles Detection and Recognition

Authors: Ricardo Gamelas Sousa, Jorge M. Santos, Luís M. Silva, Luís A. Alexandre, Tiago Esteves, Sara Rocha, Paulo Monjardino, Joaquim Marques de Sá, Francisco Figueiredo, Pedro Quelhas

Abstract: In this paper we present a system for the detection of immunogold particles and a Transfer Learning (TL) framework for the recognition of these immunogold particles. Immunogold particles are part of a high-magnification method for the selective localization of biological molecules at the subcellular level only visible through Electron Microscopy. The number of immunogold particles in the cell wall… ▽ More In this paper we present a system for the detection of immunogold particles and a Transfer Learning (TL) framework for the recognition of these immunogold particles. Immunogold particles are part of a high-magnification method for the selective localization of biological molecules at the subcellular level only visible through Electron Microscopy. The number of immunogold particles in the cell walls allows the assessment of the differences in their compositions providing a tool to analise the quality of different plants. For its quantization one requires a laborious manual labeling (or annotation) of images containing hundreds of particles. The system that is proposed in this paper can leverage significantly the burden of this manual task. For particle detection we use a LoG filter coupled with a SDA. In order to improve the recognition, we also study the applicability of TL settings for immunogold recognition. TL reuses the learning model of a source problem on other datasets (target problems) containing particles of different sizes. The proposed system was developed to solve a particular problem on maize cells, namely to determine the composition of cell wall ingrowths in endosperm transfer cells. This novel dataset as well as the code for reproducing our experiments is made publicly available. We determined that the LoG detector alone attained more than 84\% of accuracy with the F-measure. Develo** immunogold recognition with TL also provided superior performance when compared with the baseline models augmenting the accuracy rates by 10\%. △ Less

Submitted 7 December, 2017; originally announced December 2017.

arXiv:1712.02159 [pdf, ps, other]

Distribution-Based Categorization of Classifier Transfer Learning

Authors: Ricardo Gamelas Sousa, Luís A. Alexandre, Jorge M. Santos, Luís M. Silva, Joaquim Marques de Sá

Abstract: Transfer Learning (TL) aims to transfer knowledge acquired in one problem, the source problem, onto another problem, the target problem, dispensing with the bottom-up construction of the target model. Due to its relevance, TL has gained significant interest in the Machine Learning community since it paves the way to devise intelligent learning models that can easily be tailored to many different a… ▽ More Transfer Learning (TL) aims to transfer knowledge acquired in one problem, the source problem, onto another problem, the target problem, dispensing with the bottom-up construction of the target model. Due to its relevance, TL has gained significant interest in the Machine Learning community since it paves the way to devise intelligent learning models that can easily be tailored to many different applications. As it is natural in a fast evolving area, a wide variety of TL methods, settings and nomenclature have been proposed so far. However, a wide range of works have been reporting different names for the same concepts. This concept and terminology mixture contribute however to obscure the TL field, hindering its proper consideration. In this paper we present a review of the literature on the majority of classification TL methods, and also a distribution-based categorization of TL with a common nomenclature suitable to classification problems. Under this perspective three main TL categories are presented, discussed and illustrated with examples. △ Less

Submitted 6 December, 2017; originally announced December 2017.

arXiv:1612.08642 [pdf, other]

Bayesian Nonparametric Models for Synchronous Brain-Computer Interfaces

Authors: Jaime Fernando Delgado Saa, Mujdat Cetin

Abstract: A brain-computer interface (BCI) is a system that aims for establishing a non-muscular communication path for subjects who had suffer from a neurodegenerative disease. Many BCI systems make use of the phenomena of event-related synchronization and de-synchronization of brain waves as a main feature for classification of different cognitive tasks. However, the temporal dynamics of the electroenceph… ▽ More A brain-computer interface (BCI) is a system that aims for establishing a non-muscular communication path for subjects who had suffer from a neurodegenerative disease. Many BCI systems make use of the phenomena of event-related synchronization and de-synchronization of brain waves as a main feature for classification of different cognitive tasks. However, the temporal dynamics of the electroencephalographic (EEG) signals contain additional information that can be incorporated into the inference engine in order to improve the performance of the BCIs. This information about the dynamics of the signals have been exploited previously in BCIs by means of generative and discriminative methods. In particular, hidden Markov models (HMMs) have been used in previous works. These methods have the disadvantage that the model parameters such as the number of hidden states and the number of Gaussian mixtures need to be fix "a priori". In this work, we propose a Bayesian nonparametric model for brain signal classification that does not require "a priori" selection of the number of hidden states and the number of Gaussian mixtures of a HMM. The results show that the proposed model outperform other methods based on HMM as well as the winner algorithm of the BCI competition IV. △ Less

Submitted 27 December, 2016; originally announced December 2016.

arXiv:1611.08547 [pdf, other]

The G-ACM Tool: using the Drools Rule Engine for Access Control Management

Authors: João Sá, Sandra Alves, Sabine Broda

Abstract: In this paper we explore the usage of rule engines in a graphical framework for visualising dynamic access control policies. We use the Drools rule engine to dynamically compute permissions, following the Category-Based Access Control metamodel. In this paper we explore the usage of rule engines in a graphical framework for visualising dynamic access control policies. We use the Drools rule engine to dynamically compute permissions, following the Category-Based Access Control metamodel. △ Less

Submitted 25 November, 2016; originally announced November 2016.

arXiv:physics/0506207 [pdf, ps, other]

Frequent Errors in Special Relativity

Authors: Diego J. Saa

Abstract: Some reasons are given to suggest that the interpretation of the Lorentz' transformations as if they referred to coordinates instead of to intervals could be incorrect. Besides, the usual form of such transformations, by using variables that represent finite values instead of differentials, could be another error. Later it is shown that the Lorentz contraction factor must not have the form curre… ▽ More Some reasons are given to suggest that the interpretation of the Lorentz' transformations as if they referred to coordinates instead of to intervals could be incorrect. Besides, the usual form of such transformations, by using variables that represent finite values instead of differentials, could be another error. Later it is shown that the Lorentz contraction factor must not have the form currently accepted for it if the Lorentz contraction factor is assumed to be equal to the quotient between time differentials. △ Less

Submitted 26 September, 2006; v1 submitted 28 June, 2005; originally announced June 2005.

Comments: 17 pages, 1 figure. This is a slightly different version from the one presented at the NPA meeting at U. of Connecticut, Storrs. Contents changed (Arxiv moderators recommended merging with other paper). Corrected misprints Sept 2006

arXiv:hep-ex/9906022 [pdf, ps, other]

doi 10.1103/PhysRevD.61.032005

Search for flavor-changing neutral currents and lepton-family-number violation in two-body D0 decays

Authors: D. Pripstein, C. N. Brown, T. A. Carey, Y. C. Chen, R. L. Childers, W. E. Cooper, C. W. Darden, G. Gidal, K. N. Gounder, P. M. Ho, L. D. Isenhower, D. M. Jansen, R. G. Jeppesen, D. M. Kaplan, J. S. Kapustinsky, G. C. Kiang, M. S. Kowitt, D. W. Lane, L. M. Lederman, M. J. Leitch, J. W. Lillberg, W. R. Luebke, K. B. Luk, P. L. McGaughey, C. S. Mishra , et al. (10 additional authors not shown)

Abstract: Results of a search for the three neutral charm decays, D0 -> mu e, D0 -> mu mu, and D0 -> e e, are presented. This study was based on data collected in Experiment 789 at the Fermi National Accelerator Laboratory using 800 GeV/c proton-Au and proton-Be interactions. No evidence is found for any of the decays. Upper limits on the branching ratios, at the 90% confidence level, are obtained. Results of a search for the three neutral charm decays, D0 -> mu e, D0 -> mu mu, and D0 -> e e, are presented. This study was based on data collected in Experiment 789 at the Fermi National Accelerator Laboratory using 800 GeV/c proton-Au and proton-Be interactions. No evidence is found for any of the decays. Upper limits on the branching ratios, at the 90% confidence level, are obtained. △ Less

Submitted 5 October, 1999; v1 submitted 11 June, 1999; originally announced June 1999.

Comments: 28 pages, 18 figures. Submitted to Physical Review D

Report number: LBNL-43414, FERMILAB-Pub-99/152-E, LA-UR-99-2892

Journal ref: Phys.Rev.D61:032005,2000

Showing 1–14 of 14 results for author: Sae, J