Search | arXiv e-print repository

doi 10.1109/TCSI.2008.924885

Clifford Theory: A Geometrical Interpretation of Multivectorial Apparent Power

Authors: M. Castilla, Juan Carlos Bravo, M. Ordoñez, Juan Carlos Montaño

Abstract: In this paper, a generalization of the concept of electrical power for periodic current and voltage waveforms based on a new generalized complex geometric algebra (GCGA) is proposed. This powerful tool permits, in n-sinusoidal/nonlinear situations, representing and calculating the voltage, current, and apparent power in a single-port electrical network in terms of multivectors. The new expressions… ▽ More In this paper, a generalization of the concept of electrical power for periodic current and voltage waveforms based on a new generalized complex geometric algebra (GCGA) is proposed. This powerful tool permits, in n-sinusoidal/nonlinear situations, representing and calculating the voltage, current, and apparent power in a single-port electrical network in terms of multivectors. The new expressions result in a novel representation of the apparent power, similar to the Steinmetz's phasor model, based on complex numbers, but limited to the purely sinusoidal case. The multivectorial approach presented is based on the frequency-domain decomposition of the apparent power into three components: the real part and the imaginary part of the complex-scalar associated to active and reactive power respectively, and distortion power, associated to the complex-bivector. A geometrical interpretation of the multivectorial components of apparent power is discussed. Numerical examples illustrate the clear advantages of the suggested approach. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: 10 pages, 7 figures

Journal ref: IEEE Transactions on Circuits and Systems I-Regular Papers, ( Volume: 55, Issue: 10, November 2008)

arXiv:2402.11668 [pdf]

doi 10.1109/TIE.2016.2521615

Disturbance Ratio for Optimal Multi-Event Classification in Power Distribution Networks

Authors: M. D. Borrás, J. C. Bravo, J. C. Montaño

Abstract: This paper presents an effective approach to identify power quality events based on IEEE Std 1159-2009 caused by intermittent power sources like those of renewable energy. An efficient characterization of these disturbances is granted by the use of two useful wavelet based indices. For this purpose, a wavelet-based Global Disturbance Ratio index (GDR), defined through its instantaneous precursor (… ▽ More This paper presents an effective approach to identify power quality events based on IEEE Std 1159-2009 caused by intermittent power sources like those of renewable energy. An efficient characterization of these disturbances is granted by the use of two useful wavelet based indices. For this purpose, a wavelet-based Global Disturbance Ratio index (GDR), defined through its instantaneous precursor (Instantaneous Transient Disturbance index ITD(t)), is used in power distribution networks (PDN) under steady-state and/or transient conditions. An intelligent disturbance classification is done using a Support Vector Machine (SVM) with a minimum input vector based on the GDR index. The effectiveness of the proposed technique is validated using a real-time experimental system with single events and multi-events signals. △ Less

Submitted 18 February, 2024; originally announced February 2024.

Comments: 8 pages, 6 figures

arXiv:2402.03966 [pdf, other]

On dimensionality of feature vectors in MPNNs

Authors: César Bravo, Alexander Kozachinskiy, Cristóbal Rojas

Abstract: We revisit the classical result of Morris et al.~(AAAI'19) that message-passing graphs neural networks (MPNNs) are equal in their distinguishing power to the Weisfeiler--Leman (WL) isomorphism test. Morris et al.~show their simulation result with ReLU activation function and $O(n)$-dimensional feature vectors, where $n$ is the number of nodes of the graph. By introducing randomness into the arch… ▽ More We revisit the classical result of Morris et al.~(AAAI'19) that message-passing graphs neural networks (MPNNs) are equal in their distinguishing power to the Weisfeiler--Leman (WL) isomorphism test. Morris et al.~show their simulation result with ReLU activation function and $O(n)$-dimensional feature vectors, where $n$ is the number of nodes of the graph. By introducing randomness into the architecture, Aamand et al.~(NeurIPS'22) were able to improve this bound to $O(\log n)$-dimensional feature vectors, again for ReLU activation, although at the expense of guaranteeing perfect simulation only with high probability. Recently, Amir et al.~(NeurIPS'23) have shown that for any non-polynomial analytic activation function, it is enough to use just 1-dimensional feature vectors. In this paper, we give a simple proof of the result of Amit et al.~and provide an independent experimental validation of it. △ Less

Submitted 14 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: 15 pages, 2 figures. Changes to the previous version: added reference to Amir et al.~(NeurIPS'23)

arXiv:2402.00299 [pdf, other]

Attention-based Dynamic Multilayer Graph Neural Networks for Loan Default Prediction

Authors: Sahab Zandi, Kamesh Korangi, María Óskarsdóttir, Christophe Mues, Cristián Bravo

Abstract: Whereas traditional credit scoring tends to employ only individual borrower- or loan-level predictors, it has been acknowledged for some time that connections between borrowers may result in default risk propagating over a network. In this paper, we present a model for credit risk assessment leveraging a dynamic multilayer network built from a Graph Neural Network and a Recurrent Neural Network, e… ▽ More Whereas traditional credit scoring tends to employ only individual borrower- or loan-level predictors, it has been acknowledged for some time that connections between borrowers may result in default risk propagating over a network. In this paper, we present a model for credit risk assessment leveraging a dynamic multilayer network built from a Graph Neural Network and a Recurrent Neural Network, each layer reflecting a different source of network connection. We test our methodology in a behavioural credit scoring context using a dataset provided by U.S. mortgage financier Freddie Mac, in which different types of connections arise from the geographical location of the borrower and their choice of mortgage provider. The proposed model considers both types of connections and the evolution of these connections over time. We enhance the model by using a custom attention mechanism that weights the different time snapshots according to their importance. After testing multiple configurations, a model with GAT, LSTM, and the attention mechanism provides the best results. Empirical results demonstrate that, when it comes to predicting probability of default for the borrowers, our proposed model brings both better results and novel insights for the analysis of the importance of connections and timestamps, compared to traditional methods. △ Less

Submitted 24 June, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

arXiv:2401.05169 [pdf, ps, other]

Diophantine Approximation in local function fields via Bruhat-Tit trees

Authors: Luis Arenas-Carmona, Claudio Bravo

Abstract: We use the theory of arithmetic quotients of the Bruhat-Tits tree developed by Serre and others to obtain Dirichlet-style theorems for Diophantine approximation on global function fields. This approach allows us to find sharp values for the constants involved and, occasionally, explicit examples of badly approximable quadratic irrationals. Additionally, we can use this method to easily compute the… ▽ More We use the theory of arithmetic quotients of the Bruhat-Tits tree developed by Serre and others to obtain Dirichlet-style theorems for Diophantine approximation on global function fields. This approach allows us to find sharp values for the constants involved and, occasionally, explicit examples of badly approximable quadratic irrationals. Additionally, we can use this method to easily compute the measure of the set of elements that can be written as the limit of a sequence of ``better than expected'' approximants. All these results can be easily obtained via continued fractions when they are available, so that quotient graphs can be seen as a partial replacement of them when this fails to be the case. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: 19 pages and 6 figures. Comments are welcome

MSC Class: 11J61; 14H05 (primary) 11J70; 11K60; 20E08 (secondary)

arXiv:2311.07407 [pdf, other]

Towards Automatic Honey Bee Flower-Patch Assays with Paint Marking Re-Identification

Authors: Luke Meyers, Josué Rodríguez Cordero, Carlos Corrada Bravo, Fanfan Noel, José Agosto-Rivera, Tugrul Giray, Rémi Mégret

Abstract: In this paper, we show that paint markings are a feasible approach to automatize the analysis of behavioral assays involving honey bees in the field where marking has to be as lightweight as possible. We contribute a novel dataset for bees re-identification with paint-markings with 4392 images and 27 identities. Contrastive learning with a ResNet backbone and triplet loss led to identity represent… ▽ More In this paper, we show that paint markings are a feasible approach to automatize the analysis of behavioral assays involving honey bees in the field where marking has to be as lightweight as possible. We contribute a novel dataset for bees re-identification with paint-markings with 4392 images and 27 identities. Contrastive learning with a ResNet backbone and triplet loss led to identity representation features with almost perfect recognition in closed setting where identities are known in advance. Diverse experiments evaluate the capability to generalize to separate IDs, and show the impact of using different body parts for identification, such as using the unmarked abdomen only. In addition, we show the potential to fully automate the visit detection and provide preliminary results of compute time for future real-time deployment in the field on an edge device. △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: Paper 17, workshop "CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling", in conjunction with Computer Vision and Pattern Recognition (CVPR 2023), June 18, 2023, Vancouver, Canada

ACM Class: I.4.8; I.4.9; J.3

arXiv:2308.15173 [pdf, other]

Photon-rejection Power of the Light Dark Matter eXperiment in an 8 GeV Beam

Authors: Torsten Åkesson, Cameron Bravo, Liam Brennan, Lene Kristian Bryngemark, Pierfrancesco Butti, E. Craig Dukes, Valentina Dutta, Bertrand Echenard, Thomas Eichlersmith, Jonathan Eisch, Einar Elén, Ralf Ehrlich, Cooper Froemming, Andrew Furmanski, Niramay Gogate, Chiara Grieco, Craig Group, Hannah Herde, Christian Herwig, David G. Hitlin, Tyler Horoho, Joseph Incandela, Wesley Ketchum, Gordan Krnjaic, Amina Li , et al. (22 additional authors not shown)

Abstract: The Light Dark Matter eXperiment (LDMX) is an electron-beam fixed-target experiment designed to achieve comprehensive model independent sensitivity to dark matter particles in the sub-GeV mass region. An upgrade to the LCLS-II accelerator will increase the beam energy available to LDMX from 4 to 8 GeV. Using detailed GEANT4-based simulations, we investigate the effect of the increased beam energy… ▽ More The Light Dark Matter eXperiment (LDMX) is an electron-beam fixed-target experiment designed to achieve comprehensive model independent sensitivity to dark matter particles in the sub-GeV mass region. An upgrade to the LCLS-II accelerator will increase the beam energy available to LDMX from 4 to 8 GeV. Using detailed GEANT4-based simulations, we investigate the effect of the increased beam energy on the capabilities to separate signal and background, and demonstrate that the veto methodology developed for 4 GeV successfully rejects photon-induced backgrounds for at least $2\times10^{14}$ electrons on target at 8 GeV. △ Less

Submitted 4 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

Comments: 28 pages, 20 figures; corrected author list

Report number: FERMILAB-PUB-23-433-PPD-T, SLAC-PUB-17550

arXiv:2307.14321 [pdf, ps, other]

Polyhedral joins and graph complexes

Authors: Andrés Carnero Bravo

Abstract: We give the homotopy type of the suspension of a polyhedral join in terms of the polyhedral smash product for the same family of pairs and show that the polyhedral join of pairs $\left(\bigvee\mathbb{S}^0,\emptyset\right)$ over a skeleton of $Δ^n$ has the homotopy type of some wedge of spheres. We use these results to study the homotopy type of the forest filtration for some lexicographic products… ▽ More We give the homotopy type of the suspension of a polyhedral join in terms of the polyhedral smash product for the same family of pairs and show that the polyhedral join of pairs $\left(\bigvee\mathbb{S}^0,\emptyset\right)$ over a skeleton of $Δ^n$ has the homotopy type of some wedge of spheres. We use these results to study the homotopy type of the forest filtration for some lexicographic products of graphs. △ Less

Submitted 1 September, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

MSC Class: 05E45; 55P10; 55P15; 05C76

arXiv:2307.13271 [pdf, ps, other]

The Forest Filtration of a Graph

Authors: Andrés Carnero Bravo

Abstract: Given a graph $G$, we define a filtration of simplicial complexes associated to $G$, $\mathcal{F}_0(G)\subseteq\mathcal{F}_1(G)\subseteq\cdots\subseteq\mathcal{F}_\infty(G)$ where the first complex is the independence complex and the last the complex is formed by the acyclic sets of vertices. We prove some properties of this filtration and we calculate the homotopy type for various families of gra… ▽ More Given a graph $G$, we define a filtration of simplicial complexes associated to $G$, $\mathcal{F}_0(G)\subseteq\mathcal{F}_1(G)\subseteq\cdots\subseteq\mathcal{F}_\infty(G)$ where the first complex is the independence complex and the last the complex is formed by the acyclic sets of vertices. We prove some properties of this filtration and we calculate the homotopy type for various families of graphs. △ Less

Submitted 1 September, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

arXiv:2307.12401 [pdf, ps, other]

Homotopy type of the independence complex of some categorical products of graphs

Authors: Omar Antolín Camarena, Andrés Carnero Bravo

Abstract: It was conjectured by Goyal, Shukla and Singh that the independence complex of the categorical product $K_2\times K_3\times K_n$ has the homotopy type of a wedge of $(n-1)(3n-2)$ spheres of dimension $3$. Here we prove this conjecture by calculating the homotopy type of the independence complex of the graphs $C_{3r}\times K_n$ and $K_2\times K_m\times K_n$. For $C_m \times K_n$ when $m$ is not a m… ▽ More It was conjectured by Goyal, Shukla and Singh that the independence complex of the categorical product $K_2\times K_3\times K_n$ has the homotopy type of a wedge of $(n-1)(3n-2)$ spheres of dimension $3$. Here we prove this conjecture by calculating the homotopy type of the independence complex of the graphs $C_{3r}\times K_n$ and $K_2\times K_m\times K_n$. For $C_m \times K_n$ when $m$ is not a multiple of $3$, we calculate the homotopy type for $m = 4, 5$ and show that for other values it has to have the homotopy type of a wedge of spheres of at most $2$ consecutive dimensions and maybe some Moore spaces. △ Less

Submitted 1 September, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

arXiv:2307.11193 [pdf, ps, other]

Arithmetic subgroups of Chevalley group schemes over function fields II: Conjugacy classes of maximal unipotent subgroups

Authors: Claudio Bravo, Benoit Loisel

Abstract: Let $\mathcal{C}$ be a smooth, projective, geometrically integral curve defined over a perfect field $\mathbb{F}$. Let $k=\mathbb{F}(\mathcal{C})$ be the function field of $\mathcal{C}$. Let $\mathbf{G}$ be a split simply connected semisimple $\mathbb{Z}$-group scheme. Let $\mathcal{S}$ be a finite set of places of $\mathcal{C}$. In this paper, we investigate on the conjugacy classes of maximal un… ▽ More Let $\mathcal{C}$ be a smooth, projective, geometrically integral curve defined over a perfect field $\mathbb{F}$. Let $k=\mathbb{F}(\mathcal{C})$ be the function field of $\mathcal{C}$. Let $\mathbf{G}$ be a split simply connected semisimple $\mathbb{Z}$-group scheme. Let $\mathcal{S}$ be a finite set of places of $\mathcal{C}$. In this paper, we investigate on the conjugacy classes of maximal unipotents subgroups of $\mathcal{S}$-arithmetic subgroups. These are parameterized thanks to the Picard group of $\mathcal{O}_{\mathcal{S}}$ and the rank of $\mathbf{G}$. Furthermore, these maximal unipotent subgroups can be realized as the unipotent part of natural stabilizer, that are the stabilizers of sectors of the associated Bruhat-Tits building. We decompose these natural stabilizers in terms of their diagonalisable part and unipotent part, and we precise the group structure of the diagonalisable part. △ Less

Submitted 20 July, 2023; originally announced July 2023.

Comments: Comments are welcome

MSC Class: 20G30; 20E45 (primary) 11R58; 20E42 (secondary)

arXiv:2307.08131 [pdf, other]

INFLECT-DGNN: Influencer Prediction with Dynamic Graph Neural Networks

Authors: Elena Tiukhova, Emiliano Penaloza, María Óskarsdóttir, Bart Baesens, Monique Snoeck, Cristián Bravo

Abstract: Leveraging network information for predictive modeling has become widespread in many domains. Within the realm of referral and targeted marketing, influencer detection stands out as an area that could greatly benefit from the incorporation of dynamic network representation due to the ongoing development of customer-brand relationships. To elaborate this idea, we introduce INFLECT-DGNN, a new frame… ▽ More Leveraging network information for predictive modeling has become widespread in many domains. Within the realm of referral and targeted marketing, influencer detection stands out as an area that could greatly benefit from the incorporation of dynamic network representation due to the ongoing development of customer-brand relationships. To elaborate this idea, we introduce INFLECT-DGNN, a new framework for INFLuencer prEdiCTion with Dynamic Graph Neural Networks that combines Graph Neural Networks (GNN) and Recurrent Neural Networks (RNN) with weighted loss functions, the Synthetic Minority Oversampling TEchnique (SMOTE) adapted for graph data, and a carefully crafted rolling-window strategy. To evaluate predictive performance, we utilize a unique corporate data set with networks of three cities and derive a profit-driven evaluation methodology for influencer prediction. Our results show how using RNN to encode temporal attributes alongside GNNs significantly improves predictive performance. We compare the results of various models to demonstrate the importance of capturing graph representation, temporal dependencies, and using a profit-driven methodology for evaluation. △ Less

Submitted 12 December, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

Comments: 26 pages, 10 figures

arXiv:2306.15585 [pdf, ps, other]

doi 10.1016/j.ejor.2023.12.025

Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning

Authors: Sherly Alfonso-Sánchez, Jesús Solano, Alejandro Correa-Bahnsen, Kristina P. Sendova, Cristián Bravo

Abstract: Reinforcement learning has been explored for many problems, from video games with deterministic environments to portfolio and operations management in which scenarios are stochastic; however, there have been few attempts to test these methods in banking problems. In this study, we sought to find and automatize an optimal credit card limit adjustment policy by employing reinforcement learning techn… ▽ More Reinforcement learning has been explored for many problems, from video games with deterministic environments to portfolio and operations management in which scenarios are stochastic; however, there have been few attempts to test these methods in banking problems. In this study, we sought to find and automatize an optimal credit card limit adjustment policy by employing reinforcement learning techniques. Because of the historical data available, we considered two possible actions per customer, namely increasing or maintaining an individual's current credit limit. To find this policy, we first formulated this decision-making question as an optimization problem in which the expected profit was maximized; therefore, we balanced two adversarial goals: maximizing the portfolio's revenue and minimizing the portfolio's provisions. Second, given the particularities of our problem, we used an offline learning strategy to simulate the impact of the action based on historical data from a super-app in Latin America to train our reinforcement learning agent. Our results, based on the proposed methodology involving synthetic experimentation, show that a Double Q-learning agent with optimized hyperparameters can outperform other strategies and generate a non-trivial optimal policy not only reflecting the complex nature of this decision but offering an incentive to explore reinforcement learning in real-world banking scenarios. Our research establishes a conceptual structure for applying reinforcement learning framework to credit limit adjustment, presenting an objective technique to make these decisions primarily based on data-driven methods rather than relying only on expert-driven systems. We also study the use of alternative data for the problem of balance prediction, as the latter is a requirement of our proposed model. We find the use of such data does not always bring prediction gains. △ Less

Submitted 16 February, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

Comments: 29 pages, 16 figures

Journal ref: Alfonso-Sanchez, S., Solano, J., Correa-Bahnsen, A., Sendova, K. P., & Bravo, C. (2024). Optimizing credit limit adjustments under adversarial goals using reinforcement learning. European Journal of Operational Research 315(2): 802-817

arXiv:2304.10740 [pdf, other]

Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams

Authors: Mahsa Tavakoli, Rohitash Chandra, Fengrui Tian, Cristián Bravo

Abstract: Knowing which factors are significant in credit rating assignment leads to better decision-making. However, the focus of the literature thus far has been mostly on structured data, and fewer studies have addressed unstructured or multi-modal datasets. In this paper, we present an analysis of the most effective architectures for the fusion of deep learning models for the prediction of company credi… ▽ More Knowing which factors are significant in credit rating assignment leads to better decision-making. However, the focus of the literature thus far has been mostly on structured data, and fewer studies have addressed unstructured or multi-modal datasets. In this paper, we present an analysis of the most effective architectures for the fusion of deep learning models for the prediction of company credit rating classes, by using structured and unstructured datasets of different types. In these models, we tested different combinations of fusion strategies with different deep learning models, including CNN, LSTM, GRU, and BERT. We studied data fusion strategies in terms of level (including early and intermediate fusion) and techniques (including concatenation and cross-attention). Our results show that a CNN-based multi-modal model with two fusion strategies outperformed other multi-modal techniques. In addition, by comparing simple architectures with more complex ones, we found that more sophisticated deep learning models do not necessarily produce the highest performance; however, if attention-based models are producing the best results, cross-attention is necessary as a fusion strategy. Finally, our comparison of rating agencies on short-, medium-, and long-term performance shows that Moody's credit ratings outperform those of other agencies like Standard & Poor's and Fitch Ratings. △ Less

Submitted 22 September, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

arXiv:2304.00505 [pdf, ps, other]

Relative homology of arithmetic subgroups of $\mathrm{SU}(3)$

Authors: Claudio Bravo

Abstract: Let $\mathcal{C}$ be a smooth, projective and geometrically integral curve defined over a finite field $\mathbb{F}$. Let $A$ be the ring of function of $\mathcal{C}$ that are regular outside a closed point $P$ and let $k=\mathrm{Quot}(A)$. Let $\mathcal{G}=\mathrm{SU}(3)$ be the non-split group-scheme defined from an (isotropic) hermitian form in three variables. In this work, we describe, in term… ▽ More Let $\mathcal{C}$ be a smooth, projective and geometrically integral curve defined over a finite field $\mathbb{F}$. Let $A$ be the ring of function of $\mathcal{C}$ that are regular outside a closed point $P$ and let $k=\mathrm{Quot}(A)$. Let $\mathcal{G}=\mathrm{SU}(3)$ be the non-split group-scheme defined from an (isotropic) hermitian form in three variables. In this work, we describe, in terms of the Euler-Poincaré characteristic, the relative homology groups of certain arithmetic subgroups $G$ of $\mathcal{G}(A)$ modulo a representative system $\mathfrak{U}$ of the conjugacy classes of their maximal unipotent subgroups. In other words, we measure how far are the homology groups of $G$ from being the coproducts of the corresponding homology groups of the subgroups $U \in \mathfrak{U}$. △ Less

Submitted 2 April, 2023; originally announced April 2023.

Comments: Comments are welcome

MSC Class: 55N10; 20G30; 11R58 (primary); 20E08; 20E08; 20E45 (secondary)

arXiv:2301.01212 [pdf, ps, other]

doi 10.1007/978-3-031-15471-3_32

Assessment of creditworthiness models privacy-preserving training with synthetic data

Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

Abstract: Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate… ▽ More Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate the performance of models trained with synthetic data when they are applied to real-world data. Our results show that synthetic data quality is increasingly poor when the number of attributes increases. However, creditworthiness assessment models trained with synthetic data show a reduction of 3\% of AUC and 6\% of KS when compared with models trained with real data. These results have a significant impact since they encourage credit risk investigation from synthetic data, making it possible to maintain borrowers' privacy and to address problems that until now have been hampered by the availability of information. △ Less

Submitted 31 December, 2022; originally announced January 2023.

Journal ref: Hybrid Artificial Intelligent Systems. HAIS 2022. Lecture Notes in Computer Science(), vol 13469

arXiv:2212.10629 [pdf, other]

doi 10.1103/PhysRevD.108.012015

Searching for Prompt and Long-Lived Dark Photons in Electro-Produced $e^+e^-$ Pairs with the Heavy Photon Search Experiment at JLab

Authors: P. H. Adrian, N. A. Baltzell, M. Battaglieri, M. Bondi, S. Boyarinov, C. Bravo, S. Bueltmann, P. Butti, V. D. Burkert, D. Calvo, T. Cao, M. Carpinelli, A. Celentano, G. Charles, L. Colaneri, W. Cooper, C. Cuevas, A. D'Angelo, N. Dashyan, M. De Napoli, R. De Vita, A. Deur, M. Diamond, R. Dupre, H. Egiyan , et al. (59 additional authors not shown)

Abstract: The Heavy Photon Search experiment (HPS) at the Thomas Jefferson National Accelerator Facility searches for electro-produced dark photons. We report results from the 2016 Engineering Run consisting of 10608/nb of data for both the prompt and displaced vertex searches. A search for a prompt resonance in the $e^+e^-$ invariant mass distribution between 39 and 179 MeV showed no evidence of dark photo… ▽ More The Heavy Photon Search experiment (HPS) at the Thomas Jefferson National Accelerator Facility searches for electro-produced dark photons. We report results from the 2016 Engineering Run consisting of 10608/nb of data for both the prompt and displaced vertex searches. A search for a prompt resonance in the $e^+e^-$ invariant mass distribution between 39 and 179 MeV showed no evidence of dark photons above the large QED background, limiting the coupling of ε^2 {\geq} 10^-5, in agreement with previous searches. The search for displaced vertices showed no evidence of excess signal over background in the masses between 60 and 150 MeV, but had insufficient luminosity to limit canonical heavy photon production. This is the first displaced vertex search result published by HPS. HPS has taken high-luminosity data runs in 2019 and 2021 that will explore new dark photon phase space. △ Less

Submitted 12 July, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

Comments: 28 pages, 46 figures

Report number: JLAB-PHY-23-3738

arXiv:2212.07509 [pdf, ps, other]

Homotopy type through homology groups

Authors: Omar Antolín Camarena, Andrés Carnero Bravo

Abstract: We show that if a complex has free finitely generated reduced homology groups for two consecutive dimensions and trivial homology for all other dimensions, then it must have the homotopy type of a wedge of spheres of two consecutive dimensions. We also show other pairs of dimensions for which the last result can be generalized. We show that if a complex has free finitely generated reduced homology groups for two consecutive dimensions and trivial homology for all other dimensions, then it must have the homotopy type of a wedge of spheres of two consecutive dimensions. We also show other pairs of dimensions for which the last result can be generalized. △ Less

Submitted 13 January, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

arXiv:2211.09664 [pdf, other]

Influencer Detection with Dynamic Graph Neural Networks

Authors: Elena Tiukhova, Emiliano Penaloza, María Óskarsdóttir, Hernan Garcia, Alejandro Correa Bahnsen, Bart Baesens, Monique Snoeck, Cristián Bravo

Abstract: Leveraging network information for prediction tasks has become a common practice in many domains. Being an important part of targeted marketing, influencer detection can potentially benefit from incorporating dynamic network representation. In this work, we investigate different dynamic Graph Neural Networks (GNNs) configurations for influencer detection and evaluate their prediction performance u… ▽ More Leveraging network information for prediction tasks has become a common practice in many domains. Being an important part of targeted marketing, influencer detection can potentially benefit from incorporating dynamic network representation. In this work, we investigate different dynamic Graph Neural Networks (GNNs) configurations for influencer detection and evaluate their prediction performance using a unique corporate data set. We show that using deep multi-head attention in GNN and encoding temporal attributes significantly improves performance. Furthermore, our empirical evaluation illustrates that capturing neighborhood representation is more beneficial that using network centrality measures. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: Conference workshop camera-ready paper - accepted at NeurIPS TGL 2022. 8 pages, 4 figures

arXiv:2207.06546 [pdf, ps, other]

Arithmetic subgroups of Chevalley group schemes over function fields I: quotients of the Bruhat-Tits building by $\{P\}$-arithmetic subgroups

Authors: Claudio Bravo, Benoit Loisel

Abstract: Let $\mathbf{G}$ be a reductive Chevalley group scheme (defined over $\mathbb{Z}$). Let $\mathcal{C}$ be a smooth, projective, geometrically integral curve over a field $\mathbb{F}$. Let $P$ be a closed point on $\mathcal{C}$. Let $A$ be the ring of functions that are regular outside $\lbrace P \rbrace$. The fraction field $k$ of $A$ has a discrete valuation… ▽ More Let $\mathbf{G}$ be a reductive Chevalley group scheme (defined over $\mathbb{Z}$). Let $\mathcal{C}$ be a smooth, projective, geometrically integral curve over a field $\mathbb{F}$. Let $P$ be a closed point on $\mathcal{C}$. Let $A$ be the ring of functions that are regular outside $\lbrace P \rbrace$. The fraction field $k$ of $A$ has a discrete valuation $ν=ν_{P}: k^{\times} \rightarrow \mathbb{Z}$ associated to $P$. In this work, we study the action of the group $ \textbf{G}(A)$ of $A$-points of $\mathbf{G}$ on the Bruhat-Tits building $\mathcal{X}=\mathcal{X}(\textbf{G},k,ν_{P})$ in order to describe the structure of the orbit space $ \textbf{G}(A)\backslash \mathcal{X}$. We obtain that this orbit space is the ``gluing'' of a closed connected CW-complex with some sector chambers. The latter are parametrized by a set depending on the Picard group of $\mathcal{C} \smallsetminus \{P\}$ and on the rank of $\mathbf{G}$. Moreover, we observe that any rational sector face whose tip is a special vertex contains a subsector face that embeds into this orbit space. △ Less

Submitted 5 July, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

Comments: This preprint contains the sections from 1 to 9 of arXiv:2207.06546v3. Section 10 will be extended to a finite set of places in a work in progress

MSC Class: 20G30; 11R58; 20E42 (primary) 14H05; 20H25 (secondary)

arXiv:2206.10074 [pdf, ps, other]

Statistical network isomorphism

Authors: Pierre Miasnikof, Alexander Y. Shestopaloff, Cristián Bravo, Yuri Lawryshyn

Abstract: Graph isomorphism is a problem for which there is no known polynomial-time solution. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. Moreover, questions of similarity are typically more general and their answers more widely applicable than the more restrictive isomorphism… ▽ More Graph isomorphism is a problem for which there is no known polynomial-time solution. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. Moreover, questions of similarity are typically more general and their answers more widely applicable than the more restrictive isomorphism question. In this article, we offer a statistical answer to the following questions: a) {\it ``Are networks $G_1$ and $G_2$ similar?''}, b) {\it ``How different are the networks $G_1$ and $G_2$?''} and c) {\it ``Is $G_3$ more similar to $G_1$ or $G_2$?''}. Our comparisons begin with the transformation of each graph into an all-pairs distance matrix. Our node-node distance, Jaccard distance, has been shown to offer a good reflection of the graph's connectivity structure. We then model these distances as probability distributions. Finally, we use well-established statistical tools to gauge the (dis)similarities in terms of probability distribution (dis)similarity. This comparison procedure aims to detect (dis)similarities in connectivity structure, not in easily observable graph characteristics, such as degrees, edge counts or density. We validate our hypothesis that graphs can be meaningfully summarized and compared via their node-node distance distributions, using several synthetic and real-world graphs. Empirical results demonstrate its validity and the accuracy of our comparison technique. △ Less

Submitted 20 June, 2022; originally announced June 2022.

Comments: 11 pages, two figures

arXiv:2205.07328 [pdf, ps, other]

Quotients of the Bruhat-Tits tree by function field analogs of the Hecke congruence subgroups

Authors: Claudio Bravo

Abstract: Let C be a smooth, projective and geometrically integral curve defined over a finite field F. For each closed point P of C, let R be the ring of functions that are regular outside P, and let K be the completion at P of the function field of C. In order to study groups of the form GL2(R), Serre describes the quotient graph GL2(R)\t, where t is the Bruhat-Tits tree defined from SL2(K). In particular… ▽ More Let C be a smooth, projective and geometrically integral curve defined over a finite field F. For each closed point P of C, let R be the ring of functions that are regular outside P, and let K be the completion at P of the function field of C. In order to study groups of the form GL2(R), Serre describes the quotient graph GL2(R)\t, where t is the Bruhat-Tits tree defined from SL2(K). In particular, Serre shows that GL2(R)\t is the union of a finite graph and a finite number of ray shaped subgraphs, which are called cusps. It is not hard to see that finite index subgroups inherit this property. In this work we describe the associated quotient graph H\t for the action on t of the group H of matrices in GL2(R) that are upper triangular modulo a certain ideal I of R. More specifically, we give a explicit formula for the cusp number of H\t. Then, by using Bass-Serre Theory, we describe the combinatorial structure of H. These groups play, in the function field context, the same role as the Hecke congruence subgroups of SL2(Z). △ Less

Submitted 15 May, 2022; originally announced May 2022.

MSC Class: 20G30-20E08 (primary); 11R58-14H60 (secondary)

arXiv:2204.06122 [pdf, other]

On the dynamics of credit history and social interaction features, and their impact on creditworthiness assessment performance

Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

Abstract: For more than a half-century, credit risk management has used credit scoring models in each of its well-defined stages to manage credit risk. Application scoring is used to decide whether to grant a credit or not, while behavioral scoring is used mainly for portfolio management and to take preventive actions in case of default signals. In both cases, network data has recently been shown to be valu… ▽ More For more than a half-century, credit risk management has used credit scoring models in each of its well-defined stages to manage credit risk. Application scoring is used to decide whether to grant a credit or not, while behavioral scoring is used mainly for portfolio management and to take preventive actions in case of default signals. In both cases, network data has recently been shown to be valuable to increase the predictive power of these models, especially when the borrower's historical data is scarce or not available. This study aims to understand the creditworthiness assessment performance dynamics and how it is influenced by the credit history, repayment behavior, and social network features. To accomplish this, we introduced a machine learning classification framework to analyze 97.000 individuals and companies from the moment they obtained their first loan to 12 months afterward. Our novel and massive dataset allow us to characterize each borrower according to their credit behavior, and social and economic relationships. Our research shows that borrowers' history increases performance at a decreasing rate during the first six months and then stabilizes. The most notable effect on perfomance of social networks features occurs at loan application; in personal scoring, this effect prevails a few months, while in business scoring adds value throughout the study period. These findings are of great value to improve credit risk management and optimize the use of traditional information and alternative data sources. △ Less

Submitted 12 April, 2022; originally announced April 2022.

arXiv:2203.08324 [pdf, other]

The Heavy Photon Search Experiment

Authors: Nathan Baltzell, Marco Battaglieri, Mariangela Bondi, Sergei Boyarinov, Cameron Bravo, Stephen Bueltmann, Volker Burkert, Pierfrancesco Butti, Tongtong Cao, Massimo Carpinelli, Andrea Celentano, Gabriel Charles, Chris Cuevas, Annalisa D'Angelo, Domenico D'Urso, Natalia Dashyan, Marzio De Napoli, Raffaella De Vita, Alexandre Deur, Miriam Diamond, Raphael Dupre, Rouven Essig, Vitaliy Fadeyev, R. Clive Field, Alessandra Filippi , et al. (37 additional authors not shown)

Abstract: The Heavy Photon Search (HPS) experiment is designed to search for a new vector boson $A^\prime$ in the mass range of 20 MeV/$c^2$ to 220 MeV/$c^2$ that kinetically mixes with the Standard Model photon with couplings $ε^2 >10^{-10}$. In addition to the general importance of exploring light, weakly coupled physics that is difficult to probe with high-energy colliders, a prime motivation for this se… ▽ More The Heavy Photon Search (HPS) experiment is designed to search for a new vector boson $A^\prime$ in the mass range of 20 MeV/$c^2$ to 220 MeV/$c^2$ that kinetically mixes with the Standard Model photon with couplings $ε^2 >10^{-10}$. In addition to the general importance of exploring light, weakly coupled physics that is difficult to probe with high-energy colliders, a prime motivation for this search is the possibility that sub-GeV thermal relics constitute dark matter, a scenario that requires a new comparably light mediator, where models with a hidden $U(1)$ gauge symmetry, a "dark", "hidden sector", or "heavy" photon, are particularly attractive. HPS searches for visible signatures of these heavy photons, taking advantage of their small coupling to electric charge to produce them via a process analogous to bremsstrahlung in a fixed target and detect their subsequent decay to $\mathrm{e}^+ \mathrm{e}^-$ pairs in a compact spectrometer. In addition to searching for $\mathrm{e}^+ \mathrm{e}^-$ resonances atop large QED backgrounds, HPS has the ability to precisely measure decay lengths, resulting in unique sensitivity to dark photons, as well as other long-lived new physics. After completion of the experiment and operation of engineering runs in 2015 and 2016 at the JLab CEBAF, physics runs in 2019 and 2021 have provided datasets that are now being analyzed to search for dark photons and other new phenomena. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Submitted to the Proceedings of the US Community Study on the Future of Particle Physics (Snowmass 2021)

arXiv:2203.08192 [pdf, other]

Current Status and Future Prospects for the Light Dark Matter eXperiment

Authors: Torsten Åkesson, Nikita Blinov, Lukas Brand-Baugher, Cameron Bravo, Lene Kristian Bryngemark, Pierfrancesco Butti, Caterina Doglioni, Craig Dukes, Valentina Dutta, Bertrand Echenard, Ralf Ehrlich, Thomas Eichlersmith, Andrew Furmanski, Chloe Greenstein, Craig Group, Niramay Gogate, Vinay Hegde, Christian Herwig, David G. Hitlin, Duc Hoang, Tyler Horoho, Joseph Incandela, Wesley Ketchum, Gordan Krnjaic, Amina Li , et al. (23 additional authors not shown)

Abstract: The constituents of dark matter are still unknown, and the viable possibilities span a vast range of masses. The physics community has established searching for sub-GeV dark matter as a high priority and identified accelerator-based experiments as an essential facet of this search strategy. A key goal of the accelerator-based dark matter program is testing the broad idea of thermally produced sub-… ▽ More The constituents of dark matter are still unknown, and the viable possibilities span a vast range of masses. The physics community has established searching for sub-GeV dark matter as a high priority and identified accelerator-based experiments as an essential facet of this search strategy. A key goal of the accelerator-based dark matter program is testing the broad idea of thermally produced sub-GeV dark matter through experiments designed to directly produce dark matter particles. The most sensitive way to search for the production of light dark matter is to use a primary electron beam to produce it in fixed-target collisions. The Light Dark Matter eXperiment (LDMX) is an electron-beam fixed-target missing-momentum experiment that realizes this approach and provides unique sensitivity to light dark matter in the sub-GeV range. This contribution provides an overview of the theoretical motivation, the main experimental challenges, how LDMX addresses these challenges, and projected sensitivities. We further describe the capabilities of LDMX to explore other interesting new and standard physics, such as visibly-decaying axion and vector mediators or rare meson decays, and to provide timely electronuclear scattering measurements that will inform the modeling of neutrino-nucleus scattering for DUNE. △ Less

Submitted 21 August, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: 26 pages, 17 figures. Contribution to Snowmass 2021

arXiv:2112.01421 [pdf, other]

doi 10.1016/j.isprsjprs.2022.03.015

Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction

Authors: Matthew Stevenson, Christophe Mues, Cristián Bravo

Abstract: LiDAR (short for "Light Detection And Ranging" or "Laser Imaging, Detection, And Ranging") technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes. To date, airborne LiDAR imaging has been predominantly confined to the environmental and archaeological domains. However, the geographically granular and open-source nature of this data also lends itsel… ▽ More LiDAR (short for "Light Detection And Ranging" or "Laser Imaging, Detection, And Ranging") technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes. To date, airborne LiDAR imaging has been predominantly confined to the environmental and archaeological domains. However, the geographically granular and open-source nature of this data also lends itself to an array of societal, organizational and business applications where geo-demographic type data is utilised. Arguably, the complexity involved in processing this multi-dimensional data has thus far restricted its broader adoption. In this paper, we propose a series of convenient task-agnostic tile elevation embeddings to address this challenge, using recent advances from unsupervised Deep Learning. We test the potential of our embeddings by predicting seven English indices of deprivation (2019) for small geographies in the Greater London area. These indices cover a range of socio-economic outcomes and serve as a proxy for a wide variety of downstream tasks to which the embeddings can be applied. We consider the suitability of this data not just on its own but also as an auxiliary source of data in combination with demographic features, thus providing a realistic use case for the embeddings. Having trialled various model/embedding configurations, we find that our best performing embeddings lead to Root-Mean-Squared-Error (RMSE) improvements of up to 21% over using standard demographic features alone. We also demonstrate how our embedding pipeline, using Deep Learning combined with K-means clustering, produces coherent tile segments which allow the latent embedding features to be interpreted. △ Less

Submitted 1 August, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

Comments: 29 pages, 13 figures. V2 - Published

Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, 187, 378-392 (2022)

arXiv:2111.14338 [pdf, other]

Improving Deep Learning Interpretability by Saliency Guided Training

Authors: Aya Abdelsalam Ismail, Héctor Corrada Bravo, Soheil Feizi

Abstract: Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a {\it saliency guided training}procedure for neural networks to reduce noisy gra… ▽ More Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a {\it saliency guided training}procedure for neural networks to reduce noisy gradients used in predictions while retaining the predictive performance of the model. Our saliency guided training procedure iteratively masks features with small and potentially noisy gradients while maximizing the similarity of model outputs for both masked and unmasked inputs. We apply the saliency guided training procedure to various synthetic and real data sets from computer vision, natural language processing, and time series across diverse neural architectures, including Recurrent Neural Networks, Convolutional Networks, and Transformers. Through qualitative and quantitative evaluations, we show that saliency guided training procedure significantly improves model interpretability across various domains while preserving its predictive performance. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Journal ref: Thirty-fifth Conference on Neural Information Processing Systems 2021

arXiv:2111.13666 [pdf, other]

doi 10.1016/j.eswa.2022.118809

On the combination of graph data for assessing thin-file borrowers' creditworthiness

Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

Abstract: The thin-file borrowers are customers for whom a creditworthiness assessment is uncertain due to their lack of credit history; many researchers have used borrowers' relationships and interactions networks in the form of graphs as an alternative data source to address this. Incorporating network data is traditionally made by hand-crafted feature engineering, and lately, the graph neural network has… ▽ More The thin-file borrowers are customers for whom a creditworthiness assessment is uncertain due to their lack of credit history; many researchers have used borrowers' relationships and interactions networks in the form of graphs as an alternative data source to address this. Incorporating network data is traditionally made by hand-crafted feature engineering, and lately, the graph neural network has emerged as an alternative, but it still does not improve over the traditional method's performance. Here we introduce a framework to improve credit scoring models by blending several Graph Representation Learning methods: feature engineering, graph embeddings, and graph neural networks. We stacked their outputs to produce a single score in this approach. We validated this framework using a unique multi-source dataset that characterizes the relationships and credit history for the entire population of a Latin American country, applying it to credit risk models, application, and behavior, targeting both individuals and companies. Our results show that the graph representation learning methods should be used as complements, and these should not be seen as self-sufficient methods as is currently done. In terms of AUC and KS, we enhance the statistical performance, outperforming traditional methods. In Corporate lending, where the gain is much higher, it confirms that evaluating an unbanked company cannot solely consider its features. The business ecosystem where these firms interact with their owners, suppliers, customers, and other companies provides novel knowledge that enables financial institutions to enhance their creditworthiness assessment. Our results let us know when and which group to use graph data and what effects on performance to expect. They also show the enormous value of graph data on the unbanked credit scoring problem, principally to help companies' banking. △ Less

Submitted 16 September, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

Journal ref: Expert Systems with Applications, 2022, 118809

arXiv:2111.09902 [pdf, other]

doi 10.1016/j.ejor.2022.10.032

A transformer-based model for default prediction in mid-cap corporate markets

Authors: Kamesh Korangi, Christophe Mues, Cristián Bravo

Abstract: In this paper, we study mid-cap companies, i.e. publicly traded companies with less than US $10 billion in market capitalisation. Using a large dataset of US mid-cap companies observed over 30 years, we look to predict the default probability term structure over the medium term and understand which data sources (i.e. fundamental, market or pricing data) contribute most to the default risk. Whereas… ▽ More In this paper, we study mid-cap companies, i.e. publicly traded companies with less than US $10 billion in market capitalisation. Using a large dataset of US mid-cap companies observed over 30 years, we look to predict the default probability term structure over the medium term and understand which data sources (i.e. fundamental, market or pricing data) contribute most to the default risk. Whereas existing methods typically require that data from different time periods are first aggregated and turned into cross-sectional features, we frame the problem as a multi-label time-series classification problem. We adapt transformer models, a state-of-the-art deep learning model emanating from the natural language processing domain, to the credit risk modelling setting. We also interpret the predictions of these models using attention heat maps. To optimise the model further, we present a custom loss function for multi-label classification and a novel multi-channel architecture with differential training that gives the model the ability to use all input data efficiently. Our results show the proposed deep learning architecture's superior performance, resulting in a 13% improvement in AUC (Area Under the receiver operating characteristic Curve) over traditional models. We also demonstrate how to produce an importance ranking for the different data sources and the temporal relationships using a Shapley approach specific to these models. △ Less

Submitted 20 April, 2023; v1 submitted 18 November, 2021; originally announced November 2021.

Comments: 38 pages, 6 figures, V4 published

Journal ref: European Journal of Operational Research, 308, 306-320 (2023)

arXiv:2107.00589 [pdf, ps, other]

doi 10.1016/j.jpaa.2021.106996

Quotients of the Bruhat-Tits tree by arithmetic subgroups of special unitary groups

Authors: Luis Arenas-Carmona, Claudio Bravo, Benoit Loisel, Giancarlo Lucchini Arteche

Abstract: Let $K$ be the function field of a curve $C$ over a field $\mathbb{F}$ of either odd or zero characteristic. Following the work by Serre and Mason on $\mathrm{SL}_2$, we study the action of arithmetic subgroups of $\mathrm{SU}(3)$ on its corresponding Bruhat-Tits tree associated to a suitable completion of $K$. More precisely, we prove that the quotient graph "looks like a spider", in the sense th… ▽ More Let $K$ be the function field of a curve $C$ over a field $\mathbb{F}$ of either odd or zero characteristic. Following the work by Serre and Mason on $\mathrm{SL}_2$, we study the action of arithmetic subgroups of $\mathrm{SU}(3)$ on its corresponding Bruhat-Tits tree associated to a suitable completion of $K$. More precisely, we prove that the quotient graph "looks like a spider", in the sense that it is the union of a set of cuspidal rays (the "legs"), parametrized by an explicit Picard group, that are attached to a connected graph (the "body"). We use this description in order to describe these arithmetic subgroups as amalgamated products and study their homology. In the case where $\mathbb{F}$ is a finite field, we use a result by Bux, Köhl and Witzel in order to prove that the "body" is a finite graph, which allows us to get even more precise applications. △ Less

Submitted 23 November, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

Comments: 36 pages. Final version

MSC Class: primary 20E08; 20H25; 14H05; secondary 20J06; 20G30; 11F75

Journal ref: J. Pure Appl. Algebra 226 (2022), 106996

arXiv:2012.05724 [pdf]

doi 10.1016/j.dss.2020.113398

Improving healthcare access management by predicting patient no-show behaviour

Authors: David Barrera Ferro, Sally Brailsford, Cristián Bravo, Honora Smith

Abstract: Low attendance levels in medical appointments have been associated with poor health outcomes and efficiency problems for service providers. To address this problem, healthcare managers could aim at improving attendance levels or minimizing the operational impact of no-shows by adapting resource allocation policies. However, given the uncertainty of patient behaviour, generating relevant informatio… ▽ More Low attendance levels in medical appointments have been associated with poor health outcomes and efficiency problems for service providers. To address this problem, healthcare managers could aim at improving attendance levels or minimizing the operational impact of no-shows by adapting resource allocation policies. However, given the uncertainty of patient behaviour, generating relevant information regarding no-show probabilities could support the decision-making process for both approaches. In this context many researchers have used multiple regression models to identify patient and appointment characteristics than can be used as good predictors for no-show probabilities. This work develops a Decision Support System (DSS) to support the implementation of strategies to encourage attendance, for a preventive care program targeted at underserved communities in Bogotá, Colombia. Our contribution to literature is threefold. Firstly, we assess the effectiveness of different machine learning approaches to improve the accuracy of regression models. In particular, Random Forest and Neural Networks are used to model the problem accounting for non-linearity and variable interactions. Secondly, we propose a novel use of Layer-wise Relevance Propagation in order to improve the explainability of neural network predictions and obtain insights from the modelling step. Thirdly, we identify variables explaining no-show probabilities in a develo** context and study its policy implications and potential for improving healthcare access. In addition to quantifying relationships reported in previous studies, we find that income and neighbourhood crime statistics affect no-show probabilities. Our results will support patient prioritization in a pilot behavioural intervention and will inform appointment planning decisions. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: v4 - 26 pages

Journal ref: Decision Support Systems 138: 113398 (2020)

arXiv:2010.15247 [pdf, other]

doi 10.1051/epjconf/202124709025

Fast Rossi-alpha Measurements of Plutonium using Organic Scintillators

Authors: M. Y. Hua, C. A. Bravo, A. T. MacDonald, J. D. Hutchinson, G. E. McKenzie, T. J. Grove, J. M. Goda, A. T. McSpaden, S. D. Clarke, S. A. Pozzi

Abstract: In this work, Rossi-alpha measurements were simultaneously performed with a $^3$He-based detection system and an organic scintillator-based detection system. The assembly is 15 kg of plutonium (93 wt$\%$ $^{239}$Pu) reflected by copper and moderated by lead. The goal of Rossi-alpha measurements is to estimate the prompt neutron decay constant, alpha. Simulations estimate $k_\text{eff}$ = 0.624 and… ▽ More In this work, Rossi-alpha measurements were simultaneously performed with a $^3$He-based detection system and an organic scintillator-based detection system. The assembly is 15 kg of plutonium (93 wt$\%$ $^{239}$Pu) reflected by copper and moderated by lead. The goal of Rossi-alpha measurements is to estimate the prompt neutron decay constant, alpha. Simulations estimate $k_\text{eff}$ = 0.624 and $α$ = 52.3 $\pm$ 2.5 ns for the measured assembly. The organic scintillator system estimated $α$ = 47.4 $\pm$ 2.0 ns, having a 9.37$\%$ error (though the 1.09 standard deviation confidence intervals overlapped). The $^3$He system estimated $α$ = 37 $μ$s. The known slowing down time of the $^3$He system is 35-40 $μ$s, which means the slowing down time dominates and obscures the prompt neutron decay constant. Subsequently, the organic scintillator system should be used for assemblies with alpha much less than 35 $μ$s. △ Less

Submitted 13 October, 2020; originally announced October 2020.

Comments: PHYSOR 2020: Transition to a Scalable Nuclear Future Cambridge, United Kingdom, March 29th-April 2nd, 2020

arXiv:2010.13924 [pdf, other]

Benchmarking Deep Learning Interpretability in Time Series Predictions

Authors: Aya Abdelsalam Ismail, Mohamed Gunady, Héctor Corrada Bravo, Soheil Feizi

Abstract: Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Re… ▽ More Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Recurrent Neural Network, Temporal Convolutional Networks, and Transformers in a new benchmark of synthetic time series data. We propose and report multiple metrics to empirically evaluate the performance of saliency methods for detecting feature importance over time using both precision (i.e., whether identified features contain meaningful signals) and recall (i.e., the number of features with signal identified as important). Through several experiments, we show that (i) in general, network architectures and saliency methods fail to reliably and accurately identify feature importance over time in time series data, (ii) this failure is mainly due to the conflation of time and feature domains, and (iii) the quality of saliency maps can be improved substantially by using our proposed two-step temporal saliency rescaling (TSR) approach that first calculates the importance of each time step before calculating the importance of each feature at a time step. △ Less

Submitted 26 October, 2020; originally announced October 2020.

Journal ref: NeurIPS 2020

arXiv:2010.09559 [pdf, other]

doi 10.1016/j.omega.2021.102520

Multilayer Network Analysis for Improved Credit Risk Prediction

Authors: María Óskarsdóttir, Cristián Bravo

Abstract: We present a multilayer network model for credit risk assessment. Our model accounts for multiple connections between borrowers (such as their geographic location and their economic activity) and allows for explicitly modelling the interaction between connected borrowers. We develop a multilayer personalized PageRank algorithm that allows quantifying the strength of the default exposure of any bor… ▽ More We present a multilayer network model for credit risk assessment. Our model accounts for multiple connections between borrowers (such as their geographic location and their economic activity) and allows for explicitly modelling the interaction between connected borrowers. We develop a multilayer personalized PageRank algorithm that allows quantifying the strength of the default exposure of any borrower in the network. We test our methodology in an agricultural lending framework, where it has been suspected for a long time default correlates between borrowers when they are subject to the same structural risks. Our results show there are significant predictive gains just by including centrality multilayer network information in the model, and these gains are increased by more complex information such as the multilayer PageRank variables. The results suggest default risk is highest when an individual is connected to many defaulters, but this risk is mitigated by the size of the neighbourhood of the individual, showing both default risk and financial stability propagate throughout the network. △ Less

Submitted 26 July, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

Comments: 24 pages, 15 figures. v4 - accepted

Journal ref: Omega 105: 102520 (2021)

arXiv:2010.07085 [pdf, other]

Rossi-alpha Uncertainty Quantification by Analytic, Bootstrap, and Sample Methods to Inform Fitting Best Practices

Authors: M. Y. Hua, C. A. Bravo, R. M. Marchie, J. D. Hutchinson, G. E. McKenzie, S. A. Pozzi

Abstract: The prompt neutron period (the negative reciprocal of the prompt neutron decay constant) can be estimated using the Rossi-alpha technique that is predicated on fitting Rossi-alpha histograms and of interest in nuclear criticality safety and nonproliferation [1, 2, 3]. The histograms are traditionally fit with a one-exponential model; however, recent work has proposed a two-exponential model to acc… ▽ More The prompt neutron period (the negative reciprocal of the prompt neutron decay constant) can be estimated using the Rossi-alpha technique that is predicated on fitting Rossi-alpha histograms and of interest in nuclear criticality safety and nonproliferation [1, 2, 3]. The histograms are traditionally fit with a one-exponential model; however, recent work has proposed a two-exponential model to account for reflector-induced phenomenon [4, 5, 6]. Until recently, the uncertainty quantification for either model was inadequate (inaccurate and demanded large measurement times). Measurement uncertainty quantification by sample and analytic methods was developed and validated in Ref. [7]. The purpose of this transaction is to (i) validate a new bootstrap method by comparing bin-by-bin error bar estimates and (ii) demonstrate how to choose bin widths and reset times to optimize precision and accuracy. △ Less

Submitted 13 October, 2020; originally announced October 2020.

Comments: ANS Winter Meeting 2020

arXiv:2007.06821 [pdf, ps, other]

Branches in the Bruhat-Tits tree for local fields of even characteristic

Authors: Luis Arenas-Carmona, Claudio Bravo

Abstract: We extend our previous computations for the relative positions of branches of quaternions to the case of local fields of even characteristic. This is a key step to understand the set of maximal orders containing a given suborder, which is useful, for instance, to compute relative spinor images, thus solving the selectivity problem. In our previous work, the results where given in terms of the quad… ▽ More We extend our previous computations for the relative positions of branches of quaternions to the case of local fields of even characteristic. This is a key step to understand the set of maximal orders containing a given suborder, which is useful, for instance, to compute relative spinor images, thus solving the selectivity problem. In our previous work, the results where given in terms of the quadratic defect. In the present context, we introduce and characterize an analogous concept for Artin-Schreier extensions. It is no longer useful to restrict our attention to orders generated by pure quaternions, as a separable quadratic extension contains no non-trivial element of null trace. In this work we state our result for an arbitrary pair of generators, for which we discuss a more general version of the Hilbert symbol in this context. △ Less

Submitted 14 July, 2020; originally announced July 2020.

MSC Class: 11S45

arXiv:2005.14658 [pdf, other]

doi 10.1016/j.eswa.2020.114486

Super-App Behavioral Patterns in Credit Risk Models: Financial, Statistical and Regulatory Implications

Authors: Luisa Roa, Alejandro Correa-Bahnsen, Gabriel Suarez, Fernando Cortés-Tejada, María A. Luque, Cristián Bravo

Abstract: In this paper we present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models. These alternative data sources have shown themselves to be immensely powerful in predicting borrower behavior in segments traditionally underserved by banks and financial institutions. Our results, validated across two countries,… ▽ More In this paper we present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models. These alternative data sources have shown themselves to be immensely powerful in predicting borrower behavior in segments traditionally underserved by banks and financial institutions. Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals, who are also the most likely to engage with alternative lenders. Furthermore, using the TreeSHAP method for Stochastic Gradient Boosting interpretation, our results also revealed interesting non-linear trends in the variables originating from the app, which would not normally be available to traditional banks. Our results represent an opportunity for technology companies to disrupt traditional banking by correctly identifying alternative data sources and handling this new information properly. At the same time alternative data must be carefully validated to overcome regulatory hurdles across diverse jurisdictions. △ Less

Submitted 4 January, 2021; v1 submitted 8 May, 2020; originally announced May 2020.

Comments: Accepted - v2. 25 pages

Journal ref: Expert Systems with Applications: 114486 (2020)

arXiv:2005.12418 [pdf, other]

Evolution of Credit Risk Using a Personalized Pagerank Algorithm for Multilayer Networks

Authors: Cristián Bravo, María Óskarsdóttir

Abstract: In this paper we present a novel algorithm to study the evolution of credit risk across complex multilayer networks. Pagerank-like algorithms allow for the propagation of an influence variable across single networks, and allow quantifying the risk single entities (nodes) are subject to given the connection they have to other nodes in the network. Multilayer networks, on the other hand, are network… ▽ More In this paper we present a novel algorithm to study the evolution of credit risk across complex multilayer networks. Pagerank-like algorithms allow for the propagation of an influence variable across single networks, and allow quantifying the risk single entities (nodes) are subject to given the connection they have to other nodes in the network. Multilayer networks, on the other hand, are networks where subset of nodes can be associated to a unique set (layer), and where edges connect elements either intra or inter networks. Our personalized PageRank algorithm for multilayer networks allows for quantifying how credit risk evolves across time and propagates through these networks. By using bipartite networks in each layer, we can quantify the risk of various components, not only the loans. We test our method in an agricultural lending dataset, and our results show how default risk is a challenging phenomenon that propagates and evolves through the network across time. △ Less

Submitted 10 August, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

Comments: Conference camera-ready paper - accepted at KDD MLF 2020. 15 pages, 10 figures

Journal ref: Proceedings of the Third KDD Workshop on Machine Learning in Finance, joint with 26th ACM SIGKDD Conference on Knowledge Discovery in Databases (KDD MLF 2020). ACM, New York, NY, USA, 8 pages

arXiv:2003.08964 [pdf, other]

doi 10.1016/j.ejor.2021.03.008

The value of text for small business default prediction: A deep learning approach

Authors: Matthew Stevenson, Christophe Mues, Cristián Bravo

Abstract: Compared to consumer lending, Micro, Small and Medium Enterprise (mSME) credit risk modelling is particularly challenging, as, often, the same sources of information are not available. Therefore, it is standard policy for a loan officer to provide a textual loan assessment to mitigate limited data availability. In turn, this statement is analysed by a credit expert alongside any available standard… ▽ More Compared to consumer lending, Micro, Small and Medium Enterprise (mSME) credit risk modelling is particularly challenging, as, often, the same sources of information are not available. Therefore, it is standard policy for a loan officer to provide a textual loan assessment to mitigate limited data availability. In turn, this statement is analysed by a credit expert alongside any available standard credit data. In our paper, we exploit recent advances from the field of Deep Learning and Natural Language Processing (NLP), including the BERT (Bidirectional Encoder Representations from Transformers) model, to extract information from 60 000 textual assessments provided by a lender. We consider the performance in terms of the AUC (Area Under the receiver operating characteristic Curve) and Brier Score metrics and find that the text alone is surprisingly effective for predicting default. However, when combined with traditional data, it yields no additional predictive capability, with performance dependent on the text's length. Our proposed deep learning model does, however, appear to be robust to the quality of the text and therefore suitable for partly automating the mSME lending process. We also demonstrate how the content of loan assessments influences performance, leading us to a series of recommendations on a new strategy for collecting future mSME loan assessments. △ Less

Submitted 7 July, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

Comments: 25 pages, 12 figures. v4 - Accepted

Journal ref: European Journal of Operational Research 295 (2): 758-771 (2021)

arXiv:2002.09931 [pdf, other]

doi 10.1016/j.asoc.2018.10.004

The Value of Big Data for Credit Scoring: Enhancing Financial Inclusion using Mobile Phone Data and Social Network Analytics

Authors: María Óskarsdóttir, Cristián Bravo, Carlos Sarraute, Jan Vanthienen, Bart Baesens

Abstract: Credit scoring is without a doubt one of the oldest applications of analytics. In recent years, a multitude of sophisticated classification techniques have been developed to improve the statistical performance of credit scoring models. Instead of focusing on the techniques themselves, this paper leverages alternative data sources to enhance both statistical and economic model performance. The stud… ▽ More Credit scoring is without a doubt one of the oldest applications of analytics. In recent years, a multitude of sophisticated classification techniques have been developed to improve the statistical performance of credit scoring models. Instead of focusing on the techniques themselves, this paper leverages alternative data sources to enhance both statistical and economic model performance. The study demonstrates how including call networks, in the context of positive credit information, as a new Big Data source has added value in terms of profit by applying a profit measure and profit-based feature selection. A unique combination of datasets, including call-detail records, credit and debit account information of customers is used to create scorecards for credit card applicants. Call-detail records are used to build call networks and advanced social network analytics techniques are applied to propagate influence from prior defaulters throughout the network to produce influence scores. The results show that combining call-detail records with traditional data in credit scoring models significantly increases their performance when measured in AUC. In terms of profit, the best model is the one built with only calling behavior features. In addition, the calling behavior features are the most predictive in other models, both in terms of statistical and economic performance. The results have an impact in terms of ethical use of call-detail records, regulatory implications, financial inclusion, as well as data sharing and privacy. △ Less

Submitted 23 February, 2020; originally announced February 2020.

Journal ref: Applied Soft Computing, Volume 74, January 2019, Pages 26-39

arXiv:2002.03419 [pdf, other]

The Alzheimer's Disease Prediction Of Longitudinal Evolution (TADPOLE) Challenge: Results after 1 Year Follow-up

Authors: Razvan V. Marinescu, Neil P. Oxtoby, Alexandra L. Young, Esther E. Bron, Arthur W. Toga, Michael W. Weiner, Frederik Barkhof, Nick C. Fox, Arman Eshaghi, Tina Toni, Marcin Salaterski, Veronika Lunina, Manon Ansart, Stanley Durrleman, Pascal Lu, Samuel Iddi, Dan Li, Wesley K. Thompson, Michael C. Donohue, Aviv Nahon, Yarden Levy, Dan Halbersberg, Mariya Cohen, Huiling Liao, Tengfei Li , et al. (71 additional authors not shown)

Abstract: We present the findings of "The Alzheimer's Disease Prediction Of Longitudinal Evolution" (TADPOLE) Challenge, which compared the performance of 92 algorithms from 33 international teams at predicting the future trajectory of 219 individuals at risk of Alzheimer's disease. Challenge participants were required to make a prediction, for each month of a 5-year future time period, of three key outcome… ▽ More We present the findings of "The Alzheimer's Disease Prediction Of Longitudinal Evolution" (TADPOLE) Challenge, which compared the performance of 92 algorithms from 33 international teams at predicting the future trajectory of 219 individuals at risk of Alzheimer's disease. Challenge participants were required to make a prediction, for each month of a 5-year future time period, of three key outcomes: clinical diagnosis, Alzheimer's Disease Assessment Scale Cognitive Subdomain (ADAS-Cog13), and total volume of the ventricles. The methods used by challenge participants included multivariate linear regression, machine learning methods such as support vector machines and deep neural networks, as well as disease progression models. No single submission was best at predicting all three outcomes. For clinical diagnosis and ventricle volume prediction, the best algorithms strongly outperform simple baselines in predictive ability. However, for ADAS-Cog13 no single submitted prediction method was significantly better than random guesswork. Two ensemble methods based on taking the mean and median over all predictions, obtained top scores on almost all tasks. Better than average performance at diagnosis prediction was generally associated with the additional inclusion of features from cerebrospinal fluid (CSF) samples and diffusion tensor imaging (DTI). On the other hand, better performance at ventricle volume prediction was associated with inclusion of summary statistics, such as the slope or maxima/minima of biomarkers. TADPOLE's unique results suggest that current prediction algorithms provide sufficient accuracy to exploit biomarkers related to clinical diagnosis and ventricle volume, for cohort refinement in clinical trials for Alzheimer's disease. However, results call into question the usage of cognitive test scores for patient selection and as a primary endpoint in clinical trials. △ Less

Submitted 27 December, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

Comments: Presents final results of the TADPOLE competition. 60 pages, 7 tables, 14 figures

Journal ref: Machine Learning for Biomedical Imaging (MELBA), Dec 2021

arXiv:2001.10994 [pdf, other]

Credit Scoring for Good: Enhancing Financial Inclusion with Smartphone-Based Microlending

Authors: María Óskarsdóttir, Cristián Bravo, Carlos Sarraute, Bart Baesens, Jan Vanthienen

Abstract: Globally, two billion people and more than half of the poorest adults do not use formal financial services. Consequently, there is increased emphasis on develo** financial technology that can facilitate access to financial products for the unbanked. In this regard, smartphone-based microlending has emerged as a potential solution to enhance financial inclusion. We propose a methodology to impr… ▽ More Globally, two billion people and more than half of the poorest adults do not use formal financial services. Consequently, there is increased emphasis on develo** financial technology that can facilitate access to financial products for the unbanked. In this regard, smartphone-based microlending has emerged as a potential solution to enhance financial inclusion. We propose a methodology to improve the predictive performance of credit scoring models used by these applications. Our approach is composed of several steps, where we mostly focus on engineering appropriate features from the user data. Thereby, we construct pseudo-social networks to identify similar people and combine complex network analysis with representation learning. Subsequently we build credit scoring models using advanced machine learning techniques with the goal of obtaining the most accurate credit scores, while also taking into consideration ethical and privacy regulations to avoid unfair discrimination. A successful deployment of our proposed methodology could improve the performance of microlending smartphone applications and help enhance financial wellbeing worldwide. △ Less

Submitted 29 January, 2020; originally announced January 2020.

Comments: Thirty Ninth International Conference on Information Systems (ICIS), December 14, 2018, San Francisco, USA

arXiv:2001.06701 [pdf, other]

doi 10.1016/j.eswa.2017.05.028

Social Network Analytics for Churn Prediction in Telco: Model Building, Evaluation and Network Architecture

Authors: María Óskarsdóttir, Cristián Bravo, Wouter Verbeke, Carlos Sarraute, Bart Baesens, Jan Vanthienen

Abstract: Social network analytics methods are being used in the telecommunication industry to predict customer churn with great success. In particular it has been shown that relational learners adapted to this specific problem enhance the performance of predictive models. In the current study we benchmark different strategies for constructing a relational learner by applying them to a total of eight dist… ▽ More Social network analytics methods are being used in the telecommunication industry to predict customer churn with great success. In particular it has been shown that relational learners adapted to this specific problem enhance the performance of predictive models. In the current study we benchmark different strategies for constructing a relational learner by applying them to a total of eight distinct call-detail record datasets, originating from telecommunication organizations across the world. We statistically evaluate the effect of relational classifiers and collective inference methods on the predictive power of relational learners, as well as the performance of models where relational learners are combined with traditional methods of predicting customer churn in the telecommunication industry. Finally we investigate the effect of network construction on model performance; our findings imply that the definition of edges and weights in the network does have an impact on the results of the predictive models. As a result of the study, the best configuration is a non-relational learner enriched with network variables, without collective inference, using binary weights and undirected networks. In addition, we provide guidelines on how to apply social networks analytics for churn prediction in the telecommunication industry in an optimal way, ranging from network architecture to model building and evaluation. △ Less

Submitted 18 January, 2020; originally announced January 2020.

Journal ref: Expert Systems with Applications, Volume 85, 1 November 2017, Pages 204-220

arXiv:2001.06700 [pdf, other]

doi 10.1109/ASONAM.2016.7752384

A Comparative Study of Social Network Classifiers for Predicting Churn in the Telecommunication Industry

Authors: Maria Óskarsdóttir, Cristián Bravo, Wouter Verbeke, Carlos Sarraute, Bart Baesens, Jan Vanthienen

Abstract: Relational learning in networked data has been shown to be effective in a number of studies. Relational learners, composed of relational classifiers and collective inference methods, enable the inference of nodes in a network given the existence and strength of links to other nodes. These methods have been adapted to predict customer churn in telecommunication companies showing that incorporating… ▽ More Relational learning in networked data has been shown to be effective in a number of studies. Relational learners, composed of relational classifiers and collective inference methods, enable the inference of nodes in a network given the existence and strength of links to other nodes. These methods have been adapted to predict customer churn in telecommunication companies showing that incorporating them may give more accurate predictions. In this research, the performance of a variety of relational learners is compared by applying them to a number of CDR datasets originating from the telecommunication industry, with the goal to rank them as a whole and investigate the effects of relational classifiers and collective inference methods separately. Our results show that collective inference methods do not improve the performance of relational classifiers and the best performing relational classifier is the network-only link-based classifier, which builds a logistic model using link-based measures for the nodes in the network. △ Less

Submitted 18 January, 2020; originally announced January 2020.

Comments: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

arXiv:1910.12370 [pdf, other]

Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks

Authors: Aya Abdelsalam Ismail, Mohamed Gunady, Luiz Pessoa, Héctor Corrada Bravo, Soheil Feizi

Abstract: Recent efforts to improve the interpretability of deep neural networks use saliency to characterize the importance of input features to predictions made by models. Work on interpretability using saliency-based methods on Recurrent Neural Networks (RNNs) has mostly targeted language tasks, and their applicability to time series data is less understood. In this work we analyze saliency-based methods… ▽ More Recent efforts to improve the interpretability of deep neural networks use saliency to characterize the importance of input features to predictions made by models. Work on interpretability using saliency-based methods on Recurrent Neural Networks (RNNs) has mostly targeted language tasks, and their applicability to time series data is less understood. In this work we analyze saliency-based methods for RNNs, both classical and gated cell architectures. We show that RNN saliency vanishes over time, biasing detection of salient features only to later time steps and are, therefore, incapable of reliably detecting important features at arbitrary time intervals. To address this vanishing saliency problem, we propose a novel RNN cell structure (input-cell attention), which can extend any RNN cell architecture. At each time step, instead of only looking at the current input vector, input-cell attention uses a fixed-size matrix embedding, each row of the matrix attending to different inputs from current or previous time steps. Using synthetic data, we show that the saliency map produced by the input-cell attention RNN is able to faithfully detect important features regardless of their occurrence in time. We also apply the input-cell attention RNN on a neuroscience task analyzing functional Magnetic Resonance Imaging (fMRI) data for human subjects performing a variety of tasks. In this case, we use saliency to characterize brain regions (input features) for which activity is important to distinguish between tasks. We show that standard RNN architectures are only capable of detecting important brain regions in the last few time steps of the fMRI data, while the input-cell attention model is able to detect important brain region activity across time without latter time step biases. △ Less

Submitted 27 October, 2019; originally announced October 2019.

Journal ref: Neurips 2019

arXiv:1910.04886 [pdf, other]

The Heavy Photon Search Experiment

Authors: Cameron Bravo

Abstract: The Heavy Photon Search (HPS) experiment searches for an electro-produced dark photon using an electron beam provided by the CEBAF accelerator at the Thomas Jefferson National Accelerator Facility. HPS has successfully completed two engineering runs. In 2015 using a 1.056 GeV, 50 nA electron beam, 1.7 days (10 mC) of data was obtained and 5.4 days (92.5 mC) of data was collected in 2016 using a 2.… ▽ More The Heavy Photon Search (HPS) experiment searches for an electro-produced dark photon using an electron beam provided by the CEBAF accelerator at the Thomas Jefferson National Accelerator Facility. HPS has successfully completed two engineering runs. In 2015 using a 1.056 GeV, 50 nA electron beam, 1.7 days (10 mC) of data was obtained and 5.4 days (92.5 mC) of data was collected in 2016 using a 2.3 GeV, 200 nA electron beam. In addition, HPS will complete its first physics run in the summer of 2019. HPS looks for dark photons through two distinct methods, a resonance search in the $e^{+}e^{-}$ invariant mass distribution above the large QED background (large dark photon-SM particles coupling region) and a displaced vertex search for long-lived dark photons (small coupling region). HPS employs a compact spectrometer, matched to the forward kinematic characteristics of A$^\prime$ electro-production. The detector consists of a silicon tracker for momentum analysis and vertexing and a lead tungstate (PbWO$_4$) electromagnetic calorimeter for particle ID and triggering. Both analyses are complete for the 2015 engineering run and demonstrate the full functionality of the experiment that will probe hitherto unexplored parameter space with higher luminosity runs. Results from the 2015 dataset will be presented as well as an update on 2016 analysis and the status of the 2019 physics run. △ Less

Submitted 10 October, 2019; originally announced October 2019.

Comments: Talk presented at the 2019 Meeting of the Division of Particles and Fields of the American Physical Society (DPF2019), July 29 - August 2, 2019, Northeastern University, Boston, C1907293

arXiv:1905.08244 [pdf, ps, other]

On genera containing non-split Eichler orders over function fields

Authors: Luis Arenas-Carmona, Claudio Bravo

Abstract: Grothendieck-Birkhoff Theorem states that every finite dimensional vector bundle over the projective line P1 splits as the sum of one dimensional vector bundles. This can be rephrased, in terms of orders, as stating that all maximal orders over the projective line in a matrix algebra split. In this work we study the extent to which this result can be generalized to Eichler orders when the base fie… ▽ More Grothendieck-Birkhoff Theorem states that every finite dimensional vector bundle over the projective line P1 splits as the sum of one dimensional vector bundles. This can be rephrased, in terms of orders, as stating that all maximal orders over the projective line in a matrix algebra split. In this work we study the extent to which this result can be generalized to Eichler orders when the base field F is finite. To be precise, we characterize both the genera of Eichler orders containing only split orders and the genera containing only a finite number of non-split conjugacy classes. The latter characterization is given for arbitrary projective curves over F. The method developed here also allows us to compute quotient graphs for some subgroups of $PGL_2(F[t])$ of arithmetical interest. This paper includes material from the unpublished work "Simultaneous diagonalization of vector bundles". △ Less

Submitted 22 May, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1610.07716 It includes material from that (unpublished) paper

MSC Class: 14H60-11R58 (primary); 14G15-20E08 (secondary)

arXiv:1805.02786 [pdf, other]

doi 10.1007/JHEP11(2018)041

BaryoGEN, a Monte Carlo Generator for Sphaleron-Like Transitions in Proton-Proton Collisions

Authors: Cameron Bravo, Jay Hauser

Abstract: Sphaleron and instanton solutions of the Standard Model provide violation of baryon and lepton numbers and could lead to spectacular events at the LHC or future colliders. Certain models of new physics can also lead to sphaleron-like vacuum transitions. This nonperturbative physics could be relevant to the generation of the matter-antimatter asymmetry of the universe. We have developed BaryoGEN, a… ▽ More Sphaleron and instanton solutions of the Standard Model provide violation of baryon and lepton numbers and could lead to spectacular events at the LHC or future colliders. Certain models of new physics can also lead to sphaleron-like vacuum transitions. This nonperturbative physics could be relevant to the generation of the matter-antimatter asymmetry of the universe. We have developed BaryoGEN, an event generator that facilitates the exploration of sphaleron-like transitions in proton-proton collisions with minimal assumptions. BaryoGEN outputs standard Les Houches Event files that can be processed by PYTHIA, and the code is publicly available. We also discuss various approaches to experimental searches for such transitions in proton-proton collisions. △ Less

Submitted 20 July, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

arXiv:1804.06776 [pdf, other]

Improving Long-Horizon Forecasts with Expectation-Biased LSTM Networks

Authors: Aya Abdelsalam Ismail, Timothy Wood, Héctor Corrada Bravo

Abstract: State-of-the-art forecasting methods using Recurrent Neural Net- works (RNN) based on Long-Short Term Memory (LSTM) cells have shown exceptional performance targeting short-horizon forecasts, e.g given a set of predictor features, forecast a target value for the next few time steps in the future. However, in many applica- tions, the performance of these methods decays as the forecasting horizon ex… ▽ More State-of-the-art forecasting methods using Recurrent Neural Net- works (RNN) based on Long-Short Term Memory (LSTM) cells have shown exceptional performance targeting short-horizon forecasts, e.g given a set of predictor features, forecast a target value for the next few time steps in the future. However, in many applica- tions, the performance of these methods decays as the forecasting horizon extends beyond these few time steps. This paper aims to explore the challenges of long-horizon forecasting using LSTM networks. Here, we illustrate the long-horizon forecasting problem in datasets from neuroscience and energy supply management. We then propose expectation-biasing, an approach motivated by the literature of Dynamic Belief Networks, as a solution to improve long-horizon forecasting using LSTMs. We propose two LSTM ar- chitectures along with two methods for expectation biasing that significantly outperforms standard practice. △ Less

Submitted 18 April, 2018; originally announced April 2018.

arXiv:1712.01463 [pdf, ps, other]

On the missing branches of the Bruhat-Tits tree

Authors: Luis Arenas-Carmona, Claudio Bravo

Abstract: Let k be a local field and let A be the two-by-two matrix algebra over k. In our previous work we developed a theory that allows the computation of the set of maximal orders in A containing a given suborder. This set is given as a sub-tree of the Bruhat-Tits tree that is called the branch of the order. Branches have been used to study the global selectivity problem and also to compute local embedd… ▽ More Let k be a local field and let A be the two-by-two matrix algebra over k. In our previous work we developed a theory that allows the computation of the set of maximal orders in A containing a given suborder. This set is given as a sub-tree of the Bruhat-Tits tree that is called the branch of the order. Branches have been used to study the global selectivity problem and also to compute local embedding numbers. They can usually be described in terms of two invariants. To compute these invariants explicitly, the strategy in our past work has been visualizing branches through the explicit representation of the Bruhat-Tits tree in terms of balls in k. This is easier for orders spanning a split commutative sub-algebra, i.e., an algebra isomorphic to (k x k). In the present work, we develop a theory of branches over field extension that can be used to extend our previous computations to orders spanning a field. We use the same idea to compute branches for orders generated by arbitrary pairs of non-nilpotent pure quaternions. In fact, the hypotheses on the generators are not essential. △ Less

Submitted 22 May, 2019; v1 submitted 4 December, 2017; originally announced December 2017.

Comments: This article will be published under the title "Computing embedding numbers and branches of orders via extensions of the Bruhat-Tits tree"

MSC Class: 11S45; 11R52; 16G30

Showing 1–50 of 52 results for author: Bravo, C