Search | arXiv e-print repository

Towards Enhanced Local Explainability of Random Forests: a Proximity-Based Approach

Authors: Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

Abstract: We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data po… ▽ More We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data points. This linearity facilitates a local notion of explainability of RF predictions that generates attributions for any model prediction across observations in the training set, and thereby complements established methods like SHAP, which instead generates attributions for a model prediction across dimensions of the feature space. We demonstrate this approach in the context of a bond pricing model trained on US corporate bond trades, and compare our approach to various existing approaches to model explainability. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Comments: 5 pages, 6 figures

arXiv:2310.10760 [pdf, other]

Towards reducing hallucination in extracting information from financial reports using Large Language Models

Authors: Bhaskarjit Sarmah, Tianjie Zhu, Dhagash Mehta, Stefano Pasquali

Abstract: For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Op… ▽ More For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Optical Character Recognition (OCR) and similar techniques encounter difficulties in accurately processing unstructured transcript text, often missing subtle linguistic nuances that drive investor decisions. Here, we demonstrate the utilization of Large Language Models (LLMs) to efficiently and rapidly extract information from earnings report transcripts while ensuring high accuracy transforming the extraction process as well as reducing hallucination by combining retrieval-augmented generation technique as well as metadata. We evaluate the outcomes of various LLMs with and without using our proposed approach based on various objective metrics for evaluating Q\&A systems, and empirically demonstrate superiority of our method. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 4 pages + references. Accepted for publication in Workshop on Generative AI at the 3rd International Conference on AI-ML Systems 2023, Bengaluru, India

arXiv:2207.07183 [pdf, other]

Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning

Authors: Bhaskarjit Sarmah, Nayana Nair, Dhagash Mehta, Stefano Pasquali

Abstract: Understanding non-linear relationships among financial instruments has various applications in investment processes ranging from risk management, portfolio construction and trading strategies. Here, we focus on interconnectedness among stocks based on their correlation matrix which we represent as a network with the nodes representing individual stocks and the weighted links between pairs of nodes… ▽ More Understanding non-linear relationships among financial instruments has various applications in investment processes ranging from risk management, portfolio construction and trading strategies. Here, we focus on interconnectedness among stocks based on their correlation matrix which we represent as a network with the nodes representing individual stocks and the weighted links between pairs of nodes representing the corresponding pair-wise correlation coefficients. The traditional network science techniques, which are extensively utilized in financial literature, require handcrafted features such as centrality measures to understand such correlation networks. However, manually enlisting all such handcrafted features may quickly turn out to be a daunting task. Instead, we propose a new approach for studying nuances and relationships within the correlation network in an algorithmic way using a graph machine learning algorithm called Node2Vec. In particular, the algorithm compresses the network into a lower dimensional continuous space, called an embedding, where pairs of nodes that are identified as similar by the algorithm are placed closer to each other. By using log returns of S&P 500 stock data, we show that our proposed algorithm can learn such an embedding from its correlation network. We define various domain specific quantitative (and objective) and qualitative metrics that are inspired by metrics used in the field of Natural Language Processing (NLP) to evaluate the embeddings in order to identify the optimal one. Further, we discuss various applications of the embeddings in investment management. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: 8 pages, 2 column format, 3 figure, 7 tables

arXiv:1011.3738 [pdf, ps, other]

Gravitational Waves from the $r$-mode instability of neutron stars: effect of magnetic field

Authors: Bhim Prasad Sarmah, H. L. Duorah

Abstract: Studies have shown that emission of gravitational wave drives an instability in the $r$-modes of young rapidly rotating neutron stars carrying away most of the angular momentum through gravitational wave emission in the first year or so after their formation. Magnetic field plays a crucial role in the evolution of these $r$-modes and hence the evolution of the neutron star itself. An attempt is ma… ▽ More Studies have shown that emission of gravitational wave drives an instability in the $r$-modes of young rapidly rotating neutron stars carrying away most of the angular momentum through gravitational wave emission in the first year or so after their formation. Magnetic field plays a crucial role in the evolution of these $r$-modes and hence the evolution of the neutron star itself. An attempt is made here to investigate the role of magnetic field in the evolution of $r$-mode instability and detectibility of gravitational waves emitted by a newly born, hot and rapidly and differentially rotating neutron star. It is found that magnetic field tend to suppress the $r$-mode amplitude. The {\it signal-to-noise ratio} analysis shows that gravitational waves emitted from the $r$-mode instability from neutron stars with magnetic fields upto the order of $10^{14}$ gauss may be detectable by the Advanced LIGO at 20 Mpc. △ Less

Submitted 16 November, 2010; originally announced November 2010.

Comments: 16 pages, 27 figures

arXiv:gr-qc/0510018 [pdf, ps, other]

doi 10.1111/j.1365-2966.2006.10262.x

On searches for gravitational waves from mini creation event by laser interferometric detectors

Authors: Bhim Prasad Sarmah, S. K. Banerjee, S. V. Dhurandhar, J. V. Narlikar

Abstract: As an alternative view to the standard big bang cosmology the quasi-steady state cosmology(QSSC) argues that the universe was not created in a single great explosion; it neither had a beginning nor will it ever come to an end. The creation of new matter in the universe is a regular feature occurring through finite explosive events. Each creation event is called a mini-bang or, a mini creation ev… ▽ More As an alternative view to the standard big bang cosmology the quasi-steady state cosmology(QSSC) argues that the universe was not created in a single great explosion; it neither had a beginning nor will it ever come to an end. The creation of new matter in the universe is a regular feature occurring through finite explosive events. Each creation event is called a mini-bang or, a mini creation event(MCE). Gravitational waves are expected to be generated due to any anisotropy present in this process of creation. Mini creation event ejecting matter in two oppositely directed jets is thus a source of gravitational waves which can in principle be detected by laser interferometric detectors. In the present work we consider the gravitational waveforms propagated by linear jets and then estimate the response of laser interferometric detectors like LIGO and LISA. △ Less

Submitted 5 October, 2005; originally announced October 2005.

Journal ref: Mon.Not.Roy.Astron.Soc.369:89-96,2006

Showing 1–5 of 5 results for author: Sarmah, B