Skip to main content

Showing 1–10 of 10 results for author: Mehta, D

Searching in archive q-fin. Search in all archives.
.
  1. arXiv:2310.12428  [pdf, other

    stat.ML cs.AI cs.LG q-fin.ST stat.ME

    Towards Enhanced Local Explainability of Random Forests: a Proximity-Based Approach

    Authors: Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

    Abstract: We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data po… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 5 pages, 6 figures

  2. arXiv:2310.10760  [pdf, other

    cs.CL q-fin.PM q-fin.ST stat.AP

    Towards reducing hallucination in extracting information from financial reports using Large Language Models

    Authors: Bhaskarjit Sarmah, Tianjie Zhu, Dhagash Mehta, Stefano Pasquali

    Abstract: For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Op… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 4 pages + references. Accepted for publication in Workshop on Generative AI at the 3rd International Conference on AI-ML Systems 2023, Bengaluru, India

  3. arXiv:2308.08031  [pdf, other

    q-fin.ST q-fin.CP stat.AP

    Company Similarity using Large Language Models

    Authors: Dimitrios Vamvourellis, Máté Toth, Snigdha Bhagat, Dhruv Desai, Dhagash Mehta, Stefano Pasquali

    Abstract: Identifying companies with similar profiles is a core task in finance with a wide range of applications in portfolio construction, asset pricing and risk attribution. When a rigorous definition of similarity is lacking, financial analysts usually resort to 'traditional' industry classifications such as Global Industry Classification System (GICS) which assign a unique category to each company at d… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 8 pages, 2 figures, 2 tables

  4. arXiv:2308.06882  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Quantifying Outlierness of Funds from their Categories using Supervised Similarity

    Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

    Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 tables, 8 figures

  5. arXiv:2207.07183  [pdf, other

    q-fin.CP q-fin.ST stat.AP

    Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning

    Authors: Bhaskarjit Sarmah, Nayana Nair, Dhagash Mehta, Stefano Pasquali

    Abstract: Understanding non-linear relationships among financial instruments has various applications in investment processes ranging from risk management, portfolio construction and trading strategies. Here, we focus on interconnectedness among stocks based on their correlation matrix which we represent as a network with the nodes representing individual stocks and the weighted links between pairs of nodes… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: 8 pages, 2 column format, 3 figure, 7 tables

  6. arXiv:2207.04959  [pdf, other

    q-fin.CP q-fin.ST stat.ML

    Learning Mutual Fund Categorization using Natural Language Processing

    Authors: Dimitrios Vamvourellis, Mate Attila Toth, Dhruv Desai, Dhagash Mehta, Stefano Pasquali

    Abstract: Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long served the financial analysts to perform peer analysis for various purposes starting from competitor analysis, to quantifying portfolio diversification. The categorization methodology usually relies on fund composition data in the structured format extracted from the Form N-1A. Here, we initiate a study to learn the categoriz… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: 8 pages, 5 figures, 2-column format

  7. arXiv:2207.04368  [pdf, other

    q-fin.CP q-fin.ST q-fin.TR

    Supervised similarity learning for corporate bonds using Random Forest proximities

    Authors: Jerinsh Jeyapaulraj, Dhruv Desai, Peter Chu, Dhagash Mehta, Stefano Pasquali, Philip Sommer

    Abstract: Financial literature consists of ample research on similarity and comparison of financial assets and securities such as stocks, bonds, mutual funds, etc. However, going beyond correlations or aggregate statistics has been arduous since financial datasets are noisy, lack useful features, have missing data and often lack ground truth or annotated labels. However, though similarity extrapolated from… ▽ More

    Submitted 25 October, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

    Comments: A few minor typos corrected, 1 figure added. Conclusions unchanged. Matching with the accepted version

  8. arXiv:2107.05592  [pdf, other

    q-fin.ST q-fin.CP stat.AP

    Investor Behavior Modeling by Analyzing Financial Advisor Notes: A Machine Learning Perspective

    Authors: Cynthia Pagliaro, Dhagash Mehta, Han-Tai Shiao, Shaofei Wang, Luwei Xiong

    Abstract: Modeling investor behavior is crucial to identifying behavioral coaching opportunities for financial advisors. With the help of natural language processing (NLP) we analyze an unstructured (textual) dataset of financial advisors' summary notes, taken after every investor conversation, to gain first ever insights into advisor-investor interactions. These insights are used to predict investor needs… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: 8 pages, 2 column format, 7 figures+5 tables

  9. arXiv:2106.12987  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Fund2Vec: Mutual Funds Similarity using Graph Learning

    Authors: Vipul Satone, Dhruv Desai, Dhagash Mehta

    Abstract: Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) am… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 2 column format, 8 pages, 8 figures, 5 tables

  10. arXiv:2006.00123  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.ML

    Machine Learning Fund Categorizations

    Authors: Dhagash Mehta, Dhruv Desai, Jithin Pradeep

    Abstract: Given the surge in popularity of mutual funds (including exchange-traded funds (ETFs)) as a diversified financial investment, a vast variety of mutual funds from various investment management firms and diversification strategies have become available in the market. Identifying similar mutual funds among such a wide landscape of mutual funds has become more important than ever because of many appli… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: 8 pages, 2-column format, 5 figures