-
Managers versus Machines: Do Algorithms Replicate Human Intuition in Credit Ratings?
Authors:
Matthew Harding,
Gabriel F. R. Vasconcelos
Abstract:
We use machine learning techniques to investigate whether it is possible to replicate the behavior of bank managers who assess the risk of commercial loans made by a large commercial US bank. Even though a typical bank already relies on an algorithmic scorecard process to evaluate risk, bank managers are given significant latitude in adjusting the risk score in order to account for other holistic…
▽ More
We use machine learning techniques to investigate whether it is possible to replicate the behavior of bank managers who assess the risk of commercial loans made by a large commercial US bank. Even though a typical bank already relies on an algorithmic scorecard process to evaluate risk, bank managers are given significant latitude in adjusting the risk score in order to account for other holistic factors based on their intuition and experience. We show that it is possible to find machine learning algorithms that can replicate the behavior of the bank managers. The input to the algorithms consists of a combination of standard financials and soft information available to bank managers as part of the typical loan review process. We also document the presence of significant heterogeneity in the adjustment process that can be traced to differences across managers and industries. Our results highlight the effectiveness of machine learning based analytic approaches to banking and the potential challenges to high-skill jobs in the financial sector.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
Predicting Mortality from Credit Reports
Authors:
Giacomo De Giorgi,
Matthew Harding,
Gabriel Vasconcelos
Abstract:
Data on hundreds of variables related to individual consumer finance behavior (such as credit card and loan activity) is routinely collected in many countries and plays an important role in lending decisions. We postulate that the detailed nature of this data may be used to predict outcomes in seemingly unrelated domains such as individual health. We build a series of machine learning models to de…
▽ More
Data on hundreds of variables related to individual consumer finance behavior (such as credit card and loan activity) is routinely collected in many countries and plays an important role in lending decisions. We postulate that the detailed nature of this data may be used to predict outcomes in seemingly unrelated domains such as individual health. We build a series of machine learning models to demonstrate that credit report data can be used to predict individual mortality. Variable groups related to credit cards and various loans, mostly unsecured loans, are shown to carry significant predictive power. Lags of these variables are also significant thus indicating that dynamics also matters. Improved mortality predictions based on consumer finance data can have important economic implications in insurance markets but may also raise privacy concerns.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Deep learning for lithological classification of carbonate rock micro-CT images
Authors:
Carlos E. M. dos Anjos,
Manuel R. V. Avila,
Adna G. P. Vasconcelos,
Aurea M. P. Neta,
Lizianne C. Medeiros,
Alexandre G. Evsukoff,
Rodrigo Surmas
Abstract:
In addition to the ongoing development, pre-salt carbonate reservoir characterization remains a challenge, primarily due to inherent geological particularities. These challenges stimulate the use of well-established technologies, such as artificial intelligence algorithms, for image classification tasks. Therefore, this work intends to present an application of deep learning techniques to identify…
▽ More
In addition to the ongoing development, pre-salt carbonate reservoir characterization remains a challenge, primarily due to inherent geological particularities. These challenges stimulate the use of well-established technologies, such as artificial intelligence algorithms, for image classification tasks. Therefore, this work intends to present an application of deep learning techniques to identify patterns in Brazilian pre-salt carbonate rock microtomographic images, thus making possible lithological classification. Four convolutional neural network models were proposed. The first model includes three convolutional layers followed by fully connected layers and is used as a base model for the following proposals. In the next two models, we replace the max pooling layer with a spatial pyramid pooling and a global average pooling layer. The last model uses a combination of spatial pyramid pooling followed by global average pooling in place of the last pooling layer. All models are compared using original images, when possible, as well as resized images. The dataset consists of 6,000 images from three different classes. The model performances were evaluated by each image individually, as well as by the most frequently predicted class for each sample. According to accuracy, Model 2 trained on resized images achieved the best results, reaching an average of 75.54% for the first evaluation approach and an average of 81.33% for the second. We developed a workflow to automate and accelerate the lithology classification of Brazilian pre-salt carbonate samples by categorizing microtomographic images using deep learning algorithms in a non-destructive way.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
A Multiple Source Hourglass Deep Network for Multi-Focus Image Fusion
Authors:
Fidel Alejandro Guerrero Peña,
Pedro Diamel Marrero Fernández,
Tsang Ing Ren,
Germano Crispim Vasconcelos,
Alexandre Cunha
Abstract:
Multi-Focus Image Fusion seeks to improve the quality of an acquired burst of images with different focus planes. For solving the task, an activity level measurement and a fusion rule are typically established to select and fuse the most relevant information from the sources. However, the design of this kind of method by hand is really hard and sometimes restricted to solution spaces where the opt…
▽ More
Multi-Focus Image Fusion seeks to improve the quality of an acquired burst of images with different focus planes. For solving the task, an activity level measurement and a fusion rule are typically established to select and fuse the most relevant information from the sources. However, the design of this kind of method by hand is really hard and sometimes restricted to solution spaces where the optimal all-in-focus images are not contained. Then, we propose here two fast and straightforward approaches for image fusion based on deep neural networks. Our solution uses a multiple source Hourglass architecture trained in an end-to-end fashion. Models are data-driven and can be easily generalized for other kinds of fusion problems. A segmentation approach is used for recognition of the focus map, while the weighted average rule is used for fusion. We designed a training loss function for our regression-based fusion function, which allows the network to learn both the activity level measurement and the fusion rule. Experimental results show our approach has comparable results to the state-of-the-art methods with a 60X increase of computational efficiency for 520X520 resolution images.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
BooST: Boosting Smooth Trees for Partial Effect Estimation in Nonlinear Regressions
Authors:
Yuri Fonseca,
Marcelo Medeiros,
Gabriel Vasconcelos,
Alvaro Veiga
Abstract:
In this paper, we introduce a new machine learning (ML) model for nonlinear regression called the Boosted Smooth Transition Regression Trees (BooST), which is a combination of boosting algorithms with smooth transition regression trees. The main advantage of the BooST model is the estimation of the derivatives (partial effects) of very general nonlinear models. Therefore, the model can provide mor…
▽ More
In this paper, we introduce a new machine learning (ML) model for nonlinear regression called the Boosted Smooth Transition Regression Trees (BooST), which is a combination of boosting algorithms with smooth transition regression trees. The main advantage of the BooST model is the estimation of the derivatives (partial effects) of very general nonlinear models. Therefore, the model can provide more interpretation about the map** between the covariates and the dependent variable than other tree-based models, such as Random Forests. We present several examples with both simulated and real data.
△ Less
Submitted 27 July, 2020; v1 submitted 10 August, 2018;
originally announced August 2018.
-
Using UWB for Human Trajectory Extraction
Authors:
Gonçalo Vasconcelos,
Marcelo Petry,
João Emílio Almeida,
Rosaldo J. F. Rossetti,
António Leça Coelho
Abstract:
In this paper we report on a methodology to model pedestrian behaviours whilst aggregate variables are concerned, with potential applications to different situations, such as evacuating a building in emergency events. The approach consists of using UWB (ultra-wide band) based data collection to characterise behaviour in specific scenarios. From a number of experiments carried out, we detail the si…
▽ More
In this paper we report on a methodology to model pedestrian behaviours whilst aggregate variables are concerned, with potential applications to different situations, such as evacuating a building in emergency events. The approach consists of using UWB (ultra-wide band) based data collection to characterise behaviour in specific scenarios. From a number of experiments carried out, we detail the single-file scenario to demonstrate the ability of this approach to represent macroscopic characteristics of the pedestrian flow. Results are discussed and we can conclude that UWB-based data collection shows great potential and suitability for human trajectory extraction, when compared to other traditional approaches.
△ Less
Submitted 15 March, 2013;
originally announced March 2013.