Skip to main content

Showing 1–35 of 35 results for author: Utkin, L V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19897  [pdf, other

    cs.LG stat.ML

    FI-CBL: A Probabilistic Method for Concept-Based Learning with Expert Rules

    Authors: Lev V. Utkin, Andrei V. Konstantinov, Stanislav R. Kirpichenko

    Abstract: A method for solving concept-based learning (CBL) problem is proposed. The main idea behind the method is to divide each concept-annotated image into patches, to transform the patches into embeddings by using an autoencoder, and to cluster the embeddings assuming that each cluster will mainly contain embeddings of patches with certain concepts. To find concepts of a new image, the method implement… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2405.17483  [pdf, other

    eess.IV cs.AI cs.LG

    Concept-based Explainable Malignancy Scoring on Pulmonary Nodules in CT Images

    Authors: Rinat I. Dumaev, Sergei A. Molodyakov, Lev V. Utkin

    Abstract: To increase the transparency of modern computer-aided diagnosis (CAD) systems for assessing the malignancy of lung nodules, an interpretable model based on applying the generalized additive models and the concept-based learning is proposed. The model detects a set of clinically significant attributes in addition to the final malignancy regression score and learns the association between the lung n… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2402.14726  [pdf, other

    cs.LG cs.AI stat.ML

    Incorporating Expert Rules into Neural Networks in the Framework of Concept-Based Learning

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A problem of incorporating the expert rules into machine learning models for extending the concept-based learning is formulated in the paper. It is proposed how to combine logical rules and neural networks predicting the concept probabilities. The first idea behind the combination is to form constraints for a joint probability distribution over all combinations of concept values to satisfy the exp… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  4. arXiv:2402.12331  [pdf, other

    cs.LG cs.AI stat.ML

    Generating Survival Interpretable Trajectories and Data

    Authors: Andrei V. Konstantinov, Stanislav R. Kirpichenko, Lev V. Utkin

    Abstract: A new model for generating survival trajectories and data based on applying an autoencoder of a specific structure is proposed. It solves three tasks. First, it provides predictions in the form of the expected event time and the survival function for a new generated feature vector on the basis of the Beran estimator. Second, the model generates additional data based on a given training set that wo… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  5. arXiv:2401.16294  [pdf, other

    cs.LG cs.AI stat.ML

    Dual feature-based and example-based explanation methods

    Authors: Andrei V. Konstantinov, Boris V. Kozlov, Stanislav R. Kirpichenko, Lev V. Utkin

    Abstract: A new approach to the local and global explanation is proposed. It is based on selecting a convex hull constructed for the finite number of points around an explained instance. The convex hull allows us to consider a dual representation of instances in the form of convex combinations of extreme points of a produced polytope. Instead of perturbing new instances in the Euclidean feature space, vecto… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  6. arXiv:2312.06638  [pdf, other

    cs.LG cs.AI stat.ML

    SurvBeNIM: The Beran-Based Neural Importance Model for Explaining the Survival Models

    Authors: Lev V. Utkin, Danila Y. Eremenko, Andrei V. Konstantinov

    Abstract: A new method called the Survival Beran-based Neural Importance Model (SurvBeNIM) is proposed. It aims to explain predictions of machine learning survival models, which are in the form of survival or cumulative hazard functions. The main idea behind SurvBeNIM is to extend the Beran estimator by incorporating the importance functions into its kernels and by implementing these importance functions as… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  7. arXiv:2308.03730  [pdf, other

    cs.LG cs.AI stat.ML

    SurvBeX: An explanation method of the machine learning survival models based on the Beran estimator

    Authors: Lev V. Utkin, Danila Y. Eremenko, Andrei V. Konstantinov

    Abstract: An explanation method called SurvBeX is proposed to interpret predictions of the machine learning survival black-box models. The main idea behind the method is to use the modified Beran estimator as the surrogate explanation model. Coefficients, incorporated into Beran estimator, can be regarded as values of the feature impacts on the black-box model prediction. Following the well-known LIME metho… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  8. arXiv:2307.10459  [pdf, other

    cs.LG cs.AI stat.ML

    A New Computationally Simple Approach for Implementing Neural Networks with Output Hard Constraints

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A new computationally simple method of imposing hard convex constraints on the neural network output values is proposed. The key idea behind the method is to map a vector of hidden parameters of the network to a point that is guaranteed to be inside the feasible set defined by a set of constraints. The map** is implemented by the additional neural network layer with constraints for output. The p… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  9. arXiv:2304.05980  [pdf, other

    cs.LG cs.AI

    Neural Attention Forests: Transformer-Based Forest Improvement

    Authors: Andrei V. Konstantinov, Lev V. Utkin, Alexey A. Lukashin, Vladimir A. Muliukha

    Abstract: A new approach called NAF (the Neural Attention Forest) for solving regression and classification tasks under tabular training data is proposed. The main idea behind the proposed NAF model is to introduce the attention mechanism into the random forest by assigning attention weights calculated by neural networks of a specific form to data in leaves of decision trees and to the random forest itself… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Submitted for the 7th International Scientific Conference "Intelligent Information Technologies for Industry" in St. Petersburg

  10. arXiv:2303.08625  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable Ensembles of Hyper-Rectangles as Base Models

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A new extremely simple ensemble-based model with the uniformly generated axis-parallel hyper-rectangles as base models (HRBM) is proposed. Two types of HRBMs are studied: closed rectangles and corners. The main idea behind HRBM is to consider and count training examples inside and outside each rectangle. It is proposed to incorporate HRBMs into the gradient boosting machine (GBM). Despite simplici… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  11. arXiv:2302.06601  [pdf, other

    cs.LG

    Multiple Instance Learning with Trainable Decision Tree Ensembles

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A new random forest based model for solving the Multiple Instance Learning (MIL) problem under small tabular data, called Soft Tree Ensemble MIL (STE-MIL), is proposed. A new type of soft decision trees is considered, which is similar to the well-known soft oblique trees, but with a smaller number of trainable parameters. In order to train the trees, it is proposed to convert them into neural netw… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  12. arXiv:2211.10793  [pdf, other

    cs.LG stat.ML

    BENK: The Beran Estimator with Neural Kernels for Estimating the Heterogeneous Treatment Effect

    Authors: Stanislav R. Kirpichenko, Lev V. Utkin, Andrei V. Konstantinov

    Abstract: A method for estimating the conditional average treatment effect under condition of censored time-to-event data called BENK (the Beran Estimator with Neural Kernels) is proposed. The main idea behind the method is to apply the Beran estimator for estimating the survival functions of controls and treatments. Instead of typical kernel functions in the Beran estimator, it is proposed to implement ker… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

  13. arXiv:2210.05168  [pdf, other

    cs.LG cs.AI stat.ML

    LARF: Two-level Attention-based Random Forests with a Mixture of Contamination Models

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: New models of the attention-based random forests called LARF (Leaf Attention-based Random Forest) are proposed. The first idea behind the models is to introduce a two-level attention, where one of the levels is the "leaf" attention and the attention mechanism is applied to every leaf of trees. The second level is the tree attention depending on the "leaf" attention. The second idea is to replace t… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  14. arXiv:2210.02558  [pdf, other

    cs.LG stat.ML

    Improved Anomaly Detection by Using the Attention-Based Isolation Forest

    Authors: Lev V. Utkin, Andrey Y. Ageev, Andrei V. Konstantinov

    Abstract: A new modification of Isolation Forest called Attention-Based Isolation Forest (ABIForest) for solving the anomaly detection problem is proposed. It incorporates the attention mechanism in the form of the Nadaraya-Watson regression into the Isolation Forest for improving solution of the anomaly detection problem. The main idea underlying the modification is to assign attention weights to each path… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  15. arXiv:2207.09139  [pdf, other

    cs.LG stat.ML

    Heterogeneous Treatment Effect with Trained Kernels of the Nadaraya-Watson Regression

    Authors: Andrei V. Konstantinov, Stanislav R. Kirpichenko, Lev V. Utkin

    Abstract: A new method for estimating the conditional average treatment effect is proposed in the paper. It is called TNW-CATE (the Trainable Nadaraya-Watson regression for CATE) and based on the assumption that the number of controls is rather large whereas the number of treatments is small. TNW-CATE uses the Nadaraya-Watson regression for predicting outcomes of patients from the control and treatment grou… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  16. arXiv:2207.04293  [pdf, other

    cs.LG stat.ML

    Attention and Self-Attention in Random Forests

    Authors: Lev V. Utkin, Andrei V. Konstantinov

    Abstract: New models of random forests jointly using the attention and self-attention mechanisms are proposed for solving the regression problem. The models can be regarded as extensions of the attention-based random forest whose idea stems from applying a combination of the Nadaraya-Watson kernel regression and the Huber's contamination model to random forests. The self-attention aims to capture dependenci… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2201.02880

  17. arXiv:2201.02880  [pdf, other

    cs.LG cs.AI stat.ML

    Attention-based Random Forest and Contamination Model

    Authors: Lev V. Utkin, Andrei V. Konstantinov

    Abstract: A new approach called ABRF (the attention-based random forest) and its modifications for applying the attention mechanism to the random forest (RF) for regression and classification are proposed. The main idea behind the proposed ABRF models is to assign attention weights with trainable parameters to decision trees in a specific way. The weights depend on the distance between an instance, which fa… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

  18. arXiv:2112.06071  [pdf, other

    cs.LG

    Multi-Attention Multiple Instance Learning

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A new multi-attention based method for solving the MIL problem (MAMIL), which takes into account the neighboring patches or instances of each analyzed patch in a bag, is proposed. In the method, one of the attention modules takes into account adjacent patches or instances, several attention modules are used to get a diverse feature representation of patches, and one attention module is used to uni… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

  19. arXiv:2108.04855  [pdf, other

    cs.LG cs.AI

    Attention-like feature explanation for tabular data

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A new method for local and global explanation of the machine learning black-box model predictions by tabular data is proposed. It is implemented as a system called AFEX (Attention-like Feature EXplanation) and consisting of two main parts. The first part is a set of the one-feature neural subnetworks which aim to get a specific representation for every feature in the form of a basis of shape funct… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

  20. arXiv:2106.09111  [pdf, other

    cs.LG stat.ML

    An Imprecise SHAP as a Tool for Explaining the Class Probability Distributions under Limited Training Data

    Authors: Lev V. Utkin, Andrei V. Konstantinov, Kirill A. Vishniakov

    Abstract: One of the most popular methods of the machine learning prediction explanation is the SHapley Additive exPlanations method (SHAP). An imprecise SHAP as a modification of the original SHAP is proposed for cases when the class probability distributions are imprecise and represented by sets of distributions. The first idea behind the imprecise SHAP is a new approach for computing the marginal contrib… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  21. arXiv:2104.08903  [pdf, other

    cs.LG stat.ML

    SurvNAM: The machine learning survival model explanation

    Authors: Lev V. Utkin, Egor D. Satyukov, Andrei V. Konstantinov

    Abstract: A new modification of the Neural Additive Model (NAM) called SurvNAM and its modifications are proposed to explain predictions of the black-box machine learning survival model. The method is based on applying the original NAM to solving the explanation problem in the framework of survival analysis. The basic idea behind SurvNAM is to train the network by means of a specific expected loss function… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

  22. arXiv:2103.03302  [pdf, other

    cs.LG stat.ML

    Ensembles of Random SHAPs

    Authors: Lev V. Utkin, Andrei V. Konstantinov

    Abstract: Ensemble-based modifications of the well-known SHapley Additive exPlanations (SHAP) method for the local explanation of a black-box model are proposed. The modifications aim to simplify SHAP which is computationally expensive when there is a large number of features. The main idea behind the proposed modifications is to approximate SHAP by an ensemble of SHAPs with a smaller number of features. Ac… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  23. arXiv:2010.07388  [pdf, other

    cs.LG stat.ML

    Interpretable Machine Learning with an Ensemble of Gradient Boosting Machines

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: A method for the local and global interpretation of a black-box model on the basis of the well-known generalized additive models is proposed. It can be viewed as an extension or a modification of the algorithm using the neural additive model. The method is based on using an ensemble of gradient boosting machines (GBMs) such that each GBM is learned on a single feature and produces a shape function… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  24. arXiv:2010.06026  [pdf, ps, other

    cs.LG stat.ML

    A Generalized Stacking for Implementing Ensembles of Gradient Boosting Machines

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: The gradient boosting machine is one of the powerful tools for solving regression problems. In order to cope with its shortcomings, an approach for constructing ensembles of gradient boosting models is proposed. The main idea behind the approach is to use the stacking algorithm in order to learn a second-level meta-model which can be regarded as a model for implementing various ensembles of gradie… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  25. arXiv:2006.16793  [pdf, other

    cs.LG stat.ML

    Counterfactual explanation of machine learning survival models

    Authors: Maxim S. Kovalev, Lev V. Utkin

    Abstract: A method for counterfactual explanation of machine learning survival models is proposed. One of the difficulties of solving the counterfactual explanation problem is that the classes of examples are implicitly defined through outcomes of a machine learning survival model in the form of survival functions. A condition that establishes the difference between survival functions of the original exampl… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:2005.02249

  26. arXiv:2006.11014  [pdf, other

    cs.LG stat.ML

    Gradient boosting machine with partially randomized decision trees

    Authors: Andrei V. Konstantinov, Lev V. Utkin

    Abstract: The gradient boosting machine is a powerful ensemble-based machine learning method for solving regression problems. However, one of the difficulties of its using is a possible discontinuity of the regression function, which arises when regions of training data are not densely covered by training points. In order to overcome this difficulty and to reduce the computational complexity of the gradient… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  27. arXiv:2005.02387  [pdf, other

    cs.LG stat.ML

    SurvLIME-Inf: A simplified modification of SurvLIME for explanation of machine learning survival models

    Authors: Lev V. Utkin, Maxim S. Kovalev, Ernest M. Kasimov

    Abstract: A new modification of the explanation method SurvLIME called SurvLIME-Inf for explaining machine learning survival models is proposed. The basic idea behind SurvLIME as well as SurvLIME-Inf is to apply the Cox proportional hazards model to approximate the black-box survival model at the local area around a test example. The Cox model is used due to the linear relationship of covariates. In contras… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2003.08371, arXiv:2005.02249

  28. arXiv:2005.02249  [pdf, other

    cs.LG stat.ML

    A robust algorithm for explaining unreliable machine learning survival models using the Kolmogorov-Smirnov bounds

    Authors: Maxim S. Kovalev, Lev V. Utkin

    Abstract: A new robust algorithm based of the explanation method SurvLIME called SurvLIME-KS is proposed for explaining machine learning survival models. The algorithm is developed to ensure robustness to cases of a small amount of training data or outliers of survival data. The first idea behind SurvLIME-KS is to apply the Cox proportional hazards model to approximate the black-box survival model at the lo… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  29. arXiv:2003.08371  [pdf, other

    cs.LG stat.ML

    SurvLIME: A method for explaining machine learning survival models

    Authors: Maxim S. Kovalev, Lev V. Utkin, Ernest M. Kasimov

    Abstract: A new method called SurvLIME for explaining machine learning survival models is proposed. It can be viewed as an extension or modification of the well-known method LIME. The main idea behind the proposed method is to apply the Cox proportional hazards model to approximate the survival model at the local area around a test example. The Cox model is used because it considers a linear combination of… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  30. arXiv:1911.07702  [pdf

    cs.LG stat.ML

    An explanation method for Siamese neural networks

    Authors: Lev V. Utkin, Maxim S. Kovalev, Ernest M. Kasimov

    Abstract: A new method for explaining the Siamese neural network is proposed. It uses the following main ideas. First, the explained feature vector is compared with the prototype of the corresponding class computed at the embedding level (the Siamese neural network output). The important features at this level are determined as features which are close to the same features of the prototype. Second, an autoe… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: International Scientific Conference Telecommunications, Computing and Control (TELECCON-2019)

  31. arXiv:1909.03894  [pdf, other

    stat.ML cs.LG

    Estimation of Personalized Heterogeneous Treatment Effects Using Concatenation and Augmentation of Feature Vectors

    Authors: Lev V. Utkin, Mikhail V. Kots, Viacheslav S. Chukanov

    Abstract: A new meta-algorithm for estimating the conditional average treatment effects is proposed in the paper. The main idea underlying the algorithm is to consider a new dataset consisting of feature vectors produced by means of concatenation of examples from control and treatment groups, which are close to each other. Outcomes of new data are defined as the difference between outcomes of the correspond… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

  32. arXiv:1901.01334  [pdf, other

    stat.ML cs.LG

    An Adaptive Weighted Deep Forest Classifier

    Authors: Lev V. Utkin, Andrei V. Konstantinov, Viacheslav S. Chukanov, Mikhail V. Kots, Anna A. Meldo

    Abstract: A modification of the confidence screening mechanism based on adaptive weighing of every training instance at each cascade level of the Deep Forest is proposed. The idea underlying the modification is very simple and stems from the confidence screening mechanism idea proposed by Pang et al. to simplify the Deep Forest classifier by means of updating the training set at each level in accordance wit… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

  33. arXiv:1901.00213  [pdf, other

    stat.ML cs.LG

    A weighted random survival forest

    Authors: Lev V. Utkin, Andrei V. Konstantinov, Viacheslav S. Chukanov, Mikhail V. Kots, Mikhail A. Ryabinin, Anna A. Meldo

    Abstract: A weighted random survival forest is presented in the paper. It can be regarded as a modification of the random forest improving its performance. The main idea underlying the proposed model is to replace the standard procedure of averaging used for estimation of the random survival forest hazard function by weighted avaraging where the weights are assigned to every tree and can be veiwed as traini… ▽ More

    Submitted 1 January, 2019; originally announced January 2019.

  34. arXiv:1705.09620  [pdf, other

    stat.ML cs.LG

    Discriminative Metric Learning with Deep Forest

    Authors: Lev V. Utkin, Mikhail A. Ryabinin

    Abstract: A Discriminative Deep Forest (DisDF) as a metric learning algorithm is proposed in the paper. It is based on the Deep Forest or gcForest proposed by Zhou and Feng and can be viewed as a gcForest modification. The case of the fully supervised learning is studied when the class labels of individual training examples are known. The main idea underlying the algorithm is to assign weights to decision t… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1704.08715

    MSC Class: 68T10

  35. arXiv:1704.08715  [pdf, other

    stat.ML cs.LG

    A Siamese Deep Forest

    Authors: Lev V. Utkin, Mikhail A. Ryabinin

    Abstract: A Siamese Deep Forest (SDF) is proposed in the paper. It is based on the Deep Forest or gcForest proposed by Zhou and Feng and can be viewed as a gcForest modification. It can be also regarded as an alternative to the well-known Siamese neural networks. The SDF uses a modified training set consisting of concatenated pairs of vectors. Moreover, it defines the class distributions in the deep forest… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    MSC Class: 68T10