Skip to main content

Showing 1–39 of 39 results for author: Panov, M

.
  1. arXiv:2407.01794  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.ME

    Conditionally valid Probabilistic Conformal Prediction

    Authors: Vincent Plassier, Alexander Fishkov, Maxim Panov, Eric Moulines

    Abstract: We develop a new method for creating prediction sets that combines the flexibility of conformal methods with an estimate of the conditional distribution $P_{Y \mid X}$. Most existing methods, such as conformalized quantile regression and probabilistic conformal prediction, only offer marginal coverage guarantees. Our approach extends these methods to achieve conditional coverage, which is essentia… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 23 pages

  2. arXiv:2406.15627  [pdf, other

    cs.CL cs.LG

    Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph

    Authors: Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev, Akim Tsvigun, Daniil Vasilev, Rui Xing, Abdelrahman Boda Sadallah, Lyudmila Rvanova, Sergey Petrakov, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov, Artem Shelmanov

    Abstract: Uncertainty quantification (UQ) is becoming increasingly recognized as a critical component of applications that rely on machine learning (ML). The rapid proliferation of large language models (LLMs) has stimulated researchers to seek efficient and effective approaches to UQ in text generation tasks, as in addition to their emerging capabilities, these models have introduced new challenges for bui… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev contributed equally

  3. arXiv:2403.11696  [pdf, other

    cs.LG stat.ML

    Generalization error of spectral algorithms

    Authors: Maksim Velikanov, Maxim Panov, Dmitry Yarotsky

    Abstract: The asymptotically precise estimation of the generalization of kernel methods has recently received attention due to the parallels between neural networks and their associated kernels. However, prior works derive such estimates for training by kernel ridge regression (KRR), whereas neural networks are typically trained with gradient descent (GD). In the present work, we consider the training of ke… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  4. arXiv:2403.04696  [pdf, other

    cs.CL cs.AI cs.LG

    Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

    Authors: Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov

    Abstract: Large language models (LLMs) are notorious for hallucinating, i.e., producing erroneous claims in their output. Such hallucinations can be dangerous, as occasional factual inaccuracies in the generated text might be obscured by the rest of the output being generally factually correct, making it extremely hard for the users to spot them. Current services that leverage LLMs usually do not provide an… ▽ More

    Submitted 6 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted to ACL-2024 (Findings). Ekaterina Fadeeva, Aleksandr Rubashevskii, and Artem Shelmanov contributed equally

  5. arXiv:2402.10727  [pdf, other

    stat.ML cs.LG

    Predictive Uncertainty Quantification via Risk Decompositions for Strictly Proper Scoring Rules

    Authors: Nikita Kotelevskii, Maxim Panov

    Abstract: Uncertainty quantification in predictive modeling often relies on ad hoc methods as there is no universally accepted formal framework for that. This paper introduces a theoretical approach to understanding uncertainty through statistical risks, distinguishing between aleatoric (data-related) and epistemic (model-related) uncertainties. We explain how to split pointwise risk into Bayes risk and exc… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2312.15799  [pdf, other

    stat.ML cs.LG

    Efficient Conformal Prediction under Data Heterogeneity

    Authors: Vincent Plassier, Nikita Kotelevskii, Aleksandr Rubashevskii, Fedor Noskov, Maksim Velikanov, Alexander Fishkov, Samuel Horvath, Martin Takac, Eric Moulines, Maxim Panov

    Abstract: Conformal Prediction (CP) stands out as a robust framework for uncertainty quantification, which is crucial for ensuring the reliability of predictions. However, common CP methods heavily rely on data exchangeability, a condition often violated in practice. Existing approaches for tackling non-exchangeability lead to methods that are not computable beyond the simplest examples. This work introduce… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 28 pages

  7. arXiv:2312.11230  [pdf, other

    stat.ML cs.LG

    Dirichlet-based Uncertainty Quantification for Personalized Federated Learning with Improved Posterior Networks

    Authors: Nikita Kotelevskii, Samuel Horváth, Karthik Nandakumar, Martin Takáč, Maxim Panov

    Abstract: In modern federated learning, one of the main challenges is to account for inherent heterogeneity and the diverse nature of data distributions for different clients. This problem is often addressed by introducing personalization of the models towards the data distribution of the particular client. However, a personalized model might be unreliable when applied to the data that is not typical for th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  8. arXiv:2311.07383  [pdf, other

    cs.CL cs.LG

    LM-Polygraph: Uncertainty Estimation for Language Models

    Authors: Ekaterina Fadeeva, Roman Vashurin, Akim Tsvigun, Artem Vazhentsev, Sergey Petrakov, Kirill Fedyanin, Daniil Vasilev, Elizaveta Goncharova, Alexander Panchenko, Maxim Panov, Timothy Baldwin, Artem Shelmanov

    Abstract: Recent advancements in the capabilities of large language models (LLMs) have paved the way for a myriad of groundbreaking applications in various fields. However, a significant challenge arises as these models often "hallucinate", i.e., fabricate facts without providing users an apparent means to discern the veracity of their statements. Uncertainty estimation (UE) methods are one path to safer, m… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP-2023

  9. arXiv:2310.12587  [pdf

    cond-mat.mtrl-sci

    Accurate FTIR determination of boron concentration in CVD homoepitaxial diamond layers

    Authors: Mikhail Panov, Vasily Zubkov, Anna Solomnikova, Igor Klepikov

    Abstract: The intensive development of technology for fabrication semiconducting CVD diamond layers poses an important task of develo** a precise and non-destructive method for estimation the boron content in thin epitaxial layers. For bulk and uniformly doped diamond samples, the infrared optical spectroscopy successfully performs such a role. Here we propose a correct method to determine the boron conce… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 18 pages, 7 figures, 1 table

    MSC Class: 78-11

  10. arXiv:2309.16412  [pdf, other

    stat.ML cs.LG

    Selective Nonparametric Regression via Testing

    Authors: Fedor Noskov, Alexander Fishkov, Maxim Panov

    Abstract: Prediction with the possibility of abstention (or selective prediction) is an important problem for error-critical machine learning applications. While well-studied in the classification setup, selective approaches to regression are much less developed. In this work, we consider the nonparametric heteroskedastic regression problem and develop an abstention procedure via testing the hypothesis on t… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  11. arXiv:2307.14530  [pdf, other

    stat.ML cs.LG cs.SI

    Optimal Estimation in Mixed-Membership Stochastic Block Models

    Authors: Fedor Noskov, Maxim Panov

    Abstract: Community detection is one of the most critical problems in modern network science. Its applications can be found in various fields, from protein modeling to social network analysis. Recently, many papers appeared studying the problem of overlap** community detection, where each node of a network may belong to several communities. In this work, we consider Mixed-Membership Stochastic Block Model… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  12. arXiv:2306.05131  [pdf, other

    stat.ML cs.LG

    Conformal Prediction for Federated Uncertainty Quantification Under Label Shift

    Authors: Vincent Plassier, Mehdi Makni, Aleksandr Rubashevskii, Eric Moulines, Maxim Panov

    Abstract: Federated Learning (FL) is a machine learning framework where many clients collaboratively train models while kee** the training data decentralized. Despite recent advances in FL, the uncertainty quantification topic (UQ) remains partially addressed. Among UQ methods, conformal prediction (CP) approaches provides distribution-free guarantees under minimal assumptions. We develop a new federated… ▽ More

    Submitted 24 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  13. arXiv:2301.05490  [pdf, other

    cs.LG stat.ML

    Scalable Batch Acquisition for Deep Bayesian Active Learning

    Authors: Aleksandr Rubashevskii, Daria Kotova, Maxim Panov

    Abstract: In deep active learning, it is especially important to choose multiple examples to markup at each step to work efficiently, especially on large datasets. At the same time, existing solutions to this problem in the Bayesian setup, such as BatchBALD, have significant limitations in selecting a large number of examples, associated with the exponential complexity of computing mutual information for jo… ▽ More

    Submitted 16 February, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: Accepted to SIAM International Conference on Data Mining 2023

  14. arXiv:2301.03252  [pdf, other

    cs.CL

    Active Learning for Abstractive Text Summarization

    Authors: Akim Tsvigun, Ivan Lysenko, Danila Sedashov, Ivan Lazichny, Eldar Damirov, Vladimir Karlov, Artemy Belousov, Leonid Sanochkin, Maxim Panov, Alexander Panchenko, Mikhail Burtsev, Artem Shelmanov

    Abstract: Construction of human-curated annotated datasets for abstractive text summarization (ATS) is very time-consuming and expensive because creating each instance requires a human annotator to read a long document and compose a shorter summary that would preserve the key information relayed by the original document. Active Learning (AL) is a technique developed to reduce the amount of annotation requir… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted at EMNLP-2022 Findings

  15. arXiv:2301.00524  [pdf, other

    cs.CV cs.HC cs.LG

    Learning Confident Classifiers in the Presence of Label Noise

    Authors: Asma Ahmed Hashmi, Aigerim Zhumabayeva, Nikita Kotelevskii, Artem Agafonov, Mohammad Yaqub, Maxim Panov, Martin Takáč

    Abstract: The success of Deep Neural Network (DNN) models significantly depends on the quality of provided annotations. In medical image segmentation, for example, having multiple expert annotations for each data point is common to minimize subjective annotation bias. Then, the goal of estimation is to filter out the label noise and recover the ground-truth masks, which are not explicitly given. This paper… ▽ More

    Submitted 9 December, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

  16. arXiv:2209.01880  [pdf, other

    cs.CV cs.AI cs.LG

    ScaleFace: Uncertainty-aware Deep Metric Learning

    Authors: Roman Kail, Kirill Fedyanin, Nikita Muravev, Alexey Zaytsev, Maxim Panov

    Abstract: The performance of modern deep learning-based systems dramatically depends on the quality of input objects. For example, face recognition quality would be lower for blurry or corrupted inputs. However, it is hard to predict the influence of input quality on the resulting accuracy in more complex scenarios. We propose an approach for deep metric learning that allows direct estimation of the uncerta… ▽ More

    Submitted 12 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

  17. arXiv:2206.10691  [pdf, other

    cs.LG

    Towards OOD Detection in Graph Classification from Uncertainty Estimation Perspective

    Authors: Gleb Bazhenov, Sergei Ivanov, Maxim Panov, Alexey Zaytsev, Evgeny Burnaev

    Abstract: The problem of out-of-distribution detection for graph classification is far from being solved. The existing models tend to be overconfident about OOD examples or completely ignore the detection task. In this work, we consider this problem from the uncertainty estimation perspective and perform the comparison of several recently proposed methods. In our experiment, we find that there is no univers… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: ICML 2022 PODS Workshop

  18. arXiv:2205.03194  [pdf, ps, other

    stat.ML cs.LG

    Scalable computation of prediction intervals for neural networks via matrix sketching

    Authors: Alexander Fishkov, Maxim Panov

    Abstract: Accounting for the uncertainty in the predictions of modern neural networks is a challenging and important task in many domains. Existing algorithms for uncertainty estimation require modifying the model architecture and training procedure (e.g., Bayesian neural networks) or dramatically increase the computational cost of predictions such as approaches based on ensembling. This work proposes a new… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  19. arXiv:2202.12297  [pdf, other

    stat.ML cs.LG

    Embedded Ensembles: Infinite Width Limit and Operating Regimes

    Authors: Maksim Velikanov, Roman Kail, Ivan Anokhin, Roman Vashurin, Maxim Panov, Alexey Zaytsev, Dmitry Yarotsky

    Abstract: A memory efficient approach to ensembling neural networks is to share most weights among the ensembled models by means of a single reference network. We refer to this strategy as Embedded Ensembling (EE); its particular examples are BatchEnsembles and Monte-Carlo dropout ensembles. In this paper we perform a systematic theoretical and empirical analysis of embedded ensembles with different number… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  20. arXiv:2202.03101  [pdf, other

    stat.ML cs.LG

    Nonparametric Uncertainty Quantification for Single Deterministic Neural Network

    Authors: Nikita Kotelevskii, Aleksandr Artemenkov, Kirill Fedyanin, Fedor Noskov, Alexander Fishkov, Artem Shelmanov, Artem Vazhentsev, Aleksandr Petiushko, Maxim Panov

    Abstract: This paper proposes a fast and scalable method for uncertainty quantification of machine learning models' predictions. First, we show the principled way to measure the uncertainty of predictions for a classifier based on Nadaraya-Watson's nonparametric estimate of the conditional label distribution. Importantly, the proposed approach allows to disentangle explicitly aleatoric and epistemic uncerta… ▽ More

    Submitted 27 October, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022 paper

  21. arXiv:2108.00089  [pdf, other

    cs.LG cs.AI stat.ML

    Tensor-Train Density Estimation

    Authors: Georgii S. Novikov, Maxim E. Panov, Ivan V. Oseledets

    Abstract: Estimation of probability density function from samples is one of the central problems in statistics and machine learning. Modern neural network-based models can learn high dimensional distributions but have problems with hyperparameter selection and are often prone to instabilities during training and inference. We propose a new efficient tensor train-based model for density estimation (TTDE). Su… ▽ More

    Submitted 25 February, 2022; v1 submitted 30 July, 2021; originally announced August 2021.

    Comments: Accepted for the 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021)

    ACM Class: G.3

  22. arXiv:2107.03684  [pdf, other

    math.ST

    Assigning Topics to Documents by Successive Projections

    Authors: Olga Klopp, Maxim Panov, Suzanne Sigalla, Alexandre Tsybakov

    Abstract: Topic models provide a useful tool to organize and understand the structure of large corpora of text documents, in particular, to discover hidden thematic structure. Clustering documents from big unstructured corpora into topics is an important task in various areas, such as image analysis, e-commerce, social networks, population genetics. A common approach to topic modeling is to associate each t… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  23. arXiv:2106.15921  [pdf, other

    stat.ML cs.LG

    Monte Carlo Variational Auto-Encoders

    Authors: Achille Thin, Nikita Kotelevskii, Arnaud Doucet, Alain Durmus, Eric Moulines, Maxim Panov

    Abstract: Variational auto-encoders (VAE) are popular deep latent variable models which are trained by maximizing an Evidence Lower Bound (ELBO). To obtain tighter ELBO and hence better variational approximations, it has been proposed to use importance sampling to get a lower variance estimate of the evidence. However, importance sampling is known to perform poorly in high dimensions. While it has been sugg… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

  24. arXiv:2012.15550  [pdf, ps, other

    stat.CO stat.ML

    Nonreversible MCMC from conditional invertible transforms: a complete recipe with convergence guarantees

    Authors: Achille Thin, Nikita Kotelevskii, Christophe Andrieu, Alain Durmus, Eric Moulines, Maxim Panov

    Abstract: Markov Chain Monte Carlo (MCMC) is a class of algorithms to sample complex and high-dimensional probability distributions. The Metropolis-Hastings (MH) algorithm, the workhorse of MCMC, provides a simple recipe to construct reversible Markov kernels. Reversibility is a tractable property that implies a less tractable but essential property here, invariance. Reversibility is however not necessarily… ▽ More

    Submitted 29 March, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

  25. arXiv:2009.14588  [pdf, other

    stat.ML cs.LG

    EWS-GCN: Edge Weight-Shared Graph Convolutional Network for Transactional Banking Data

    Authors: Ivan Sukharev, Valentina Shumovskaia, Kirill Fedyanin, Maxim Panov, Dmitry Berestnev

    Abstract: In this paper, we discuss how modern deep learning approaches can be applied to the credit scoring of bank clients. We show that information about connections between clients based on money transfers between them allows us to significantly improve the quality of credit scoring compared to the approaches using information about the target client solely. As a final solution, we develop a new graph n… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  26. arXiv:2003.03274  [pdf, other

    cs.LG stat.ML

    Dropout Strikes Back: Improved Uncertainty Estimation via Diversity Sampling

    Authors: Kirill Fedyanin, Evgenii Tsymbalov, Maxim Panov

    Abstract: Uncertainty estimation for machine learning models is of high importance in many scenarios such as constructing the confidence intervals for model predictions and detection of out-of-distribution or adversarially generated points. In this work, we show that modifying the sampling distributions for dropout layers in neural networks improves the quality of uncertainty estimation. Our main idea consi… ▽ More

    Submitted 4 May, 2022; v1 submitted 6 March, 2020; originally announced March 2020.

  27. arXiv:2002.12253  [pdf, other

    stat.ML cs.LG stat.CO

    MetFlow: A New Efficient Method for Bridging the Gap between Markov Chain Monte Carlo and Variational Inference

    Authors: Achille Thin, Nikita Kotelevskii, Jean-Stanislas Denain, Leo Grinsztajn, Alain Durmus, Maxim Panov, Eric Moulines

    Abstract: In this contribution, we propose a new computationally efficient method to combine Variational Inference (VI) with Markov Chain Monte Carlo (MCMC). This approach can be used with generic MCMC kernels, but is especially well suited to \textit{MetFlow}, a novel family of MCMC algorithms we introduce, in which proposals are obtained using Normalizing Flows. The marginal distribution produced by such… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  28. arXiv:2001.11411  [pdf, other

    stat.ML cs.LG

    NCVis: Noise Contrastive Approach for Scalable Visualization

    Authors: Aleksandr Artemenkov, Maxim Panov

    Abstract: Modern methods for data visualization via dimensionality reduction, such as t-SNE, usually have performance issues that prohibit their application to large amounts of high-dimensional data. In this work, we propose NCVis -- a high-performance dimensionality reduction method built on a sound statistical basis of noise contrastive estimation. We show that NCVis outperforms state-of-the-art technique… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  29. arXiv:2001.08427  [pdf, other

    stat.ML cs.LG

    Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data

    Authors: Valentina Shumovskaia, Kirill Fedyanin, Ivan Sukharev, Dmitry Berestnev, Maxim Panov

    Abstract: Financial institutions obtain enormous amounts of data about user transactions and money transfers, which can be considered as a large graph dynamically changing in time. In this work, we focus on the task of predicting new interactions in the network of bank clients and treat it as a link prediction problem. We propose a new graph neural network model, which uses not only the topological structur… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  30. arXiv:1910.06028  [pdf, ps, other

    math.ST

    Accuracy of Gaussian approximation in nonparametric Bernstein -- von Mises Theorem

    Authors: Vladimir Spokoiny, Maxim Panov

    Abstract: The prominent Bernstein -- von Mises (BvM) result claims that the posterior distribution after centering by the efficient estimator and standardizing by the square root of the total Fisher information is nearly standard normal. In particular, the prior completely washes out from the asymptotic posterior distribution. This fact is fundamental and justifies the Bayes approach from the frequentist vi… ▽ More

    Submitted 1 June, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    MSC Class: 62F15; 62F25

  31. arXiv:1904.06151  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Geometry-Aware Maximum Likelihood Estimation of Intrinsic Dimension

    Authors: Marina Gomtsyan, Nikita Mokrov, Maxim Panov, Yury Yanovich

    Abstract: The existing approaches to intrinsic dimension estimation usually are not reliable when the data are nonlinearly embedded in the high dimensional space. In this work, we show that the explicit accounting to geometric properties of unknown support leads to the polynomial correction to the standard maximum likelihood estimate of intrinsic dimension for flat manifolds. The proposed algorithm (GeoMLE)… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

  32. Deeper Connections between Neural Networks and Gaussian Processes Speed-up Active Learning

    Authors: Evgenii Tsymbalov, Sergei Makarychev, Alexander Shapeev, Maxim Panov

    Abstract: Active learning methods for neural networks are usually based on greedy criteria which ultimately give a single new design point for the evaluation. Such an approach requires either some heuristics to sample a batch of design points at one active learning iteration, or retraining the neural network after adding each data point, which is computationally inefficient. Moreover, uncertainty estimates… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Journal ref: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence {IJCAI-19}, 2019

  33. arXiv:1810.03032  [pdf, other

    stat.ML cs.LG cs.SI

    Constructing Graph Node Embeddings via Discrimination of Similarity Distributions

    Authors: Stanislav Tsepa, Maxim Panov

    Abstract: The problem of unsupervised learning node embeddings in graphs is one of the important directions in modern network science. In this work we propose a novel framework, which is aimed to find embeddings by \textit{discriminating distributions of similarities (DDoS)} between nodes in the graph. The general idea is implemented by maximizing the \textit{earth mover distance} between distributions of d… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Journal ref: In 2018 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 1050-1053

  34. Dropout-based Active Learning for Regression

    Authors: Evgenii Tsymbalov, Maxim Panov, Alexander Shapeev

    Abstract: Active learning is relevant and challenging for high-dimensional regression models when the annotation of the samples is expensive. Yet most of the existing sampling methods cannot be applied to large-scale problems, consuming too much time for data processing. In this paper, we propose a fast active learning algorithm for regression, tailored for neural network models. It is based on uncertainty… ▽ More

    Submitted 5 July, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

    Comments: Report on AIST 2018; will be published in Springer LNCS series (Analysis of Images, Social Networks and Texts - 7th International Conference, AIST 2018)

    Journal ref: Analysis of Images, Social Networks and Texts - 7th International Conference, AIST 2018, Lecture Notes in Computer Science book series (LNCS), volume 11179, pp. 247-258

  35. arXiv:1804.10653  [pdf, other

    stat.ML cs.LG

    Sparse Group Inductive Matrix Completion

    Authors: Ivan Nazarov, Boris Shirokikh, Maria Burkina, Gennady Fedonin, Maxim Panov

    Abstract: We consider the problem of matrix completion with side information (\textit{inductive matrix completion}). In real-world applications many side-channel features are typically non-informative making feature selection an important part of the problem. We incorporate feature selection into inductive matrix completion by proposing a matrix factorization framework with group-lasso regularization on sid… ▽ More

    Submitted 6 October, 2018; v1 submitted 27 April, 2018; originally announced April 2018.

  36. Simultaneous Matrix Diagonalization for Structural Brain Networks Classification

    Authors: Nikita Mokrov, Maxim Panov, Boris A. Gutman, Joshua I. Faskowitz, Neda Jahanshad, Paul M. Thompson

    Abstract: This paper considers the problem of brain disease classification based on connectome data. A connectome is a network representation of a human brain. The typical connectome classification problem is very challenging because of the small sample size and high dimensionality of the data. We propose to use simultaneous approximate diagonalization of adjacency matrices in order to compute their eigenst… ▽ More

    Submitted 14 October, 2017; originally announced October 2017.

    Journal ref: Complex Networks & Their Applications VI. COMPLEX NETWORKS 2017. Studies in Computational Intelligence, vol 689

  37. Consistent Estimation of Mixed Memberships with Successive Projections

    Authors: Maxim Panov, Konstantin Slavnov, Roman Ushakov

    Abstract: This paper considers the parameter estimation problem in Mixed Membership Stochastic Block Model (MMSB), which is a quite general instance of random graph model allowing for overlap** community structure. We present the new algorithm successive projection overlap** clustering (SPOC) which combines the ideas of spectral clustering and geometric approach for separable non-negative matrix factori… ▽ More

    Submitted 14 October, 2017; v1 submitted 5 July, 2017; originally announced July 2017.

    Journal ref: Complex Networks & Their Applications VI. COMPLEX NETWORKS 2017. Studies in Computational Intelligence, vol 689

  38. arXiv:1609.01088  [pdf, other

    cs.MS cs.CE stat.ML

    GTApprox: surrogate modeling for industrial design

    Authors: Mikhail Belyaev, Evgeny Burnaev, Ermek Kapushev, Maxim Panov, Pavel Prikhodko, Dmitry Vetrov, Dmitry Yarotsky

    Abstract: We describe GTApprox - a new tool for medium-scale surrogate modeling in industrial design. Compared to existing software, GTApprox brings several innovations: a few novel approximation algorithms, several advanced methods of automated model selection, novel options in the form of hints. We demonstrate the efficiency of GTApprox on a large collection of test problems. In addition, we describe seve… ▽ More

    Submitted 5 September, 2016; originally announced September 2016.

    Comments: 31 pages, 11 figures

  39. Finite Sample Bernstein -- von Mises Theorem for Semiparametric Problems

    Authors: Maxim Panov, Vladimir Spokoiny

    Abstract: The classical parametric and semiparametric Bernstein -- von Mises (BvM) results are reconsidered in a non-classical setup allowing finite samples and model misspecification. In the case of a finite dimensional nuisance parameter we obtain an upper bound on the error of Gaussian approximation of the posterior distribution for the target parameter which is explicit in the dimension of the nuisance… ▽ More

    Submitted 15 June, 2014; v1 submitted 29 October, 2013; originally announced October 2013.

    Journal ref: Bayesian Analysis, 10(3), 665-710, 2015