Skip to main content

Showing 1–12 of 12 results for author: Novikov, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16718  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Features Fusion for Dual-View Mammography Mass Detection

    Authors: Arina Varlamova, Valery Belotsky, Grigory Novikov, Anton Konushin, Evgeny Sidorov

    Abstract: Detection of malignant lesions on mammography images is extremely important for early breast cancer diagnosis. In clinical practice, images are acquired from two different angles, and radiologists can fully utilize information from both views, simultaneously locating the same lesion. However, for automatic detection approaches such information fusion remains a challenge. In this paper, we propose… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at ISBI 2024 (21st IEEE International Symposium on Biomedical Imaging)

  2. arXiv:2306.02697  [pdf, other

    cs.AI

    Efficient GPT Model Pre-training using Tensor Train Matrix Representation

    Authors: Viktoriia Chekalina, Georgii Novikov, Julia Gusak, Ivan Oseledets, Alexander Panchenko

    Abstract: Large-scale transformer models have shown remarkable performance in language modelling tasks. However, such models feature billions of parameters, leading to difficulties in their deployment and prohibitive training costs from scratch. To reduce the number of the parameters in the GPT-2 architecture, we replace the matrices of fully-connected layers with the corresponding Tensor Train Matrix~(TTM)… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  3. arXiv:2302.10844  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Robust Mean Estimation Without Moments for Symmetric Distributions

    Authors: Gleb Novikov, David Steurer, Stefan Tiegel

    Abstract: We study the problem of robustly estimating the mean or location parameter without moment assumptions. We show that for a large class of symmetric distributions, the same error as in the Gaussian setting can be achieved efficiently. The distributions we study include products of arbitrary symmetric one-dimensional distributions, such as product Cauchy distributions, as well as elliptical distribut… ▽ More

    Submitted 8 November, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted at NeurIPS 2023

  4. arXiv:2302.10158  [pdf, ps, other

    cs.LG stat.ML

    Sparse PCA Beyond Covariance Thresholding

    Authors: Gleb Novikov

    Abstract: In the Wishart model for sparse PCA we are given $n$ samples $Y_1,\ldots, Y_n$ drawn independently from a $d$-dimensional Gaussian distribution $N({0, Id + βvv^\top})$, where $β> 0$ and $v\in \mathbb{R}^d$ is a $k$-sparse unit vector, and we wish to recover $v$ (up to sign). We show that if $n \ge Ω(d)$, then for every $t \ll k$ there exists an algorithm running in time $n\cdot d^{O(t)}$ that so… ▽ More

    Submitted 8 November, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: COLT 2023

  5. arXiv:2211.07327  [pdf, ps, other

    cs.LG stat.ML

    Higher degree sum-of-squares relaxations robust against oblivious outliers

    Authors: Tommaso d'Orsi, Rajai Nasser, Gleb Novikov, David Steurer

    Abstract: We consider estimation models of the form $Y=X^*+N$, where $X^*$ is some $m$-dimensional signal we wish to recover, and $N$ is symmetrically distributed noise that may be unbounded in all but a small $α$ fraction of the entries. We introduce a family of algorithms that under mild assumptions recover the signal $X^*$ in all estimation problems for which there exists a sum-of-squares algorithm that… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: To appear in SODA 2023

  6. arXiv:2202.00441  [pdf, other

    cs.LG cs.AI

    Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction

    Authors: Georgii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets

    Abstract: Memory footprint is one of the main limiting factors for large neural network training. In backpropagation, one needs to store the input to each operation in the computational graph. Every modern neural network model has quite a few pointwise nonlinearities in its architecture, and such operation induces additional memory costs which -- as we show -- can be significantly reduced by quantization of… ▽ More

    Submitted 2 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted

  7. arXiv:2111.02966  [pdf, ps, other

    cs.LG stat.ML

    Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

    Authors: Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser, Gleb Novikov, David Steurer, Stefan Tiegel

    Abstract: We develop machinery to design efficiently computable and consistent estimators, achieving estimation error approaching zero as the number of observations grows, when facing an oblivious adversary that may corrupt responses in all but an $α$ fraction of the samples. As concrete examples, we investigate two problems: sparse regression and principal component analysis (PCA). For sparse regression, w… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: To appear in NeurIPS 2021

  8. arXiv:2108.00089  [pdf, other

    cs.LG cs.AI stat.ML

    Tensor-Train Density Estimation

    Authors: Georgii S. Novikov, Maxim E. Panov, Ivan V. Oseledets

    Abstract: Estimation of probability density function from samples is one of the central problems in statistics and machine learning. Modern neural network-based models can learn high dimensional distributions but have problems with hyperparameter selection and are often prone to instabilities during training and inference. We propose a new efficient tensor train-based model for density estimation (TTDE). Su… ▽ More

    Submitted 25 February, 2022; v1 submitted 30 July, 2021; originally announced August 2021.

    Comments: Accepted for the 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021)

    ACM Class: G.3

  9. arXiv:2011.06585  [pdf, ps, other

    cs.LG cs.DS

    Sparse PCA: Algorithms, Adversarial Perturbations and Certificates

    Authors: Tommaso d'Orsi, Pravesh K. Kothari, Gleb Novikov, David Steurer

    Abstract: We study efficient algorithms for Sparse PCA in standard statistical models (spiked covariance in its Wishart form). Our goal is to achieve optimal recovery guarantees while being resilient to small perturbations. Despite a long history of prior works, including explicit studies of perturbation resilience, the best known algorithmic guarantees for Sparse PCA are fragile and break down under small… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  10. arXiv:2009.14774  [pdf, ps, other

    cs.LG stat.ML

    Consistent regression when oblivious outliers overwhelm

    Authors: Tommaso d'Orsi, Gleb Novikov, David Steurer

    Abstract: We consider a robust linear regression model $y=Xβ^* + η$, where an adversary oblivious to the design $X\in \mathbb{R}^{n\times d}$ may choose $η$ to corrupt all but an $α$ fraction of the observations $y$ in an arbitrary way. Prior to our work, even for Gaussian $X$, no estimator for $β^*$ was known to be consistent in this model except for quadratic sample size $n \gtrsim (d/α)^2$ or for logarit… ▽ More

    Submitted 25 May, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: To appear in ICML 2021

  11. arXiv:1803.00397  [pdf, other

    cs.CV cs.LG

    Satellite imagery analysis for operational damage assessment in Emergency situations

    Authors: Alexey Trekin, German Novikov, Georgy Potapov, Vladimir Ignatiev, Evgeny Burnaev

    Abstract: When major disaster occurs the questions are raised how to estimate the damage in time to support the decision making process and relief efforts by local authorities or humanitarian teams. In this paper we consider the use of Machine Learning and Computer Vision on remote sensing imagery to improve time efficiency of assessment of damaged buildings in disaster affected area. We propose a general w… ▽ More

    Submitted 19 February, 2018; originally announced March 2018.

    Comments: 12 pages, 10 figures

  12. arXiv:1609.08209  [pdf, other

    cs.CV cs.LG stat.ML

    Automatic Construction of a Recurrent Neural Network based Classifier for Vehicle Passage Detection

    Authors: Evgeny Burnaev, Ivan Koptelov, German Novikov, Timur Khanipov

    Abstract: Recurrent Neural Networks (RNNs) are extensively used for time-series modeling and prediction. We propose an approach for automatic construction of a binary classifier based on Long Short-Term Memory RNNs (LSTM-RNNs) for detection of a vehicle passage through a checkpoint. As an input to the classifier we use multidimensional signals of various sensors that are installed on the checkpoint. Obtaine… ▽ More

    Submitted 26 September, 2016; originally announced September 2016.

    Comments: 6 pages, 2 figures, 5 tables