Skip to main content

Showing 1–15 of 15 results for author: Savchenko, A V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03299  [pdf, other

    cs.AI cs.CL

    The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games

    Authors: Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Mikhail Baklashkin, Andrey V. Savchenko, Ilya Makarov

    Abstract: Behavior study experiments are an important part of society modeling and understanding human interactions. In practice, many behavioral experiments encounter challenges related to internal and external validity, reproducibility, and social bias due to the complexity of social interactions and cooperation in human user studies. Recent advances in Large Language Models (LLMs) have provided researche… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; J.4

  2. arXiv:2403.11590  [pdf, other

    cs.CV

    HSEmotion Team at the 6th ABAW Competition: Facial Expressions, Valence-Arousal and Emotion Intensity Prediction

    Authors: Andrey V. Savchenko

    Abstract: This article presents our results for the sixth Affective Behavior Analysis in-the-wild (ABAW) competition. To improve the trustworthiness of facial analysis, we study the possibility of using pre-trained deep models that extract reliable emotional features without the need to fine-tune the neural networks for a downstream task. In particular, we introduce several lightweight models based on Mobil… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 10 pages, 1 figure, 8 tables

    MSC Class: 68T10 ACM Class: I.4.9

  3. arXiv:2303.09162  [pdf, other

    cs.CV

    EmotiEffNet Facial Features in Uni-task Emotion Recognition in Video at ABAW-5 competition

    Authors: Andrey V. Savchenko

    Abstract: In this article, the results of our team for the fifth Affective Behavior Analysis in-the-wild (ABAW) competition are presented. The usage of the pre-trained convolutional networks from the EmotiEffNet family for frame-level feature extraction is studied. In particular, we propose an ensemble of a multi-layered perceptron and the LightAutoML-based classifier. The post-processing by smoothing the r… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 7 pages; 5 figures; 3 tables

    MSC Class: 68T10 ACM Class: I.4.9

  4. arXiv:2207.09508  [pdf, other

    cs.CV

    HSE-NN Team at the 4th ABAW Competition: Multi-task Emotion Recognition and Learning from Synthetic Images

    Authors: Andrey V. Savchenko

    Abstract: In this paper, we present the results of the HSE-NN team in the 4th competition on Affective Behavior Analysis in-the-wild (ABAW). The novel multi-task EfficientNet model is trained for simultaneous recognition of facial expressions and prediction of valence and arousal on static photos. The resulting MT-EmotiEffNet extracts visual features that are fed into simple feed-forward neural networks in… ▽ More

    Submitted 20 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: accepted at ECCV Workshop ABAW4; 14 pages, 3 figures, 8 tables

    MSC Class: 68T10 ACM Class: I.4.9

  5. arXiv:2203.13436  [pdf, other

    cs.CV

    Frame-level Prediction of Facial Expressions, Valence, Arousal and Action Units for Mobile Devices

    Authors: Andrey V. Savchenko

    Abstract: In this paper, we consider the problem of real-time video-based facial emotion analytics, namely, facial expression recognition, prediction of valence and arousal and detection of action unit points. We propose the novel frame-level emotion recognition algorithm by extracting facial features with the single EfficientNet model pre-trained on AffectNet. As a result, our approach may be implemented e… ▽ More

    Submitted 24 May, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: accepted at CVPR Workshop ABAW3, 8 pages, 2 figures, 6 tables

    MSC Class: 68T10 ACM Class: I.4.9

  6. Facial expression and attributes recognition based on multi-task learning of lightweight neural networks

    Authors: Andrey V. Savchenko

    Abstract: In this paper, the multi-task learning of lightweight convolutional neural networks is studied for face identification and classification of facial attributes (age, gender, ethnicity) trained on cropped faces without margins. The necessity to fine-tune these networks to predict facial expressions is highlighted. Several models are presented based on MobileNet, EfficientNet and RexNet architectures… ▽ More

    Submitted 4 October, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: 14 pages, 3 figures, accepted at IEEE SISY 2021

    MSC Class: 68T10

  7. arXiv:2010.04224  [pdf

    eess.AS cs.SD

    Gender domain adaptation for automatic speech recognition task

    Authors: Sokolov Artem, Andrey V. Savchenko

    Abstract: This paper is focused on the finetuning of acoustic models for speaker adaptation goals on a given gender. We pretrained the Transformer baseline model on Librispeech-960 and conduct experiments with finetuning on the gender-specific test subsets and. In general, we do not obtain essential WER reduction by finetuning techniques by this approach. We achieved up to ~5% lower word error rate on the m… ▽ More

    Submitted 17 November, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Draft of paper for SAMI conference

  8. arXiv:1911.11010  [pdf, other

    cs.CV cs.LG

    Event Recognition with Automatic Album Detection based on Sequential Processing, Neural Attention and Image Captioning

    Authors: Andrey V. Savchenko

    Abstract: In this paper a new formulation of event recognition task is examined: it is required to predict event categories in a gallery of images, for which albums (groups of photos corresponding to a single event) are unknown. We propose the novel two-stage approach. At first, features are extracted in each photo using the pre-trained convolutional neural network. These features are classified individuall… ▽ More

    Submitted 15 January, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 11 pages, 5 figures

    MSC Class: 68T10 (Primary)

  9. Preferences Prediction using a Gallery of Mobile Device based on Scene Recognition and Object Detection

    Authors: A. V. Savchenko, K. V. Demochkin, I. S. Grechikhin

    Abstract: In this paper user modeling task is examined by processing a gallery of photos and videos on a mobile device. We propose novel engine for user preference prediction based on scene recognition, object detection and facial analysis. At first, all faces in a gallery are clustered and all private photos and videos with faces from large clusters are processed on the embedded system in offline mode. Oth… ▽ More

    Submitted 18 April, 2021; v1 submitted 10 July, 2019; originally announced July 2019.

    Comments: 19 pages; 9 figures, preprint submitter to Pattern Recognition journal

    MSC Class: 68T10

  10. Compression of Recurrent Neural Networks for Efficient Language Modeling

    Authors: Artem M. Grachev, Dmitry I. Ignatov, Andrey V. Savchenko

    Abstract: Recurrent neural networks have proved to be an effective method for statistical language modeling. However, in practice their memory and run-time complexity are usually too large to be implemented in real-time offline mobile applications. In this paper we consider several compression techniques for recurrent neural networks including Long-Short Term Memory models. We make particular attention to t… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: 25 pages, 3 tables, 4 figures

  11. Efficient Facial Representations for Age, Gender and Identity Recognition in Organizing Photo Albums using Multi-output CNN

    Authors: Andrey V. Savchenko

    Abstract: This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. We propose the two-stage approach, in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. We modified the MobileNet, which is preli… ▽ More

    Submitted 13 June, 2019; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: 19 pages, 2 figures, 8 tables

    MSC Class: 68T10

    Journal ref: PeerJ Computer Science 5:e197 (2019)

  12. Organizing Multimedia Data in Video Surveillance Systems Based on Face Verification with Convolutional Neural Networks

    Authors: Anastasiia D. Sokolova, Angelina S. Kharchevnikova, Andrey V. Savchenko

    Abstract: In this paper we propose the two-stage approach of organizing information in video surveillance systems. At first, the faces are detected in each frame and a video stream is split into sequences of frames with face region of one person. Secondly, these sequences (tracks) that contain identical faces are grouped using face verification algorithms and hierarchical agglomerative clustering. Gender an… ▽ More

    Submitted 17 September, 2017; originally announced September 2017.

    Comments: 8 pages; 1 figure, accepted for publication at AIST17

    MSC Class: 68T10; 68T45 ACM Class: I.4.8; I.5.4

    Journal ref: Proceedings of the International Conference on Analysis of Images, Social Networks and Texts (AIST), 2018, pp. 223-230

  13. Group-level Emotion Recognition using Transfer Learning from Face Identification

    Authors: Alexandr G. Rassadin, Alexey S. Gruzdev, Andrey V. Savchenko

    Abstract: In this paper, we describe our algorithmic approach, which was used for submissions in the fifth Emotion Recognition in the Wild (EmotiW 2017) group-level emotion recognition sub-challenge. We extracted feature vectors of detected faces using the Convolutional Neural Network trained for face identification task, rather than traditional pre-training on emotion recognition problems. In the final pip… ▽ More

    Submitted 30 October, 2017; v1 submitted 6 September, 2017; originally announced September 2017.

    Comments: 5 pages, 3 figures, accepted for publication at ICMI17 (EmotiW Grand Challenge)

    MSC Class: 68T10; 68T45 ACM Class: I.4.8; I.5.4

    Journal ref: Proceedings of the 19th ACM International Conference on Multimodal Interaction (ICMI), 2017, pp. 544-548

  14. Maximum A Posteriori Estimation of Distances Between Deep Features in Still-to-Video Face Recognition

    Authors: Andrey V. Savchenko, Natalya S. Belova

    Abstract: The paper deals with the still-to-video face recognition for the small sample size problem based on computation of distances between high-dimensional deep bottleneck features. We present the novel statistical recognition method, in which the still-to-video recognition task is casted into Maximum A Posteriori estimation. In this method we maximize the joint probabilistic density of the distances to… ▽ More

    Submitted 26 August, 2017; originally announced August 2017.

    Comments: 20 pages, 5 figures, 40 references

    MSC Class: 68T10

  15. arXiv:1708.05963  [pdf, ps, other

    stat.ML cs.CL cs.LG cs.NE

    Neural Networks Compression for Language Modeling

    Authors: Artem M. Grachev, Dmitry I. Ignatov, Andrey V. Savchenko

    Abstract: In this paper, we consider several compression techniques for the language modeling problem based on recurrent neural networks (RNNs). It is known that conventional RNNs, e.g, LSTM-based networks in language modeling, are characterized with either high space complexity or substantial inference time. This problem is especially crucial for mobile applications, in which the constant interaction with… ▽ More

    Submitted 20 August, 2017; originally announced August 2017.

    Comments: Keywords: LSTM, RNN, language modeling, low-rank factorization, pruning, quantization. Published by Springer in the LNCS series, 7th International Conference on Pattern Recognition and Machine Intelligence, 2017

    MSC Class: 62M45; 68T50 ACM Class: I.2.7, I.2.6, I.5.1, I.5.4