Skip to main content

Showing 1–12 of 12 results for author: Popov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.16728  [pdf, other

    cs.AI

    Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss

    Authors: Artem Khrapov, Vadim Popov, Tasnima Sadekova, Assel Yermekova, Mikhail Kudinov

    Abstract: Diffusion models are known to be vulnerable to outliers in training data. In this paper we study an alternative diffusion loss function, which can preserve the high quality of generated data like the original squared $L_{2}$ loss while at the same time being robust to outliers. We propose to use pseudo-Huber loss function with a time-dependent parameter to allow for the trade-off between robustnes… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 13 pages, 16 figures

  2. Looking Together $\neq$ Seeing the Same Thing: Understanding Surgeons' Visual Needs During Intra-operative Coordination and Instruction

    Authors: Vitaliy Popov, Xinyue Chen, **gying Wang, Michael Kemp, Gurjit Sandhu, Taylor Kantor, Natalie Mateju, Xu Wang

    Abstract: Shared gaze visualizations have been found to enhance collaboration and communication outcomes in diverse HCI scenarios including computer supported collaborative work and learning contexts. Given the importance of gaze in surgery operations, especially when a surgeon trainer and trainee need to coordinate their actions, research on the use of gaze to facilitate intra-operative coordination and in… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Journal ref: CHI'2024

  3. Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning

    Authors: **gying Wang, Haoran Tang, Taylor Kantor, Tandis Soltani, Vitaliy Popov, Xu Wang

    Abstract: Videos are prominent learning materials to prepare surgical trainees before they enter the operating room (OR). In this work, we explore techniques to enrich the video-based surgery learning experience. We propose Surgment, a system that helps expert surgeons create exercises with feedback based on surgery recordings. Surgment is powered by a few-shot-learning-based pipeline (SegGPT+SAM) to segmen… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Journal ref: CHI'2024

  4. arXiv:2312.03759  [pdf, ps, other

    cs.CL cs.AI cs.CY cs.DL

    How should the advent of large language models affect the practice of science?

    Authors: Marcel Binz, Stephan Alaniz, Adina Roskies, Balazs Aczel, Carl T. Bergstrom, Colin Allen, Daniel Schad, Dirk Wulff, Jevin D. West, Qiong Zhang, Richard M. Shiffrin, Samuel J. Gershman, Ven Popov, Emily M. Bender, Marco Marelli, Matthew M. Botvinick, Zeynep Akata, Eric Schulz

    Abstract: Large language models (LLMs) are being increasingly incorporated into scientific workflows. However, we have yet to fully grasp the implications of this integration. How should the advent of large language models affect the practice of science? For this opinion piece, we have invited four diverse groups of scientists to reflect on this query, sharing their perspectives and engaging in debate. Schu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2109.13821  [pdf, other

    cs.SD cs.LG stat.ML

    Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme

    Authors: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei

    Abstract: Voice conversion is a common speech synthesis task which can be solved in different ways depending on a particular real-world scenario. The most challenging one often referred to as one-shot many-to-many voice conversion consists in copying the target voice from only one reference utterance in the most general case when both source and target speakers do not belong to the training dataset. We pres… ▽ More

    Submitted 4 August, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

  6. arXiv:2105.06337  [pdf, other

    cs.LG cs.CL stat.ML

    Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech

    Authors: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov

    Abstract: Recently, denoising diffusion probabilistic models and generative score matching have shown high potential in modelling complex data distributions while stochastic calculus has provided a unified point of view on these techniques allowing for flexible inference schemes. In this paper we introduce Grad-TTS, a novel text-to-speech model with score-based decoder producing mel-spectrograms by graduall… ▽ More

    Submitted 5 August, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

  7. arXiv:1811.04623  [pdf, ps, other

    cs.CL

    Fine-tuning of Language Models with Discriminator

    Authors: Vadim Popov, Mikhail Kudinov

    Abstract: Cross-entropy loss is a common choice when it comes to multiclass classification tasks and language modeling in particular. Minimizing this loss results in language models of very good quality. We show that it is possible to fine-tune these models and make them perform even better if they are fine-tuned with sum of cross-entropy loss and reverse Kullback-Leibler divergence. The latter is estimated… ▽ More

    Submitted 15 January, 2019; v1 submitted 12 November, 2018; originally announced November 2018.

  8. arXiv:1712.07473  [pdf, ps, other

    cs.CL cs.CR cs.LG

    Differentially Private Distributed Learning for Language Modeling Tasks

    Authors: Vadim Popov, Mikhail Kudinov, Irina Piontkovskaya, Petr Vytovtov, Alex Nevidomsky

    Abstract: One of the big challenges in machine learning applications is that training data can be different from the real-world data faced by the algorithm. In language modeling, users' language (e.g. in private messaging) could change in a year and be completely different from what we observe in publicly available data. At the same time, public data can be used for obtaining general knowledge (i.e. general… ▽ More

    Submitted 6 March, 2018; v1 submitted 20 December, 2017; originally announced December 2017.

  9. arXiv:1412.4316  [pdf

    cs.CR

    Hasq Hash Chains

    Authors: Oleg Mazonka, Vlad Popov

    Abstract: This paper describes a particular hash-based records linking chain scheme. This scheme is simple conceptually and easy to implement in software. It allows for a simple and secure way to transfer ownership of digital objects between peers.

    Submitted 14 December, 2014; originally announced December 2014.

  10. arXiv:1309.4507  [pdf

    cs.DC

    Faster Fair Solution for the Reader-Writer Problem

    Authors: Vlad Popov, Oleg Mazonka

    Abstract: A fast fair solution for Reader-Writer Problem is presented.

    Submitted 17 September, 2013; originally announced September 2013.

  11. arXiv:1104.4433  [pdf, ps, other

    cs.CC

    Arc-preserving subsequences of arc-annotated sequences

    Authors: Vladimir Yu. Popov

    Abstract: Arc-annotated sequences are useful in representing the structural information of RNA and protein sequences. The longest arc-preserving common subsequence problem has been introduced as a framework for studying the similarity of arc-annotated sequences. In this paper, we consider arc-annotated sequences with various arc structures. We consider the longest arc preserving common subsequence problem.… ▽ More

    Submitted 22 April, 2011; originally announced April 2011.

    MSC Class: 68Q15 ACM Class: F.1.3

    Journal ref: Acta Univ. Sapientiae, Informatica 3, 1 (2011) 35--47

  12. arXiv:1011.3257  [pdf

    cs.HC cs.AI

    Integration of Flexible Web Based GUI in I-SOAS

    Authors: Zeeshan Ahmed, Vasil Popov

    Abstract: It is necessary to improve the concepts of the present web based graphical user interface for the development of more flexible and intelligent interface to provide ease and increase the level of comfort at user end like most of the desktop based applications. This research is conducted targeting the goal of implementing flexible GUI consisting of a visual component manager with different component… ▽ More

    Submitted 14 November, 2010; originally announced November 2010.

    Comments: In the proceedings of 6th I*PROMS Virtual International Conference on Innovative Production Machines and Systems (IPROMS 2010), Session Production Organisation and Management, Cardiff University, Whittles Publishing, Scotland UK, 15-26 November, 2010