Skip to main content

Showing 1–14 of 14 results for author: Arkhipkin, V

.
  1. arXiv:2312.03511  [pdf, other

    cs.CV cs.LG cs.MM

    Kandinsky 3.0 Technical Report

    Authors: Vladimir Arkhipkin, Andrei Filatov, Viacheslav Vasilev, Anastasia Maltseva, Said Azizov, Igor Pavlov, Julia Agafonova, Andrey Kuznetsov, Denis Dimitrov

    Abstract: We present Kandinsky 3.0, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation. In this report we describe the architecture of the model, the data collection procedure, the training technique, and the production system for user interaction… ▽ More

    Submitted 28 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://ai-forever.github.io/Kandinsky-3

  2. arXiv:2311.13073  [pdf, other

    cs.CV cs.LG cs.MM

    FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

    Authors: Vladimir Arkhipkin, Zein Shaheen, Viacheslav Vasilev, Elizaveta Dakhova, Andrey Kuznetsov, Denis Dimitrov

    Abstract: Multimedia generation approaches occupy a prominent place in artificial intelligence research. Text-to-image models achieved high-quality results over the last few years. However, video synthesis methods recently started to develop. This paper presents a new two-stage latent diffusion text-to-video generation architecture based on the text-to-image diffusion model. The first stage concerns keyfram… ▽ More

    Submitted 20 December, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Project page: https://ai-forever.github.io/kandinsky-video/

  3. arXiv:2310.03502  [pdf, other

    cs.CV

    Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion

    Authors: Anton Razzhigaev, Arseniy Shakhmatov, Anastasia Maltseva, Vladimir Arkhipkin, Igor Pavlov, Ilya Ryabov, Angelina Kuts, Alexander Panchenko, Andrey Kuznetsov, Denis Dimitrov

    Abstract: Text-to-image generation is a significant domain in modern computer vision and has achieved substantial improvements through the evolution of generative architectures. Among these, there are diffusion-based models that have demonstrated essential quality enhancements. These models are generally split into two categories: pixel-level and latent-level approaches. We present Kandinsky1, a novel explo… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  4. arXiv:2302.05259  [pdf, other

    stat.ML cs.LG

    Star-Shaped Denoising Diffusion Probabilistic Models

    Authors: Andrey Okhotin, Dmitry Molchanov, Vladimir Arkhipkin, Grigory Bartosh, Viktor Ohanesian, Aibek Alanov, Dmitry Vetrov

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) provide the foundation for the recent breakthroughs in generative modeling. Their Markovian structure makes it difficult to define DDPMs with distributions other than Gaussian or discrete. In this paper, we introduce Star-Shaped DDPM (SS-DDPM). Its star-shaped diffusion process allows us to bypass the need to define the transition probabilities or c… ▽ More

    Submitted 28 October, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: Accepted at NeurIPS 2023

  5. arXiv:2208.00406  [pdf, other

    cs.LG cs.AI cs.CE cs.CY

    Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI

    Authors: Semen Budennyy, Vladimir Lazarev, Nikita Zakharenko, Alexey Korovin, Olga Plosskaya, Denis Dimitrov, Vladimir Arkhipkin, Ivan Oseledets, Ivan Barsola, Ilya Egorov, Aleksandra Kosterina, Leonid Zhukov

    Abstract: The size and complexity of deep neural networks continue to grow exponentially, significantly increasing energy consumption for training and inference by these models. We introduce an open-source package eco2AI to help data scientists and researchers to track energy consumption and equivalent CO2 emissions of their models in a straightforward way. In eco2AI we put emphasis on accuracy of energy co… ▽ More

    Submitted 3 August, 2022; v1 submitted 31 July, 2022; originally announced August 2022.

    Comments: Source code for eco2AI package (energy consumption and carbon emission tracker of code in python) is available at: https://github.com/sb-ai-lab/Eco2AI , the package is also available at PyPi: https://pypi.org/project/eco2ai/

  6. arXiv:2111.10974  [pdf, other

    cs.CV cs.AI cs.CL

    Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture

    Authors: Daria Bakshandaeva, Denis Dimitrov, Vladimir Arkhipkin, Alex Shonenkov, Mark Potanin, Denis Karachev, Andrey Kuznetsov, Anton Voronov, Vera Davydova, Elena Tutubalina, Aleksandr Petiushko

    Abstract: Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called Fusion Brain, the first competition which is targeted to make the universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language. The Fusion Brain Challenge combines the following specific tasks: Code2code Transl… ▽ More

    Submitted 28 December, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

  7. arXiv:1703.00310  [pdf, other

    physics.optics

    Chiral Optical Tamm States: Temporal Coupled-Mode Theory

    Authors: Ivan V. Timofeev, Pavel S. Pankin, Stepan Ya. Vetrov, Vasily G. Arkhipkin, Wei Lee, Victor Ya. Zyryanov

    Abstract: The chiral optical Tamm state (COTS) is a special localized state at the interface of a handedness-preserving mirror and a structurally chiral medium such as a cholesteric liquid crystal or a chiral sculptured thin film. The spectral behavior of COTS, observed as reflection resonances, is described by the temporal coupled-mode theory. Mode coupling is different for two circular light polarizations… ▽ More

    Submitted 30 March, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

    Comments: the text is available both in English and in Russian, reported at ACLC-2017 (http://aclc2017.conf.tw/)

    Journal ref: Crystals 7, 113 (2017)

  8. arXiv:1612.03167  [pdf, ps, other

    quant-ph physics.optics

    Quantum properties of a parametric four-wave mixing in a Raman-type atomic system

    Authors: A. V. Sharypov, Bing He, V. G. Arkhipkin, S. A. Myslivets

    Abstract: We present a study of the quantum properties of two light fields used to parametric four-wave mixing in a Raman-type atomic system. The system realizes an effective Hamiltonian of beamsplitter type coupling between the light fields, which allows to control squeezing and amplitude distribution of the light fields, as well as realizing their entanglement. The scheme can be feasibly applied to engine… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

    Comments: 6 pages, 4 figures

    Journal ref: Phys. Rev. A 95, 053812 (2017)

  9. Coherently controlling Raman-induced grating in atomic media

    Authors: V. G. Arkhipkin, S. A. Myslivets, I. V. Timofeev

    Abstract: We consider dynamically controllable periodic structures, called Raman induced gratings, in three- and four-level atomic media, resulting from Raman interaction in a standing-wave pump. These gratings are due to periodic spatial modulation of the Raman nonlinearity and fundamentally differ from the ones based on electromagnetically induced transparency. The transmission and reflection spectra of s… ▽ More

    Submitted 26 November, 2015; originally announced November 2015.

    Comments: 7 pages, 12 figures

  10. Geometric phase and o-mode blue shift in a chiral anisotropic medium inside a Fabry-Pérot cavity

    Authors: I. V. Timofeev, V. A. Gunyakov, V. S. Sutormin, S. A. Myslivets, V. G. Arkhipkin, S. Ya. Vetrov, W. Lee, V. Ya. Zyryanov

    Abstract: Anomalous spectral shift of transmission peaks is observed in a Fabry--Pérot cavity filled with a chiral anisotropic medium. The effective refractive index value resides out of the interval between the ordinary and the extraordinary refractive indices. The spectral shift is explained by contribution of a geometric phase. The problem is solved analytically using the approximate Jones matrix method,… ▽ More

    Submitted 19 September, 2015; originally announced September 2015.

    Comments: the text is available both in English (Timofeev2015en.tex) and in Russian (download: other formats - source - Timofeev2015ru.tex, Timofeev2015rus.pdf)

    Journal ref: Phys. Rev. E 92, 052504 (2015)

  11. arXiv:1110.4725  [pdf

    physics.optics physics.chem-ph physics.comp-ph

    Voltage-induced defect mode interaction in a one-dimensional photonic crystal with a twisted-nematic defect layer

    Authors: Ivan V. Timofeev, Yu-Ting Lin, Vladimir A. Gunyakov, Sergey A. Myslivets, Vasily G. Arkhipkin, Stepan Ya. Vetrov, Wei Lee, Victor Ya. Zyryanov

    Abstract: Defect modes are investigated in a band gap of an electrically tunable one-dimensional photonic crystal infiltrated with a twisted-nematic liquid crystal (1D PC/TN). Their frequency shift and interference under applied voltage are studied both experimentally and theoretically. We deal with the case where the defect layer thickness is much larger than the wavelength (Mauguin condition). It is shown… ▽ More

    Submitted 21 October, 2011; originally announced October 2011.

    Comments: 14 pages, 8 figures, sent to PRE

    Journal ref: PhysRevE, 85, 011705 (2012)

  12. arXiv:0909.0092  [pdf, ps, other

    quant-ph physics.optics

    Ultranarrow resonance peaks in the transmission and reflection spectra of a photonic crystal cavity with Raman gain

    Authors: V. G. Arkhipkin, S. A. Myslivets

    Abstract: The Raman gain of a probe light in a three-state $Λ$-scheme placed into a defect of a one-dimensional photonic crystal is studied theoretically. We show that there exists a pump intensity range, where the transmission and reflection spectra of the probe field exhibit \textit{simultaneously} occurring narrow peaks (resonances) whose position is determined by the Raman resonance. Transmission and… ▽ More

    Submitted 1 September, 2009; originally announced September 2009.

    Comments: 9 pages, 3 figures

  13. Temporal shape manipulation of adiabatons

    Authors: V. G. Arkhipkin, I. V. Timofeev

    Abstract: We describe how to control the temporal shape of adiabaton using peculiarities of propagation dynamics under coherent population trap**. Temporal compression is demonstrated as a special case of pulse sha**. The general case of unequal oscillator strengths of two optical transitions in atom is considered.

    Submitted 6 August, 2005; v1 submitted 23 June, 2005; originally announced June 2005.

    Comments: 5 pages, 7 figures, LaTeX, sent to Phys. Rev. A, correct indices added in fig 1

    Journal ref: Phys. Rev. A 73, 025803 (2006)

  14. Spatial evolution of short pulses under coherent population trap**

    Authors: V. G. Arkhipkin, I. V. Timofeev

    Abstract: Spatial and temporal evolution is studied of two powerful short laser pulses having different wavelengths and interacting with a dense three-level Lambda-type optical medium under coherent population trap**. A general case of unequal oscillator strengths of the transitions is considered. Durations of the probe pulse and the coupling pulse $T_{1,2}$ ($T_2>T_1$) are assumed to be shorter than an… ▽ More

    Submitted 30 July, 2001; v1 submitted 23 March, 2001; originally announced March 2001.

    Comments: 16 pages revtex style, 7 EPS figures, accepted to Physical Review A

    Journal ref: Phys. Rev. A 64, 053811 (2001)