Skip to main content

Showing 1–30 of 30 results for author: Ustyuzhanin, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.05666  [pdf, other

    cs.CV

    YaART: Yet Another ART Rendering Technology

    Authors: Sergey Kastryulin, Artem Konev, Alexander Shishenya, Eugene Lyapustin, Artem Khurshudov, Alexander Tselousov, Nikita Vinokurov, Denis Kuznedelev, Alexander Markovich, Grigoriy Livshits, Alexey Kirillov, Anastasiia Tabisheva, Liubov Chubarova, Marina Kaminskaia, Alexander Ustyuzhanin, Artemii Shvetsov, Daniil Shlenskii, Valerii Startsev, Dmitrii Kornilov, Mikhail Romanov, Artem Babenko, Sergei Ovcharenko, Valentin Khrulkov

    Abstract: In the rapidly progressing field of generative models, the development of efficient and high-fidelity text-to-image diffusion systems represents a significant frontier. This study introduces YaART, a novel production-grade text-to-image cascaded diffusion model aligned to human preferences using Reinforcement Learning from Human Feedback (RLHF). During the development of YaART, we especially focus… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Prompts and additional information are available on the project page, see https://ya.ru/ai/art/paper-yaart-v1

  2. arXiv:2403.11585  [pdf, other

    cs.LG cs.AI cs.CL cs.PL cs.SE

    Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines

    Authors: Ekaterina Trofimova, Emil Sataev, Andrey E. Ustyuzhanin

    Abstract: In the ever-evolving landscape of machine learning, seamless translation of natural language descriptions into executable code remains a formidable challenge. This paper introduces Linguacodus, an innovative framework designed to tackle this challenge by deploying a dynamic pipeline that iteratively transforms natural language task descriptions into code through high-level data-sha** instruction… ▽ More

    Submitted 30 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2312.05185  [pdf, ps, other

    cs.LG cs.AI

    AI Competitions and Benchmarks: Competition platforms

    Authors: Andrey Ustyuzhanin, Harald Carlens

    Abstract: The ecosystem of artificial intelligence competitions is a diverse and multifaceted landscape, encompassing a variety of platforms that each host numerous competitions annually, alongside a plethora of specialized websites dedicated to singular contests. These platforms adeptly manage the overarching administrative responsibilities inherent in orchestrating competitions, thus affording organizers… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  4. arXiv:2301.06064  [pdf, other

    cs.LG cs.AI cs.SC

    Symbolic expression generation via Variational Auto-Encoder

    Authors: Sergei Popov, Mikhail Lazarev, Vladislav Belavin, Denis Derkach, Andrey Ustyuzhanin

    Abstract: There are many problems in physics, biology, and other natural sciences in which symbolic regression can provide valuable insights and discover new laws of nature. A widespread Deep Neural Networks do not provide interpretable solutions. Meanwhile, symbolic expressions give us a clear relation between observations and the target variable. However, at the moment, there is no dominant solution for t… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  5. arXiv:2210.16018  [pdf, other

    cs.SE

    Code4ML: a Large-scale Dataset of annotated Machine Learning Code

    Authors: Anastasia Drozdova, Polina Guseva, Ekaterina Trofimova, Anna Scherbakova, Andrey Ustyuzhanin

    Abstract: Program code as a data source is gaining popularity in the data science community. Possible applications for models trained on such assets range from classification for data dimensionality reduction to automatic code generation. However, without annotation number of methods that could be applied is somewhat limited. To address the lack of annotated datasets, we present the Code4ML corpus. It conta… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Under review

  6. arXiv:2110.08626  [pdf, other

    cs.LG cs.SD eess.AS

    Learning velocity model for complex media with deep convolutional neural networks

    Authors: A. Stankevich, I. Nechepurenko, A. Shevchenko, L. Gremyachikh, A. Ustyuzhanin, A. Vasyukov

    Abstract: The paper considers the problem of velocity model acquisition for a complex media based on boundary measurements. The acoustic model is used to describe the media. We used an open-source dataset of velocity distributions to compare the presented results with the previous works directly. Forward modeling is performed using the grid-characteristic numerical method. The inverse problem is solved usin… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: 14 pages, 6 figures, 6 tables

    MSC Class: 86-10; 86A22 ACM Class: I.2.6

  7. arXiv:2105.01160  [pdf, other

    cs.LG hep-ex

    The Tracking Machine Learning challenge : Throughput phase

    Authors: Sabrina Amrouche, Laurent Basara, Paolo Calafiura, Dmitry Emeliyanov, Victor Estrade, Steven Farrell, Cécile Germain, Vladimir Vava Gligorov, Tobias Golling, Sergey Gorbunov, Heather Gray, Isabelle Guyon, Mikhail Hushchyn, Vincenzo Innocente, Moritz Kiehn, Marcel Kunze, Edward Moyse, David Rousseau, Andreas Salzburger, Andrey Ustyuzhanin, Jean-Roch Vlimant

    Abstract: This paper reports on the second "Throughput" phase of the Tracking Machine Learning (TrackML) challenge on the Codalab platform. As in the first "Accuracy" phase, the participants had to solve a difficult experimental problem linked to tracking accurately the trajectory of particles as e.g. created at the Large Hadron Collider (LHC): given O($10^5$) points, the participants had to connect them in… ▽ More

    Submitted 14 May, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: submitted to Computing and Software for Big Science

  8. Segmentation of EM showers for neutrino experiments with deep graph neural networks

    Authors: Vladislav Belavin, Ekaterina Trofimova, Andrey Ustyuzhanin

    Abstract: We introduce a first-ever algorithm for the reconstruction of multiple showers from the data collected with electromagnetic (EM) sampling calorimeters. Such detectors are widely used in High Energy Physics to measure the energy and kinematics of in-going particles. In this work, we consider the case when many electrons pass through an Emulsion Cloud Chamber (ECC) brick, initiating electron-induced… ▽ More

    Submitted 9 December, 2021; v1 submitted 5 April, 2021; originally announced April 2021.

    Comments: 29 pages, 27 figures

  9. arXiv:2101.07100  [pdf, other

    cs.LG

    Online detection of failures generated by storage simulator

    Authors: Kenenbek Arzymatov, Mikhail Hushchyn, Andrey Sapronov, Vladislav Belavin, Leonid Gremyachikh, Maksim Karpov, Andrey Ustyuzhanin

    Abstract: Modern large-scale data-farms consist of hundreds of thousands of storage devices that span distributed infrastructure. Devices used in modern data centers (such as controllers, links, SSD- and HDD-disks) can fail due to hardware as well as software problems. Such failures or anomalies can be detected by monitoring the activity of components using machine learning techniques. In order to use these… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  10. arXiv:2007.04295  [pdf, other

    cs.CV astro-ph.HE

    A study of Neural networks point source extraction on simulated Fermi/LAT Telescope images

    Authors: Mariia Drozdova, Anton Broilovskiy, Andrey Ustyuzhanin, Denys Malyshev

    Abstract: Astrophysical images in the GeV band are challenging to analyze due to the strong contribution of the background and foreground astrophysical diffuse emission and relatively broad point spread function of modern space-based instruments. In certain cases, even finding of point sources on the image becomes a non-trivial task. We present a method for point sources extraction using a convolution neura… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: Accepted to Astronomische Nachrichten

  11. arXiv:2002.04632  [pdf, other

    cs.LG hep-ex physics.data-an stat.ML

    Black-Box Optimization with Local Generative Surrogates

    Authors: Sergey Shirobokov, Vladislav Belavin, Michael Kagan, Andrey Ustyuzhanin, Atılım Güneş Baydin

    Abstract: We propose a novel method for gradient-based optimization of black-box simulators using differentiable local surrogate models. In fields such as physics and engineering, many processes are modeled with non-differentiable simulators with intractable likelihoods. Optimization of these forward models is particularly challenging, especially when the simulator is stochastic. To address such cases, we i… ▽ More

    Submitted 15 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Journal ref: In Advances in Neural Information Processing Systems 34 (NeurIPS), 2020

  12. Generalization of Change-Point Detection in Time Series Data Based on Direct Density Ratio Estimation

    Authors: Mikhail Hushchyn, Andrey Ustyuzhanin

    Abstract: The goal of the change-point detection is to discover changes of time series distribution. One of the state of the art approaches of the change-point detection are based on direct density ratio estimation. In this work we show how existing algorithms can be generalized using various binary classification and regression models. In particular, we show that the Gradient Boosting over Decision Trees a… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

  13. arXiv:1912.09323  [pdf

    cs.LG stat.ML

    NFAD: Fixing anomaly detection using normalizing flows

    Authors: Artem Ryzhikov, Maxim Borisyak, Andrey Ustyuzhanin, Denis Derkach

    Abstract: Anomaly detection is a challenging task that frequently arises in practically all areas of industry and science, from fraud detection and data quality monitoring to finding rare cases of diseases and searching for new physics. Most of the conventional approaches to anomaly detection, such as one-class SVM and Robust Auto-Encoder, are one-class classification methods, i.e. focus on separating norma… ▽ More

    Submitted 19 November, 2021; v1 submitted 19 December, 2019; originally announced December 2019.

    Journal ref: PeerJ Computer Science 7:e757 (2021)

  14. Adaptive Divergence for Rapid Adversarial Optimization

    Authors: Maxim Borisyak, Tatiana Gaintseva, Andrey Ustyuzhanin

    Abstract: Adversarial Optimization (AO) provides a reliable, practical way to match two implicitly defined distributions, one of which is usually represented by a sample of real data, and the other is defined by a generator. Typically, AO involves training of a high-capacity model on each step of the optimization. In this work, we consider computationally heavy generators, for which training of high-capacit… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Journal ref: PeerJ Computer Science. 2020 May;6:e274

  15. arXiv:1906.06096  [pdf, other

    stat.ML cs.LG

    $(1 + \varepsilon)$-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets

    Authors: Maxim Borisyak, Artem Ryzhikov, Andrey Ustyuzhanin, Denis Derkach, Fedor Ratnikov, Olga Mineeva

    Abstract: Anomaly detection is not an easy problem since distribution of anomalous samples is unknown a priori. We explore a novel method that gives a trade-off possibility between one-class and two-class approaches, and leads to a better performance on anomaly detection problems with small or non-representative anomalous samples. The method is evaluated using several data sets and compared to a set of conv… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Journal ref: Journal of Machine Learning Research. 2020;21(72):1-22

  16. arXiv:1905.11825  [pdf, other

    physics.ins-det cs.LG hep-ex

    Fast Data-Driven Simulation of Cherenkov Detectors Using Generative Adversarial Networks

    Authors: Artem Maevskiy, Denis Derkach, Nikita Kazeev, Andrey Ustyuzhanin, Maksim Artemev, Lucio Anderlini

    Abstract: The increasing luminosities of future Large Hadron Collider runs and next generation of collider experiments will require an unprecedented amount of simulated events to be produced. Such large scale productions are extremely demanding in terms of computing resources. Thus new approaches to event generation and simulation of detector responses are needed. In LHCb, the accurate simulation of Cherenk… ▽ More

    Submitted 26 September, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Proceedings for 19th International Workshop on Advanced Computing and Analysis Techniques in Physics Research. (Fixed typos and added one missing reference in the revised version.)

    Journal ref: J. Phys.: Conf. Ser. 1525 012097 (2020)

  17. arXiv:1903.11788  [pdf, other

    hep-ex cs.LG physics.ins-det

    Cherenkov Detectors Fast Simulation Using Neural Networks

    Authors: Denis Derkach, Nikita Kazeev, Fedor Ratnikov, Andrey Ustyuzhanin, Alexandra Volokhova

    Abstract: We propose a way to simulate Cherenkov detector response using a generative adversarial neural network to bypass low-level details. This network is trained to reproduce high level features of the simulated detector events based on input observables of incident particles. This allows the dramatic increase of simulation speed. We demonstrate that this approach provides simulation precision which is… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: In proceedings of 10th International Workshop on Ring Imaging Cherenkov Detectors

  18. arXiv:1902.02095  [pdf, other

    eess.SY cs.LG

    Space Navigator: a Tool for the Optimization of Collision Avoidance Maneuvers

    Authors: Leonid Gremyachikh, Dmitrii Dubov, Nikita Kazeev, Andrey Kulibaba, Andrey Skuratov, Anton Tereshkin, Andrey Ustyuzhanin, Lubov Shiryaeva, Sergej Shishkin

    Abstract: The number of space objects will grow several times in a few years due to the planned launches of constellations of thousands microsatellites. It leads to a significant increase in the threat of satellite collisions. Spacecraft must undertake collision avoidance maneuvers to mitigate the risk. According to publicly available information, conjunction events are now manually handled by operators on… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: Submitted to AAS Advances in the Astronautical Sciences, presented at IAA SciTech Forum 2018

    Journal ref: Advances in the Astronautical Sciences 2020 First IAA/AAS SciTech Forum on Space Flight Mechanics and Space Structures and Materials Conference, volume 170

  19. arXiv:1812.01319  [pdf, other

    physics.data-an cs.LG

    Generative Models for Fast Calorimeter Simulation.LHCb case

    Authors: Viktoria Chekalina, Elena Orlova, Fedor Ratnikov, Dmitry Ulyanov, Andrey Ustyuzhanin, Egor Zakharov

    Abstract: Simulation is one of the key components in high energy physics. Historically it relies on the Monte Carlo methods which require a tremendous amount of computation resources. These methods may have difficulties with the expected High Luminosity Large Hadron Collider (HL LHC) need, so the experiment is in urgent need of new fast simulation techniques. We introduce a new Deep Learning framework based… ▽ More

    Submitted 6 April, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: Proceedings of the presentation at CHEP 2018 Conference

  20. arXiv:1807.02876  [pdf, other

    physics.comp-ph cs.LG hep-ex stat.ML

    Machine Learning in High Energy Physics Community White Paper

    Authors: Kim Albertsson, Piero Altoe, Dustin Anderson, John Anderson, Michael Andrews, Juan Pedro Araque Espinosa, Adam Aurisano, Laurent Basara, Adrian Bevan, Wahid Bhimji, Daniele Bonacorsi, Bjorn Burkle, Paolo Calafiura, Mario Campanelli, Louis Capps, Federico Carminati, Stefano Carrazza, Yi-fan Chen, Taylor Childers, Yann Coadou, Elias Coniavitis, Kyle Cranmer, Claire David, Douglas Davis, Andrea De Simone , et al. (103 additional authors not shown)

    Abstract: Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We d… ▽ More

    Submitted 16 May, 2019; v1 submitted 8 July, 2018; originally announced July 2018.

    Comments: Editors: Sergei Gleyzer, Paul Seyfert and Steven Schramm

  21. arXiv:1711.07051  [pdf, other

    physics.data-an cs.LG hep-ex

    Deep learning for inferring cause of data anomalies

    Authors: V. Azzolini, M. Borisyak, G. Cerminara, D. Derkach, G. Franzoni, F. De Guio, O. Koval, M. Pierini, A. Pol, F. Ratnikov, F. Siroky, A. Ustyuzhanin, J-R. Vlimant

    Abstract: Daily operation of a large-scale experiment is a resource consuming task, particularly from perspectives of routine data quality monitoring. Typically, data comes from different sub-detectors and the global quality of data depends on the combinatorial performance of each of them. In this paper, the problem of identifying channels in which anomalies occurred is considered. We introduce a generic de… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

    Comments: Presented at ACAT 2017 conference, Seattle, USA

  22. arXiv:1709.08610  [pdf, other

    cs.CV hep-ex physics.data-an

    Numerical optimization for Artificial Retina Algorithm

    Authors: Maxim Borisyak, Andrey Ustyuzhanin, Denis Derkach, Mikhail Belous

    Abstract: High-energy physics experiments rely on reconstruction of the trajectories of particles produced at the interaction point. This is a challenging task, especially in the high track multiplicity environment generated by p-p collisions at the LHC energies. A typical event includes hundreds of signal examples (interesting decays) and a significant amount of noise (uninteresting examples). This work… ▽ More

    Submitted 1 October, 2017; v1 submitted 25 September, 2017; originally announced September 2017.

  23. arXiv:1709.08607  [pdf, other

    physics.data-an cs.AI cs.LG hep-ex

    Towards automation of data quality system for CERN CMS experiment

    Authors: Maxim Borisyak, Fedor Ratnikov, Denis Derkach, Andrey Ustyuzhanin

    Abstract: Daily operation of a large-scale experiment is a challenging task, particularly from perspectives of routine monitoring of quality for data being taken. We describe an approach that uses Machine Learning for the automated system to monitor data quality, which is based on partial use of data qualified manually by detector experts. The system automatically classifies marginal cases: both of good an… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

  24. arXiv:1709.08605  [pdf, other

    cs.CV astro-ph.IM physics.ins-det

    Muon Trigger for Mobile Phones

    Authors: Maxim Borisyak, Michail Usvyatsov, Michael Mulhearn, Chase Shimmin, Andrey Ustyuzhanin

    Abstract: The CRAYFIS experiment proposes to use privately owned mobile phones as a ground detector array for Ultra High Energy Cosmic Rays. Upon interacting with Earth's atmosphere, these events produce extensive particle showers which can be detected by cameras on mobile phones. A typical shower contains minimally-ionizing particles such as muons. As these particles interact with CMOS image sensors, they… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

  25. GRID Storage Optimization in Transparent and User-Friendly Way for LHCb Datasets

    Authors: Mikhail Hushchyn, Andrey Ustyuzhanin, Philippe Charpentier, Christophe Haen

    Abstract: The LHCb collaboration is one of the four major experiments at the Large Hadron Collider at CERN. Many petabytes of data are produced by the detectors and Monte-Carlo simulations. The LHCb Grid interware LHCbDIRAC is used to make data available to all collaboration members around the world. The data is replicated to the Grid sites in different locations. However the Grid disk storage is limited an… ▽ More

    Submitted 12 May, 2017; originally announced May 2017.

  26. Everware toolkit. Supporting reproducible science and challenge-driven education

    Authors: Andrey Ustyuzhanin, Timothy Daniel Head, Igor Babuschkin, Alexander Tiunov

    Abstract: Modern science clearly demands for a higher level of reproducibility and collaboration. To make research fully reproducible one has to take care of several aspects: research protocol description, data access, environment preservation, workflow pipeline, and analysis script preservation. Version control systems like git help with the workflow and analysis scripts part. Virtualization techniques lik… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

  27. LHCb trigger streams optimization

    Authors: D. Derkach, N. Kazeev, R. Neychev, A. Panin, I. Trofimov, A. Ustyuzhanin, M. Vesterinen

    Abstract: The LHCb experiment stores around $10^{11}$ collision events per year. A typical physics analysis deals with a final sample of up to $10^7$ events. Event preselection algorithms (lines) are used for data reduction. Since the data are stored in a format that requires sequential access, the lines are grouped into several output file streams, in order to increase the efficiency of user analysis jobs… ▽ More

    Submitted 6 June, 2017; v1 submitted 17 February, 2017; originally announced February 2017.

    Comments: Submitted to CHEP-2016 proceedings

    Journal ref: Journal of Physics: Conference Series. Vol. 898. No. 6. IOP Publishing, 2017

  28. arXiv:1510.00132  [pdf, other

    cs.DC cs.LG physics.data-an

    Disk storage management for LHCb based on Data Popularity estimator

    Authors: Mikhail Hushchyn, Philippe Charpentier, Andrey Ustyuzhanin

    Abstract: This paper presents an algorithm providing recommendations for optimizing the LHCb data storage. The LHCb data storage system is a hybrid system. All datasets are kept as archives on magnetic tapes. The most popular datasets are kept on disks. The algorithm takes the dataset usage history and metadata (size, type, configuration etc.) to generate a recommendation report. This article presents how w… ▽ More

    Submitted 1 October, 2015; originally announced October 2015.

  29. arXiv:1507.07374  [pdf, other

    cs.LG cs.AI cs.NE

    A genetic algorithm for autonomous navigation in partially observable domain

    Authors: Maxim Borisyak, Andrey Ustyuzhanin

    Abstract: The problem of autonomous navigation is one of the basic problems for robotics. Although, in general, it may be challenging when an autonomous vehicle is placed into partially observable domain. In this paper we consider simplistic environment model and introduce a navigation algorithm based on Learning Classifier System.

    Submitted 27 July, 2015; originally announced July 2015.

    MSC Class: 68T05

  30. Event Index - an LHCb Event Search System

    Authors: Andrey Ustyuzhanin, Alexey Artemov, Nikita Kazeev, Artem Redkin

    Abstract: During LHC Run 1, the LHCb experiment recorded around $10^{11}$ collision events. This paper describes Event Index - an event search system. Its primary function is to quickly select subsets of events from a combination of conditions, such as the estimated decay channel or number of hits in a subdetector. Event Index is essentially Apache Lucene optimized for read-only indexes distributed over ind… ▽ More

    Submitted 26 October, 2015; v1 submitted 27 May, 2015; originally announced May 2015.

    Comments: Report for the proceedings of the CHEP-2015 conference

    Journal ref: Journal of Physics: Conference Series, vol. 664, num 3, pages 032019, 2015