Skip to main content

Showing 1–18 of 18 results for author: Volkovs, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05207  [pdf, other

    cs.LG

    Retrieval & Fine-Tuning for In-Context Tabular Models

    Authors: Valentin Thomas, Junwei Ma, Rasa Hosseinzadeh, Keyvan Golestan, Guangwei Yu, Maksims Volkovs, Anthony Caterini

    Abstract: Tabular data is a pervasive modality spanning a wide range of domains, and the inherent diversity poses a considerable challenge for deep learning. Recent advancements using transformer-based in-context learning have shown promise on smaller and less complex datasets, but have struggled to scale to larger and more complex ones. To address this limitation, we propose a combination of retrieval and… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2312.10144  [pdf, other

    cs.LG cs.AI cs.CV

    Data-Efficient Multimodal Fusion on a Single GPU

    Authors: Noël Vouitsis, Zhaoyan Liu, Satya Krishna Gorti, Valentin Villecroze, Jesse C. Cresswell, Guangwei Yu, Gabriel Loaiza-Ganem, Maksims Volkovs

    Abstract: The goal of multimodal alignment is to learn a single latent space that is shared between multimodal inputs. The most powerful models in this space have been trained using massive datasets of paired inputs and large-scale computational resources, making them prohibitively expensive to train in many practical scenarios. We surmise that existing unimodal encoders pre-trained on large amounts of unim… ▽ More

    Submitted 10 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 (Highlight)

  3. arXiv:2311.18780  [pdf, other

    cs.LG

    MultiResFormer: Transformer with Adaptive Multi-Resolution Modeling for General Time Series Forecasting

    Authors: Linfeng Du, Ji Xin, Alex Labach, Saba Zuberi, Maksims Volkovs, Rahul G. Krishnan

    Abstract: Transformer-based models have greatly pushed the boundaries of time series forecasting recently. Existing methods typically encode time series data into $\textit{patches}$ using one or a fixed set of patch lengths. This, however, could result in a lack of ability to capture the variety of intricate temporal dependencies present in real-world multi-periodic time series. In this paper, we propose Mu… ▽ More

    Submitted 8 February, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  4. arXiv:2310.07756  [pdf, other

    cs.LG

    Self-supervised Representation Learning From Random Data Projectors

    Authors: Yi Sui, Tongzi Wu, Jesse C. Cresswell, Ga Wu, George Stein, Xiao Shi Huang, Xiaochen Zhang, Maksims Volkovs

    Abstract: Self-supervised representation learning~(SSRL) has advanced considerably by exploiting the transformation invariance assumption under artificially designed data augmentations. While augmentation-based SSRL algorithms push the boundaries of performance in computer vision and natural language processing, they are often not directly applicable to other data modalities, and can conflict with applicati… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper of ICLR 2024. https://openreview.net/pdf?id=EpYnZpDpsQ

  5. arXiv:2304.13017  [pdf, other

    cs.LG

    DuETT: Dual Event Time Transformer for Electronic Health Records

    Authors: Alex Labach, Aslesha Pokhrel, Xiao Shi Huang, Saba Zuberi, Seung Eun Yi, Maksims Volkovs, Tomi Poutanen, Rahul G. Krishnan

    Abstract: Electronic health records (EHRs) recorded in hospital settings typically contain a wide range of numeric time series data that is characterized by high sparsity and irregular observations. Effective modelling for such data must exploit its time series nature, the semantic relationship between different types of observations, and information in the sparsity structure of the data. Self-supervised Tr… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at MLHC 2023, camera-ready version

  6. arXiv:2206.02999  [pdf, other

    cs.CL

    DiMS: Distilling Multiple Steps of Iterative Non-Autoregressive Transformers for Machine Translation

    Authors: Sajad Norouzi, Rasa Hosseinzadeh, Felipe Perez, Maksims Volkovs

    Abstract: The computational benefits of iterative non-autoregressive transformers decrease as the number of decoding steps increases. As a remedy, we introduce Distill Multiple Steps (DiMS), a simple yet effective distillation technique to decrease the number of required steps to reach a certain translation quality. The distilled model enjoys the computational benefits of early iterations while preserving t… ▽ More

    Submitted 9 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Code available at https://github.com/layer6ai-labs/DiMS

  7. arXiv:2203.15086  [pdf, other

    cs.CV

    X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval

    Authors: Satya Krishna Gorti, Noel Vouitsis, Junwei Ma, Keyvan Golestan, Maksims Volkovs, Animesh Garg, Guangwei Yu

    Abstract: In text-video retrieval, the objective is to learn a cross-modal similarity function between a text and a video that ranks relevant text-video pairs higher than irrelevant pairs. However, videos inherently express a much wider gamut of information than texts. Instead, texts often capture sub-regions of entire videos and are most semantically similar to certain frames within videos. Therefore, for… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  8. Decentralized Federated Learning through Proxy Model Sharing

    Authors: Shivam Kalra, Junfeng Wen, Jesse C. Cresswell, Maksims Volkovs, Hamid R. Tizhoosh

    Abstract: Institutions in highly regulated domains such as finance and healthcare often have restrictive rules around data sharing. Federated learning is a distributed learning framework that enables multi-institutional collaborations on decentralized data with improved protection for each collaborator's data privacy. In this paper, we propose a communication-efficient scheme for decentralized federated lea… ▽ More

    Submitted 22 May, 2023; v1 submitted 22 November, 2021; originally announced November 2021.

    Journal ref: Nature Communications 14, 2899 (2023)

  9. arXiv:2107.14317  [pdf, other

    cs.LG

    Temporal Dependencies in Feature Importance for Time Series Predictions

    Authors: Kin Kwan Leung, Clayton Rooke, Jonathan Smith, Saba Zuberi, Maksims Volkovs

    Abstract: Time series data introduces two key challenges for explainability methods: firstly, observations of the same feature over subsequent time steps are not independent, and secondly, the same feature can have varying importance to model predictions over time. In this paper, we propose Windowed Feature Importance in Time (WinIT), a feature removal based explainability approach to address these issues.… ▽ More

    Submitted 6 March, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: International Conference on Learning Representations 2023 (ICLR'23)

  10. arXiv:2105.02439  [pdf, other

    cs.CV

    Weakly Supervised Action Selection Learning in Video

    Authors: Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Guangwei Yu

    Abstract: Localizing actions in video is a core task in computer vision. The weakly supervised temporal localization problem investigates whether this task can be adequately solved with only video-level labels, significantly reducing the amount of expensive and error-prone annotation that is required. A common approach is to train a frame-level classifier where frames with the highest class probability are… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: CVPR 2021

  11. arXiv:1912.06185  [pdf, other

    cs.CV

    Learning Effective Visual Relationship Detector on 1 GPU

    Authors: Yichao Lu, Cheng Chang, Himanshu Rai, Guangwei Yu, Maksims Volkovs

    Abstract: We present our winning solution to the Open Images 2019 Visual Relationship challenge. This is the largest challenge of its kind to date with nearly 9 million training images. Challenge task consists of detecting objects and identifying relationships between them in complex scenes. Our solution has three stages, first object detection model is fine-tuned for the challenge classes using a novel wei… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

  12. arXiv:1911.08548  [pdf, other

    cs.CV

    Cross-Class Relevance Learning for Temporal Concept Localization

    Authors: Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Ilya Stanevich, Guangwei Yu

    Abstract: We present a novel Cross-Class Relevance Learning approach for the task of temporal concept localization. Most localization architectures rely on feature extraction layers followed by a classification layer which outputs class probabilities for each segment. However, in many real-world applications classes can exhibit complex relationships that are difficult to model with this architecture. In con… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  13. arXiv:1906.04944  [pdf, other

    cs.CV

    Semi-Supervised Exploration in Image Retrieval

    Authors: Cheng Chang, Himanshu Rai, Satya Krishna Gorti, Junwei Ma, Chundi Liu, Guangwei Yu, Maksims Volkovs

    Abstract: We present our solution to Landmark Image Retrieval Challenge 2019. This challenge was based on the large Google Landmarks Dataset V2[9]. The goal was to retrieve all database images containing the same landmark for every provided query image. Our solution is a combination of global and local models to form an initial KNN graph. We then use a novel extension of the recently proposed graph traversa… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

  14. arXiv:1904.04137  [pdf, other

    stat.AP cs.LG

    Diabetes Mellitus Forecasting Using Population Health Data in Ontario, Canada

    Authors: Mathieu Ravaut, Hamed Sadeghi, Kin Kwan Leung, Maksims Volkovs, Laura C. Rosella

    Abstract: Leveraging health administrative data (HAD) datasets for predicting the risk of chronic diseases including diabetes has gained a lot of attention in the machine learning community recently. In this paper, we use the largest health records datasets of patients in Ontario,Canada. Provided by the Institute of Clinical Evaluative Sciences (ICES), this database is age, gender and ethnicity-diverse. The… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 18 pages, 3 figures, 8 Tables, Submitted to 2019 ML for Healthcare conference

  15. arXiv:1811.00697  [pdf, other

    cs.IR

    Noise Contrastive Estimation for Scalable Linear Models for One-Class Collaborative Filtering

    Authors: Ga Wu, Maksims Volkovs, Chee Loong Soon, Scott Sanner, Himanshu Rai

    Abstract: Previous highly scalable one-class collaborative filtering methods such as Projected Linear Recommendation (PLRec) have advocated using fast randomized SVD to embed items into a latent space, followed by linear regression methods to learn personalized recommendation models per user. Unfortunately, naive SVD embedding methods often exhibit a popularity bias that skews the ability to accurately embe… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: 8 pages

  16. arXiv:1711.04168  [pdf, other

    cs.CL cs.LG stat.ML

    Unsupervised Document Embedding With CNNs

    Authors: Chundi Liu, Shunan Zhao, Maksims Volkovs

    Abstract: We propose a new model for unsupervised document embedding. Leading existing approaches either require complex inference or use recurrent neural networks (RNN) that are difficult to parallelize. We take a different route and develop a convolutional neural network (CNN) embedding model. Our CNN architecture is fully parallelizable resulting in over 10x speedup in inference time over RNN models. Par… ▽ More

    Submitted 19 February, 2018; v1 submitted 11 November, 2017; originally announced November 2017.

    Comments: Major revision with additional experiments and model description

  17. arXiv:1502.00527  [pdf, other

    cs.IR

    Context Models For Web Search Personalization

    Authors: Maksims Volkovs

    Abstract: We present our solution to the Yandex Personalized Web Search Challenge. The aim of this challenge was to use the historical search logs to personalize top-N document rankings for a set of test users. We used over 100 features extracted from user- and query-depended contexts to train neural net and tree-based learning-to-rank and regression models. Our final submission, which was a blend of severa… ▽ More

    Submitted 2 February, 2015; originally announced February 2015.

  18. arXiv:1107.1805  [pdf, other

    stat.ML cs.AI

    Loss-sensitive Training of Probabilistic Conditional Random Fields

    Authors: Maksims N. Volkovs, Hugo Larochelle, Richard S. Zemel

    Abstract: We consider the problem of training probabilistic conditional random fields (CRFs) in the context of a task where performance is measured using a specific loss function. While maximum likelihood is the most common approach to training CRFs, it ignores the inherent structure of the task's loss function. We describe alternatives to maximum likelihood which take that loss into account. These include… ▽ More

    Submitted 9 July, 2011; originally announced July 2011.