Search | arXiv e-print repository

doi 10.1109/CVPR52688.2022.01207

Multi-modal Extreme Classification

Authors: Anshul Mittal, Kunal Dahiya, Shreya Malani, Janani Ramaswamy, Seba Kuruvilla, Jitendra Ajmera, Keng-hao Chang, Sumeet Agarwal, Purushottam Kar, Manik Varma

Abstract: This paper develops the MUFIN technique for extreme classification (XC) tasks with millions of labels where datapoints and labels are endowed with visual and textual descriptors. Applications of MUFIN to product-to-product recommendation and bid query prediction over several millions of products are presented. Contemporary multi-modal methods frequently rely on purely embedding-based methods. On t… ▽ More This paper develops the MUFIN technique for extreme classification (XC) tasks with millions of labels where datapoints and labels are endowed with visual and textual descriptors. Applications of MUFIN to product-to-product recommendation and bid query prediction over several millions of products are presented. Contemporary multi-modal methods frequently rely on purely embedding-based methods. On the other hand, XC methods utilize classifier architectures to offer superior accuracies than embedding only methods but mostly focus on text-based categorization tasks. MUFIN bridges this gap by reformulating multi-modal categorization as an XC problem with several millions of labels. This presents the twin challenges of develo** multi-modal architectures that can offer embeddings sufficiently expressive to allow accurate categorization over millions of labels; and training and inference routines that scale logarithmically in the number of labels. MUFIN develops an architecture based on cross-modal attention and trains it in a modular fashion using pre-training and positive and negative mining. A novel product-to-product recommendation dataset MM-AmazonTitles-300K containing over 300K products was curated from publicly available amazon.com listings with each product endowed with a title and multiple images. On the all datasets MUFIN offered at least 3% higher accuracy than leading text-based, image-based and multi-modal techniques. Code for MUFIN is available at https://github.com/Extreme-classification/MUFIN △ Less

Submitted 10 September, 2023; originally announced September 2023.

ACM Class: H.3.3

Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

arXiv:2304.14214 [pdf, other]

Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information

Authors: Saurabh Malani, Tom S. Bertalan, Tianqi Cui, Jose L. Avalos, Michael Betenbaugh, Ioannis G. Kevrekidis

Abstract: Experimental data is often comprised of variables measured independently, at different sampling rates (non-uniform $Δ$t between successive measurements); and at a specific time point only a subset of all variables may be sampled. Approaches to identifying dynamical systems from such data typically use interpolation, imputation or subsampling to reorganize or modify the training data… ▽ More Experimental data is often comprised of variables measured independently, at different sampling rates (non-uniform $Δ$t between successive measurements); and at a specific time point only a subset of all variables may be sampled. Approaches to identifying dynamical systems from such data typically use interpolation, imputation or subsampling to reorganize or modify the training data $\textit{prior}$ to learning. Partial physical knowledge may also be available $\textit{a priori}$ (accurately or approximately), and data-driven techniques can complement this knowledge. Here we exploit neural network architectures based on numerical integration methods and $\textit{a priori}$ physical knowledge to identify the right-hand side of the underlying governing differential equations. Iterates of such neural-network models allow for learning from data sampled at arbitrary time points $\textit{without}$ data modification. Importantly, we integrate the network with available partial physical knowledge in "physics informed gray-boxes"; this enables learning unknown kinetic rates or microbial growth functions while simultaneously estimating experimental parameters. △ Less

Submitted 27 April, 2023; originally announced April 2023.

Comments: 25 pages, 15 figures

arXiv:2104.13101 [pdf, other]

doi 10.1063/5.0055371

Initializing LSTM internal states via manifold learning

Authors: Felix P. Kemeth, Tom Bertalan, Nikolaos Evangelou, Tianqi Cui, Saurabh Malani, Ioannis G. Kevrekidis

Abstract: We present an approach, based on learning an intrinsic data manifold, for the initialization of the internal state values of LSTM recurrent neural networks, ensuring consistency with the initial observed input data. Exploiting the generalized synchronization concept, we argue that the converged, "mature" internal states constitute a function on this learned manifold. The dimension of this manifold… ▽ More We present an approach, based on learning an intrinsic data manifold, for the initialization of the internal state values of LSTM recurrent neural networks, ensuring consistency with the initial observed input data. Exploiting the generalized synchronization concept, we argue that the converged, "mature" internal states constitute a function on this learned manifold. The dimension of this manifold then dictates the length of observed input time series data required for consistent initialization. We illustrate our approach through a partially observed chemical model system, where initializing the internal LSTM states in this fashion yields visibly improved performance. Finally, we show that learning this data manifold enables the transformation of partially observed dynamics into fully observed ones, facilitating alternative identification paths for nonlinear dynamical systems. △ Less

Submitted 12 May, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

arXiv:2012.00056 [pdf, other]

Diversifying Relevant Phrases

Authors: Shreya Malani, Dinesh Gaurav, Anoop Vallabhajosyula, Rahul Agrawal

Abstract: Diverse keyword suggestions for a given landing page or matching queries to diverse documents is an active research area in online advertising. Modern search engines provide advertisers with products like Dynamic Search Ads and Smart Campaigns where they extract meaningful keywords/phrases from the advertiser's product inventory. These keywords/phrases are representative of a diverse spectrum of a… ▽ More Diverse keyword suggestions for a given landing page or matching queries to diverse documents is an active research area in online advertising. Modern search engines provide advertisers with products like Dynamic Search Ads and Smart Campaigns where they extract meaningful keywords/phrases from the advertiser's product inventory. These keywords/phrases are representative of a diverse spectrum of advertiser's interests. In this paper, we address the problem of obtaining relevant yet diverse keywords/phrases for any given document. We formulate this as an optimization problem, maximizing the parameterized trade-off between diversity and relevance constrained over number of possible keywords/phrases. We show that this is a combinatorial NP-hard optimization problem. We propose two approaches based on convex relaxations varying in complexity and performance. In the first approach, we show that the optimization problem reduces to an eigen value problem. In the second approach, we show that the optimization problem reduces to minimizing a quadratic form over an l1-ball. Subsequently, we show that this is equivalent to a semi-definite optimization problem. To prove the efficacy of our proposed formulation, we evaluate it on various real-world datasets and compare it to the state-of-the-art heuristic approaches. △ Less

Submitted 30 November, 2020; originally announced December 2020.

Showing 1–4 of 4 results for author: Malani, S