Skip to main content

Showing 1–17 of 17 results for author: Madan, V

.
  1. arXiv:2406.04391  [pdf, other

    cs.LG cs.AI cs.CL

    Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

    Authors: Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo

    Abstract: Predictable behavior from scaling advanced AI systems is an extremely desirable property. Although a well-established literature exists on how pretraining performance scales, the literature on how particular downstream capabilities scale is significantly muddier. In this work, we take a step back and ask: why has predicting specific downstream capabilities with scale remained elusive? While many f… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2205.11603  [pdf, other

    cs.CL

    Representation Projection Invariance Mitigates Representation Collapse

    Authors: Anastasia Razdaibiedina, Ashish Khetan, Zohar Karnin, Daniel Khashabi, Vishaal Kapoor, Vivek Madan

    Abstract: Fine-tuning contextualized representations learned by pre-trained language models remains a prevalent practice in NLP. However, fine-tuning can lead to representation degradation (also known as representation collapse), which may result in instability, sub-optimal performance, and weak generalization. In this paper, we propose Representation Projection Invariance (REPINA), a novel regularization… ▽ More

    Submitted 21 November, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: 41 pages, 6 figures

  3. arXiv:2107.11094  [pdf, other

    cs.CL

    Improving Early Sepsis Prediction with Multi Modal Learning

    Authors: Fred Qin, Vivek Madan, Ujjwal Ratan, Zohar Karnin, Vishaal Kapoor, Parminder Bhatia, Taha Kass-Hout

    Abstract: Sepsis is a life-threatening disease with high morbidity, mortality and healthcare costs. The early prediction and administration of antibiotics and intravenous fluids is considered crucial for the treatment of sepsis and can save potentially millions of lives and billions in health care costs. Professional clinical care practitioners have proposed clinical criterion which aid in early detection o… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

  4. arXiv:2101.08587  [pdf, other

    cs.LG cs.AI

    Stress Testing of Meta-learning Approaches for Few-shot Learning

    Authors: Aroof Aimen, Sahil Sidheekh, Vineet Madan, Narayanan C. Krishnan

    Abstract: Meta-learning (ML) has emerged as a promising learning method under resource constraints such as few-shot learning. ML approaches typically propose a methodology to learn generalizable models. In this work-in-progress paper, we put the recent ML approaches to a stress test to discover their limitations. Precisely, we measure the performance of ML approaches for few-shot learning against increasing… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  5. arXiv:2012.06723  [pdf, other

    cs.LG cs.AI

    On Duality Gap as a Measure for Monitoring GAN Training

    Authors: Sahil Sidheekh, Aroof Aimen, Vineet Madan, Narayanan C. Krishnan

    Abstract: Generative adversarial network (GAN) is among the most popular deep learning models for learning complex data distributions. However, training a GAN is known to be a challenging task. This is often attributed to the lack of correlation between the training progress and the trajectory of the generator and discriminator losses and the need for the GAN's subjective evaluation. A recently proposed mea… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  6. arXiv:2006.11742  [pdf, ps, other

    math.CV

    Estimates for initial coefficients of certain bi-univalent functions

    Authors: Vibha Madaan, Ajay Kumar, V. Ravichandran

    Abstract: Estimates are obtained for the initial coefficients of a normalized analytic function $f$ in the unit disk $\mathbb{D}$ such that $f$ and the analytic extension of $f^{-1}$ to $\mathbb{D}$ belong to certain subclasses of univalent functions. The bounds obtained improve some existing known bounds.

    Submitted 21 June, 2020; originally announced June 2020.

    MSC Class: 30C45; 30C80

  7. arXiv:2004.07886  [pdf, ps, other

    cs.DS cs.DM math.CO math.OC stat.ML

    Maximizing Determinants under Matroid Constraints

    Authors: Vivek Madan, Aleksandar Nikolov, Mohit Singh, Uthaipon Tantipongpipat

    Abstract: Given vectors $v_1,\dots,v_n\in\mathbb{R}^d$ and a matroid $M=([n],I)$, we study the problem of finding a basis $S$ of $M$ such that $\det(\sum_{i \in S}v_i v_i^\top)$ is maximized. This problem appears in a diverse set of areas such as experimental design, fair allocation of goods, network design, and machine learning. The current best results include an $e^{2k}$-estimation for any matroid of ran… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  8. arXiv:1910.07686  [pdf, ps, other

    math.CO

    Critical group structure from the parameters of a strongly regular graph

    Authors: Joshua E. Ducey, David L. Duncan, Wesley J. Engelbrecht, Jawahar V. Madan, Eric Piato, Christina S. Shatford, Angela Vichitbandha

    Abstract: We give simple arithmetic conditions that force the Sylow $p$-subgroup of the critical group of a strongly regular graph to take a specific form. These conditions depend only on the parameters $(v, k, λ, μ)$ of the strongly regular graph under consideration. We give many examples, including how the theory can be used to compute the critical group of Conway's $99$-graph and to give an elementary ar… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: 20 pages

    MSC Class: 05C50

  9. arXiv:1906.05547  [pdf, ps, other

    math.CV

    Radii of Starlikeness and Convexity of Bessel Functions

    Authors: Vibha Madaan, Ajay Kumar, V. Ravichandran

    Abstract: The radii of starlikeness and convexity associated with lemniscate of Bernoulli and the Janowski function, $(1+Az)/(1+Bz)$ for $-1\leq B<A\leq 1$, have been determined for normalizations of $q$-Bessel function, Bessel function of first kind of order $ν$, Lommel function of first kind and Legendre polynomial of odd degree.

    Submitted 13 June, 2019; originally announced June 2019.

    MSC Class: 30C10; 30C15; 30C45

  10. arXiv:1902.04277  [pdf, ps, other

    math.CV

    Lemniscate Convexity and Other Properties of Generalized Bessel Functions

    Authors: Vibha Madaan, Ajay Kumar, V. Ravichandran

    Abstract: Sufficient conditions on associated parameters $p,b$ and $c$ are obtained so that the generalized and \textquotedblleft{normalized}\textquotedblright{} Bessel function $u_p(z)=u_{p,b,c}(z)$ satisfies $|(1+(zu''_p(z)/u'_p(z)))^2-1|<1$ or $|((zu_p(z))'/u_p(z))^2-1|<1$. We also determine the condition on these parameters so that $-(4(p+(b+1)/2)/c)u'_p(z)\prec\sqrt{1+z}$. Relations between the paramet… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    MSC Class: 30C10; 30C45

  11. arXiv:1807.09735  [pdf, other

    cs.DS math.OC

    Improving the Integrality Gap for Multiway Cut

    Authors: Kristóf Bérczi, Karthekeyan Chandrasekaran, Tamás Király, Vivek Madan

    Abstract: In the multiway cut problem, we are given an undirected graph with non-negative edge weights and a collection of $k$ terminal nodes, and the goal is to partition the node set of the graph into $k$ non-empty parts each containing exactly one terminal so that the total weight of the edges crossing the partition is minimized. The multiway cut problem for $k\ge 3$ is APX-hard. For arbitrary $k$, the b… ▽ More

    Submitted 21 November, 2018; v1 submitted 25 July, 2018; originally announced July 2018.

    Comments: 28 pages

  12. arXiv:1806.05136  [pdf, ps, other

    math.CV

    Starlikeness associated with lemniscate of Bernoulli

    Authors: Vibha Madaan, Ajay Kumar, V. Ravichandran

    Abstract: For an analytic function $f$ on the unit disk $\mathbb{D}=\{z:|z|<1\}$ satisfying $f(0)=0=f'(0)-1,$ we obtain sufficient conditions so that $f$ satisfies $|(zf'(z)/f(z))^2-1|<1.$ The technique of differential subordination of first or second order is used. The admissibility conditions for lemniscate of Bernoulli are derived and employed in order to prove the main results.

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: 20 pages

  13. arXiv:1805.00181  [pdf, ps, other

    cs.DS

    Spectrally Robust Graph Isomorphism

    Authors: Alexandra Kolla, Ioannis Koutis, Vivek Madan, Ali Kemal Sinop

    Abstract: We initiate the study of spectral generalizations of the graph isomorphism problem. (a)The Spectral Graph Dominance (SGD) problem: On input of two graphs $G$ and $H$ does there exist a permutation $π$ such that $G\preceq π(H)$? (b) The Spectrally Robust Graph Isomorphism (SRGI) problem: On input of two graphs $G$ and $H$, find the smallest number $κ$ over all permutations $π$ such that… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

    Comments: Extended version of a paper appearing in the proceedings of ICALP 2018

  14. arXiv:1607.07200  [pdf, other

    cs.DM cs.CC cs.DS

    Approximating Multicut and the Demand Graph

    Authors: Chandra Chekuri, Vivek Madan

    Abstract: In the minimum Multicut problem, the input is an edge-weighted supply graph $G=(V,E)$ and a simple demand graph $H=(V,F)$. Either $G$ and $H$ are directed (DMulC) or both are undirected (UMulC). The goal is to remove a minimum weight set of edges in $G$ such that there is no path from $s$ to $t$ in the remaining graph for any $(s,t) \in F$. UMulC admits an $O(\log k)$-approximation where $k$ is th… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.

  15. arXiv:1507.04674  [pdf, other

    cs.DS

    Simple and Fast Rounding Algorithms for Directed and Node-weighted Multiway Cut

    Authors: Chandra Chekuri, Vivek Madan

    Abstract: In Directed Multiway Cut(Dir-MC) the input is an edge-weighted directed graph $G=(V,E)$ and a set of $k$ terminal nodes $\{s_1,s_2,\ldots,s_k\} \subseteq V$; the goal is to find a min-weight subset of edges whose removal ensures that there is no path from $s_i$ to $s_j$ for any $i \neq j$. In Node-weighted Multiway Cut(Node-MC) the input is a node-weighted undirected graph $G$ and a set of $k$ ter… ▽ More

    Submitted 16 July, 2015; originally announced July 2015.

  16. arXiv:1311.3268  [pdf, ps, other

    cs.DM math.CO

    On the Expansion of Group-Based Lifts

    Authors: Naman Agarwal, Karthekeyan Chandrasekaran, Alexandra Kolla, Vivek Madan

    Abstract: A $k$-lift of an $n$-vertex base graph $G$ is a graph $H$ on $n\times k$ vertices, where each vertex $v$ of $G$ is replaced by $k$ vertices $v_1,\cdots{},v_k$ and each edge $(u,v)$ in $G$ is replaced by a matching representing a bijection $π_{uv}$ so that the edges of $H$ are of the form $(u_i,v_{π_{uv}(i)})$. Lifts have been studied as a means to efficiently construct expanders. In this work, we… ▽ More

    Submitted 17 December, 2016; v1 submitted 13 November, 2013; originally announced November 2013.

  17. arXiv:1205.1358  [pdf, ps, other

    cs.LO math.LO

    Preservation under Substructures modulo Bounded Cores

    Authors: Abhisekh Sankaran, Bharat Adsul, Vivek Madan, Pritish Kamath, Supratik Chakraborty

    Abstract: We investigate a model-theoretic property that generalizes the classical notion of "preservation under substructures". We call this property \emph{preservation under substructures modulo bounded cores}, and present a syntactic characterization via $Σ_2^0$ sentences for properties of arbitrary structures definable by FO sentences. As a sharper characterization, we further show that the count of exi… ▽ More

    Submitted 12 July, 2012; v1 submitted 7 May, 2012; originally announced May 2012.

    Comments: From v2 to v3: Corrected typos, edited sentences for better readability; Conjecture 1 of v2 is now resolved so it is now Theorem 4, its proof is included in a new section (Section 7), Thm i in v2 is now Thm i+1 for i >= 4; everything else remains the same. From v1 to v2: Thm i is now Thm i-1 for i >= 7, Corrected the proof of Theorem 10 (now Theorem 9) for B > 2 (statement is still correct)