Skip to main content

Showing 1–41 of 41 results for author: Mehta, H

.
  1. arXiv:2406.10916  [pdf, other

    cs.RO cs.DC

    M-SET: Multi-Drone Swarm Intelligence Experimentation with Collision Avoidance Realism

    Authors: Chuhao Qin, Alexander Robins, Callum Lillywhite-Roake, Adam Pearce, Hritik Mehta, Scott James, Tsz Ho Wong, Evangelos Pournaras

    Abstract: Distributed sensing by cooperative drone swarms is crucial for several Smart City applications, such as traffic monitoring and disaster response. Using an indoor lab with inexpensive drones, a testbed supports complex and ambitious studies on these systems while maintaining low cost, rigor, and external validity. This paper introduces the Multi-drone Sensing Experimentation Testbed (M-SET), a nove… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 7 pages, 7 figures. This work has been submitted to the IEEE conferenece

  2. arXiv:2405.15682  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    The Road Less Scheduled

    Authors: Aaron Defazio, Xingyu, Yang, Harsh Mehta, Konstantin Mishchenko, Ahmed Khaled, Ashok Cutkosky

    Abstract: Existing learning rate schedules that do not require specification of the optimization stop** step T are greatly out-performed by learning rate schedules that depend on T. We propose an approach that avoids the need for this stop** time by eschewing the use of schedules entirely, while exhibiting state-of-the-art performance compared to schedules across a wide family of problems ranging from c… ▽ More

    Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2405.08879  [pdf, other

    astro-ph.CO hep-ph hep-th

    A Diffused Background from Axion-like Particles in the Microwave Sky

    Authors: Harsh Mehta, Suvodip Mukherjee

    Abstract: The nature of dark matter is an unsolved cosmological problem and axions are one of the weakly interacting cold dark matter candidates. Axions or ALPs (Axion-like particles) are pseudo-scalar bosons predicted by beyond-standard model theories. The weak coupling of ALPs with photons leads to the conversion of CMB photons to ALPs in the presence of a transverse magnetic field. If they have the same… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 33 pages, 20 figures, To be submitted to JCAP

  4. arXiv:2405.08878  [pdf, other

    astro-ph.CO hep-ph hep-th

    A power spectrum approach to search for Axion-like Particles from resolved galaxy clusters using CMB as a backlight

    Authors: Harsh Mehta, Suvodip Mukherjee

    Abstract: Axions or ALPs are hypothetical particles predicted by BSM theories, which make one of the dark matter candidates. These particles can convert into photons and vice-versa in the presence of magnetic field, with a probability decided by its coupling strength $\mathrm{g_{aγ}}$. One of the ways to detect these particles is using the CMB as a backlight. As the CMB photons pass through a galaxy cluster… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 31 pages, 17 figures, To be submitted to JCAP

  5. arXiv:2404.07523  [pdf, other

    cs.AI cs.LG

    GNN-based Probabilistic Supply and Inventory Predictions in Supply Chain Networks

    Authors: Hyung-il Ahn, Young Chol Song, Santiago Olivar, Hershel Mehta, Naveen Tewari

    Abstract: Successful supply chain optimization must mitigate imbalances between supply and demand over time. While accurate demand prediction is essential for supply planning, it alone does not suffice. The key to successful supply planning for optimal and viable execution lies in maximizing predictability for both demand and supply throughout an execution horizon. Therefore, enhancing the accuracy of suppl… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  6. arXiv:2404.07511  [pdf

    cs.AI cs.LG

    Generative Probabilistic Planning for Optimizing Supply Chain Networks

    Authors: Hyung-il Ahn, Santiago Olivar, Hershel Mehta, Young Chol Song

    Abstract: Supply chain networks in enterprises are typically composed of complex topological graphs involving various types of nodes and edges, accommodating numerous products with considerable demand and supply variability. However, as supply chain networks expand in size and complexity, traditional supply chain planning methods (e.g., those found in heuristic rule-based and operations research-based syste… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  7. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  8. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2310.07831  [pdf, other

    cs.LG cs.AI stat.ML

    When, Why and How Much? Adaptive Learning Rate Scheduling by Refinement

    Authors: Aaron Defazio, Ashok Cutkosky, Harsh Mehta, Konstantin Mishchenko

    Abstract: Learning rate schedules used in practice bear little resemblance to those recommended by theory. We close much of this theory/practice gap, and as a consequence are able to derive new problem-adaptive learning rate schedules. Our key technical contribution is a refined analysis of learning rate schedules for a wide class of optimization algorithms (including SGD). In contrast to most prior works t… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  10. arXiv:2310.01258  [pdf, other

    eess.IV cs.CV cs.LG

    MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device

    Authors: Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautiere, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke Wiggers

    Abstract: Neural video codecs have recently become competitive with standard codecs such as HEVC in the low-delay setting. However, most neural codecs are large floating-point networks that use pixel-dense war** operations for temporal modeling, making them too computationally expensive for deployment on mobile devices. Recent work has demonstrated that running a neural decoder in real time on mobile is f… ▽ More

    Submitted 15 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Matches version published at WACV 2024

  11. arXiv:2306.00144  [pdf, other

    cs.LG

    Mechanic: A Learning Rate Tuner

    Authors: Ashok Cutkosky, Aaron Defazio, Harsh Mehta

    Abstract: We introduce a technique for tuning the learning rate scale factor of any base optimization algorithm and schedule automatically, which we call \textsc{mechanic}. Our method provides a practical realization of recent theoretical reductions for accomplishing a similar goal in online convex optimization. We rigorously evaluate \textsc{mechanic} on a range of large scale deep learning tasks with vary… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

  12. arXiv:2302.03775  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion

    Authors: Ashok Cutkosky, Harsh Mehta, Francesco Orabona

    Abstract: We present new algorithms for optimizing non-smooth, non-convex stochastic objectives based on a novel analysis technique. This improves the current best-known complexity for finding a $(δ,ε)$-stationary point from $O(ε^{-4}δ^{-1})$ stochastic gradient queries to $O(ε^{-3}δ^{-1})$, which we also show to be optimal. Our primary technique is a reduction from non-smooth non-convex optimization to onl… ▽ More

    Submitted 11 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  13. arXiv:2212.00768  [pdf, other

    cs.LG cs.CL

    Simplifying and Understanding State Space Models with Diagonal Linear RNNs

    Authors: Ankit Gupta, Harsh Mehta, Jonathan Berant

    Abstract: Sequence models based on linear state spaces (SSMs) have recently emerged as a promising choice of architecture for modeling long range dependencies across various modalities. However, they invariably rely on discretization of a continuous state space, which complicates their presentation and understanding. In this work, we dispose of the discretization step, and propose a model based on vanilla D… ▽ More

    Submitted 14 November, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: added Long Range Arena, language modeling with mixture of experts

  14. arXiv:2211.13403  [pdf, other

    cs.LG cs.CR cs.CV

    Differentially Private Image Classification from Features

    Authors: Harsh Mehta, Walid Krichene, Abhradeep Thakurta, Alexey Kurakin, Ashok Cutkosky

    Abstract: Leveraging transfer learning has recently been shown to be an effective strategy for training large models with Differential Privacy (DP). Moreover, somewhat surprisingly, recent works have found that privately training just the last layer of a pre-trained model provides the best utility with DP. While past studies largely rely on algorithms like DP-SGD for training large models, in the specific c… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  15. arXiv:2211.11052  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Convexifying Transformers: Improving optimization and understanding of transformer networks

    Authors: Tolga Ergen, Behnam Neyshabur, Harsh Mehta

    Abstract: Understanding the fundamental mechanism behind the success of transformer networks is still an open problem in the deep learning literature. Although their remarkable performance has been mostly attributed to the self-attention mechanism, the literature still lacks a solid analysis of these networks and interpretation of the functions learned by them. To this end, we study the training problem of… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  16. arXiv:2211.06389  [pdf

    stat.AP

    What does it mean to be "representative"?

    Authors: Jacqueline E. Rudolph, Yongqi Zhong, Priya Duggal, Shruti H. Mehta, Bryan Lau

    Abstract: Medical and population health science researchers frequently make ambiguous statements about whether they believe their study sample or results are "representative" of some (implicit or explicit) target population. Here, we provide a comprehensive definition of representativeness, with the goal of capturing the different ways in which a study can be representative of a target population. We propos… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 15 pages, 0 figures

  17. arXiv:2206.13947  [pdf, other

    cs.LG cs.CL

    Long Range Language Modeling via Gated State Spaces

    Authors: Harsh Mehta, Ankit Gupta, Ashok Cutkosky, Behnam Neyshabur

    Abstract: State space models have shown to be effective at modeling long range dependencies, specially on sequence classification tasks. In this work we focus on autoregressive sequence modeling over English books, Github source code and ArXiv mathematics articles. Based on recent developments around the effectiveness of gated activation functions, we propose a new layer named Gated State Space (GSS) and sh… ▽ More

    Submitted 2 July, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

  18. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  19. arXiv:2205.02973  [pdf, other

    cs.LG cs.CR cs.CV

    Large Scale Transfer Learning for Differentially Private Image Classification

    Authors: Harsh Mehta, Abhradeep Thakurta, Alexey Kurakin, Ashok Cutkosky

    Abstract: Differential Privacy (DP) provides a formal framework for training machine learning models with individual example level privacy. In the field of deep learning, Differentially Private Stochastic Gradient Descent (DP-SGD) has emerged as a popular private training algorithm. Unfortunately, the computational cost of training large-scale models with DP-SGD is substantially higher than non-private trai… ▽ More

    Submitted 20 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

  20. arXiv:2204.07827  [pdf, other

    cs.DS cs.DM

    Local treewidth of random and noisy graphs with applications to stop** contagion in networks

    Authors: Hermish Mehta, Daniel Reichman

    Abstract: We study the notion of local treewidth in sparse random graphs: the maximum treewidth over all $k$-vertex subgraphs of an $n$-vertex graph. When $k$ is not too large, we give nearly tight bounds for this local treewidth parameter; we also derive tight bounds for the local treewidth of noisy trees, trees where every non-edge is added independently with small probability. We apply our upper bounds o… ▽ More

    Submitted 15 July, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: Accepted to RANDOM 2022

  21. AI for Next Generation Computing: Emerging Trends and Future Directions

    Authors: Sukhpal Singh Gill, Minxian Xu, Carlo Ottaviani, Panos Patros, Rami Bahsoon, Arash Shaghaghi, Muhammed Golec, Vlado Stankovski, Huaming Wu, Ajith Abraham, Manmeet Singh, Harshit Mehta, Soumya K. Ghosh, Thar Baker, Ajith Kumar Parlikad, Hanan Lutfiyya, Salil S. Kanhere, Rizos Sakellariou, Schahram Dustdar, Omer Rana, Ivona Brandic, Steve Uhlig

    Abstract: Autonomic computing investigates how systems can achieve (user) specified control outcomes on their own, without the intervention of a human operator. Autonomic computing fundamentals have been substantially influenced by those of control theory for closed and open-loop systems. In practice, complex systems may exhibit a number of concurrent and inter-dependent control loops. Despite research into… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: Accepted for Publication in Elsevier IoT Journal, 2022

  22. arXiv:2202.06991  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Transformer Memory as a Differentiable Search Index

    Authors: Yi Tay, Vinh Q. Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Gupta, Tal Schuster, William W. Cohen, Donald Metzler

    Abstract: In this paper, we demonstrate that information retrieval can be accomplished with a single Transformer, in which all information about the corpus is encoded in the parameters of the model. To this end, we introduce the Differentiable Search Index (DSI), a new paradigm that learns a text-to-text model that maps string queries directly to relevant docids; in other words, a DSI model answers queries… ▽ More

    Submitted 21 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022

  23. arXiv:2112.02194  [pdf, other

    cs.LG cs.DC

    ALX: Large Scale Matrix Factorization on TPUs

    Authors: Harsh Mehta, Steffen Rendle, Walid Krichene, Li Zhang

    Abstract: We present ALX, an open-source library for distributed matrix factorization using Alternating Least Squares, written in JAX. Our design allows for efficient use of the TPU architecture and scales well to matrix factorization problems of O(B) rows/columns by scaling the number of available TPU cores. In order to spur future research on large scale matrix factorization methods and to illustrate the… ▽ More

    Submitted 29 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

  24. arXiv:2110.05603  [pdf, other

    cs.CL cs.RO

    Generalizing to New Domains by Map** Natural Language to Lifted LTL

    Authors: Eric Hsiung, Hiloni Mehta, Junchi Chu, Xinyu Liu, Roma Patel, Stefanie Tellex, George Konidaris

    Abstract: Recent work on using natural language to specify commands to robots has grounded that language to LTL. However, map** natural language task specifications to LTL task specifications using language models require probability distributions over finite vocabulary. Existing state-of-the-art methods have extended this finite vocabulary to include unseen terms from the input sequence to improve output… ▽ More

    Submitted 9 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 7 pages (6 + 1 references page), 3 figures, 2 tables. Accepted to ICRA 2022. To appear in Proceedings of the 2022 International Conference on Robotics and Automation, May 2022

  25. arXiv:2106.14343  [pdf, other

    cs.LG math.OC stat.ML

    High-probability Bounds for Non-Convex Stochastic Optimization with Heavy Tails

    Authors: Ashok Cutkosky, Harsh Mehta

    Abstract: We consider non-convex stochastic optimization using first-order algorithms for which the gradient estimates may have heavy tails. We show that a combination of gradient clip**, momentum, and normalized gradient descent yields convergence to critical points in high-probability with best-known rates for smooth losses when the gradients only have bounded $\mathfrak{p}$th moments for some… ▽ More

    Submitted 9 November, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

  26. arXiv:2008.13363  [pdf, other

    cs.LG cs.CV stat.ML

    Extreme Memorization via Scale of Initialization

    Authors: Harsh Mehta, Ashok Cutkosky, Behnam Neyshabur

    Abstract: We construct an experimental setup in which changing the scale of initialization strongly impacts the implicit regularization induced by SGD, interpolating from good generalization performance to completely memorizing the training set while making little progress on the test set. Moreover, we find that the extent and manner in which generalization ability is affected depends on the activation and… ▽ More

    Submitted 1 May, 2021; v1 submitted 31 August, 2020; originally announced August 2020.

  27. arXiv:2006.00342  [pdf, other

    cs.DC

    WattsApp: Power-Aware Container Scheduling

    Authors: Hemant Mehta, Paul Harvey, Omer Rana, Rajkumar Buyya, Blesson Varghese

    Abstract: Containers are becoming a popular workload deployment mechanism in modern distributed systems. However, there are limited software-based methods (hardware-based methods are expensive requiring hardware level changes) for obtaining the power consumed by containers for facilitating power-aware container scheduling, an essential activity for efficient management of distributed systems. This paper pre… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

  28. arXiv:2002.03305  [pdf, other

    cs.LG math.OC stat.ML

    Momentum Improves Normalized SGD

    Authors: Ashok Cutkosky, Harsh Mehta

    Abstract: We provide an improved analysis of normalized SGD showing that adding momentum provably removes the need for large batch sizes on non-convex objectives. Then, we consider the case of objectives with bounded second derivative and show that in this case a small tweak to the momentum formula allows normalized SGD with momentum to find an $ε$-critical point in $O(1/ε^{3.5})$ iterations, matching the b… ▽ More

    Submitted 16 May, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

  29. arXiv:2001.03671  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Retouchdown: Adding Touchdown to StreetLearn as a Shareable Resource for Language Grounding Tasks in Street View

    Authors: Harsh Mehta, Yoav Artzi, Jason Baldridge, Eugene Ie, Piotr Mirowski

    Abstract: The Touchdown dataset (Chen et al., 2019) provides instructions by human annotators for navigation through New York City streets and for resolving spatial descriptions at a given location. To enable the wider research community to work effectively with the Touchdown tasks, we are publicly releasing the 29k raw Street View panoramas needed for Touchdown. We follow the process used for the StreetLea… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

  30. arXiv:1912.03241  [pdf, other

    cs.LG stat.ML

    VALAN: Vision and Language Agent Navigation

    Authors: Larry Lansing, Vihan Jain, Harsh Mehta, Haoshuo Huang, Eugene Ie

    Abstract: VALAN is a lightweight and scalable software framework for deep reinforcement learning based on the SEED RL architecture. The framework facilitates the development and evaluation of embodied agents for solving grounded language understanding tasks, such as Vision-and-Language Navigation and Vision-and-Dialog Navigation, in photo-realistic environments, such as Matterport3D and Google StreetView. W… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

  31. Transformative effects of IoT, Blockchain and Artificial Intelligence on cloud computing: Evolution, vision, trends and open challenges

    Authors: Sukhpal Singh Gill, Shreshth Tuli, Minxian Xu, Inderpreet Singh, Karan Vijay Singh, Dominic Lindsay, Shikhar Tuli, Daria Smirnova, Manmeet Singh, Udit Jain, Haris Pervaiz, Bhanu Sehgal, Sukhwinder Singh Kaila, Sanjay Misra, Mohammad Sadegh Aslanpour, Harshit Mehta, Vlado Stankovski, Peter Garraghan

    Abstract: Cloud computing plays a critical role in modern society and enables a range of applications from infrastructure to social media. Such system must cope with varying load and evolving usage reflecting societies interaction and dependency on automated computing systems whilst satisfying Quality of Service (QoS) guarantees. Enabling these systems are a cohort of conceptual technologies, synthesized to… ▽ More

    Submitted 21 October, 2019; originally announced November 2019.

    Comments: 30 Pages, 4 Figures and Preprint version - Published in Elsevier's Internet of Things Journal

  32. arXiv:1911.00121  [pdf, ps, other

    math.NT

    Counting extensions of number fields with Frobenius Galois group

    Authors: Harsh Mehta

    Abstract: Let $G$ be a Frobenius group with an abelian Frobenius kernel $F$ and let $k$ be a finite extension of $\mathbb{Q}$. We obtain an upper bound for the number of degree $|F|$ algebraic extensions $K/k$ with Galois group $G$ with the norm of the discriminant $\mathcal{N}_{k/\mathbb{Q}}(d_{K/k})$ bounded above by $X$. We extend this method for any group $G$ that has an abelian normal subgroup. If $G$… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

    Comments: This is a preliminary version

  33. arXiv:1908.03409  [pdf, other

    cs.CV cs.CL cs.LG cs.RO

    Transferable Representation Learning in Vision-and-Language Navigation

    Authors: Haoshuo Huang, Vihan Jain, Harsh Mehta, Alexander Ku, Gabriel Magalhaes, Jason Baldridge, Eugene Ie

    Abstract: Vision-and-Language Navigation (VLN) tasks such as Room-to-Room (R2R) require machine agents to interpret natural language instructions and learn to act in visually realistic environments to achieve navigation goals. The overall task requires competence in several perception problems: successful agents combine spatio-temporal, vision and language understanding to produce appropriate action sequenc… ▽ More

    Submitted 12 August, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

    Comments: To appear in ICCV 2019

  34. arXiv:1905.13358  [pdf, other

    cs.CL cs.CV

    Multi-modal Discriminative Model for Vision-and-Language Navigation

    Authors: Haoshuo Huang, Vihan Jain, Harsh Mehta, Jason Baldridge, Eugene Ie

    Abstract: Vision-and-Language Navigation (VLN) is a natural language grounding task where agents have to interpret natural language instructions in the context of visual scenes in a dynamic environment to achieve prescribed navigation goals. Successful agents must have the ability to parse natural language of varying linguistic styles, ground them in potentially unfamiliar scenes, plan and react with ambigu… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: Accepted at SpLU-RoboNLP 2019 (workshop at NAACL)

  35. arXiv:1808.03633  [pdf, ps, other

    cs.DS

    A New Algorithm for the Robust Semi-random Independent Set Problem

    Authors: Theo McKenzie, Hermish Mehta, Luca Trevisan

    Abstract: In this paper, we study a general semi-random version of the planted independent set problem in a model initially proposed by Feige and Kilian, which has a large proportion of adversarial edges. We give a new deterministic algorithm that finds a list of independent sets, one of which, with high probability, is the planted one, provided that the planted set has size $k=Ω(n^{2/3})$. This improves… ▽ More

    Submitted 30 October, 2019; v1 submitted 10 August, 2018; originally announced August 2018.

  36. arXiv:1712.06957  [pdf, other

    physics.med-ph cs.AI

    MURA: Large Dataset for Abnormality Detection in Musculoskeletal Radiographs

    Authors: Pranav Rajpurkar, Jeremy Irvin, Aarti Bagul, Daisy Ding, Tony Duan, Hershel Mehta, Brandon Yang, Kaylie Zhu, Dillon Laird, Robyn L. Ball, Curtis Langlotz, Katie Shpanskaya, Matthew P. Lungren, Andrew Y. Ng

    Abstract: We introduce MURA, a large dataset of musculoskeletal radiographs containing 40,561 images from 14,863 studies, where each study is manually labeled by radiologists as either normal or abnormal. To evaluate models robustly and to get an estimate of radiologist performance, we collect additional labels from six board-certified Stanford radiologists on the test set, consisting of 207 musculoskeletal… ▽ More

    Submitted 22 May, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

    Comments: 1st Conference on Medical Imaging with Deep Learning (MIDL 2018)

  37. arXiv:1711.05225  [pdf, other

    cs.CV cs.LG stat.ML

    CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning

    Authors: Pranav Rajpurkar, Jeremy Irvin, Kaylie Zhu, Brandon Yang, Hershel Mehta, Tony Duan, Daisy Ding, Aarti Bagul, Curtis Langlotz, Katie Shpanskaya, Matthew P. Lungren, Andrew Y. Ng

    Abstract: We develop an algorithm that can detect pneumonia from chest X-rays at a level exceeding practicing radiologists. Our algorithm, CheXNet, is a 121-layer convolutional neural network trained on ChestX-ray14, currently the largest publicly available chest X-ray dataset, containing over 100,000 frontal-view X-ray images with 14 diseases. Four practicing academic radiologists annotate a test set, on w… ▽ More

    Submitted 25 December, 2017; v1 submitted 14 November, 2017; originally announced November 2017.

  38. Products of Farey Fractions

    Authors: Jeffery Lagarias, Harsh Mehta

    Abstract: The {Farey fractions} $F_n$ of order $n$ consist of all fractions $\frac{h}{k}$ in lowest terms lying in the closed unit interval and having denominator at most $n$. This paper considers the products $F_n$ of all nonzero Farey fractions of order $n$. It studies their growth measured by $\log(F_n)$ and their divisibility properties by powers of a fixed prime, given by $ord_p(F_n)$, as a function of… ▽ More

    Submitted 9 May, 2017; v1 submitted 28 February, 2015; originally announced March 2015.

    Comments: 32 pages, 10 figures

    Journal ref: Experimental Mathematics 26, No. 1, 1--21 (2017)

  39. Products of binomial coefficients and unreduced Farey fractions

    Authors: Jeffrey C. Lagarias, Harsh Mehta

    Abstract: This paper studies the product $\bar{G}_n$ of the binomial coefficients in the n-th row of Pascal's triangle, which equals the reciprocal of the product of all the reduced and unreduced Farey fractions of order n. It studies its size as a real number, measured by its logarithm $log(\bar{G}_n)$, and its prime factorization, measured by the order of divisibility by a fixed prime p, each viewed as a… ▽ More

    Submitted 15 September, 2015; v1 submitted 14 September, 2014; originally announced September 2014.

    Comments: 30 pages, 3 figures, two Appendices. ; v2 is 31 pages, Appendices moved before reference list; v3 is 31 pages,corrections to match journal version

    MSC Class: Primary 11B65; Secondary: 05A10; 11B57; 11N05; 11N64

    Journal ref: International J. of Number Theory 12 (2016), no.1, 57--91

  40. arXiv:1311.1407  [pdf, other

    math.CA

    The L1 norm of the generalized de la Vallee Poussin kernel

    Authors: Harsh Mehta

    Abstract: Charles de la Vall'ee Poussin defined two different kernels that bear his name. This paper considers the one are a linear combinations of two Fej'er kernels, which are known as the delayed means. We show that the $L^1$ norms are constant in families of delayed means, and determine the exact value

    Submitted 5 November, 2013; originally announced November 2013.

    Comments: 12 pages, 4 figures

    MSC Class: 42A16 ACM Class: F.2.1

  41. arXiv:1201.4210  [pdf

    cs.IR cs.AI

    Collaborative Personalized Web Recommender System using Entropy based Similarity Measure

    Authors: Harita Mehta, Shveta Kundra Bhatia, Punam Bedi, V. S. Dixit

    Abstract: On the internet, web surfers, in the search of information, always strive for recommendations. The solutions for generating recommendations become more difficult because of exponential increase in information domain day by day. In this paper, we have calculated entropy based similarity between users to achieve solution for scalability problem. Using this concept, we have implemented an online user… ▽ More

    Submitted 20 January, 2012; originally announced January 2012.

    Comments: 10 pages

    Journal ref: IJCSI, Vol 8, Issue 6, No 3, Nov 2011