Skip to main content

Showing 1–15 of 15 results for author: Parikh, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15551  [pdf, other

    cs.LG

    Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

    Authors: Kunjal Panchal, Nisarg Parikh, Sunav Choudhary, Lijun Zhang, Yuriy Brun, Hui Guan

    Abstract: Finetuning large language models (LLMs) in federated learning (FL) settings has become important as it allows resource-constrained devices to finetune a model using private data. However, finetuning LLMs using backpropagation requires excessive memory (especially from intermediate activations) for resource-constrained devices. While Forward-mode Auto-Differentiation (AD) can reduce memory footprin… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2312.15064  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Joint Self-Supervised and Supervised Contrastive Learning for Multimodal MRI Data: Towards Predicting Abnormal Neurodevelopment

    Authors: Zhiyuan Li, Hailong Li, Anca L. Ralescu, Jonathan R. Dillman, Mekibib Altaye, Kim M. Cecil, Nehal A. Parikh, Lili He

    Abstract: The integration of different imaging modalities, such as structural, diffusion tensor, and functional magnetic resonance imaging, with deep learning models has yielded promising outcomes in discerning phenotypic characteristics and enhancing disease diagnosis. The development of such a technique hinges on the efficient fusion of heterogeneous multimodal features, which initially reside within dist… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 35 pages. Submitted to journal

  3. arXiv:2306.04605  [pdf

    cs.SE cs.AI

    Empowering Business Transformation: The Positive Impact and Ethical Considerations of Generative AI in Software Product Management -- A Systematic Literature Review

    Authors: Nishant A. Parikh

    Abstract: Generative Artificial Intelligence (GAI) has made outstanding strides in recent years, with a good-sized impact on software product management. Drawing on pertinent articles from 2016 to 2023, this systematic literature evaluation reveals generative AI's potential applications, benefits, and constraints in this area. The study shows that technology can assist in idea generation, market research, c… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 24 pages, 4 figures

  4. arXiv:2303.04353  [pdf, other

    cs.MS

    Cascading GEMM: High Precision from Low Precision

    Authors: Devangi N. Parikh, Robert A. van de Geijn, Greg M. Henry

    Abstract: This paper lays out insights and opportunities for implementing higher-precision matrix-matrix multiplication (GEMM) from (in terms of) lower-precision high-performance GEMM. The driving case study approximates double-double precision (FP64x2) GEMM in terms of double precision (FP64) GEMM, leveraging how the BLAS-like Library Instantiation Software (BLIS) framework refactors the Goto Algorithm. Wi… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: 26 pages, 9 figures

    ACM Class: G.4

  5. arXiv:2302.09807  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.ML

    A Novel Collaborative Self-Supervised Learning Method for Radiomic Data

    Authors: Zhiyuan Li, Hailong Li, Anca L. Ralescu, Jonathan R. Dillman, Nehal A. Parikh, Lili He

    Abstract: The computer-aided disease diagnosis from radiomic data is important in many medical applications. However, develo** such a technique relies on annotating radiological images, which is a time-consuming, labor-intensive, and expensive process. In this work, we present the first novel collaborative self-supervised learning method to solve the challenge of insufficient labeled radiomic data, whose… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 14 pages, 7 figures

    Journal ref: Neuroimage. 2023;120229

  6. arXiv:2211.15281  [pdf, other

    cs.LG

    Flow: Per-Instance Personalized Federated Learning Through Dynamic Routing

    Authors: Kunjal Panchal, Sunav Choudhary, Nisarg Parikh, Lijun Zhang, Hui Guan

    Abstract: Personalization in Federated Learning (FL) aims to modify a collaboratively trained global model according to each client. Current approaches to personalization in FL are at a coarse granularity, i.e. all the input instances of a client use the same personalized model. This ignores the fact that some instances are more accurately handled by the global model due to better generalizability. To addre… ▽ More

    Submitted 10 February, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 37th Annual Conference on Neural Information Processing Systems (NeurIPS), 2023

  7. A Novel Ontology-guided Attribute Partitioning Ensemble Learning Model for Early Prediction of Cognitive Deficits using Quantitative Structural MRI in Very Preterm Infants

    Authors: Zhiyuan Li, Hailong Li, Adebayo Braimah, Jonathan R. Dillman, Nehal A. Parikh, Lili He

    Abstract: Structural magnetic resonance imaging studies have shown that brain anatomical abnormalities are associated with cognitive deficits in preterm infants. Brain maturation and geometric features can be used with machine learning models for predicting later neurodevelopmental deficits. However, traditional machine learning models would suffer from a large feature-to-instance ratio (i.e., a large numbe… ▽ More

    Submitted 9 August, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Latest Version, published at NeuroImage. PMID: 35850161 DOI: 10.1016/j.neuroimage.2022.119484

    Journal ref: NeuroImage 260 (2022): 119484

  8. arXiv:2106.04379  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Markov State Abstractions for Deep Reinforcement Learning

    Authors: Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

    Abstract: A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state representation, and such representations are not guaranteed to preserve the Markov property. We introduce a novel set of conditions and prove that they are suff… ▽ More

    Submitted 14 March, 2024; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Fixed typo (see Errata). Code available at https://github.com/camall3n/markov-state-abstractions

  9. arXiv:2012.07729  [pdf

    cs.SI cs.LG stat.ML

    "Thought I'd Share First" and Other Conspiracy Theory Tweets from the COVID-19 Infodemic: Exploratory Study

    Authors: Dax Gerts, Courtney D. Shelley, Nidhi Parikh, Travis Pitts, Chrysm Watson Ross, Geoffrey Fairchild, Nidia Yadria Vaquera Chavez, Ashlynn R. Daughton

    Abstract: Background: The COVID-19 outbreak has left many people isolated within their homes; these people are turning to social media for news and social connection, which leaves them vulnerable to believing and sharing misinformation. Health-related misinformation threatens adherence to public health messaging, and monitoring its spread on social media is critical to understanding the evolution of ideas t… ▽ More

    Submitted 15 April, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

    Report number: LA-UR-20-28305

    Journal ref: JMIR Pub Hlth Surv 2021 7(4)

  10. arXiv:2002.01883  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Radial-Basis Value Functions for Continuous Control

    Authors: Kavosh Asadi, Neev Parikh, Ronald E. Parr, George D. Konidaris, Michael L. Littman

    Abstract: A core operation in reinforcement learning (RL) is finding an action that is optimal with respect to a learned value function. This operation is often challenging when the learned value function takes continuous actions as input. We introduce deep radial-basis value functions (RBVFs): value functions learned using a deep network with a radial-basis function (RBF) output layer. We show that the max… ▽ More

    Submitted 13 March, 2021; v1 submitted 5 February, 2020; originally announced February 2020.

    Comments: In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI)

  11. arXiv:1901.06015  [pdf, other

    cs.MS

    Supporting mixed-datatype matrix multiplication within the BLIS framework

    Authors: Field G. Van Zee, Devangi N. Parikh, Robert A. van de Geijn

    Abstract: We approach the problem of implementing mixed-datatype support within the general matrix multiplication (GEMM) operation of the BLIS framework, whereby each matrix operand A, B, and C may be stored as single- or double-precision real or complex values. Another factor of complexity, whereby the computation is allowed to take place in a precision different from the storage precisions of either A or… ▽ More

    Submitted 1 May, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    Report number: FLAME Working Note #89, The University of Texas at Austin, Department of Computer Science, Technical Report TR-19-01

  12. arXiv:1901.01331  [pdf, other

    cs.DC cs.LG

    The ISTI Rapid Response on Exploring Cloud Computing 2018

    Authors: Carleton Coffrin, James Arnold, Stephan Eidenbenz, Derek Aberle, John Ambrosiano, Zachary Baker, Sara Brambilla, Michael Brown, K. Nolan Carter, **han Chu, Patrick Conry, Keeley Costigan, Ariane Eberhardt, David M. Fobes, Adam Gausmann, Sean Harris, Donovan Heimer, Marlin Holmes, Bill Junor, Csaba Kiss, Steve Linger, Rodman Linn, Li-Ta Lo, Jonathan MacCarthy, Omar Marcillo , et al. (23 additional authors not shown)

    Abstract: This report describes eighteen projects that explored how commercial cloud computing services can be utilized for scientific computation at national laboratories. These demonstrations ranged from deploying proprietary software in a cloud environment to leveraging established cloud-based analytics workflows for processing scientific datasets. By and large, the projects were successful and collectiv… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

    Report number: LA-UR-18-31581

  13. arXiv:1808.07832  [pdf, ps, other

    cs.PL cs.LO cs.MS

    A Simple Methodology for Computing Families of Algorithms

    Authors: Devangi N. Parikh, Margaret E. Myers, Richard Vuduc, Robert A. van de Geijn

    Abstract: Discovering "good" algorithms for an operation is often considered an art best left to experts. What if there is a simple methodology, an algorithm, for systematically deriving a family of algorithms as well as their cost analyses, so that the best algorithm can be chosen? We discuss such an approach for deriving loop-based algorithms. The example used to illustrate this methodology, evaluation of… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Report number: FLAME Working Note #87, The University of Texas at Austin, Department of Computer Science, Technical Report TR-18-06

  14. arXiv:1807.08000  [pdf, ps, other

    cs.CL

    Abstractive and Extractive Text Summarization using Document Context Vector and Recurrent Neural Networks

    Authors: Chandra Khatri, Gyanit Singh, Nish Parikh

    Abstract: Sequence to sequence (Seq2Seq) learning has recently been used for abstractive and extractive summarization. In current study, Seq2Seq models have been used for eBay product description summarization. We propose a novel Document-Context based Seq2Seq models using RNNs for abstractive and extractive summarizations. Intuitively, this is similar to humans reading the title, abstract or any other cont… ▽ More

    Submitted 29 July, 2018; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: ACM KDD 2018 Deep Learning Day

  15. arXiv:1710.04286  [pdf, ps, other

    cs.MS

    Deriving Correct High-Performance Algorithms

    Authors: Devangi N. Parikh, Maggie E. Myers, Robert A. van de Geijn

    Abstract: Dijkstra observed that verifying correctness of a program is difficult and conjectured that derivation of a program hand-in-hand with its proof of correctness was the answer. We illustrate this goal-oriented approach by applying it to the domain of dense linear algebra libraries for distributed memory parallel computers. We show that algorithms that underlie the implementation of most functionalit… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Report number: FLAME Working Note #86, The University of Texas at Austin, Department of Computer Science, Technical Report TR-17-07