Skip to main content

Showing 1–50 of 65 results for author: Patel, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17261  [pdf, other

    cs.CL

    TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

    Authors: Yiran Luo, Het Patel, Yu Fu, Dawon Ahn, Jia Chen, Yue Dong, Evangelos E. Papalexakis

    Abstract: Large language models (LLMs) have fundamentally transformed artificial intelligence, catalyzing recent advancements while imposing substantial environmental and computational burdens. We introduce TRAWL (Tensor Reduced and Approximated Weights for Large Language Models), a novel methodology for optimizing LLMs through tensor decomposition. TRAWL leverages diverse strategies to exploit matrices wit… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 8 pages, 5 figures. Submitted to EMNLP 2024 and under review

    MSC Class: 68T50 (Primary); 65F55 (Secondary) ACM Class: I.2.7

  2. arXiv:2406.02844  [pdf, other

    cs.IR cs.CL

    Item-Language Model for Conversational Recommendation

    Authors: Li Yang, Anushya Subbiah, Hardik Patel, Judith Yue Li, Yanwei Song, Reza Mirghaderi, Vikram Aggarwal

    Abstract: Large-language Models (LLMs) have been extremely successful at tasks like complex dialogue understanding, reasoning and coding due to their emergent abilities. These emergent abilities have been extended with multi-modality to include image, audio, and video capabilities. Recommender systems, on the other hand, have been critical for information seeking and item discovery needs. Recently, there ha… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 15 pages, 3 figures

  3. arXiv:2405.19338  [pdf, other

    eess.SP cs.AI cs.CV

    Accurate Patient Alignment without Unnecessary Imaging Dose via Synthesizing Patient-specific 3D CT Images from 2D kV Images

    Authors: Yuzhen Ding, Jason M. Holmes, Hongying Feng, Baoxin Li, Lisa A. McGee, Jean-Claude M. Rwigema, Sujay A. Vora, Daniel J. Ma, Robert L. Foote, Samir H. Patel, Wei Liu

    Abstract: In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imag… ▽ More

    Submitted 1 April, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures and tables

  4. arXiv:2405.16021  [pdf, other

    cs.RO

    VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

    Authors: Michael Ahn, Montserrat Gonzalez Arenas, Matthew Bennice, Noah Brown, Christine Chan, Byron David, Anthony Francis, Gavin Gonzalez, Rainer Hessmer, Tomas Jackson, Nikhil J Joshi, Daniel Lam, Tsang-Wei Edward Lee, Alex Luong, Sharath Maddineni, Harsh Patel, Jodilyn Peralta, Jornell Quiambao, Diego Reyes, Rosario M Jauregui Ruano, Dorsa Sadigh, Pannag Sanketi, Leila Takayama, Pavel Vodenski, Fei Xia

    Abstract: Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon ta… ▽ More

    Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  5. arXiv:2405.14341  [pdf, other

    cs.HC

    How do Observable Users Decompose D3 Code? An Exploratory Study

    Authors: Melissa Lin, Heer Patel, Medina Lamkin, Tukey Tu, Hannah Bako, Soham Raut, Leilani Battle

    Abstract: Users often struggle to program visualizations using complex toolkits like D3. Before we can design effective code assistants to support them, we must first understand how D3 users reason about their code. In this work, we explore users' understanding of D3 using an important gauge of code comprehension in CS education: code decomposition. We qualitatively analyze 560 D3 programs published on Obse… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2405.06835  [pdf, other

    cs.LG cs.AI cs.SE

    Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

    Authors: Harsh Patel, Buvaneswari A. Ramanan, Manzoor A. Khan, Thomas Williams, Brian Friedman, Lawrence Drabeck

    Abstract: This paper explores the possibilities of the current generation of Large Language Models for incorporating Machine Learning Operations (MLOps) functionalities into ML training code bases. We evaluate the performance of OpenAI (gpt-3.5-turbo) and WizardCoder (open-source, 15B parameters) models on the automated accomplishment of various MLOps functionalities in different settings. We perform a benc… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: The work was completed during 2Q, 3Q of Year 2023, when WizardCoder was the top performing Open source LLM for coding. Newer and better models have emerged since then. The processes and methodologies utilized for this benchmarking can still be utilized for evaluating the current SoTA models

  7. arXiv:2405.05618  [pdf, other

    cs.LG cs.CL

    An Automatic Prompt Generation System for Tabular Data Tasks

    Authors: Ashlesha Akella, Abhijit Manatkar, Brij Chavda, Hima Patel

    Abstract: Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through carefully crafted prompts. However, creating effective prompts for tabular datasets is challenging due to the structured nature of the data and the need to manage nu… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024 Industry Track

  8. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  9. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  10. arXiv:2404.15485  [pdf

    cs.CL cs.AI

    Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts

    Authors: Het Patel, Umair Rehman, Farkhund Iqbal

    Abstract: Phishing, a prevalent cybercrime tactic for decades, remains a significant threat in today's digital world. By leveraging clever social engineering elements and modern technology, cybercrime targets many individuals, businesses, and organizations to exploit trust and security. These cyber-attackers are often disguised in many trustworthy forms to appear as legitimate sources. By cleverly using psy… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 7 pages, 3 figures

  11. arXiv:2404.01897  [pdf, other

    cs.NE cs.AI cs.LG

    Continuous Spiking Graph Neural Networks

    Authors: Nan Yin, Mengzhu Wan, Li Shen, Hitesh Laxmichand Patel, Baopu Li, Bin Gu, Huan Xiong

    Abstract: Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs req… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  12. arXiv:2403.18958  [pdf, other

    cs.SE cs.AI

    A State-of-the-practice Release-readiness Checklist for Generative AI-based Software Products

    Authors: Harsh Patel, Dominique Boucher, Emad Fallahzadeh, Ahmed E. Hassan, Bram Adams

    Abstract: This paper investigates the complexities of integrating Large Language Models (LLMs) into software products, with a focus on the challenges encountered for determining their readiness for release. Our systematic review of grey literature identifies common challenges in deploying LLMs, ranging from pre-training and fine-tuning to user experience considerations. The study introduces a comprehensive… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  13. arXiv:2403.09806  [pdf, other

    cs.AI

    xLP: Explainable Link Prediction for Master Data Management

    Authors: Balaji Ganesan, Matheen Ahmed Pasha, Srinivasa Parkala, Neeraj R Singh, Gayatri Mishra, Sumit Bhatia, Hima Patel, Somashekar Naganna, Sameep Mehta

    Abstract: Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link prediction in master data management, we have built a number of explainability solutions drawing from research in interpretability, fact verification, path ranking, neu… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures, NeurIPS 2020 Competition and Demonstration Track. arXiv admin note: text overlap with arXiv:2012.05516

  14. arXiv:2403.02054  [pdf, other

    cs.AI

    Large Language Model-Based Evolutionary Optimizer: Reasoning with elitism

    Authors: Shuvayan Brahmachary, Subodh M. Joshi, Aniruddha Panda, Kaushik Koneripalli, Arun Kumar Sagotra, Harshil Patel, Ankush Sharma, Ameya D. Jagtap, Kaushic Kalyanaraman

    Abstract: Large Language Models (LLMs) have demonstrated remarkable reasoning abilities, prompting interest in their application as black-box optimizers. This paper asserts that LLMs possess the capability for zero-shot optimization across diverse scenarios, including multi-objective and high-dimensional problems. We introduce a novel population-based method for numerical optimization using LLMs called Lang… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  15. arXiv:2401.08993  [pdf, other

    cs.CY cs.IR

    Estimating Gender Completeness in Wikipedia

    Authors: Hrishikesh Patel, Tianwa Chen, Ivano Bongiovanni, Gianluca Demartini

    Abstract: Gender imbalance in Wikipedia content is a known challenge which the editor community is actively addressing. The aim of this paper is to provide the Wikipedia community with instruments to estimate the magnitude of the problem for different entity types (also known as classes) in Wikipedia. To this end, we apply class completeness estimation methods based on the gender attribute. Our results show… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  16. arXiv:2311.15138  [pdf, other

    cs.CV

    Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture

    Authors: Rutuja Gurav, Het Patel, Zhuocheng Shang, Ahmed Eldawy, Jia Chen, Elia Scudiero, Evangelos Papalexakis

    Abstract: Climate change is increasingly disrupting worldwide agriculture, making global food production less reliable. To tackle the growing challenges in feeding the planet, cutting-edge management strategies, such as precision agriculture, empower farmers and decision-makers with rich and actionable information to increase the efficiency and sustainability of their farming practices. Crop-type maps are k… ▽ More

    Submitted 4 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 AI for Science Workshop

  17. arXiv:2311.10456  [pdf, other

    cs.LG cs.AI physics.chem-ph physics.comp-ph

    Accurate and Fast Fischer-Tropsch Reaction Microkinetics using PINNs

    Authors: Harshil Patel, Aniruddha Panda, Tymofii Nikolaienko, Stanislav Jaso, Alejandro Lopez, Kaushic Kalyanaraman

    Abstract: Microkinetics allows detailed modelling of chemical transformations occurring in many industrially relevant reactions. Traditional way of solving the microkinetics model for Fischer-Tropsch synthesis (FTS) becomes inefficient when it comes to more advanced real-time applications. In this work, we address these challenges by using physics-informed neural networks(PINNs) for modelling FTS microkinet… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  18. arXiv:2310.09412  [pdf, other

    cs.AI cs.LG

    Hybrid Reinforcement Learning for Optimizing Pump Sustainability in Real-World Water Distribution Networks

    Authors: Harsh Patel, Yuan Zhou, Alexander P Lamb, Shu Wang, Jieliang Luo

    Abstract: This article addresses the pump-scheduling optimization problem to enhance real-time control of real-world water distribution networks (WDNs). Our primary objectives are to adhere to physical operational constraints while reducing energy consumption and operational costs. Traditional optimization techniques, such as evolution-based and genetic algorithms, often fall short due to their lack of conv… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  19. arXiv:2309.10160  [pdf, other

    physics.med-ph cs.AI

    RadOnc-GPT: A Large Language Model for Radiation Oncology

    Authors: Zhengliang Liu, Peilong Wang, Yiwei Li, Jason Holmes, Peng Shu, Lian Zhang, Chenbin Liu, Ninghao Liu, Dajiang Zhu, Xiang Li, Quanzheng Li, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

    Abstract: This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods. RadOnc-GPT was finetuned on a large dataset of radiation oncology patient records from the Mayo Clinic in Arizona. The model employs instruction tuning on three key tasks - generating radiotherapy treatment regimens, determining optimal radiation modalities, and providing diag… ▽ More

    Submitted 5 November, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  20. arXiv:2308.04982  [pdf, other

    cs.CL cs.AI

    Exploring Multilingual Text Data Distillation

    Authors: Shivam Sahni, Harsh Patel

    Abstract: With the rise of deep learning, large datasets and complex models have become common, requiring significant computing power. To address this, data distillation has emerged as a technique to quickly train models with lower memory and time requirements. However, data distillation on text-based datasets hasn't been explored much because of the challenges rising due to its discrete nature. Additionall… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    ACM Class: F.2.2, I.2.7

  21. arXiv:2307.03966  [pdf, other

    cs.AI cs.SE

    Multi-Intent Detection in User Provided Annotations for Programming by Examples Systems

    Authors: Nischal Ashok Kumar, Nitin Gupta, Shanmukha Guttula, Hima Patel

    Abstract: In map** enterprise applications, data map** remains a fundamental part of integration development, but its time consuming. An increasing number of applications lack naming standards, and nested field structures further add complexity for the integration developers. Once the map** is done, data transformation is the next challenge for the users since each application expects data to be in a… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  22. arXiv:2306.13931  [pdf

    cs.LG cs.AI

    Comparative Study of Predicting Stock Index Using Deep Learning Models

    Authors: Harshal Patel, Bharath Kumar Bolla, Sabeesh E, Dinesh Reddy

    Abstract: Time series forecasting has seen many methods attempted over the past few decades, including traditional technical analysis, algorithmic statistical models, and more recent machine learning and artificial intelligence approaches. Recently, neural networks have been incorporated into the forecasting scenario, such as the LSTM and conventional RNN approaches, which utilize short-term and long-term d… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  23. arXiv:2306.09489  [pdf, other

    cs.CV cs.AI cs.MM

    The 2023 Video Similarity Dataset and Challenge

    Authors: Ed Pizzi, Giorgos Kordopatis-Zilos, Hiral Patel, Gheorghe Postelnicu, Sugosh Nagavara Ravindra, Akshay Gupta, Symeon Papadopoulos, Giorgos Tolias, Matthijs Douze

    Abstract: This work introduces a dataset, benchmark, and challenge for the problem of video copy detection and localization. The problem comprises two distinct but related tasks: determining whether a query video shares content with a reference video ("detection"), and additionally temporally localizing the shared content within each video ("localization"). The benchmark is designed to evaluate methods on t… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  24. arXiv:2304.05295  [pdf

    cs.CV cs.LG

    A Comprehensive Study on Object Detection Techniques in Unconstrained Environments

    Authors: Hrishitva Patel

    Abstract: Object detection is a crucial task in computer vision that aims to identify and localize objects in images or videos. The recent advancements in deep learning and Convolutional Neural Networks (CNNs) have significantly improved the performance of object detection techniques. This paper presents a comprehensive study of object detection techniques in unconstrained environments, including various ch… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 9 pages, 3 Figures, 2 Tables

  25. arXiv:2302.06155  [pdf, other

    cs.CL cs.AI

    Identifying Semantically Difficult Samples to Improve Text Classification

    Authors: Shashank Mujumdar, Stuti Mehta, Hima Patel, Suman Mitra

    Abstract: In this paper, we investigate the effect of addressing difficult samples from a given text dataset on the downstream text classification task. We define difficult samples as being non-obvious cases for text classification by analysing them in the semantic embedding space; specifically - (i) semantically similar samples that belong to different classes and (ii) semantically dissimilar samples that… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  26. arXiv:2211.05965  [pdf, other

    cs.HC cs.IR

    Using dynamic circles and squares to visualize spatio-temporal variation

    Authors: Harsh Patel, Nicole Schneider, Hanan Samet

    Abstract: Visualizations such as bar charts, scatter plots, and objects on geographical maps often convey critical information, including exact and relative numeric values, using shapes. The choice of shape and method of encoding information is often arbitrarily, or based on convention. However, past studies have shown that the human eye can be fooled by visual representations. The Ebbinghaus illusion demon… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  27. arXiv:2211.05823  [pdf, other

    cs.HC cs.IR

    CoronaViz: Visualizing Multilayer Spatiotemporal COVID-19 Data with Animated Geocircles

    Authors: Brian Ondov, Harsh B. Patel, Ai-Te Kuo, Hanan Samet, John Kastner, Yunheng Han, Hong Wei, Niklas Elmqvist

    Abstract: While many dashboards for visualizing COVID-19 data exist, most separate geospatial and temporal data into discrete visualizations or tables. Further, the common use of choropleth maps or space-filling map overlays supports only a single geospatial variable at once, making it difficult to compare the temporal and geospatial trends of multiple, potentially interacting variables, such as active case… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  28. arXiv:2211.01770  [pdf, other

    cs.LG cs.CV

    Exploring Explainability Methods for Graph Neural Networks

    Authors: Harsh Patel, Shivam Sahni

    Abstract: With the growing use of deep learning methods, particularly graph neural networks, which encode intricate interconnectedness information, for a variety of real tasks, there is a necessity for explainability in such settings. In this paper, we demonstrate the applicability of popular explainability approaches on Graph Attention Networks (GAT) for a graph-based super-pixel image classification task.… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  29. Deploying a Steered Query Optimizer in Production at Microsoft

    Authors: Wangda Zhang, Matteo Interlandi, Paul Mineiro, Shi Qiao, Nasim Ghazanfari Karlen Lie, Marc Friedman, Rafah Hosn, Hiren Patel, Alekh **dal

    Abstract: Modern analytical workloads are highly heterogeneous and massively complex, making generic query optimizers untenable for many customers and scenarios. As a result, it is important to specialize these optimizers to instances of the workloads. In this paper, we continue a recent line of work in steering a query optimizer towards better plans for a given workload, and make major strides in pushing p… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of the 2022 International Conference on Management of Data 2022 Jun 10 (pp. 2299-2311)

  30. arXiv:2207.13500  [pdf, other

    cs.SI cs.CL cs.IR

    Modelling Social Context for Fake News Detection: A Graph Neural Network Based Approach

    Authors: Pallabi Saikia, Kshitij Gundale, Ankit Jain, Dev Jadeja, Harvi Patel, Mohendra Roy

    Abstract: Detection of fake news is crucial to ensure the authenticity of information and maintain the news ecosystems reliability. Recently, there has been an increase in fake news content due to the recent proliferation of social media and fake content generation techniques such as Deep Fake. The majority of the existing modalities of fake news detection focus on content based approaches. However, most of… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Journal ref: copyright with IEEE, Paper No: 834, IJCNN, 2022 IEEE World Congress on Computational Intelligence

  31. arXiv:2207.11181  [pdf, other

    cs.CR eess.SY

    Secure and Lightweight Strong PUF Challenge Obfuscation with Keyed Non-linear FSR

    Authors: Kleber Stangherlin, Zhuanhao Wu, Hiren Patel, Manoj Sachdev

    Abstract: We propose a secure and lightweight key based challenge obfuscation for strong PUFs. Our architecture is designed to be resilient against learning attacks. Our obfuscation mechanism uses non-linear feedback shift registers (NLFSRs). Responses are directly provided to the user, without error correction or extra post-processing steps. We also discuss the cost of protecting our architecture against p… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  32. arXiv:2206.11840  [pdf, other

    cs.CR eess.SY

    Design Exploration and Security Assessment of PUF-on-PUF Implementations

    Authors: Kleber Stangherlin, Zhuanhao Wu, Hiren Patel, Manoj Sachdev

    Abstract: We design, implement, and assess the security of several variations of the PUF-on-PUF (POP) architecture. We perform extensive experiments with deep neural networks (DNNs), showing results that endorse its resilience to learning attacks when using APUFs with 6, or more, stages in the first layer. Compositions using APUFs with 2, and 4 stages are shown vulnerable to DNN attacks. We reflect on such… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  33. arXiv:2206.03440  [pdf, other

    eess.SY cs.CR

    Enhancing Strong PUF Security with Non-monotonic Response Quantization

    Authors: Kleber Stangherlin, Zhuanhao Wu, Hiren Patel, Manoj Sachdev

    Abstract: Strong physical unclonable functions (PUFs) provide a low-cost authentication primitive for resource constrained devices. However, most strong PUF architectures can be modeled through learning algorithms with a limited number of CRPs. In this paper, we introduce the concept of non-monotonic response quantization for strong PUFs. Responses depend not only on which path is faster, but also on the di… ▽ More

    Submitted 11 June, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

  34. arXiv:2205.08109  [pdf

    cs.LG cs.AI eess.SP

    Forecasting Solar Power Generation on the basis of Predictive and Corrective Maintenance Activities

    Authors: Soham Vyas, Yuvraj Goyal, Neel Bhatt, Sanskar Bhuwania, Hardik Patel, Shakti Mishra, Brijesh Tripathi

    Abstract: Solar energy forecasting has seen tremendous growth in the last decade using historical time series collected from a weather station, such as weather variables wind speed and direction, solar radiance, and temperature. It helps in the overall management of solar power plants. However, the solar power plant regularly requires preventive and corrective maintenance activities that further impact ener… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  35. arXiv:2204.01679  [pdf, other

    cs.AR

    Predictable Sharing of Last-level Cache Partitions for Multi-core Safety-critical Systems

    Authors: Zhuanhao Wu, Hiren Patel

    Abstract: Last-level cache (LLC) partitioning is a technique to provide temporal isolation and low worst-case latency (WCL) bounds when cores access the shared LLC in multicore safety-critical systems. A typical approach to cache partitioning involves allocating a separate partition to a distinct core. A central criticism of this approach is its poor utilization of cache storage. Today's trend of integratin… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  36. arXiv:2203.03704  [pdf, other

    cs.RO

    Mid-Air Helicopter Delivery at Mars Using a Jetpack

    Authors: Jeff Delaune, Jacob Izraelevitz, Samuel Sirlin, David Sternberg, Louis Giersch, L. Phillipe Tosi, Evgeniy Skliyanskiy, Larry Young, Michael Mischna, Shannah Withrow-Maser, Juergen Mueller, Joshua Bowman, Mark S Wallace, Havard F. Grip, Larry Matthies, Wayne Johnson, Matthew Keennon, Benjamin Pipenberg, Harsh Patel, Christopher Lim, Aaron Schutte, Marcel Veismann, Haley Cummings, Sarah Conley, Jonathan Bapst , et al. (10 additional authors not shown)

    Abstract: Mid-Air Helicopter Delivery (MAHD) is a new Entry, Descent and Landing (EDL) architecture to enable in situ mobility for Mars science at lower cost than previous missions. It uses a jetpack to slow down a Mars Science Helicopter (MSH) after separation from the backshell, and reach aerodynamic conditions suitable for helicopter take-off in mid air. For given aeroshell dimensions, only MAHD's lander… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted in 2022 IEEE Aerospace Conference

  37. arXiv:2112.14407  [pdf

    cs.HC cs.CY cs.LG

    The impacts of various parameters on learning process and machine learning based performance prediction in online coding competitions

    Authors: Hardik Patel, Purvi Koringa

    Abstract: Various parameters affect the performance of students in online coding competitions. Students' behavior, approach, emotions, and problem difficulty levels significantly impact their performance in online coding competitions. We have organized two coding competitions to understand the effects of the above parameters. We have done the online survey at the end of each coding competition, and it conta… ▽ More

    Submitted 26 September, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

  38. arXiv:2110.02313  [pdf, other

    cs.DB cs.AI cs.LG

    Phoebe: A Learning-based Checkpoint Optimizer

    Authors: Yiwen Zhu, Matteo Interlandi, Abhishek Roy, Krishnadhan Das, Hiren Patel, Malay Bag, Hitesh Sharma, Alekh **dal

    Abstract: Easy-to-use programming interfaces paired with cloud-scale processing engines have enabled big data system users to author arbitrarily complex analytical jobs over massive volumes of data. However, as the complexity and scale of analytical jobs increase, they encounter a number of unforeseen problems, hotspots with large intermediate data on temporary storage, longer job recovery time after failur… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the VLDB Endowment 14 (11), 2505-2518, 2021

  39. arXiv:2110.01395  [pdf

    cs.LG

    Prediction of IPL Match Outcome Using Machine Learning Techniques

    Authors: Srikantaiah K C, Aryan Khetan, Baibhav Kumar, Divy Tolani, Harshal Patel

    Abstract: India's most popular sport is cricket and is played across all over the nation in different formats like T20, ODI, and Test. The Indian Premier League (IPL) is a national cricket match where players are drawn from regional teams of India, National Team and also from international team. Many factors like live streaming, radio, TV broadcast made this league as popular among cricket fans. The predict… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: 8 pages. Atlantis Highlights in Computer Sciences, Proceedings of the 3rd International Conference on Integrated Intelligent Computing Communication & Security ICIIC 2021

  40. arXiv:2108.08791  [pdf, other

    cs.CV

    Image Inpainting using Partial Convolution

    Authors: Harsh Patel, Amey Kulkarni, Shivam Sahni, Udit Vyas

    Abstract: Image Inpainting is one of the very popular tasks in the field of image processing with broad applications in computer vision. In various practical applications, images are often deteriorated by noise due to the presence of corrupted, lost, or undesirable information. There have been various restoration techniques used in the past with both classical and deep learning approaches for handling such… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  41. arXiv:2108.08665  [pdf, other

    cs.SI

    Trust as a Metric for Resiliency in Signed Social Networks

    Authors: Harsh Patel, Shivam Sahni, Pushkar Mujumdar

    Abstract: Recent technological advancements have resulted in a surge in online trading, raising severe concerns about theft and fraud, especially on platforms like Bitcoin OTC (over-the-counter), where users' identities remain anonymous. To mitigate the risk, it has become essential to capture the reputation of users based on their trade histories. The who-trusts-whom signed network of people has the capabi… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  42. arXiv:2108.05935  [pdf, other

    cs.LG

    Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets

    Authors: Nitin Gupta, Hima Patel, Shazia Afzal, Naveen Panwar, Ruhi Sharma Mittal, Shanmukha Guttula, Abhinav Jain, Lokesh Nagalapatti, Sameep Mehta, Sandeep Hans, Pranay Lohia, Aniya Aggarwal, Diptikalyan Saha

    Abstract: The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes… ▽ More

    Submitted 5 September, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

  43. arXiv:2107.08594  [pdf, other

    cs.DB cs.LG

    Optimal Resource Allocation for Serverless Queries

    Authors: Anish Pimpley, Shuo Li, Anubha Srivastava, Vishal Rohra, Yi Zhu, Soundararajan Srinivasan, Alekh **dal, Hiren Patel, Shi Qiao, Rathijit Sen

    Abstract: Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they frequently misallocate by orders of magnitude. Unfortunately, prior work focused on predicting peak allocation while ignoring aggressive trade-offs between resource al… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

  44. arXiv:2105.07809  [pdf, other

    eess.IV cs.CV cs.LG

    Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Cheng-Ming Chiang, Hsien-Kai Kuo, Anastasia Sycheva, Radu Timofte, Min-Hung Chen, Man-Yu Lee, Yu-Syuan Xu, Yu Tseng, Shusong Xu, ** Guo, Chao-Hung Chen, Ming-Chun Hsyu, Wen-Chia Tsai, Chao-Wei Chen, Grigory Malivenko, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Zheng Shaolong, Hao Dejun, Xie Fen, Feng Zhuang , et al. (16 additional authors not shown)

    Abstract: As the quality of mobile cameras starts to play a crucial role in modern smartphones, more and more attention is now being paid to ISP algorithms used to improve various perceptual aspects of mobile photos. In this Mobile AI challenge, the target was to develop an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly r… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/

  45. arXiv:2102.11916  [pdf, other

    cs.CV

    Event Camera Based Real-Time Detection and Tracking of Indoor Ground Robots

    Authors: Himanshu Patel, Craig Iaboni, Deepan Lobo, Ji-won Choi, Pramod Abichandani

    Abstract: This paper presents a real-time method to detect and track multiple mobile ground robots using event cameras. The method uses density-based spatial clustering of applications with noise (DBSCAN) to detect the robots and a single k-dimensional ($k - d$) tree to accurately keep track of them as they move in an indoor arena. Robust detections and tracks are maintained in the face of event camera nois… ▽ More

    Submitted 2 August, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

  46. arXiv:2101.06949  [pdf, other

    cs.CL

    HinFlair: pre-trained contextual string embeddings for pos tagging and text classification in the Hindi language

    Authors: Harsh Patel

    Abstract: Recent advancements in language models based on recurrent neural networks and transformers architecture have achieved state-of-the-art results on a wide range of natural language processing tasks such as pos tagging, named entity recognition, and text classification. However, most of these language models are pre-trained in high resource languages like English, German, Spanish. Multi-lingual langu… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  47. arXiv:2012.05516  [pdf, other

    cs.CR cs.AI cs.LG cs.SI

    Explainable Link Prediction for Privacy-Preserving Contact Tracing

    Authors: Balaji Ganesan, Hima Patel, Sameep Mehta

    Abstract: Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update t… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: 8 pages, 7 figures, SpicyFL 2020 Workshop at NeurIPS 2020

  48. arXiv:2011.07313  [pdf

    cs.SE cs.LG

    Classification of Reverse-Engineered Class Diagram and Forward-Engineered Class Diagram using Machine Learning

    Authors: Kaushil Mangaroliya, Het Patel

    Abstract: UML Class diagram is very important to visualize the whole software we are working on and helps understand the whole system in the easiest way possible by showing the system classes, its attributes, methods, and relations with other objects. In the real world, there are two types of Class diagram engineers work with namely 1) Forward Engineered Class Diagram (FwCD) which are hand-made as part of t… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

  49. arXiv:2011.01504  [pdf, other

    cs.CL cs.IR

    BioNerFlair: biomedical named entity recognition using flair embedding and sequence tagger

    Authors: Harsh Patel

    Abstract: Motivation: The proliferation of Biomedical research articles has made the task of information retrieval more important than ever. Scientists and Researchers are having difficulty in finding articles that contain information relevant to them. Proper extraction of biomedical entities like Disease, Drug/chem, Species, Gene/protein, can considerably improve the filtering of articles resulting in bett… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  50. arXiv:2010.07213  [pdf, other

    cs.DB cs.AI

    Data Readiness Report

    Authors: Shazia Afzal, Rajmohan C, Manish Kesarwani, Sameep Mehta, Hima Patel

    Abstract: Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which limits their reuse and results in loss of productivity. We introduce the concept of a Data Readiness Report as an accompanying documentation to a dataset that allow… ▽ More

    Submitted 15 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.