Search | arXiv e-print repository

TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

Authors: Yiran Luo, Het Patel, Yu Fu, Dawon Ahn, Jia Chen, Yue Dong, Evangelos E. Papalexakis

Abstract: Large language models (LLMs) have fundamentally transformed artificial intelligence, catalyzing recent advancements while imposing substantial environmental and computational burdens. We introduce TRAWL (Tensor Reduced and Approximated Weights for Large Language Models), a novel methodology for optimizing LLMs through tensor decomposition. TRAWL leverages diverse strategies to exploit matrices wit… ▽ More Large language models (LLMs) have fundamentally transformed artificial intelligence, catalyzing recent advancements while imposing substantial environmental and computational burdens. We introduce TRAWL (Tensor Reduced and Approximated Weights for Large Language Models), a novel methodology for optimizing LLMs through tensor decomposition. TRAWL leverages diverse strategies to exploit matrices within transformer-based architectures, realizing notable performance enhancements without necessitating retraining. The most significant improvements were observed through a layer-by-layer intervention strategy, particularly when applied to fully connected weights of the final layers, yielding up to 16% enhancement in accuracy without the need for additional data or fine-tuning. These results underscore the importance of targeted and adaptive techniques in increasing the efficiency and effectiveness of large language model optimization, thereby promoting the development of more sustainable and accessible AI systems. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 8 pages, 5 figures. Submitted to EMNLP 2024 and under review

MSC Class: 68T50 (Primary); 65F55 (Secondary) ACM Class: I.2.7

arXiv:2406.02844 [pdf, other]

Item-Language Model for Conversational Recommendation

Authors: Li Yang, Anushya Subbiah, Hardik Patel, Judith Yue Li, Yanwei Song, Reza Mirghaderi, Vikram Aggarwal

Abstract: Large-language Models (LLMs) have been extremely successful at tasks like complex dialogue understanding, reasoning and coding due to their emergent abilities. These emergent abilities have been extended with multi-modality to include image, audio, and video capabilities. Recommender systems, on the other hand, have been critical for information seeking and item discovery needs. Recently, there ha… ▽ More Large-language Models (LLMs) have been extremely successful at tasks like complex dialogue understanding, reasoning and coding due to their emergent abilities. These emergent abilities have been extended with multi-modality to include image, audio, and video capabilities. Recommender systems, on the other hand, have been critical for information seeking and item discovery needs. Recently, there have been attempts to apply LLMs for recommendations. One difficulty of current attempts is that the underlying LLM is usually not trained on the recommender system data, which largely contains user interaction signals and is often not publicly available. Another difficulty is user interaction signals often have a different pattern from natural language text, and it is currently unclear if the LLM training setup can learn more non-trivial knowledge from interaction signals compared with traditional recommender system methods. Finally, it is difficult to train multiple LLMs for different use-cases, and to retain the original language and reasoning abilities when learning from recommender system data. To address these three limitations, we propose an Item-Language Model (ILM), which is composed of an item encoder to produce text-aligned item representations that encode user interaction signals, and a frozen LLM that can understand those item representations with preserved pretrained knowledge. We conduct extensive experiments which demonstrate both the importance of the language-alignment and of user interaction knowledge in the item encoder. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 15 pages, 3 figures

arXiv:2405.19338 [pdf, other]

Accurate Patient Alignment without Unnecessary Imaging Dose via Synthesizing Patient-specific 3D CT Images from 2D kV Images

Authors: Yuzhen Ding, Jason M. Holmes, Hongying Feng, Baoxin Li, Lisa A. McGee, Jean-Claude M. Rwigema, Sujay A. Vora, Daniel J. Ma, Robert L. Foote, Samir H. Patel, Wei Liu

Abstract: In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imag… ▽ More In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imaging dose, thus unfavorable for pediatric patients. A solution to this dilemma is to reconstruct 3D CT from kV images obtained at the treatment position. Here, we propose a dual-models framework built with hierarchical ViT blocks. Unlike a proof-of-concept approach, our framework considers kV images as the solo input and can synthesize accurate, full-size 3D CT in real time(within milliseconds). We demonstrate the feasibility of the proposed approach on 10 patients with head and neck (H&N) cancer using image quality(MAE: <45HU), dosimetrical accuracy(Gamma passing rate (2%/2mm/10%)>97%) and patient position uncertainty(shift error: <0.4mm). The proposed framework can generate accurate 3D CT faithfully mirroring real-time patient position, thus significantly improving patient setup accuracy, kee** imaging dose minimum, and maintaining treatment veracity. △ Less

Submitted 1 April, 2024; originally announced May 2024.

Comments: 17 pages, 8 figures and tables

arXiv:2405.16021 [pdf, other]

VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Authors: Michael Ahn, Montserrat Gonzalez Arenas, Matthew Bennice, Noah Brown, Christine Chan, Byron David, Anthony Francis, Gavin Gonzalez, Rainer Hessmer, Tomas Jackson, Nikhil J Joshi, Daniel Lam, Tsang-Wei Edward Lee, Alex Luong, Sharath Maddineni, Harsh Patel, Jodilyn Peralta, Jornell Quiambao, Diego Reyes, Rosario M Jauregui Ruano, Dorsa Sadigh, Pannag Sanketi, Leila Takayama, Pavel Vodenski, Fei Xia

Abstract: Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon ta… ▽ More Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon tasks with the help of humans or other robots. VADER leverages visual question answering (VQA) modules to detect visual affordances and recognize execution errors. It then generates prompts for a language model planner (LMP) which decides when to seek help from another robot or human to recover from errors in long-horizon task execution. We show the effectiveness of VADER with two long-horizon robotic tasks. Our pilot study showed that VADER is capable of performing complex long-horizon tasks by asking for help from another robot to clear a table. Our user study showed that VADER is capable of performing complex long-horizon tasks by asking for help from a human to clear a path. We gathered feedback from people (N=19) about the performance of the VADER performance vs. a robot that did not ask for help. https://google-vader.github.io/ △ Less

Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: 9 pages, 4 figures

arXiv:2405.14341 [pdf, other]

How do Observable Users Decompose D3 Code? An Exploratory Study

Authors: Melissa Lin, Heer Patel, Medina Lamkin, Tukey Tu, Hannah Bako, Soham Raut, Leilani Battle

Abstract: Users often struggle to program visualizations using complex toolkits like D3. Before we can design effective code assistants to support them, we must first understand how D3 users reason about their code. In this work, we explore users' understanding of D3 using an important gauge of code comprehension in CS education: code decomposition. We qualitatively analyze 560 D3 programs published on Obse… ▽ More Users often struggle to program visualizations using complex toolkits like D3. Before we can design effective code assistants to support them, we must first understand how D3 users reason about their code. In this work, we explore users' understanding of D3 using an important gauge of code comprehension in CS education: code decomposition. We qualitatively analyze 560 D3 programs published on Observable and identify three distinct strategies to decomposing D3 programs: segmenting code into layers of functionality, kee** everything all in one cell, or creating reusable visualization functions. We also observe how users inherit decomposition methods from copied examples and reorganize copied code to suit their needs. We corroborate our findings for decomposition preferences through interviews with D3 and Observable users. Based on our findings, we suggest strategies for generating more intuitive D3 code recommendations using decomposition preferences and highlight new research opportunities for visualization code assistants. All supplemental materials are available at https://osf.io/sudb8/?view_only=302fc5c8d397412aac35c6e094ae7dd6. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.06835 [pdf, other]

Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Authors: Harsh Patel, Buvaneswari A. Ramanan, Manzoor A. Khan, Thomas Williams, Brian Friedman, Lawrence Drabeck

Abstract: This paper explores the possibilities of the current generation of Large Language Models for incorporating Machine Learning Operations (MLOps) functionalities into ML training code bases. We evaluate the performance of OpenAI (gpt-3.5-turbo) and WizardCoder (open-source, 15B parameters) models on the automated accomplishment of various MLOps functionalities in different settings. We perform a benc… ▽ More This paper explores the possibilities of the current generation of Large Language Models for incorporating Machine Learning Operations (MLOps) functionalities into ML training code bases. We evaluate the performance of OpenAI (gpt-3.5-turbo) and WizardCoder (open-source, 15B parameters) models on the automated accomplishment of various MLOps functionalities in different settings. We perform a benchmarking study that assesses the ability of these models to: (1) adapt existing code samples (Inlining) with component-specific MLOps functionality such as MLflow and Weights & Biases for experiment tracking, Optuna for hyperparameter optimization etc., and (2) perform the task of Translation from one component of an MLOps functionality to another, e.g., translating existing GitPython library based version control code to Data Version Control library based. We also propose three different approaches that involve teaching LLMs to comprehend the API documentation of the components as a reference while accomplishing the Translation tasks. In our evaluations, the gpt-3.5-turbo model significantly outperforms WizardCoder by achieving impressive Pass@3 accuracy in model optimization (55% compared to 0% by WizardCoder), experiment tracking (100%, compared to 62.5% by WizardCoder), model registration (92% compared to 42% by WizardCoder) and hyperparameter optimization (83% compared to 58% by WizardCoder) on average, in their best possible settings, showcasing its superior code adaptability performance in complex MLOps tasks. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: The work was completed during 2Q, 3Q of Year 2023, when WizardCoder was the top performing Open source LLM for coding. Newer and better models have emerged since then. The processes and methodologies utilized for this benchmarking can still be utilized for evaluating the current SoTA models

arXiv:2405.05618 [pdf, other]

An Automatic Prompt Generation System for Tabular Data Tasks

Authors: Ashlesha Akella, Abhijit Manatkar, Brij Chavda, Hima Patel

Abstract: Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through carefully crafted prompts. However, creating effective prompts for tabular datasets is challenging due to the structured nature of the data and the need to manage nu… ▽ More Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through carefully crafted prompts. However, creating effective prompts for tabular datasets is challenging due to the structured nature of the data and the need to manage numerous columns. This paper presents an innovative auto-prompt generation system suitable for multiple LLMs, with minimal training. It proposes two novel methods; 1) A Reinforcement Learning-based algorithm for identifying and sequencing task-relevant columns 2) Cell-level similarity-based approach for enhancing few-shot example selection. Our approach has been extensively tested across 66 datasets, demonstrating improved performance in three downstream tasks: data imputation, error detection, and entity matching using two distinct LLMs; Google flan-t5-xxl and Mixtral 8x7B. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: Accepted to NAACL 2024 Industry Track

arXiv:2405.04324 [pdf, other]

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabilities, including code generation, fixing bugs, explaining and documenting code, maintaining repositories, and more. In this work, we introduce the Granite series of decoder-only code models for code generative tasks, trained with code written in 116 programming languages. The Granite Code models family consists of models ranging in size from 3 to 34 billion parameters, suitable for applications ranging from complex application modernization tasks to on-device memory-constrained use cases. Evaluation on a comprehensive set of tasks demonstrates that Granite Code models consistently reaches state-of-the-art performance among available open-source code LLMs. The Granite Code model family was optimized for enterprise software development workflows and performs well across a range of coding tasks (e.g. code generation, fixing and explanation), making it a versatile all around code model. We release all our Granite Code models under an Apache 2.0 license for both research and commercial use. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

arXiv:2405.01813 [pdf, other]

doi 10.1145/3555041.3589674

Towards Building Autonomous Data Services on Azure

Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to gain the most value from them. For cloud providers, managing every aspect of an ever-increasing set of data services, while meeting customer SLAs and minimizing operational cost is becoming more challenging. Cloud technology enables the collection of significant amounts of workload traces and system telemetry. With the progress in data science (DS) and machine learning (ML), it is feasible and desirable to utilize a data-driven, ML-based approach to automate various aspects of data services, resulting in the creation of autonomous data services. This paper presents our perspectives and insights on creating autonomous data services on Azure. It also covers the future endeavors we plan to undertake and unresolved issues that still need attention. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

arXiv:2404.15485 [pdf]

Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts

Authors: Het Patel, Umair Rehman, Farkhund Iqbal

Abstract: Phishing, a prevalent cybercrime tactic for decades, remains a significant threat in today's digital world. By leveraging clever social engineering elements and modern technology, cybercrime targets many individuals, businesses, and organizations to exploit trust and security. These cyber-attackers are often disguised in many trustworthy forms to appear as legitimate sources. By cleverly using psy… ▽ More Phishing, a prevalent cybercrime tactic for decades, remains a significant threat in today's digital world. By leveraging clever social engineering elements and modern technology, cybercrime targets many individuals, businesses, and organizations to exploit trust and security. These cyber-attackers are often disguised in many trustworthy forms to appear as legitimate sources. By cleverly using psychological elements like urgency, fear, social proof, and other manipulative strategies, phishers can lure individuals into revealing sensitive and personalized information. Building on this pervasive issue within modern technology, this paper aims to analyze the effectiveness of 15 Large Language Models (LLMs) in detecting phishing attempts, specifically focusing on a randomized set of "419 Scam" emails. The objective is to determine which LLMs can accurately detect phishing emails by analyzing a text file containing email metadata based on predefined criteria. The experiment concluded that the following models, ChatGPT 3.5, GPT-3.5-Turbo-Instruct, and ChatGPT, were the most effective in detecting phishing emails. △ Less

Submitted 6 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

Comments: 7 pages, 3 figures

arXiv:2404.01897 [pdf, other]

Continuous Spiking Graph Neural Networks

Authors: Nan Yin, Mengzhu Wan, Li Shen, Hitesh Laxmichand Patel, Baopu Li, Bin Gu, Huan Xiong

Abstract: Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs req… ▽ More Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs requires significant computational power, making them challenging to deploy on battery-powered devices. Inspired by recent spiking neural networks (SNNs), which emulate a biological inference process and provide an energy-efficient neural architecture, we incorporate the SNNs with CGNNs in a unified framework, named Continuous Spiking Graph Neural Networks (COS-GNN). We employ SNNs for graph node representation at each time step, which are further integrated into the ODE process along with time. To enhance information preservation and mitigate information loss in SNNs, we introduce the high-order structure of COS-GNN, which utilizes the second-order ODE for spiking representation and continuous propagation. Moreover, we provide the theoretical proof that COS-GNN effectively mitigates the issues of exploding and vanishing gradients, enabling us to capture long-range dependencies between nodes. Experimental results on graph-based learning tasks demonstrate the effectiveness of the proposed COS-GNN over competitive baselines. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2403.18958 [pdf, other]

A State-of-the-practice Release-readiness Checklist for Generative AI-based Software Products

Authors: Harsh Patel, Dominique Boucher, Emad Fallahzadeh, Ahmed E. Hassan, Bram Adams

Abstract: This paper investigates the complexities of integrating Large Language Models (LLMs) into software products, with a focus on the challenges encountered for determining their readiness for release. Our systematic review of grey literature identifies common challenges in deploying LLMs, ranging from pre-training and fine-tuning to user experience considerations. The study introduces a comprehensive… ▽ More This paper investigates the complexities of integrating Large Language Models (LLMs) into software products, with a focus on the challenges encountered for determining their readiness for release. Our systematic review of grey literature identifies common challenges in deploying LLMs, ranging from pre-training and fine-tuning to user experience considerations. The study introduces a comprehensive checklist designed to guide practitioners in evaluating key release readiness aspects such as performance, monitoring, and deployment strategies, aiming to enhance the reliability and effectiveness of LLM-based applications in real-world settings. △ Less

Submitted 27 March, 2024; originally announced March 2024.

arXiv:2403.09806 [pdf, other]

xLP: Explainable Link Prediction for Master Data Management

Authors: Balaji Ganesan, Matheen Ahmed Pasha, Srinivasa Parkala, Neeraj R Singh, Gayatri Mishra, Sumit Bhatia, Hima Patel, Somashekar Naganna, Sameep Mehta

Abstract: Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link prediction in master data management, we have built a number of explainability solutions drawing from research in interpretability, fact verification, path ranking, neu… ▽ More Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link prediction in master data management, we have built a number of explainability solutions drawing from research in interpretability, fact verification, path ranking, neuro-symbolic reasoning and self-explaining AI. In this demo, we present explanations for link prediction in a creative way, to allow users to choose explanations they are more comfortable with. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 8 pages, 4 figures, NeurIPS 2020 Competition and Demonstration Track. arXiv admin note: text overlap with arXiv:2012.05516

arXiv:2403.02054 [pdf, other]

Large Language Model-Based Evolutionary Optimizer: Reasoning with elitism

Authors: Shuvayan Brahmachary, Subodh M. Joshi, Aniruddha Panda, Kaushik Koneripalli, Arun Kumar Sagotra, Harshil Patel, Ankush Sharma, Ameya D. Jagtap, Kaushic Kalyanaraman

Abstract: Large Language Models (LLMs) have demonstrated remarkable reasoning abilities, prompting interest in their application as black-box optimizers. This paper asserts that LLMs possess the capability for zero-shot optimization across diverse scenarios, including multi-objective and high-dimensional problems. We introduce a novel population-based method for numerical optimization using LLMs called Lang… ▽ More Large Language Models (LLMs) have demonstrated remarkable reasoning abilities, prompting interest in their application as black-box optimizers. This paper asserts that LLMs possess the capability for zero-shot optimization across diverse scenarios, including multi-objective and high-dimensional problems. We introduce a novel population-based method for numerical optimization using LLMs called Language-Model-Based Evolutionary Optimizer (LEO). Our hypothesis is supported through numerical examples, spanning benchmark and industrial engineering problems such as supersonic nozzle shape optimization, heat transfer, and windfarm layout optimization. We compare our method to several gradient-based and gradient-free optimization approaches. While LLMs yield comparable results to state-of-the-art methods, their imaginative nature and propensity to hallucinate demand careful handling. We provide practical guidelines for obtaining reliable answers from LLMs and discuss method limitations and potential research directions. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2401.08993 [pdf, other]

Estimating Gender Completeness in Wikipedia

Authors: Hrishikesh Patel, Tianwa Chen, Ivano Bongiovanni, Gianluca Demartini

Abstract: Gender imbalance in Wikipedia content is a known challenge which the editor community is actively addressing. The aim of this paper is to provide the Wikipedia community with instruments to estimate the magnitude of the problem for different entity types (also known as classes) in Wikipedia. To this end, we apply class completeness estimation methods based on the gender attribute. Our results show… ▽ More Gender imbalance in Wikipedia content is a known challenge which the editor community is actively addressing. The aim of this paper is to provide the Wikipedia community with instruments to estimate the magnitude of the problem for different entity types (also known as classes) in Wikipedia. To this end, we apply class completeness estimation methods based on the gender attribute. Our results show not only which gender for different sub-classes of Person is more prevalent in Wikipedia, but also an idea of how complete the coverage is for difference genders and sub-classes of Person. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2312.03462 [pdf, other]

Modelling of shattered pellet injection experiments on the ASDEX Upgrade tokamak

Authors: Anshkumar Himanshu Patel

Abstract: A disruption mitigation system (DMS) is necessary for fusion-grade tokamaks like ITER in order to ensure the preservation of machine components throughout their designated operational lifespan. To address the intense heat and electromagnetic loads that occur during a disruption, a shattered pellet injection (SPI) system will be employed. The penetration and assimilation (ionized material that stay… ▽ More A disruption mitigation system (DMS) is necessary for fusion-grade tokamaks like ITER in order to ensure the preservation of machine components throughout their designated operational lifespan. To address the intense heat and electromagnetic loads that occur during a disruption, a shattered pellet injection (SPI) system will be employed. The penetration and assimilation (ionized material that stays inside the plasma volume) of the injected material is influenced by various SPI parameters, including the fragment sizes, speeds, and composition of the shattered fragments. An SPI system was installed on the ASDEX Upgrade tokamak to study the effect of the aforementioned parameters. In this thesis, 1.5D simulations with the INDEX code have been utilised to conduct parametric scans, thus examining the influence of fragment sizes, velocities, and pellet composition on the efficacy of disruption mitigation. When injecting only deuterium, I found material assimilation to be limited to the edge of the plasma with larger and faster fragments leading to higher assimilation. For mixed deuterium/neon injections, again, larger and faster fragments enabled higher assimilation. The amount of assimilated neon increased with increasing injected neon amounts but saturated for larger neon fraction pellets. I also carried out comparisons with previous experimental results of penetration, material assimilation and pre-TQ duration. Previous experimental results for pure deuterium injections indicated that larger and faster fragments exhibit greater penetration, aligning with findings from the simulations. Simulated material assimilation trends for pure deuterium injections were also found to be qualitatively similar to the experiments. Nonetheless, a major difference in the quantitative assimilation values was identified, likely associated with the experimental assimilation criterion. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 43 pages, 47 figures, master's thesis

arXiv:2311.15138 [pdf, other]

Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture

Authors: Rutuja Gurav, Het Patel, Zhuocheng Shang, Ahmed Eldawy, Jia Chen, Elia Scudiero, Evangelos Papalexakis

Abstract: Climate change is increasingly disrupting worldwide agriculture, making global food production less reliable. To tackle the growing challenges in feeding the planet, cutting-edge management strategies, such as precision agriculture, empower farmers and decision-makers with rich and actionable information to increase the efficiency and sustainability of their farming practices. Crop-type maps are k… ▽ More Climate change is increasingly disrupting worldwide agriculture, making global food production less reliable. To tackle the growing challenges in feeding the planet, cutting-edge management strategies, such as precision agriculture, empower farmers and decision-makers with rich and actionable information to increase the efficiency and sustainability of their farming practices. Crop-type maps are key information for decision-support tools but are challenging and costly to generate. We investigate the capabilities of Meta AI's Segment Anything Model (SAM) for crop-map prediction task, acknowledging its recent successes at zero-shot image segmentation. However, SAM being limited to up-to 3 channel inputs and its zero-shot usage being class-agnostic in nature pose unique challenges in using it directly for crop-type map**. We propose using clustering consensus metrics to assess SAM's zero-shot performance in segmenting satellite imagery and producing crop-type maps. Although direct crop-type map** is challenging using SAM in zero-shot setting, experiments reveal SAM's potential for swiftly and accurately outlining fields in satellite images, serving as a foundation for subsequent crop classification. This paper attempts to highlight a use-case of state-of-the-art image segmentation models like SAM for crop-type map** and related specific needs of the agriculture industry, offering a potential avenue for automatic, efficient, and cost-effective data products for precision agriculture practices. △ Less

Submitted 4 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

Comments: Accepted at NeurIPS 2023 AI for Science Workshop

arXiv:2311.10456 [pdf, other]

Accurate and Fast Fischer-Tropsch Reaction Microkinetics using PINNs

Authors: Harshil Patel, Aniruddha Panda, Tymofii Nikolaienko, Stanislav Jaso, Alejandro Lopez, Kaushic Kalyanaraman

Abstract: Microkinetics allows detailed modelling of chemical transformations occurring in many industrially relevant reactions. Traditional way of solving the microkinetics model for Fischer-Tropsch synthesis (FTS) becomes inefficient when it comes to more advanced real-time applications. In this work, we address these challenges by using physics-informed neural networks(PINNs) for modelling FTS microkinet… ▽ More Microkinetics allows detailed modelling of chemical transformations occurring in many industrially relevant reactions. Traditional way of solving the microkinetics model for Fischer-Tropsch synthesis (FTS) becomes inefficient when it comes to more advanced real-time applications. In this work, we address these challenges by using physics-informed neural networks(PINNs) for modelling FTS microkinetics. We propose a computationally efficient and accurate method, enabling the ultra-fast solution of the existing microkinetics models in realistic process conditions. The proposed PINN model computes the fraction of vacant catalytic sites, a key quantity in FTS microkinetics, with median relative error (MRE) of 0.03%, and the FTS product formation rates with MRE of 0.1%. Compared to conventional equation solvers, the model achieves up to 1E+06 times speed-up when running on GPUs, thus being fast enough for multi-scale and multi-physics reactor modelling and enabling its applications in real-time process control and optimization. △ Less

Submitted 17 November, 2023; originally announced November 2023.

arXiv:2310.09412 [pdf, other]

Hybrid Reinforcement Learning for Optimizing Pump Sustainability in Real-World Water Distribution Networks

Authors: Harsh Patel, Yuan Zhou, Alexander P Lamb, Shu Wang, Jieliang Luo

Abstract: This article addresses the pump-scheduling optimization problem to enhance real-time control of real-world water distribution networks (WDNs). Our primary objectives are to adhere to physical operational constraints while reducing energy consumption and operational costs. Traditional optimization techniques, such as evolution-based and genetic algorithms, often fall short due to their lack of conv… ▽ More This article addresses the pump-scheduling optimization problem to enhance real-time control of real-world water distribution networks (WDNs). Our primary objectives are to adhere to physical operational constraints while reducing energy consumption and operational costs. Traditional optimization techniques, such as evolution-based and genetic algorithms, often fall short due to their lack of convergence guarantees. Conversely, reinforcement learning (RL) stands out for its adaptability to uncertainties and reduced inference time, enabling real-time responsiveness. However, the effective implementation of RL is contingent on building accurate simulation models for WDNs, and prior applications have been limited by errors in simulation training data. These errors can potentially cause the RL agent to learn misleading patterns and actions and recommend suboptimal operational strategies. To overcome these challenges, we present an improved "hybrid RL" methodology. This method integrates the benefits of RL while anchoring it in historical data, which serves as a baseline to incrementally introduce optimal control recommendations. By leveraging operational data as a foundation for the agent's actions, we enhance the explainability of the agent's actions, foster more robust recommendations, and minimize error. Our findings demonstrate that the hybrid RL agent can significantly improve sustainability, operational efficiency, and dynamically adapt to emerging scenarios in real-world WDNs. △ Less

Submitted 13 October, 2023; originally announced October 2023.

arXiv:2309.10160 [pdf, other]

RadOnc-GPT: A Large Language Model for Radiation Oncology

Authors: Zhengliang Liu, Peilong Wang, Yiwei Li, Jason Holmes, Peng Shu, Lian Zhang, Chenbin Liu, Ninghao Liu, Dajiang Zhu, Xiang Li, Quanzheng Li, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

Abstract: This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods. RadOnc-GPT was finetuned on a large dataset of radiation oncology patient records from the Mayo Clinic in Arizona. The model employs instruction tuning on three key tasks - generating radiotherapy treatment regimens, determining optimal radiation modalities, and providing diag… ▽ More This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods. RadOnc-GPT was finetuned on a large dataset of radiation oncology patient records from the Mayo Clinic in Arizona. The model employs instruction tuning on three key tasks - generating radiotherapy treatment regimens, determining optimal radiation modalities, and providing diagnostic descriptions/ICD codes based on patient diagnostic details. Evaluations conducted by comparing RadOnc-GPT outputs to general large language model outputs showed higher ROUGE scores in these three tasks. The study demonstrated the potential of using large language models fine-tuned using domain-specific knowledge like RadOnc-GPT to achieve transformational capabilities in highly specialized healthcare fields such as radiation oncology. However, our model's clinical relevance requires confirmation, and it specializes in only the aforementioned three specific tasks and lacks broader applicability. Furthermore, its evaluation through ROUGE scores might not reflect the true semantic and clinical accuracy - challenges we intend to address in future research. △ Less

Submitted 5 November, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

arXiv:2308.14515 [pdf, other]

Spreading of a viscoelastic drop on a solid substrate

Authors: Peyman Rostami, Mathis Fricke, Simon Schubotz, Himanshu Patel, Reza Azizmalayeri, Güunter K. Auernhammer

Abstract: We study the spreading of viscous and viscoelastic drops on solid substrates with different wettability. In the early stages of spreading, we find that the viscoelastic drop spreads with faster and a different power law than the Newtonian drop (i.e. aqueous glycerine solution) for the same zero shear rate viscosity. We argue that the effect of viscoelasticity is only observable for experimental ti… ▽ More We study the spreading of viscous and viscoelastic drops on solid substrates with different wettability. In the early stages of spreading, we find that the viscoelastic drop spreads with faster and a different power law than the Newtonian drop (i.e. aqueous glycerine solution) for the same zero shear rate viscosity. We argue that the effect of viscoelasticity is only observable for experimental time scales in the order of the internal relaxation time of the polymer solution or longer times. Near the contact line, the effective viscosity is lower for the viscoelastic drop than for the Newtonian drop. Together with its shear rate dependency, this difference in effective viscosity can explain the different spreading dynamics. We support our experimental findings with a simple perturbation model that qualitatively agrees with our findings. △ Less

Submitted 29 August, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

arXiv:2308.04982 [pdf, other]

Exploring Multilingual Text Data Distillation

Authors: Shivam Sahni, Harsh Patel

Abstract: With the rise of deep learning, large datasets and complex models have become common, requiring significant computing power. To address this, data distillation has emerged as a technique to quickly train models with lower memory and time requirements. However, data distillation on text-based datasets hasn't been explored much because of the challenges rising due to its discrete nature. Additionall… ▽ More With the rise of deep learning, large datasets and complex models have become common, requiring significant computing power. To address this, data distillation has emerged as a technique to quickly train models with lower memory and time requirements. However, data distillation on text-based datasets hasn't been explored much because of the challenges rising due to its discrete nature. Additionally, existing dataset distillation methods often struggle to generalize to new architectures. In the paper, we propose several data distillation techniques for multilingual text classification datasets using language-model-based learning methods. We conduct experiments to analyze their performance in terms of classification strength, and cross-architecture generalization. Furthermore, we investigate the language-specific fairness of the data summaries generated by these methods. Our approach builds upon existing techniques, enhancing cross-architecture generalization in the text data distillation domain. △ Less

Submitted 9 August, 2023; originally announced August 2023.

ACM Class: F.2.2, I.2.7

arXiv:2308.02547 [pdf, other]

$d$-mon: transmon with strong anharmonicity

Authors: Hrishikesh Patel, Vedangi Pathak, Oguzhan Can, Andrew C. Potter, Marcel Franz

Abstract: We propose a novel qubit architecture based on a planar $c$-axis Josephson junction between a thin flake $d$-wave superconductor ($d$SC), such as a high-$T_c$ cuprate Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$, and a conventional $s$-wave superconductor. When operated in the transmon regime the device -- that we call "$d$-mon" -- becomes insensitive to offset charge fluctuations and, importantly, exhibits at t… ▽ More We propose a novel qubit architecture based on a planar $c$-axis Josephson junction between a thin flake $d$-wave superconductor ($d$SC), such as a high-$T_c$ cuprate Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$, and a conventional $s$-wave superconductor. When operated in the transmon regime the device -- that we call "$d$-mon" -- becomes insensitive to offset charge fluctuations and, importantly, exhibits at the same time energy level spectrum with strong anharmonicity that is widely tunable through the device geometry and applied magnetic flux. Crucially, unlike previous qubit designs based on $d$-wave superconductors the proposed device operates in a regime where quasiparticles are fully gapped and can be therefore expected to achieve long coherence times. △ Less

Submitted 9 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 4 pages main text + 6 pages SM; V2: corrected typos and updated references

arXiv:2307.03966 [pdf, other]

Multi-Intent Detection in User Provided Annotations for Programming by Examples Systems

Authors: Nischal Ashok Kumar, Nitin Gupta, Shanmukha Guttula, Hima Patel

Abstract: In map** enterprise applications, data map** remains a fundamental part of integration development, but its time consuming. An increasing number of applications lack naming standards, and nested field structures further add complexity for the integration developers. Once the map** is done, data transformation is the next challenge for the users since each application expects data to be in a… ▽ More In map** enterprise applications, data map** remains a fundamental part of integration development, but its time consuming. An increasing number of applications lack naming standards, and nested field structures further add complexity for the integration developers. Once the map** is done, data transformation is the next challenge for the users since each application expects data to be in a certain format. Also, while building integration flow, developers need to understand the format of the source and target data field and come up with transformation program that can change data from source to target format. The problem of automatic generation of a transformation program through program synthesis paradigm from some specifications has been studied since the early days of Artificial Intelligence (AI). Programming by Example (PBE) is one such kind of technique that targets automatic inferencing of a computer program to accomplish a format or string conversion task from user-provided input and output samples. To learn the correct intent, a diverse set of samples from the user is required. However, there is a possibility that the user fails to provide a diverse set of samples. This can lead to multiple intents or ambiguity in the input and output samples. Hence, PBE systems can get confused in generating the correct intent program. In this paper, we propose a deep neural network based ambiguity prediction model, which analyzes the input-output strings and maps them to a different set of properties responsible for multiple intent. Users can analyze these properties and accordingly can provide new samples or modify existing samples which can help in building a better PBE system for map** enterprise applications. △ Less

Submitted 8 July, 2023; originally announced July 2023.

arXiv:2307.01416 [pdf]

Modelling small block aperture in an in-house developed GPU-accelerated Monte Carlo-based dose engine for pencil beam scanning proton therapy

Authors: Hongying Feng, Jason M. Holmes, Sujay A. Vora, Joshua B. Stoker, Martin Bues, William W. Wong, Terence S. Sio, Robert L. Foote, Samir H. Patel, Jiajian Shen, Wei Liu

Abstract: Purpose: To enhance an in-house graphic-processing-unit (GPU) accelerated virtual particle (VP)-based Monte Carlo (MC) proton dose engine (VPMC) to model aperture blocks in both dose calculation and optimization for pencil beam scanning proton therapy (PBSPT)-based stereotactic radiosurgery (SRS). Methods and Materials: A block aperture module was integrated into VPMC. VPMC was validated by an ope… ▽ More Purpose: To enhance an in-house graphic-processing-unit (GPU) accelerated virtual particle (VP)-based Monte Carlo (MC) proton dose engine (VPMC) to model aperture blocks in both dose calculation and optimization for pencil beam scanning proton therapy (PBSPT)-based stereotactic radiosurgery (SRS). Methods and Materials: A block aperture module was integrated into VPMC. VPMC was validated by an opensource code, MCsquare, in eight water phantom simulations with 3cm thick brass apertures: four were with aperture openings of 1, 2, 3, and 4cm without a range shifter, while the other four were with same aperture opening configurations with a range shifter of 45mm water equivalent thickness. VPMC was benchmarked with MCsquare and RayStation MC for 10 patients with small targets (average volume 8.4 cc). Finally, 3 patients were selected for robust optimization with aperture blocks using VPMC. Results: In the water phantoms, 3D gamma passing rate (2%/2mm/10%) between VPMC and MCsquare were 99.71$\pm$0.23%. In the patient geometries, 3D gamma passing rates (3%/2mm/10%) between VPMC/MCsquare and RayStation MC were 97.79$\pm$2.21%/97.78$\pm$1.97%, respectively. The calculation time was greatly decreased from 112.45$\pm$114.08 seconds (MCsquare) to 8.20$\pm$6.42 seconds (VPMC), both having statistical uncertainties of about 0.5%. The robustly optimized plans met all the dose-volume-constraints (DVCs) for the targets and OARs per our institutional protocols. The mean calculation time for 13 influence matrices in robust optimization by VPMC was 41.6 seconds. Conclusion: VPMC has been successfully enhanced to model aperture blocks in dose calculation and optimization for the PBSPT-based SRS. △ Less

Submitted 3 July, 2023; originally announced July 2023.

Comments: 3 tables, 3 figures

arXiv:2306.13931 [pdf]

Comparative Study of Predicting Stock Index Using Deep Learning Models

Authors: Harshal Patel, Bharath Kumar Bolla, Sabeesh E, Dinesh Reddy

Abstract: Time series forecasting has seen many methods attempted over the past few decades, including traditional technical analysis, algorithmic statistical models, and more recent machine learning and artificial intelligence approaches. Recently, neural networks have been incorporated into the forecasting scenario, such as the LSTM and conventional RNN approaches, which utilize short-term and long-term d… ▽ More Time series forecasting has seen many methods attempted over the past few decades, including traditional technical analysis, algorithmic statistical models, and more recent machine learning and artificial intelligence approaches. Recently, neural networks have been incorporated into the forecasting scenario, such as the LSTM and conventional RNN approaches, which utilize short-term and long-term dependencies. This study evaluates traditional forecasting methods, such as ARIMA, SARIMA, and SARIMAX, and newer neural network approaches, such as DF-RNN, DSSM, and Deep AR, built using RNNs. The standard NIFTY-50 dataset from Kaggle is used to assess these models using metrics such as MSE, RMSE, MAPE, POCID, and Theil's U. Results show that Deep AR outperformed all other conventional deep learning and traditional approaches, with the lowest MAPE of 0.01 and RMSE of 189. Additionally, the performance of Deep AR and GRU did not degrade when the amount of training data was reduced, suggesting that these models may not require a large amount of data to achieve consistent and reliable performance. The study demonstrates that incorporating deep learning approaches in a forecasting scenario significantly outperforms conventional approaches and can handle complex datasets, with potential applications in various domains, such as weather predictions and other time series applications in a real-world scenario. △ Less

Submitted 24 June, 2023; originally announced June 2023.

arXiv:2306.09489 [pdf, other]

The 2023 Video Similarity Dataset and Challenge

Authors: Ed Pizzi, Giorgos Kordopatis-Zilos, Hiral Patel, Gheorghe Postelnicu, Sugosh Nagavara Ravindra, Akshay Gupta, Symeon Papadopoulos, Giorgos Tolias, Matthijs Douze

Abstract: This work introduces a dataset, benchmark, and challenge for the problem of video copy detection and localization. The problem comprises two distinct but related tasks: determining whether a query video shares content with a reference video ("detection"), and additionally temporally localizing the shared content within each video ("localization"). The benchmark is designed to evaluate methods on t… ▽ More This work introduces a dataset, benchmark, and challenge for the problem of video copy detection and localization. The problem comprises two distinct but related tasks: determining whether a query video shares content with a reference video ("detection"), and additionally temporally localizing the shared content within each video ("localization"). The benchmark is designed to evaluate methods on these two tasks, and simulates a realistic needle-in-haystack setting, where the majority of both query and reference videos are "distractors" containing no copied content. We propose a metric that reflects both detection and localization accuracy. The associated challenge consists of two corresponding tracks, each with restrictions that reflect real-world settings. We provide implementation code for evaluation and baselines. We also analyze the results and methods of the top submissions to the challenge. The dataset, baseline methods and evaluation code is publicly available and will be discussed at a dedicated CVPR'23 workshop. △ Less

Submitted 15 June, 2023; originally announced June 2023.

arXiv:2304.05295 [pdf]

A Comprehensive Study on Object Detection Techniques in Unconstrained Environments

Authors: Hrishitva Patel

Abstract: Object detection is a crucial task in computer vision that aims to identify and localize objects in images or videos. The recent advancements in deep learning and Convolutional Neural Networks (CNNs) have significantly improved the performance of object detection techniques. This paper presents a comprehensive study of object detection techniques in unconstrained environments, including various ch… ▽ More Object detection is a crucial task in computer vision that aims to identify and localize objects in images or videos. The recent advancements in deep learning and Convolutional Neural Networks (CNNs) have significantly improved the performance of object detection techniques. This paper presents a comprehensive study of object detection techniques in unconstrained environments, including various challenges, datasets, and state-of-the-art approaches. Additionally, we present a comparative analysis of the methods and highlight their strengths and weaknesses. Finally, we provide some future research directions to further improve object detection in unconstrained environments. △ Less

Submitted 11 April, 2023; originally announced April 2023.

Comments: 9 pages, 3 Figures, 2 Tables

arXiv:2304.04777 [pdf]

Green synthesis of silver nanoparticles using Curcuma longa flower extract and antibacterial activity

Authors: Kamal Kishor Rajak, Pavan Pahilani, Harsh Patel, Bhavtosh Kikani, Rucha Desai, Hemant Kumar

Abstract: Silver nanoparticles (AgNP's) possess inherent biological potentials that have obliged an alternative, eco-friendly, sustainable approach to "Green Synthesis." In the present study, we synthesized Green Silver Nanoparticles (GAgNP's) using Curcuma longa L. (C. longa) flower extract as a reducing and cap** agent. The synthesized GAgNP's were characterized using UV-Visible spectroscopy, X-ray diff… ▽ More Silver nanoparticles (AgNP's) possess inherent biological potentials that have obliged an alternative, eco-friendly, sustainable approach to "Green Synthesis." In the present study, we synthesized Green Silver Nanoparticles (GAgNP's) using Curcuma longa L. (C. longa) flower extract as a reducing and cap** agent. The synthesized GAgNP's were characterized using UV-Visible spectroscopy, X-ray diffraction (XRD), and High-resolution transmission electron microscopy (HR-TEM), which confirmed their homogeneity and physical characteristics. The GAgNP's were found to contain crystalline silver through XRD, and the particles were confirmed to be homogeneous and spherical with a size of approximately 5 nm, as evidenced by UV-Visible spectroscopy, XRD, and HR-TEM. In addition, the biological potential of GAgNP's was evaluated for their antibacterial activities. GAgNP's showed significant activity and formed different sizes of inhibition zones against all selected bacteria: Mycobacterium smegmatis (M. smegmatis) (26 mm), Mycobacterium phlei (M. phlei), and Staphylococcus aureus (S. aureus) (22 mm), Staphylococcus epidermidis (S. epidermidis) and Klebsiella pneumoniae (K. pneumoniae) (18 mm), and Escherichia coli (E. coli) (13 mm). The MIC value of GAgNP's was found to be between 625 ug/mL-39.06 ug/mL for different microbes tested. With further research, the green synthesis of GAgNP's using C. longa flower extracts could lead to the development of effective antibacterial treatments in the medical field. △ Less

Submitted 10 April, 2023; originally announced April 2023.

arXiv:2302.06155 [pdf, other]

Identifying Semantically Difficult Samples to Improve Text Classification

Authors: Shashank Mujumdar, Stuti Mehta, Hima Patel, Suman Mitra

Abstract: In this paper, we investigate the effect of addressing difficult samples from a given text dataset on the downstream text classification task. We define difficult samples as being non-obvious cases for text classification by analysing them in the semantic embedding space; specifically - (i) semantically similar samples that belong to different classes and (ii) semantically dissimilar samples that… ▽ More In this paper, we investigate the effect of addressing difficult samples from a given text dataset on the downstream text classification task. We define difficult samples as being non-obvious cases for text classification by analysing them in the semantic embedding space; specifically - (i) semantically similar samples that belong to different classes and (ii) semantically dissimilar samples that belong to the same class. We propose a penalty function to measure the overall difficulty score of every sample in the dataset. We conduct exhaustive experiments on 13 standard datasets to show a consistent improvement of up to 9% and discuss qualitative results to show effectiveness of our approach in identifying difficult samples for a text classification model. △ Less

Submitted 13 February, 2023; originally announced February 2023.

arXiv:2212.08188 [pdf, other]

Ab initio in-medium similarity renormalization group for open-shell atomic systems

Authors: G. Tenkila, V. Chand, T. Miyagi, H. Patel, S. R. Stroberg, R. F. Garcia Ruiz, J. D. Holt

Abstract: Precise theoretical calculations of open-shell atomic systems are critical for extracting fundamental physics parameters from precision experiments. Here we present proof-of-principle calculations illustrating the effectiveness of the valence-space formulation of the ab initio in-medium similarity renormalization group, widely used in nuclear theory, as a new ab initio method for atomic systems. W… ▽ More Precise theoretical calculations of open-shell atomic systems are critical for extracting fundamental physics parameters from precision experiments. Here we present proof-of-principle calculations illustrating the effectiveness of the valence-space formulation of the ab initio in-medium similarity renormalization group, widely used in nuclear theory, as a new ab initio method for atomic systems. We adapt this approach to study properties of closed- and open-shell many-electron systems from helium to calcium. Ground-state energies, excitation spectra, and ionization energies are obtained for selected atoms, and reasonable agreement is found with benchmark coupled-cluster and many-body perturbation theory calculations, where available. △ Less

Submitted 15 December, 2022; originally announced December 2022.

Comments: 6 pages, 8 figures

arXiv:2211.05965 [pdf, other]

Using dynamic circles and squares to visualize spatio-temporal variation

Authors: Harsh Patel, Nicole Schneider, Hanan Samet

Abstract: Visualizations such as bar charts, scatter plots, and objects on geographical maps often convey critical information, including exact and relative numeric values, using shapes. The choice of shape and method of encoding information is often arbitrarily, or based on convention. However, past studies have shown that the human eye can be fooled by visual representations. The Ebbinghaus illusion demon… ▽ More Visualizations such as bar charts, scatter plots, and objects on geographical maps often convey critical information, including exact and relative numeric values, using shapes. The choice of shape and method of encoding information is often arbitrarily, or based on convention. However, past studies have shown that the human eye can be fooled by visual representations. The Ebbinghaus illusion demonstrates that the perceived relative sizes of shapes depends on their configuration, which in turn can affect judgements, especially in visualizations like proportional symbol maps. In this study we evaluate the effects of varying the type of shapes and metrics for encoding data in visual representations on a spatio-temporal map interface. We find that some combinations of shape and metric are more conducive to accurate human judgements than others, and provide recommendations for applying these findings in future visualization designs. △ Less

Submitted 10 November, 2022; originally announced November 2022.

arXiv:2211.05823 [pdf, other]

CoronaViz: Visualizing Multilayer Spatiotemporal COVID-19 Data with Animated Geocircles

Authors: Brian Ondov, Harsh B. Patel, Ai-Te Kuo, Hanan Samet, John Kastner, Yunheng Han, Hong Wei, Niklas Elmqvist

Abstract: While many dashboards for visualizing COVID-19 data exist, most separate geospatial and temporal data into discrete visualizations or tables. Further, the common use of choropleth maps or space-filling map overlays supports only a single geospatial variable at once, making it difficult to compare the temporal and geospatial trends of multiple, potentially interacting variables, such as active case… ▽ More While many dashboards for visualizing COVID-19 data exist, most separate geospatial and temporal data into discrete visualizations or tables. Further, the common use of choropleth maps or space-filling map overlays supports only a single geospatial variable at once, making it difficult to compare the temporal and geospatial trends of multiple, potentially interacting variables, such as active cases, deaths, and vaccinations. We present CoronaViz, a COVID-19 visualization system that conveys multilayer, spatiotemporal data in a single, interactive display. CoronaViz encodes variables with concentric, hollow circles, termed geocircles, allowing multiple variables via color encoding and avoiding occlusion problems. The radii of geocircles relate to the values of the variables they represent via the psychophysically determined Flannery formula. The time dimension of spatiotemporal variables is encoded with sequential rendering. Animation controls allow the user to seek through time manually or to view the pandemic unfolding in accelerated time. An adjustable time window allows aggregation at any granularity, from single days to cumulative values for the entire available range. In addition to describing the CoronaViz system, we report findings from a user study comparing CoronaViz with multi-view dashboards from the New York Times and Johns Hopkins University. While participants preferred using the latter two dashboards to perform queries with only a geospatial component or only a temporal component, participants uniformly preferred CoronaViz for queries with both spatial and temporal components, highlighting the utility of a unified spatiotemporal encoding. CoronaViz is open-source and freely available at http://coronaviz.umiacs.io. △ Less

Submitted 10 November, 2022; originally announced November 2022.

arXiv:2211.01770 [pdf, other]

Exploring Explainability Methods for Graph Neural Networks

Authors: Harsh Patel, Shivam Sahni

Abstract: With the growing use of deep learning methods, particularly graph neural networks, which encode intricate interconnectedness information, for a variety of real tasks, there is a necessity for explainability in such settings. In this paper, we demonstrate the applicability of popular explainability approaches on Graph Attention Networks (GAT) for a graph-based super-pixel image classification task.… ▽ More With the growing use of deep learning methods, particularly graph neural networks, which encode intricate interconnectedness information, for a variety of real tasks, there is a necessity for explainability in such settings. In this paper, we demonstrate the applicability of popular explainability approaches on Graph Attention Networks (GAT) for a graph-based super-pixel image classification task. We assess the qualitative and quantitative performance of these techniques on three different datasets and describe our findings. The results shed a fresh light on the notion of explainability in GNNs, particularly GATs. △ Less

Submitted 3 November, 2022; originally announced November 2022.

arXiv:2210.13625 [pdf, other]

doi 10.1145/3514221.3526052

Deploying a Steered Query Optimizer in Production at Microsoft

Authors: Wangda Zhang, Matteo Interlandi, Paul Mineiro, Shi Qiao, Nasim Ghazanfari Karlen Lie, Marc Friedman, Rafah Hosn, Hiren Patel, Alekh **dal

Abstract: Modern analytical workloads are highly heterogeneous and massively complex, making generic query optimizers untenable for many customers and scenarios. As a result, it is important to specialize these optimizers to instances of the workloads. In this paper, we continue a recent line of work in steering a query optimizer towards better plans for a given workload, and make major strides in pushing p… ▽ More Modern analytical workloads are highly heterogeneous and massively complex, making generic query optimizers untenable for many customers and scenarios. As a result, it is important to specialize these optimizers to instances of the workloads. In this paper, we continue a recent line of work in steering a query optimizer towards better plans for a given workload, and make major strides in pushing previous research ideas to production deployment. Along the way we solve several operational challenges including, making steering actions more manageable, kee** the costs of steering within budget, and avoiding unexpected performance regressions in production. Our resulting system, QQ-advisor, essentially externalizes the query planner to a massive offline pipeline for better exploration and specialization. We discuss various aspects of our design and show detailed results over production SCOPE workloads at Microsoft, where the system is currently enabled by default. △ Less

Submitted 24 October, 2022; originally announced October 2022.

Journal ref: Proceedings of the 2022 International Conference on Management of Data 2022 Jun 10 (pp. 2299-2311)

arXiv:2208.08481 [pdf, other]

doi 10.1016/j.physletb.2022.137361

Single neutron transfer on 23Ne and its relevance forthepathway ofnucleosynthesis in astrophysical X-ray bursts

Authors: G. Lotay, J. Henderson, W. N. Catford, F. A. Ali, J. Berean, N. Bernier, S. S. Bhattacharjee, M. Bowry, R. Caballero-Folch, B. Davids, T. E. Drake, A. B. Garnsworthy, F. GhaziMoradi, S. A. Gillespie, B. Greaves, G. Hackman, S. Hallam, D. Hymers, E. Kasanda, D. Levy, B. K. Luna, A. Mathews, Z. Meisel, M. Moukaddam, D. Muecher , et al. (10 additional authors not shown)

Abstract: We present new experimental measurements of resonance strengths in the astrophysical 23Al(p, γ)24Si reaction, constraining the pathway of nucleosynthesis beyond 22Mg in X-ray burster scenarios. Specifically, we have performed the first measurement of the (d, p) reaction using a radioactive beam of 23Ne to explore levels in 24Ne, the mirror analog of 24Si. Four strong single-particle states were ob… ▽ More We present new experimental measurements of resonance strengths in the astrophysical 23Al(p, γ)24Si reaction, constraining the pathway of nucleosynthesis beyond 22Mg in X-ray burster scenarios. Specifically, we have performed the first measurement of the (d, p) reaction using a radioactive beam of 23Ne to explore levels in 24Ne, the mirror analog of 24Si. Four strong single-particle states were observed and corresponding neutron spectroscopic factors were extracted with a precision of {\sim}20{\%}. Using these spectroscopic factors, together with mirror state identifications, we have reduced uncertainties in the strength of the key {\ell} = 0 resonance at Er= 157 keV, in the astrophysical 23Al(p, γ) reaction, by a factor of 4. Our results show that the 22Mg(p, γ)23Al(p, γ) pathway dominates over the competing 22Mg(α, p) reaction in all but the most energetic X-ray burster events (T>0.85GK), significantly affecting energy production and the preservation of hydrogen fuel. △ Less

Submitted 17 August, 2022; originally announced August 2022.

Comments: 5 pages, 3 figures

arXiv:2208.01645 [pdf, other]

doi 10.1088/1402-4896/ad3178

Novel setup for detecting short-range anisotropic corrections to gravity

Authors: Jake S. Bobowski, Hrishikesh Patel, Mir Faizal

Abstract: In this paper we argue that, even though there are strong theoretical and empirical reasons to expect a violation of spatial isotropy at short distances, contemporary setups for probing gravitational interactions at short distances have not been configured to measure such spatial anisotropies. We propose a simple modification to the state-of-the-art torsion pendulum design and numerically demonstr… ▽ More In this paper we argue that, even though there are strong theoretical and empirical reasons to expect a violation of spatial isotropy at short distances, contemporary setups for probing gravitational interactions at short distances have not been configured to measure such spatial anisotropies. We propose a simple modification to the state-of-the-art torsion pendulum design and numerically demonstrate that it suppresses signals due to the large spatially-isotropic component of the gravitational force while maintaining a high sensitivity to short-range spatial anisotropies. We incorporate anisotropy using both Yukawa-type and power-law-type short-distance corrections to gravity. The proposed differential torsion pendulum is shown to be capable of making sensitive measurements of small gravitational anisotropies and the resulting anisotropic torques are largely independent of the details of the underlying short-distance modification to gravity. Thus, if there is an anisotropic modification to gravity, from any theory, in any form of the modified potential, the proposed setup provides a practical means of detecting it. △ Less

Submitted 1 April, 2024; v1 submitted 2 August, 2022; originally announced August 2022.

Comments: 14 pages, 9 figures

Journal ref: Phys. Scr. 99 045017 (2024)

arXiv:2207.13500 [pdf, other]

Modelling Social Context for Fake News Detection: A Graph Neural Network Based Approach

Authors: Pallabi Saikia, Kshitij Gundale, Ankit Jain, Dev Jadeja, Harvi Patel, Mohendra Roy

Abstract: Detection of fake news is crucial to ensure the authenticity of information and maintain the news ecosystems reliability. Recently, there has been an increase in fake news content due to the recent proliferation of social media and fake content generation techniques such as Deep Fake. The majority of the existing modalities of fake news detection focus on content based approaches. However, most of… ▽ More Detection of fake news is crucial to ensure the authenticity of information and maintain the news ecosystems reliability. Recently, there has been an increase in fake news content due to the recent proliferation of social media and fake content generation techniques such as Deep Fake. The majority of the existing modalities of fake news detection focus on content based approaches. However, most of these techniques fail to deal with ultra realistic synthesized media produced by generative models. Our recent studies find that the propagation characteristics of authentic and fake news are distinguishable, irrespective of their modalities. In this regard, we have investigated the auxiliary information based on social context to detect fake news. This paper has analyzed the social context of fake news detection with a hybrid graph neural network based approach. This hybrid model is based on integrating a graph neural network on the propagation of news and bi directional encoder representations from the transformers model on news content to learn the text features. Thus this proposed approach learns the content as well as the context features and hence able to outperform the baseline models with an f1 score of 0.91 on PolitiFact and 0.93 on the Gossipcop dataset, respectively △ Less

Submitted 27 July, 2022; originally announced July 2022.

Journal ref: copyright with IEEE, Paper No: 834, IJCNN, 2022 IEEE World Congress on Computational Intelligence

arXiv:2207.12856 [pdf, other]

doi 10.1119/5.0110405

Low-cost Quadrature Optical Interferometer

Authors: Tanner M. Melody, Krishna H. Patel, Peter K. Nguyen, Christopher L. Smallwood

Abstract: We report on the construction and characterization of a low-cost Mach-Zehnder optical interferometer in which quadrature signal detection is achieved by means of polarization control. The device incorporates a generic green laser pointer, home-built photodetectors, 3D-printed optical mounts, a circular polarizer extracted from a pair of 3D movie glasses, and a Python-enabled microcontroller for an… ▽ More We report on the construction and characterization of a low-cost Mach-Zehnder optical interferometer in which quadrature signal detection is achieved by means of polarization control. The device incorporates a generic green laser pointer, home-built photodetectors, 3D-printed optical mounts, a circular polarizer extracted from a pair of 3D movie glasses, and a Python-enabled microcontroller for analog-to-digital data acquisition. Components fit inside of a 12"x6" space and can be assembled on a budget of less than US\$500. The device has the potential to make quadrature interferometry accessible and affordable for instructors, students, and enthusiasts alike. △ Less

Submitted 23 January, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

Comments: 11 pages, 7 figures

Journal ref: American Journal of Physics 91, 132-141 (2023)

arXiv:2207.11181 [pdf, other]

Secure and Lightweight Strong PUF Challenge Obfuscation with Keyed Non-linear FSR

Authors: Kleber Stangherlin, Zhuanhao Wu, Hiren Patel, Manoj Sachdev

Abstract: We propose a secure and lightweight key based challenge obfuscation for strong PUFs. Our architecture is designed to be resilient against learning attacks. Our obfuscation mechanism uses non-linear feedback shift registers (NLFSRs). Responses are directly provided to the user, without error correction or extra post-processing steps. We also discuss the cost of protecting our architecture against p… ▽ More We propose a secure and lightweight key based challenge obfuscation for strong PUFs. Our architecture is designed to be resilient against learning attacks. Our obfuscation mechanism uses non-linear feedback shift registers (NLFSRs). Responses are directly provided to the user, without error correction or extra post-processing steps. We also discuss the cost of protecting our architecture against power analysis attacks with clock randomization, and Boolean masking. Security against learning attacks is assessed using avalanche criterion, and deep-neural network attacks. We designed a testchip in 65 nm CMOS. When compared to the baseline arbiter PUF implementation, the cost increase of our proposed architecture is 1.27x, and 2.2x when using clock randomization, and Boolean masking, respectively. △ Less

Submitted 22 July, 2022; originally announced July 2022.

arXiv:2206.11840 [pdf, other]

Design Exploration and Security Assessment of PUF-on-PUF Implementations

Authors: Kleber Stangherlin, Zhuanhao Wu, Hiren Patel, Manoj Sachdev

Abstract: We design, implement, and assess the security of several variations of the PUF-on-PUF (POP) architecture. We perform extensive experiments with deep neural networks (DNNs), showing results that endorse its resilience to learning attacks when using APUFs with 6, or more, stages in the first layer. Compositions using APUFs with 2, and 4 stages are shown vulnerable to DNN attacks. We reflect on such… ▽ More We design, implement, and assess the security of several variations of the PUF-on-PUF (POP) architecture. We perform extensive experiments with deep neural networks (DNNs), showing results that endorse its resilience to learning attacks when using APUFs with 6, or more, stages in the first layer. Compositions using APUFs with 2, and 4 stages are shown vulnerable to DNN attacks. We reflect on such results, extending previous techniques of influential bits to assess stage bias in APUF instances. Our data shows that compositions not always preserve security properties of PUFs, the size of PUFs used plays a crucial role. We implemented a testchip in 65 nm CMOS to obtain accurate measurements of uniformity, uniqueness, and response stability for our POP implementations. Measurement results show that minimum bit error rate is obtained when using APUFs with 8 stages in the first layer, while fewer APUF stages lead to a large spread of bit error rate across different chips. △ Less

Submitted 23 June, 2022; originally announced June 2022.

arXiv:2206.11801 [pdf, other]

A Database for Reduced-Complexity Modeling of Fluid Flows

Authors: Aaron Towne, Scott T. M. Dawson, Guillaume A. Brès, Adrián Lozano-Durán, Theresa Saxton-Fox, Aadhy Parthasarathy, Anya R. Jones, Hulya Biler, Chi-An Yeh, Het D. Patel, Kunihiko Taira

Abstract: We present a publicly accessible database designed to aid in the conception, training, demonstration, evaluation, and comparison of reduced-complexity models for fluid mechanics. Availability of high-quality flow data is essential for all of these aspects of model development for both data-driven and physics-based methods. The database contains time-resolved data for six distinct datasets: a large… ▽ More We present a publicly accessible database designed to aid in the conception, training, demonstration, evaluation, and comparison of reduced-complexity models for fluid mechanics. Availability of high-quality flow data is essential for all of these aspects of model development for both data-driven and physics-based methods. The database contains time-resolved data for six distinct datasets: a large eddy simulation of a turbulent jet, direct numerical simulations of a zero-pressure-gradient turbulent boundary layer, particle-image-velocimetry measurements for the same boundary layer at several Reynolds numbers, direct numerical simulations of laminar stationary and pitching flat-plate airfoils, particle-image-velocimetry and force measurements of an airfoil encountering a gust, and a large eddy simulation of the separated, turbulent flow over an airfoil. These six cases span several key flow categories: laminar and turbulent, statistically stationary and transient, tonal and broadband spectral content, canonical and application-oriented, wall-bounded and free-shear flow, and simulation and experimental measurements. For each dataset, we describe the flow setup and computational/experimental methods, catalog the data available in the database, and provide examples of how these data can be used for reduced-complexity modeling. All data can be downloaded using a browser interface or Globus. Our vision is that the common testbed provided by this database will aid the fluid mechanics community in clarifying the distinct capabilities of new and existing methods. △ Less

Submitted 23 June, 2022; originally announced June 2022.

arXiv:2206.03440 [pdf, other]

Enhancing Strong PUF Security with Non-monotonic Response Quantization

Authors: Kleber Stangherlin, Zhuanhao Wu, Hiren Patel, Manoj Sachdev

Abstract: Strong physical unclonable functions (PUFs) provide a low-cost authentication primitive for resource constrained devices. However, most strong PUF architectures can be modeled through learning algorithms with a limited number of CRPs. In this paper, we introduce the concept of non-monotonic response quantization for strong PUFs. Responses depend not only on which path is faster, but also on the di… ▽ More Strong physical unclonable functions (PUFs) provide a low-cost authentication primitive for resource constrained devices. However, most strong PUF architectures can be modeled through learning algorithms with a limited number of CRPs. In this paper, we introduce the concept of non-monotonic response quantization for strong PUFs. Responses depend not only on which path is faster, but also on the distance between the arriving signals. Our experiments show that the resulting PUF has increased security against learning attacks. To demonstrate, we designed and implemented a non-monotonically quantized ring-oscillator based PUF in 65 nm technology. Measurement results show nearly ideal uniformity and uniqueness, with bit error rate of 13.4% over the temperature range from 0 C to 50 C. △ Less

Submitted 11 June, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

arXiv:2205.08109 [pdf]

Forecasting Solar Power Generation on the basis of Predictive and Corrective Maintenance Activities

Authors: Soham Vyas, Yuvraj Goyal, Neel Bhatt, Sanskar Bhuwania, Hardik Patel, Shakti Mishra, Brijesh Tripathi

Abstract: Solar energy forecasting has seen tremendous growth in the last decade using historical time series collected from a weather station, such as weather variables wind speed and direction, solar radiance, and temperature. It helps in the overall management of solar power plants. However, the solar power plant regularly requires preventive and corrective maintenance activities that further impact ener… ▽ More Solar energy forecasting has seen tremendous growth in the last decade using historical time series collected from a weather station, such as weather variables wind speed and direction, solar radiance, and temperature. It helps in the overall management of solar power plants. However, the solar power plant regularly requires preventive and corrective maintenance activities that further impact energy production. This paper presents a novel work for forecasting solar power energy production based on maintenance activities, problems observed at a power plant, and weather data. The results accomplished on the datasets obtained from the 1MW solar power plant of PDEU (our university) that has generated data set with 13 columns as daily entries from 2012 to 2020. There are 12 structured columns and one unstructured column with manual text entries about different maintenance activities, problems observed, and weather conditions daily. The unstructured column is used to create a new feature column vector using Hash Map, flag words, and stop words. The final dataset comprises five important feature vector columns based on correlation and causality analysis. △ Less

Submitted 17 May, 2022; originally announced May 2022.

arXiv:2204.07144 [pdf, other]

doi 10.1016/j.physletb.2022.137371

On the $W$-mass and New Higgs Bosons

Authors: Pavel Fileviez Perez, Hiren H. Patel, Alexis D. Plascencia

Abstract: We discuss the prediction of the $W$ boson mass in a simple extension of the Standard Model ($Σ{\rm SM}$) with a real scalar triplet. A shift in the $W$ mass as reported by the CDF II collaboration can naturally be accommodated by the model without modifying the Standard Model value for the $Z$ mass. We discuss the main implications and the properties of the new Higgs bosons. Namely, the partial d… ▽ More We discuss the prediction of the $W$ boson mass in a simple extension of the Standard Model ($Σ{\rm SM}$) with a real scalar triplet. A shift in the $W$ mass as reported by the CDF II collaboration can naturally be accommodated by the model without modifying the Standard Model value for the $Z$ mass. We discuss the main implications and the properties of the new Higgs bosons. Namely, the partial decay widths of the new charged Higgs are predicted. Furthermore, the neutral Higgs has suppressed couplings to fermions and decays predominantly into a pair of $W$ gauge bosons. △ Less

Submitted 5 August, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

Comments: 5 pages, 2 figures. v2: Added discussion on the scalar mixing angle, version to appear in Physics Letters B

Journal ref: Physics Letters B 833 (2022) 137371

arXiv:2204.01679 [pdf, other]

Predictable Sharing of Last-level Cache Partitions for Multi-core Safety-critical Systems

Authors: Zhuanhao Wu, Hiren Patel

Abstract: Last-level cache (LLC) partitioning is a technique to provide temporal isolation and low worst-case latency (WCL) bounds when cores access the shared LLC in multicore safety-critical systems. A typical approach to cache partitioning involves allocating a separate partition to a distinct core. A central criticism of this approach is its poor utilization of cache storage. Today's trend of integratin… ▽ More Last-level cache (LLC) partitioning is a technique to provide temporal isolation and low worst-case latency (WCL) bounds when cores access the shared LLC in multicore safety-critical systems. A typical approach to cache partitioning involves allocating a separate partition to a distinct core. A central criticism of this approach is its poor utilization of cache storage. Today's trend of integrating a larger number of cores exacerbates this issue such that we are forced to consider shared LLC partitions for effective deployments. This work presents an approach to share LLC partitions among multiple cores while being able to provide low WCL bounds. △ Less

Submitted 4 April, 2022; originally announced April 2022.

arXiv:2203.03704 [pdf, other]

Mid-Air Helicopter Delivery at Mars Using a Jetpack

Authors: Jeff Delaune, Jacob Izraelevitz, Samuel Sirlin, David Sternberg, Louis Giersch, L. Phillipe Tosi, Evgeniy Skliyanskiy, Larry Young, Michael Mischna, Shannah Withrow-Maser, Juergen Mueller, Joshua Bowman, Mark S Wallace, Havard F. Grip, Larry Matthies, Wayne Johnson, Matthew Keennon, Benjamin Pipenberg, Harsh Patel, Christopher Lim, Aaron Schutte, Marcel Veismann, Haley Cummings, Sarah Conley, Jonathan Bapst , et al. (10 additional authors not shown)

Abstract: Mid-Air Helicopter Delivery (MAHD) is a new Entry, Descent and Landing (EDL) architecture to enable in situ mobility for Mars science at lower cost than previous missions. It uses a jetpack to slow down a Mars Science Helicopter (MSH) after separation from the backshell, and reach aerodynamic conditions suitable for helicopter take-off in mid air. For given aeroshell dimensions, only MAHD's lander… ▽ More Mid-Air Helicopter Delivery (MAHD) is a new Entry, Descent and Landing (EDL) architecture to enable in situ mobility for Mars science at lower cost than previous missions. It uses a jetpack to slow down a Mars Science Helicopter (MSH) after separation from the backshell, and reach aerodynamic conditions suitable for helicopter take-off in mid air. For given aeroshell dimensions, only MAHD's lander-free approach leaves enough room in the aeroshell to accommodate the largest rotor option for MSH. This drastically improves flight performance, notably allowing +150\% increased science payload mass. Compared to heritage EDL approaches, the simpler MAHD architecture is also likely to reduce cost, and enables access to more hazardous and higher-elevation terrains on Mars. This paper introduces a design for the MAHD system architecture and operations. We present a mechanical configuration that fits both MSH and the jetpack within the 2.65-m Mars heritage aeroshell, and a jetpack control architecture which fully leverages the available helicopter avionics. We discuss preliminary numerical models of the flow dynamics resulting from the interaction between the jets, the rotors and the side winds. We define a force-torque sensing architecture capable of handling the wind and trimming the rotors to prepare for safe take-off. Finally, we analyze the dynamic environment and closed-loop control simulation results to demonstrate the preliminary feasibility of MAHD. △ Less

Submitted 7 March, 2022; originally announced March 2022.

Comments: Accepted in 2022 IEEE Aerospace Conference

arXiv:2201.00051 [pdf]

doi 10.1016/j.ecresq.2020.04.004

Identifying the preschool home learning experiences that predict early number skills: Evidence from a longitudinal study

Authors: Elena Soto-Calvo, Fiona R. Simmons, Anne-Marie Adams, Hannah N. Francis, Hannah Patel, David Giofrè

Abstract: This study examines the longitudinal relationships between home learning experiences and early number skills. The counting, number transcoding and calculation skills of 274 children were assessed in the penultimate term of preschool (Mage=4:0). Prior to these assessments, parents completed questionnaires that surveyed the frequency of the children's home learning experiences. Three types of experi… ▽ More This study examines the longitudinal relationships between home learning experiences and early number skills. The counting, number transcoding and calculation skills of 274 children were assessed in the penultimate term of preschool (Mage=4:0). Prior to these assessments, parents completed questionnaires that surveyed the frequency of the children's home learning experiences. Three types of experiences were indexed: code-focused home literacy experiences that focus on the phonological and orthographic features of language, meaning-focused home literacy experiences that focus on sharing the meaning of language and text, and home number experiences. The children's language abilities (phonological awareness and vocabulary) and nonverbal abilities (inhibitory control and nonverbal reasoning) were assessed in the final term of preschool (Mage=4:3). Their number skills were reassessed in the final term of the first year of primary school (Mage=5:3). Home letter-sound interaction experiences (interactive code-focused literacy experiences) had significant longitudinal relationships with counting and number transcoding that were independent of language and nonverbal abilities. The relationship between letter-sound interaction experiences and later counting was also independent of the autoregressive influence of baseline counting ability. We extend previous findings by demonstrating that interactive code-focused home literacy experiences in the preschool period predict growth in counting skills even when a broad range of language and cognitive abilities are controlled. △ Less

Submitted 31 December, 2021; originally announced January 2022.

Journal ref: Early Childhood Research Quarterly, 53, 314-328 (2020)

arXiv:2112.14407 [pdf]

The impacts of various parameters on learning process and machine learning based performance prediction in online coding competitions

Authors: Hardik Patel, Purvi Koringa

Abstract: Various parameters affect the performance of students in online coding competitions. Students' behavior, approach, emotions, and problem difficulty levels significantly impact their performance in online coding competitions. We have organized two coding competitions to understand the effects of the above parameters. We have done the online survey at the end of each coding competition, and it conta… ▽ More Various parameters affect the performance of students in online coding competitions. Students' behavior, approach, emotions, and problem difficulty levels significantly impact their performance in online coding competitions. We have organized two coding competitions to understand the effects of the above parameters. We have done the online survey at the end of each coding competition, and it contains questions related to the behavior, approach, and emotions of students during online coding competitions. Students are evaluated based on the time and status of the submissions. We have carried out a detailed analysis to address the impact of students' approach, behavior, and emotions on the learning process in online coding competitions. Two difficulty levels are proposed based on the time and status of submissions. The impact of difficulty levels on machine learning-based performance prediction is presented in this research work. Based on time, the coding solution submissions have two classes "Less than 15 minutes" and "More than 15 minutes". There are three classes, "Complete solution", "Partial solution", and "Not submitted at all," based on the submission status. The appropriate approaches are found for both the coding competitions to submit the solution within 15 minutes. Machine learning classifiers are trained and evaluated for the above classification problems. The impacts of mood, emotions, and difficulty levels on the learning process are also assessed by comparing the results of machine learning models for both coding competitions. △ Less

Submitted 26 September, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

arXiv:2111.14657 [pdf, ps, other]

Orthosymplectic Cauchy identities

Authors: Aalekh Patel, Harsh Patel, Anna Stokke

Abstract: We give bijective proofs of orthosymplectic analogues of the Cauchy identity and dual Cauchy identity for orthosymplectic Schur functions. To do so, we present two insertion algorithms; these are orthosymplectic versions of Berele's symplectic insertion algorithms, which were used by Sundaram to give bijective proofs of Cauchy identities for symplectic Schur functions. We give bijective proofs of orthosymplectic analogues of the Cauchy identity and dual Cauchy identity for orthosymplectic Schur functions. To do so, we present two insertion algorithms; these are orthosymplectic versions of Berele's symplectic insertion algorithms, which were used by Sundaram to give bijective proofs of Cauchy identities for symplectic Schur functions. △ Less

Submitted 23 December, 2021; v1 submitted 29 November, 2021; originally announced November 2021.

MSC Class: 05E05; 05E10

Showing 1–50 of 135 results for author: Patel, H