Search | arXiv e-print repository

Surface-Functionalization of Oleate-Capped Nano-Emitters for Stable Dispersion in 3D-Printable Polymers

Authors: Akhilesh Kumar Pathak, Sachin Prashant Kulkarni, Rachel R. Chan, Chad A. Mirkin, Koray Aydin, Sridhar Krishnaswamy

Abstract: Two-photon polymerization (2PP) 3D printing is a well-known technique for fabricating passive micro/nanoscale structures, such as microlenses and inversely designed polarization splitters. The integration of light emitting nanoparticle (NP) dopants, such as quantum dots (QDs) and rare-earth doped nanoparticles (RENPs), into a polymer resist would enable 3D printing of active polymer micro-photonic… ▽ More Two-photon polymerization (2PP) 3D printing is a well-known technique for fabricating passive micro/nanoscale structures, such as microlenses and inversely designed polarization splitters. The integration of light emitting nanoparticle (NP) dopants, such as quantum dots (QDs) and rare-earth doped nanoparticles (RENPs), into a polymer resist would enable 3D printing of active polymer micro-photonic devices, including sensors, lasers, and solid-state displays. Many NPs are stabilized with oleic acid ligands to prevent degradation, but oleate-capped NPs (oc-NPs) tend to agglomerate in nonpolar media despite the hydrophobicity of the ligand. This results in an uneven distribution of NPs in polymers and increased optical extinction properties. In this work, we propose a general approach for dispersing various oc-NPs in commercial 3D printable polymers. We achieve controlled growth of small carbon chains around the oc-NPs by functionalizing the NPs with methyl-methacrylate monomers. The proposed approach is validated on RENPs (~65 nm) and CdSe/ZnS quantum dots (~12 nm) using different commercial polymer resists (IP-Dip and IP-Visio). Dispersions of functionalized NPs (f-NPs) have improved NP density by an order of magnitude and are shown to be stable for several weeks with minimal impact on printing quality. Our approach is generalizable to a variety of oc-NPs and ultimately leads to higher quality polymer-based optical and electronic devices. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 25 Pages, 12 Figures

arXiv:2406.16932 [pdf, other]

doi 10.17023/pn92-d609

Xi-Net: Transformer Based Seismic Waveform Reconstructor

Authors: Anshuman Gaharwar, Parth Parag Kulkarni, Joshua Dickey, Mubarak Shah

Abstract: Missing/erroneous data is a major problem in today's world. Collected seismic data sometimes contain gaps due to multitude of reasons like interference and sensor malfunction. Gaps in seismic waveforms hamper further signal processing to gain valuable information. Plethora of techniques are used for data reconstruction in other domains like image, video, audio, but translation of those methods to… ▽ More Missing/erroneous data is a major problem in today's world. Collected seismic data sometimes contain gaps due to multitude of reasons like interference and sensor malfunction. Gaps in seismic waveforms hamper further signal processing to gain valuable information. Plethora of techniques are used for data reconstruction in other domains like image, video, audio, but translation of those methods to address seismic waveforms demands adapting them to lengthy sequence inputs, which is practically complex. Even if that is accomplished, high computational costs and inefficiency would still persist in these predominantly convolution-based reconstruction models. In this paper, we present a transformer-based deep learning model, Xi-Net, which utilizes multi-faceted time and frequency domain inputs for accurate waveform reconstruction. Xi-Net converts the input waveform to frequency domain, employs separate encoders for time and frequency domains, and one decoder for getting reconstructed output waveform from the fused features. 1D shifted-window transformer blocks form the elementary units of all parts of the model. To the best of our knowledge, this is the first transformer-based deep learning model for seismic waveform reconstruction. We demonstrate this model's prowess by filling 0.5-1s random gaps in 120s waveforms, resembling the original waveform quite closely. The code, models can be found at: https://github.com/Anshuman04/waveformReconstructor. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: Oral Presentation at IEEE International Conference on Image Processing(ICIP) 2023 (Multidimensional Signal Processing Track)

arXiv:2405.00156 [pdf, other]

Expanding the Horizon: Enabling Hybrid Quantum Transfer Learning for Long-Tailed Chest X-Ray Classification

Authors: Skylar Chan, Pranav Kulkarni, Paul H. Yi, Vishwa S. Parekh

Abstract: Quantum machine learning (QML) has the potential for improving the multi-label classification of rare, albeit critical, diseases in large-scale chest x-ray (CXR) datasets due to theoretical quantum advantages over classical machine learning (CML) in sample efficiency and generalizability. While prior literature has explored QML with CXRs, it has focused on binary classification tasks with small da… ▽ More Quantum machine learning (QML) has the potential for improving the multi-label classification of rare, albeit critical, diseases in large-scale chest x-ray (CXR) datasets due to theoretical quantum advantages over classical machine learning (CML) in sample efficiency and generalizability. While prior literature has explored QML with CXRs, it has focused on binary classification tasks with small datasets due to limited access to quantum hardware and computationally expensive simulations. To that end, we implemented a Jax-based framework that enables the simulation of medium-sized qubit architectures with significant improvements in wall-clock time over current software offerings. We evaluated the performance of our Jax-based framework in terms of efficiency and performance for hybrid quantum transfer learning for long-tailed classification across 8, 14, and 19 disease labels using large-scale CXR datasets. The Jax-based framework resulted in up to a 58% and 95% speed-up compared to PyTorch and TensorFlow implementations, respectively. However, compared to CML, QML demonstrated slower convergence and an average AUROC of 0.70, 0.73, and 0.74 for the classification of 8, 14, and 19 CXR disease labels. In comparison, the CML models had an average AUROC of 0.77, 0.78, and 0.80 respectively. In conclusion, our work presents an accessible implementation of hybrid quantum transfer learning for long-tailed CXR classification with a computationally efficient Jax-based framework. △ Less

Submitted 30 April, 2024; originally announced May 2024.

Comments: 11 pages, 13 figures, 3 tables

arXiv:2404.15656 [pdf, other]

MISLEAD: Manipulating Importance of Selected features for Learning Epsilon in Evasion Attack Deception

Authors: Vidit Khazanchi, Pavan Kulkarni, Yuvaraj Govindarajulu, Manojkumar Parmar

Abstract: Emerging vulnerabilities in machine learning (ML) models due to adversarial attacks raise concerns about their reliability. Specifically, evasion attacks manipulate models by introducing precise perturbations to input data, causing erroneous predictions. To address this, we propose a methodology combining SHapley Additive exPlanations (SHAP) for feature importance analysis with an innovative Optim… ▽ More Emerging vulnerabilities in machine learning (ML) models due to adversarial attacks raise concerns about their reliability. Specifically, evasion attacks manipulate models by introducing precise perturbations to input data, causing erroneous predictions. To address this, we propose a methodology combining SHapley Additive exPlanations (SHAP) for feature importance analysis with an innovative Optimal Epsilon technique for conducting evasion attacks. Our approach begins with SHAP-based analysis to understand model vulnerabilities, crucial for devising targeted evasion strategies. The Optimal Epsilon technique, employing a Binary Search algorithm, efficiently determines the minimum epsilon needed for successful evasion. Evaluation across diverse machine learning architectures demonstrates the technique's precision in generating adversarial samples, underscoring its efficacy in manipulating model outcomes. This study emphasizes the critical importance of continuous assessment and monitoring to identify and mitigate potential security risks in machine learning systems. △ Less

Submitted 2 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.08311 [pdf, other]

emucxl: an emulation framework for CXL-based disaggregated memory applications

Authors: Raja Gond, Purushottam Kulkarni

Abstract: The emergence of CXL (Compute Express Link) promises to transform the status of interconnects between host and devices and in turn impact the design of all software layers. With its low overhead, low latency, and memory coherency capabilities, CXL has the potential to improve the performance of existing devices while making viable new operational use cases (e.g., disaggregated memory pools, cache… ▽ More The emergence of CXL (Compute Express Link) promises to transform the status of interconnects between host and devices and in turn impact the design of all software layers. With its low overhead, low latency, and memory coherency capabilities, CXL has the potential to improve the performance of existing devices while making viable new operational use cases (e.g., disaggregated memory pools, cache coherent memory across devices etc.). The focus of this work is design of applications and middleware with use of CXL for supporting disaggregated memory. A vital building block for solutions in this space is the availability of a standard CXL hardware and software platform. Currently, CXL devices are not commercially available, and researchers often rely on custom-built hardware or emulation techniques and/or use customized software interfaces and abstractions. These techniques do not provide a standard usage model and abstraction layer for CXL usage, and developers and researchers have to reinvent the CXL setup to design and test their solutions, our work aims to provide a standardized view of the CXL emulation platform and the software interfaces and abstractions for disaggregated memory. This standardization is designed and implemented as a user space library, emucxl and is available as a virtual appliance. The library provides a user space API and is coupled with a NUMA-based CXL emulation backend. Further, we demonstrate usage of the standardized API for different use cases relying on disaggregated memory and show that generalized functionality can be built using the open source emucxl library. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2404.07374 [pdf, other]

Improving Multi-Center Generalizability of GAN-Based Fat Suppression using Federated Learning

Authors: Pranav Kulkarni, Adway Kanhere, Harshita Kukreja, Vivian Zhang, Paul H. Yi, Vishwa S. Parekh

Abstract: Generative Adversarial Network (GAN)-based synthesis of fat suppressed (FS) MRIs from non-FS proton density sequences has the potential to accelerate acquisition of knee MRIs. However, GANs trained on single-site data have poor generalizability to external data. We show that federated learning can improve multi-center generalizability of GANs for synthesizing FS MRIs, while facilitating privacy-pr… ▽ More Generative Adversarial Network (GAN)-based synthesis of fat suppressed (FS) MRIs from non-FS proton density sequences has the potential to accelerate acquisition of knee MRIs. However, GANs trained on single-site data have poor generalizability to external data. We show that federated learning can improve multi-center generalizability of GANs for synthesizing FS MRIs, while facilitating privacy-preserving multi-institutional collaborations. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 5 pages, 2 figures

arXiv:2403.20147 [pdf, other]

IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context

Authors: Nihar Ranjan Sahoo, Pranamya Prashant Kulkarni, Narjis Asad, Arif Ahmad, Tanu Goyal, Aparna Garimella, Pushpak Bhattacharyya

Abstract: The pervasive influence of social biases in language data has sparked the need for benchmark datasets that capture and evaluate these biases in Large Language Models (LLMs). Existing efforts predominantly focus on English language and the Western context, leaving a void for a reliable dataset that encapsulates India's unique socio-cultural nuances. To bridge this gap, we introduce IndiBias, a comp… ▽ More The pervasive influence of social biases in language data has sparked the need for benchmark datasets that capture and evaluate these biases in Large Language Models (LLMs). Existing efforts predominantly focus on English language and the Western context, leaving a void for a reliable dataset that encapsulates India's unique socio-cultural nuances. To bridge this gap, we introduce IndiBias, a comprehensive benchmarking dataset designed specifically for evaluating social biases in the Indian context. We filter and translate the existing CrowS-Pairs dataset to create a benchmark dataset suited to the Indian context in Hindi language. Additionally, we leverage LLMs including ChatGPT and InstructGPT to augment our dataset with diverse societal biases and stereotypes prevalent in India. The included bias dimensions encompass gender, religion, caste, age, region, physical appearance, and occupation. We also build a resource to address intersectional biases along three intersectional dimensions. Our dataset contains 800 sentence pairs and 300 tuples for bias measurement across different demographics. The dataset is available in English and Hindi, providing a size comparable to existing benchmark datasets. Furthermore, using IndiBias we compare ten different language models on multiple bias measurement metrics. We observed that the language models exhibit more bias across a majority of the intersectional groups. △ Less

Submitted 3 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

arXiv:2403.15218 [pdf, other]

Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations

Authors: Pranav Kulkarni, Adway Kanhere, Dharmam Savani, Andrew Chan, Devina Chatterjee, Paul H. Yi, Vishwa S. Parekh

Abstract: Curating annotations for medical image segmentation is a labor-intensive and time-consuming task that requires domain expertise, resulting in "narrowly" focused deep learning (DL) models with limited translational utility. Recently, foundation models like the Segment Anything Model (SAM) have revolutionized semantic segmentation with exceptional zero-shot generalizability across various domains, i… ▽ More Curating annotations for medical image segmentation is a labor-intensive and time-consuming task that requires domain expertise, resulting in "narrowly" focused deep learning (DL) models with limited translational utility. Recently, foundation models like the Segment Anything Model (SAM) have revolutionized semantic segmentation with exceptional zero-shot generalizability across various domains, including medical imaging, and hold a lot of promise for streamlining the annotation process. However, SAM has yet to be evaluated in a crowd-sourced setting to curate annotations for training 3D DL segmentation models. In this work, we explore the potential of SAM for crowd-sourcing "sparse" annotations from non-experts to generate "dense" segmentation masks for training 3D nnU-Net models, a state-of-the-art DL segmentation model. Our results indicate that while SAM-generated annotations exhibit high mean Dice scores compared to ground-truth annotations, nnU-Net models trained on SAM-generated annotations perform significantly worse than nnU-Net models trained on ground-truth annotations ($p<0.001$, all). △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2402.05713 [pdf, other]

Hidden in Plain Sight: Undetectable Adversarial Bias Attacks on Vulnerable Patient Populations

Authors: Pranav Kulkarni, Andrew Chan, Nithya Navarathna, Skylar Chan, Paul H. Yi, Vishwa S. Parekh

Abstract: The proliferation of artificial intelligence (AI) in radiology has shed light on the risk of deep learning (DL) models exacerbating clinical biases towards vulnerable patient populations. While prior literature has focused on quantifying biases exhibited by trained DL models, demographically targeted adversarial bias attacks on DL models and its implication in the clinical environment remains an u… ▽ More The proliferation of artificial intelligence (AI) in radiology has shed light on the risk of deep learning (DL) models exacerbating clinical biases towards vulnerable patient populations. While prior literature has focused on quantifying biases exhibited by trained DL models, demographically targeted adversarial bias attacks on DL models and its implication in the clinical environment remains an underexplored field of research in medical imaging. In this work, we demonstrate that demographically targeted label poisoning attacks can introduce undetectable underdiagnosis bias in DL models. Our results across multiple performance metrics and demographic groups like sex, age, and their intersectional subgroups show that adversarial bias attacks demonstrate high-selectivity for bias in the targeted group by degrading group model performance without impacting overall model performance. Furthermore, our results indicate that adversarial bias attacks result in biased DL models that propagate prediction bias even when evaluated with external datasets. △ Less

Submitted 7 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 29 pages, 4 figures

arXiv:2401.07148 [pdf, other]

Assessing the Effectiveness of Binary-Level CFI Techniques

Authors: Ruturaj K. Vaidya, Prasad A. Kulkarni

Abstract: Memory corruption is an important class of vulnerability that can be leveraged to craft control flow hijacking attacks. Control Flow Integrity (CFI) provides protection against such attacks. Application of type-based CFI policies requires information regarding the number and type of function arguments. Binary-level type recovery is inherently speculative, which motivates the need for an evaluation… ▽ More Memory corruption is an important class of vulnerability that can be leveraged to craft control flow hijacking attacks. Control Flow Integrity (CFI) provides protection against such attacks. Application of type-based CFI policies requires information regarding the number and type of function arguments. Binary-level type recovery is inherently speculative, which motivates the need for an evaluation framework to assess the effectiveness of binary-level CFI techniques compared with their source-level counterparts, where such type information is fully and accurately accessible. In this work, we develop a novel, generalized and extensible framework to assess how the program analysis information we get from state-of-the-art binary analysis tools affects the efficacy of type-based CFI techniques. We introduce new and insightful metrics to quantitatively compare source independent CFI policies with their ground truth source aware counterparts. We leverage our framework to evaluate binary-level CFI policies implemented using program analysis information extracted from the IDA Pro binary analyzer and compared with the ground truth information obtained from the LLVM compiler, and present our observations. △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: 14 pages, 9 figures, 9 tables, Part of this work is to be published in 16th International Symposium on Foundations & Practice of Security (FPS - 2023)

arXiv:2312.15977 [pdf]

doi 10.1063/5.0191974

Perspective on nanoscale magnetic sensors using giant anomalous Hall effect in topological magnetic materials for read head application in magnetic recording

Authors: Tomoya Nakatani, Prabhanjan D. Kulkarni, Hirofumi Suto, Keisuke Masuda, Hitoshi Iwasaki, Yuya Sakuraba

Abstract: Recent advances in the study of materials with topological electronic band structures have revealed magnetic materials exhibiting giant anomalous Hall effects (AHE). The giant AHE has not only attracted the research interest in its mechanism but also opened up the possibility of practical application in magnetic sensors. In this article, we describe simulation-based investigations of AHE magnetic… ▽ More Recent advances in the study of materials with topological electronic band structures have revealed magnetic materials exhibiting giant anomalous Hall effects (AHE). The giant AHE has not only attracted the research interest in its mechanism but also opened up the possibility of practical application in magnetic sensors. In this article, we describe simulation-based investigations of AHE magnetic sensors for the applications to read head sensors (readers) of hard disk drives. With the shrinking of magnetic recording patterns, the reader technology, which currently uses multilayer-based tunnel magnetoresistance (TMR) devices, is associated with fundamental challenges, such as insufficient spatial resolution and signal-to-noise ratio (SNR) in sensors with dimensions below 20 nm. The structure of an AHE-based device composed of a single ferromagnetic material is advantageous for magnetic sensors with nanoscale dimensions. We found that AHE readers using topological ferromagnets with giant AHE, such as Co2MnGa, can achieve a higher SNR than current TMR readers. The higher SNR originates from the large output signal of the giant AHE as well as from the reduced thermal magnetic noise, which is the dominant noise in TMR readers. We highlight a major challenge in the development of AHE readers: the reduction in the output signal due to the shunting of the bias current and the leakage of the Hall voltage through the soft magnetic shields surrounding the AHE reader. We propose reader structures that overcome this challenge. Finally, we discuss the scope for future research to realize AHE readers. △ Less

Submitted 16 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

Comments: Revised version after peer-review

Journal ref: Appl. Phys. Lett. 124, 070501 (2024)

arXiv:2312.08509 [pdf, ps, other]

Approximating APS under Submodular and XOS valuations with Binary Marginals

Authors: Pooja Kulkarni, Rucha Kulkarni, Ruta Mehta

Abstract: We study the problem of fairly dividing indivisible goods among a set of agents under the fairness notion of Any Price Share (APS). APS is known to dominate the widely studied Maximin share (MMS). Since an exact APS allocation may not exist, the focus has traditionally been on the computation of approximate APS allocations. Babaioff et al. studied the problem under additive valuations, and asked (… ▽ More We study the problem of fairly dividing indivisible goods among a set of agents under the fairness notion of Any Price Share (APS). APS is known to dominate the widely studied Maximin share (MMS). Since an exact APS allocation may not exist, the focus has traditionally been on the computation of approximate APS allocations. Babaioff et al. studied the problem under additive valuations, and asked (i) how large can the APS value be compared to the MMS value? and (ii) what guarantees can one achieve beyond additive functions. We partly answer these questions by considering valuations beyond additive, namely submodular and XOS functions, with binary marginals. For the submodular functions with binary marginals, also known as matroid rank functions (MRFs), we show that APS is exactly equal to MMS. Consequently, we get that an exact APS allocation exists and can be computed efficiently while maximizing the social welfare. Complementing this result, we show that it is NP-hard to compute the APS value within a factor of 5/6 for submodular valuations with three distinct marginals of {0, 1/2, 1}. We then consider binary XOS functions, which are immediate generalizations of binary submodular functions in the complement free hierarchy. In contrast to the MRFs setting, MMS and APS values are not equal under this case. Nevertheless, we show that under binary XOS valuations, $MMS \leq APS \leq 2 \cdot MMS + 1$. Further, we show that this is almost the tightest bound we can get using MMS, by giving an instance where $APS \geq 2 \cdot MMS$. The upper bound on APS, implies a ~0.1222-approximation for APS under binary XOS valuations. And the lower bound implies the non-existence of better than 0.5-APS even when agents have identical valuations, which is in sharp contrast to the guaranteed existence of exact MMS allocation when agent valuations are identical. △ Less

Submitted 13 December, 2023; originally announced December 2023.

arXiv:2312.08504 [pdf, ps, other]

1/2 Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations

Authors: Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni, Ruta Mehta

Abstract: We study fair distribution of a collection of m indivisible goods among a group of n agents, using the widely recognized fairness principles of Maximin Share (MMS) and Any Price Share (APS). These principles have undergone thorough investigation within the context of additive valuations. We explore these notions for valuations that extend beyond additivity. First, we study approximate MMS under… ▽ More We study fair distribution of a collection of m indivisible goods among a group of n agents, using the widely recognized fairness principles of Maximin Share (MMS) and Any Price Share (APS). These principles have undergone thorough investigation within the context of additive valuations. We explore these notions for valuations that extend beyond additivity. First, we study approximate MMS under the separable (piecewise-linear) concave (SPLC) valuations, an important class generalizing additive, where the best known factor was 1/3-MMS. We show that 1/2-MMS allocation exists and can be computed in polynomial time, significantly improving the state-of-the-art. We note that SPLC valuations introduce an elevated level of intricacy in contrast to additive. For instance, the MMS value of an agent can be as high as her value for the entire set of items. Further, the equilibrium computation problem, which is polynomial-time for additive valuations, becomes intractable for SPLC. We use a relax-and-round paradigm that goes through competitive equilibrium and LP relaxation. Our result extends to give (symmetric) 1/2-APS, a stronger guarantee than MMS. APS is a stronger notion that generalizes MMS by allowing agents with arbitrary entitlements. We study the approximation of APS under submodular valuation functions. We design and analyze a simple greedy algorithm using concave extensions of submodular functions. We prove that the algorithm gives a 1/3-APS allocation which matches the current best-known factor. Concave extensions are hard to compute in polynomial time and are, therefore, generally not used in approximation algorithms. Our approach shows a way to utilize it within analysis (while bypassing its computation), and might be of independent interest. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: To appear in AAAI Conference on Artificial Intelligence, 2024

arXiv:2312.06979 [pdf, ps, other]

On the notion of Hallucinations from the lens of Bias and Validity in Synthetic CXR Images

Authors: Gauri Bhardwaj, Yuvaraj Govindarajulu, Sundaraparipurnan Narayanan, Pavan Kulkarni, Manojkumar Parmar

Abstract: Medical imaging has revolutionized disease diagnosis, yet the potential is hampered by limited access to diverse and privacy-conscious datasets. Open-source medical datasets, while valuable, suffer from data quality and clinical information disparities. Generative models, such as diffusion models, aim to mitigate these challenges. At Stanford, researchers explored the utility of a fine-tuned Stabl… ▽ More Medical imaging has revolutionized disease diagnosis, yet the potential is hampered by limited access to diverse and privacy-conscious datasets. Open-source medical datasets, while valuable, suffer from data quality and clinical information disparities. Generative models, such as diffusion models, aim to mitigate these challenges. At Stanford, researchers explored the utility of a fine-tuned Stable Diffusion model (RoentGen) for medical imaging data augmentation. Our work examines specific considerations to expand the Stanford research question, Could Stable Diffusion Solve a Gap in Medical Imaging Data? from the lens of bias and validity of the generated outcomes. We leveraged RoentGen to produce synthetic Chest-XRay (CXR) images and conducted assessments on bias, validity, and hallucinations. Diagnostic accuracy was evaluated by a disease classifier, while a COVID classifier uncovered latent hallucinations. The bias analysis unveiled disparities in classification performance among various subgroups, with a pronounced impact on the Female Hispanic subgroup. Furthermore, incorporating race and gender into input prompts exacerbated fairness issues in the generated images. The quality of synthetic images exhibited variability, particularly in certain disease classes, where there was more significant uncertainty compared to the original images. Additionally, we observed latent hallucinations, with approximately 42% of the images incorrectly indicating COVID, hinting at the presence of hallucinatory elements. These identifications provide new research directions towards interpretability of synthetic CXR images, for further understanding of associated risks and patient safety in medical applications. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023) - "Medical Imaging Meets NeurIPS" Workshop

arXiv:2308.05127 [pdf, other]

Data-Free Model Extraction Attacks in the Context of Object Detection

Authors: Harshit Shah, Aravindhan G, Pavan Kulkarni, Yuvaraj Govidarajulu, Manojkumar Parmar

Abstract: A significant number of machine learning models are vulnerable to model extraction attacks, which focus on stealing the models by using specially curated queries against the target model. This task is well accomplished by using part of the training data or a surrogate dataset to train a new model that mimics a target model in a white-box environment. In pragmatic situations, however, the target mo… ▽ More A significant number of machine learning models are vulnerable to model extraction attacks, which focus on stealing the models by using specially curated queries against the target model. This task is well accomplished by using part of the training data or a surrogate dataset to train a new model that mimics a target model in a white-box environment. In pragmatic situations, however, the target models are trained on private datasets that are inaccessible to the adversary. The data-free model extraction technique replaces this problem when it comes to using queries artificially curated by a generator similar to that used in Generative Adversarial Nets. We propose for the first time, to the best of our knowledge, an adversary black box attack extending to a regression problem for predicting bounding box coordinates in object detection. As part of our study, we found that defining a loss function and using a novel generator setup is one of the key aspects in extracting the target model. We find that the proposed model extraction method achieves significant results by using reasonable queries. The discovery of this object detection vulnerability will support future prospects for securing such models. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: Submitted to The 14th International Conference on Computer Vision Systems (ICVS 2023), to be published in Springer, Lecture Notes in Computer Science

arXiv:2307.12679 [pdf, other]

An Estimator for the Sensitivity to Perturbations of Deep Neural Networks

Authors: Naman Maheshwari, Nicholas Malaya, Scott Moe, Jaydeep P. Kulkarni, Sudhanva Gurumurthi

Abstract: For Deep Neural Networks (DNNs) to become useful in safety-critical applications, such as self-driving cars and disease diagnosis, they must be stable to perturbations in input and model parameters. Characterizing the sensitivity of a DNN to perturbations is necessary to determine minimal bit-width precision that may be used to safely represent the network. However, no general result exists that i… ▽ More For Deep Neural Networks (DNNs) to become useful in safety-critical applications, such as self-driving cars and disease diagnosis, they must be stable to perturbations in input and model parameters. Characterizing the sensitivity of a DNN to perturbations is necessary to determine minimal bit-width precision that may be used to safely represent the network. However, no general result exists that is capable of predicting the sensitivity of a given DNN to round-off error, noise, or other perturbations in input. This paper derives an estimator that can predict such quantities. The estimator is derived via inequalities and matrix norms, and the resulting quantity is roughly analogous to a condition number for the entire neural network. An approximation of the estimator is tested on two Convolutional Neural Networks, AlexNet and VGG-19, using the ImageNet dataset. For each of these networks, the tightness of the estimator is explored via random perturbations and adversarial attacks. △ Less

Submitted 24 July, 2023; originally announced July 2023.

Comments: Actual work and paper concluded in January 2019

arXiv:2307.11721 [pdf, other]

doi 10.1093/mnras/stad2134

MUSE-ALMA Haloes IX: Morphologies and Stellar Properties of Gas-rich Galaxies

Authors: Arjun Karki, Varsha P. Kulkarni, Simon Weng, Céline Péroux, Ramona Augustin, Matthew Hayes, Mohammadreza Ayromlou, Glenn G. Kacprzak, J. Christopher Howk, Roland Szakacs, Anne Klitsch, Aleksandra Hamanowicz, Alejandra Fresco, Martin A. Zwaan, Andrew D. Biggs, Andrew J. Fox, Susan Kassin, Harald Kuntschner

Abstract: Understanding how galaxies interact with the circumgalactic medium (CGM) requires determining how galaxies morphological and stellar properties correlate with their CGM properties. We report an analysis of 66 well-imaged galaxies detected in HST and VLT MUSE observations and determined to be within $\pm$500 km s$^{-1}$ of the redshifts of strong intervening quasar absorbers at… ▽ More Understanding how galaxies interact with the circumgalactic medium (CGM) requires determining how galaxies morphological and stellar properties correlate with their CGM properties. We report an analysis of 66 well-imaged galaxies detected in HST and VLT MUSE observations and determined to be within $\pm$500 km s$^{-1}$ of the redshifts of strong intervening quasar absorbers at $0.2 \lesssim z \lesssim 1.4$ with H I column densities $N_{\rm H I}$ $>$ $10^{18}$ $\rm cm^{-2}$. We present the geometrical properties (Sérsic indices, effective radii, axis ratios, and position angles) of these galaxies determined using GALFIT. Using these properties along with star formation rates (SFRs, estimated using the H$α$ or [O II] luminosity) and stellar masses ($M_{*}$ estimated from spectral energy distribution fits), we examine correlations among various stellar and CGM properties. Our main findings are as follows: (1) SFR correlates well with $M_{*}$, and most absorption-selected galaxies are consistent with the star formation main sequence (SFMS) of the global population. (2) More massive absorber counterparts are more centrally concentrated and are larger in size. (3) Galaxy sizes and normalized impact parameters correlate negatively with $N_{\rm H I}$, consistent with higher $N_{\rm H I}$ absorption arising in smaller galaxies, and closer to galaxy centers. (4) Absorption and emission metallicities correlate with $M_{*}$ and sSFR, implying metal-poor absorbers arise in galaxies with low past star formation and faster current gas consumption rates. (5) SFR surface densities of absorption-selected galaxies are higher than predicted by the Kennicutt-Schmidt relation for local galaxies, suggesting a higher star formation efficiency in the absorption-selected galaxies. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: Accepted for publication in MNRAS, 25 pages, 19 figures

MSC Class: newtxmath

arXiv:2307.03692 [pdf, other]

Becoming self-instruct: introducing early stop** criteria for minimal instruct tuning

Authors: Waseem AlShikh, Manhal Daaboul, Kirk Goddard, Brock Imel, Kiran Kamble, Parikshith Kulkarni, Melisa Russak

Abstract: In this paper, we introduce the Instruction Following Score (IFS), a metric that detects language models' ability to follow instructions. The metric has a dual purpose. First, IFS can be used to distinguish between base and instruct models. We benchmark publicly available base and instruct models, and show that the ratio of well formatted responses to partial and full sentences can be an effective… ▽ More In this paper, we introduce the Instruction Following Score (IFS), a metric that detects language models' ability to follow instructions. The metric has a dual purpose. First, IFS can be used to distinguish between base and instruct models. We benchmark publicly available base and instruct models, and show that the ratio of well formatted responses to partial and full sentences can be an effective measure between those two model classes. Secondly, the metric can be used as an early stop** criteria for instruct tuning. We compute IFS for Supervised Fine-Tuning (SFT) of 7B and 13B LLaMA models, showing that models learn to follow instructions relatively early in the training process, and the further finetuning can result in changes in the underlying base model semantics. As an example of semantics change we show the objectivity of model predictions, as defined by an auxiliary metric ObjecQA. We show that in this particular case, semantic changes are the steepest when the IFS tends to plateau. We hope that decomposing instruct tuning into IFS and semantic factors starts a new trend in better controllable instruct tuning and opens possibilities for designing minimal instruct interfaces querying foundation models. △ Less

Submitted 5 July, 2023; originally announced July 2023.

arXiv:2307.00438 [pdf, other]

One Copy Is All You Need: Resource-Efficient Streaming of Medical Imaging Data at Scale

Authors: Pranav Kulkarni, Adway Kanhere, Eliot Siegel, Paul H. Yi, Vishwa S. Parekh

Abstract: Large-scale medical imaging datasets have accelerated development of artificial intelligence tools for clinical decision support. However, the large size of these datasets is a bottleneck for users with limited storage and bandwidth. Many users may not even require such large datasets as AI models are often trained on lower resolution images. If users could directly download at their desired resol… ▽ More Large-scale medical imaging datasets have accelerated development of artificial intelligence tools for clinical decision support. However, the large size of these datasets is a bottleneck for users with limited storage and bandwidth. Many users may not even require such large datasets as AI models are often trained on lower resolution images. If users could directly download at their desired resolution, storage and bandwidth requirements would significantly decrease. However, it is impossible to anticipate every users' requirements and impractical to store the data at multiple resolutions. What if we could store images at a single resolution but send them at different ones? We propose MIST, an open-source framework to operationalize progressive resolution for streaming medical images at multiple resolutions from a single high-resolution copy. We demonstrate that MIST can dramatically reduce imaging infrastructure inefficiencies for hosting and streaming medical images by >90%, while maintaining diagnostic quality for deep learning applications. △ Less

Submitted 1 July, 2023; originally announced July 2023.

Comments: 13 pages, 4 figures, 2 tables

arXiv:2305.15617 [pdf, other]

ISLE: An Intelligent Streaming Framework for High-Throughput AI Inference in Medical Imaging

Authors: Pranav Kulkarni, Sean Garin, Adway Kanhere, Eliot Siegel, Paul H. Yi, Vishwa S. Parekh

Abstract: As the adoption of Artificial Intelligence (AI) systems within the clinical environment grows, limitations in bandwidth and compute can create communication bottlenecks when streaming imaging data, leading to delays in patient care and increased cost. As such, healthcare providers and AI vendors will require greater computational infrastructure, therefore dramatically increasing costs. To that end… ▽ More As the adoption of Artificial Intelligence (AI) systems within the clinical environment grows, limitations in bandwidth and compute can create communication bottlenecks when streaming imaging data, leading to delays in patient care and increased cost. As such, healthcare providers and AI vendors will require greater computational infrastructure, therefore dramatically increasing costs. To that end, we developed ISLE, an intelligent streaming framework for high-throughput, compute- and bandwidth- optimized, and cost effective AI inference for clinical decision making at scale. In our experiments, ISLE on average reduced data transmission by 98.02% and decoding time by 98.09%, while increasing throughput by 2,730%. We show that ISLE results in faster turnaround times, and reduced overall cost of data, transmission, and compute, without negatively impacting clinical decision making using AI systems. △ Less

Submitted 25 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: 5 pages, 3 figures, 3 tables

arXiv:2305.11219 [pdf, other]

doi 10.1093/mnras/stad1462

MUSE-ALMA Halos XI: Gas flows in the circumgalactic medium

Authors: Simon Weng, Céline Péroux, Arjun Karki, Ramona Augustin, Varsha P. Kulkarni, Aleksandra Hamanowicz, Martin Zwaan, Elaine M. Sadler, Dylan Nelson, Matthew J. Hayes, Glenn G. Kacprzak, Andrew J. Fox, Victoria Bollo, Benedetta Casavecchia, Roland Szakacs

Abstract: The flow of gas into and out of galaxies leaves traces in the circumgalactic medium which can then be studied using absorption lines towards background quasars. We analyse 27 log(N_HI) > 18.0 HI absorbers at z = 0.2 to 1.4 from the MUSE-ALMA Halos survey with at least one galaxy counterpart within a line of sight velocity of +/-500 km s^{-1}. We perform 3D kinematic forward modelling of these asso… ▽ More The flow of gas into and out of galaxies leaves traces in the circumgalactic medium which can then be studied using absorption lines towards background quasars. We analyse 27 log(N_HI) > 18.0 HI absorbers at z = 0.2 to 1.4 from the MUSE-ALMA Halos survey with at least one galaxy counterpart within a line of sight velocity of +/-500 km s^{-1}. We perform 3D kinematic forward modelling of these associated galaxies to examine the flow of dense, neutral gas in the circumgalactic medium. From the VLT/MUSE, HST broadband imaging and VLT/UVES and Keck/HIRES high-resolution UV quasar spectroscopy observations, we compare the impact parameters, star-formation rates and stellar masses of the associated galaxies with the absorber properties. We find marginal evidence for a bimodal distribution in azimuthal angles for strong HI absorbers, similar to previous studies of the MgII and OVI absorption lines. There is no clear metallicity dependence on azimuthal angle and we suggest a larger sample of absorbers are required to fully test the relationship predicted by cosmological hydrodynamical simulations. A case-by-case study of the absorbers reveals that ten per cent of absorbers are consistent with gas accretion, up to 30 per cent trace outflows while the remainder trace gas in the galaxy disk, the intragroup medium and low-mass galaxies below the MUSE detection limit. Our results highlight that the baryon cycle directly affects the dense neutral gas required for star-formation and plays a critical role in galaxy evolution. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: 13 pages, 6 figures, 12 pages of appendix. Accepted for publication in MNRAS

arXiv:2305.07637 [pdf, other]

Text2Cohort: Facilitating Intuitive Access to Biomedical Data with Natural Language Cohort Discovery

Authors: Pranav Kulkarni, Adway Kanhere, Paul H. Yi, Vishwa S. Parekh

Abstract: The Imaging Data Commons (IDC) is a cloud-based database that provides researchers with open access to cancer imaging data, with the goal of facilitating collaboration. However, cohort discovery within the IDC database has a significant technical learning curve. Recently, large language models (LLM) have demonstrated exceptional utility for natural language processing tasks. We developed Text2Coho… ▽ More The Imaging Data Commons (IDC) is a cloud-based database that provides researchers with open access to cancer imaging data, with the goal of facilitating collaboration. However, cohort discovery within the IDC database has a significant technical learning curve. Recently, large language models (LLM) have demonstrated exceptional utility for natural language processing tasks. We developed Text2Cohort, a LLM-powered toolkit to facilitate user-friendly natural language cohort discovery in the IDC. Our method translates user input into IDC queries using grounding techniques and returns the query's response. We evaluate Text2Cohort on 50 natural language inputs, from information extraction to cohort discovery. Our toolkit successfully generated responses with an 88% accuracy and 0.94 F1 score. We demonstrate that Text2Cohort can enable researchers to discover and curate cohorts on IDC with high levels of accuracy using natural language in a more intuitive and user-friendly way. △ Less

Submitted 25 November, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: 5 pages, 3 figures, 2 tables

arXiv:2304.01423 [pdf, other]

Thematic context vector association based on event uncertainty for Twitter

Authors: Vaibhav Khatavkar, Swapnil Mane, Parag Kulkarni

Abstract: Keyword extraction is a crucial process in text mining. The extraction of keywords with respective contextual events in Twitter data is a big challenge. The challenging issues are mainly because of the informality in the language used. The use of misspelled words, acronyms, and ambiguous terms causes informality. The extraction of keywords with informal language in current systems is pattern based… ▽ More Keyword extraction is a crucial process in text mining. The extraction of keywords with respective contextual events in Twitter data is a big challenge. The challenging issues are mainly because of the informality in the language used. The use of misspelled words, acronyms, and ambiguous terms causes informality. The extraction of keywords with informal language in current systems is pattern based or event based. In this paper, contextual keywords are extracted using thematic events with the help of data association. The thematic context for events is identified using the uncertainty principle in the proposed system. The thematic contexts are weighed with the help of vectors called thematic context vectors which signifies the event as certain or uncertain. The system is tested on the Twitter COVID-19 dataset and proves to be effective. The system extracts event-specific thematic context vectors from the test dataset and ranks them. The extracted thematic context vectors are used for the clustering of contextual thematic vectors which improves the silhouette coefficient by 0.5% than state of art methods namely TF and TF-IDF. The thematic context vector can be used in other applications like Cyberbullying, sarcasm detection, figurative language detection, etc. △ Less

Submitted 3 April, 2023; originally announced April 2023.

Comments: 6 pages

arXiv:2303.06180 [pdf, other]

Optimizing Federated Learning for Medical Image Classification on Distributed Non-iid Datasets with Partial Labels

Authors: Pranav Kulkarni, Adway Kanhere, Paul H. Yi, Vishwa S. Parekh

Abstract: Numerous large-scale chest x-ray datasets have spearheaded expert-level detection of abnormalities using deep learning. However, these datasets focus on detecting a subset of disease labels that could be present, thus making them distributed and non-iid with partial labels. Recent literature has indicated the impact of batch normalization layers on the convergence of federated learning due to doma… ▽ More Numerous large-scale chest x-ray datasets have spearheaded expert-level detection of abnormalities using deep learning. However, these datasets focus on detecting a subset of disease labels that could be present, thus making them distributed and non-iid with partial labels. Recent literature has indicated the impact of batch normalization layers on the convergence of federated learning due to domain shift associated with non-iid data with partial labels. To that end, we propose FedFBN, a federated learning framework that draws inspiration from transfer learning by using pretrained networks as the model backend and freezing the batch normalization layers throughout the training process. We evaluate FedFBN with current FL strategies using synthetic iid toy datasets and large-scale non-iid datasets across scenarios with partial and complete labels. Our results demonstrate that FedFBN outperforms current aggregation strategies for training global models using distributed and non-iid data with partial labels. △ Less

Submitted 10 March, 2023; originally announced March 2023.

Comments: 10 pages, 1 algorithm, 4 tables

arXiv:2303.04249 [pdf, other]

Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes

Authors: Brandon Clark, Alec Kerrigan, Parth Parag Kulkarni, Vicente Vivanco Cepeda, Mubarak Shah

Abstract: Determining the exact latitude and longitude that a photo was taken is a useful and widely applicable task, yet it remains exceptionally difficult despite the accelerated progress of other computer vision tasks. Most previous approaches have opted to learn a single representation of query images, which are then classified at different levels of geographic granularity. These approaches fail to expl… ▽ More Determining the exact latitude and longitude that a photo was taken is a useful and widely applicable task, yet it remains exceptionally difficult despite the accelerated progress of other computer vision tasks. Most previous approaches have opted to learn a single representation of query images, which are then classified at different levels of geographic granularity. These approaches fail to exploit the different visual cues that give context to different hierarchies, such as the country, state, and city level. To this end, we introduce an end-to-end transformer-based architecture that exploits the relationship between different geographic levels (which we refer to as hierarchies) and the corresponding visual scene information in an image through hierarchical cross-attention. We achieve this by learning a query for each geographic hierarchy and scene type. Furthermore, we learn a separate representation for different environmental scenes, as different scenes in the same location are often defined by completely different visual features. We achieve state of the art street level accuracy on 4 standard geo-localization datasets : Im2GPS, Im2GPS3k, YFCC4k, and YFCC26k, as well as qualitatively demonstrate how our method learns different representations for different visual hierarchies and scenes, which has not been demonstrated in the previous methods. These previous testing datasets mostly consist of iconic landmarks or images taken from social media, which makes them either a memorization task, or biased towards certain places. To address this issue we introduce a much harder testing dataset, Google-World-Streets-15k, comprised of images taken from Google Streetview covering the whole planet and present state of the art results. Our code will be made available in the camera-ready version. △ Less

Submitted 7 March, 2023; originally announced March 2023.

Comments: CVPR 2023

arXiv:2302.00509 [pdf, other]

Exploring Semantic Perturbations on Grover

Authors: Pranav Kulkarni, Ziqing Ji, Yan Xu, Marko Neskovic, Kevin Nolan

Abstract: With news and information being as easy to access as they currently are, it is more important than ever to ensure that people are not mislead by what they read. Recently, the rise of neural fake news (AI-generated fake news) and its demonstrated effectiveness at fooling humans has prompted the development of models to detect it. One such model is the Grover model, which can both detect neural fake… ▽ More With news and information being as easy to access as they currently are, it is more important than ever to ensure that people are not mislead by what they read. Recently, the rise of neural fake news (AI-generated fake news) and its demonstrated effectiveness at fooling humans has prompted the development of models to detect it. One such model is the Grover model, which can both detect neural fake news to prevent it, and generate it to demonstrate how a model could be misused to fool human readers. In this work we explore the Grover model's fake news detection capabilities by performing targeted attacks through perturbations on input news articles. Through this we test Grover's resilience to these adversarial attacks and expose some potential vulnerabilities which should be addressed in further iterations to ensure it can detect all types of fake news accurately. △ Less

Submitted 1 February, 2023; originally announced February 2023.

Comments: 15 pages, 12 figures, 1 table, capstone research in machine learning

arXiv:2301.11544 [pdf, other]

Targeted Attacks on Timeseries Forecasting

Authors: Yuvaraj Govindarajulu, Avinash Amballa, Pavan Kulkarni, Manojkumar Parmar

Abstract: Real-world deep learning models developed for Time Series Forecasting are used in several critical applications ranging from medical devices to the security domain. Many previous works have shown how deep learning models are prone to adversarial attacks and studied their vulnerabilities. However, the vulnerabilities of time series models for forecasting due to adversarial inputs are not extensivel… ▽ More Real-world deep learning models developed for Time Series Forecasting are used in several critical applications ranging from medical devices to the security domain. Many previous works have shown how deep learning models are prone to adversarial attacks and studied their vulnerabilities. However, the vulnerabilities of time series models for forecasting due to adversarial inputs are not extensively explored. While the attack on a forecasting model might aim to deteriorate the performance of the model, it is more effective, if the attack is focused on a specific impact on the model's output. In this paper, we propose a novel formulation of Directional, Amplitudinal, and Temporal targeted adversarial attacks on time series forecasting models. These targeted attacks create a specific impact on the amplitude and direction of the output prediction. We use the existing adversarial attack techniques from the computer vision domain and adapt them for time series. Additionally, we propose a modified version of the Auto Projected Gradient Descent attack for targeted attacks. We examine the impact of the proposed targeted attacks versus untargeted attacks. We use KS-Tests to statistically demonstrate the impact of the attack. Our experimental results show how targeted attacks on time series models are viable and are more powerful in terms of statistical similarity. It is, hence difficult to detect through statistical methods. We believe that this work opens a new paradigm in the time series forecasting domain and represents an important consideration for develo** better defenses. △ Less

Submitted 27 January, 2023; originally announced January 2023.

arXiv:2301.07074 [pdf, other]

SegViz: A federated-learning based framework for multi-organ segmentation on heterogeneous data sets with partial annotations

Authors: Adway U. Kanhere, Pranav Kulkarni, Paul H. Yi, Vishwa S. Parekh

Abstract: Segmentation is one of the most primary tasks in deep learning for medical imaging, owing to its multiple downstream clinical applications. However, generating manual annotations for medical images is time-consuming, requires high skill, and is an expensive effort, especially for 3D images. One potential solution is to aggregate knowledge from partially annotated datasets from multiple groups to c… ▽ More Segmentation is one of the most primary tasks in deep learning for medical imaging, owing to its multiple downstream clinical applications. However, generating manual annotations for medical images is time-consuming, requires high skill, and is an expensive effort, especially for 3D images. One potential solution is to aggregate knowledge from partially annotated datasets from multiple groups to collaboratively train global models using Federated Learning. To this end, we propose SegViz, a federated learning-based framework to train a segmentation model from distributed non-i.i.d datasets with partial annotations. The performance of SegViz was compared against training individual models separately on each dataset as well as centrally aggregating all the datasets in one place and training a single model. The SegViz framework using FedBN as the aggregation strategy demonstrated excellent performance on the external BTCV set with dice scores of 0.93, 0.83, 0.55, and 0.75 for segmentation of liver, spleen, pancreas, and kidneys, respectively, significantly ($p<0.05$) better (except spleen) than the dice scores of 0.87, 0.83, 0.42, and 0.48 for the baseline models. In contrast, the central aggregation model significantly ($p<0.05$) performed poorly on the test dataset with dice scores of 0.65, 0, 0.55, and 0.68. Our results demonstrate the potential of the SegViz framework to train multi-task models from distributed datasets with partial labels. All our implementations are open-source and available at https://anonymous.4open.science/r/SegViz-B746 △ Less

Submitted 13 March, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

arXiv:2301.06683 [pdf, other]

Surgical Aggregation: Federated Class-Heterogeneous Learning

Authors: Pranav Kulkarni, Adway Kanhere, Paul H. Yi, Vishwa S. Parekh

Abstract: The release of numerous chest x-ray datasets has spearheaded the development of deep learning models with expert-level performance. However, they have limited interoperability due to class-heterogeneity -- a result of inconsistent labeling schemes and partial annotations. Therefore, it is challenging to leverage these datasets in aggregate to train models with a complete representation of abnormal… ▽ More The release of numerous chest x-ray datasets has spearheaded the development of deep learning models with expert-level performance. However, they have limited interoperability due to class-heterogeneity -- a result of inconsistent labeling schemes and partial annotations. Therefore, it is challenging to leverage these datasets in aggregate to train models with a complete representation of abnormalities that may occur within the thorax. In this work, we propose surgical aggregation, a federated learning framework for aggregating knowledge from class-heterogeneous datasets and learn a model that can simultaneously predict the presence of all disease labels present across the datasets. We evaluate our method using simulated and real-world class-heterogeneous datasets across both independent and identically distributed (iid) and non-iid settings. Our results show that surgical aggregation outperforms current methods, has better generalizability, and is a crucial first step towards tackling class-heterogeneity in federated learning to facilitate the development of clinically-useful models using previously non-interoperable chest x-ray datasets. △ Less

Submitted 5 January, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

Comments: 9 pages, 7 figures, 4 tables

arXiv:2212.01395 [pdf, other]

doi 10.1093/mnras/stac3497

MUSE-ALMA Haloes VIII: Statistical Study of Circumgalactic Medium Gas

Authors: Simon Weng, Céline Péroux, Arjun Karki, Ramona Augustin, Varsha P. Kulkarni, Roland Szakacs, Martin A. Zwaan, Anne Klitsch, Aleksandra Hamanowicz, Elaine M. Sadler, Andrew Biggs, Alejandra Y. Fresco, Mattjew Hayes, J. Christopher Howk, Glenn G. Kacprzak, Harald Kuntschner, Dylan Nelson, Max Pettini

Abstract: The distribution of gas and metals in the circumgalactic medium (CGM) plays a critical role in how galaxies evolve. The MUSE-ALMA Halos survey combines MUSE, ALMA and HST observations to constrain the properties of the multi-phase gas in the CGM and the galaxies associated with the gas probed in absorption. In this paper, we analyse the properties of galaxies associated with 32 strong \ion{H}{i} L… ▽ More The distribution of gas and metals in the circumgalactic medium (CGM) plays a critical role in how galaxies evolve. The MUSE-ALMA Halos survey combines MUSE, ALMA and HST observations to constrain the properties of the multi-phase gas in the CGM and the galaxies associated with the gas probed in absorption. In this paper, we analyse the properties of galaxies associated with 32 strong \ion{H}{i} Ly-$α$ absorbers at redshift $0.2 \lesssim z \lesssim 1.4$. We detect 79 galaxies within $\pm 500$ \kms \!of the absorbers in our 19 MUSE fields. These associated galaxies are found at physical distances from 5.7 kpc and reach star-formation rates as low as $0.1$ \Moyr. The significant number of associated galaxies allows us to map their physical distribution on the $Δv$ and $b$ plane. Building on previous studies, we examine the physical and nebular properties of these associated galaxies and find the following: i) 27/32 absorbers have galaxy counterparts and more than 50 per cent of the absorbers have two or more associated galaxies, ii) the \ion{H}{i} column density of absorbers is anti-correlated with the impact parameter (scaled by virial radius) of the nearest galaxy as expected from simulations, iii) the metallicity of associated galaxies is typically larger than the absorber metallicity which decreases at larger impact parameters. It becomes clear that while strong \ion{H}{i} absorbers are typically associated with more than a single galaxy, we can use them to statistically map the gas and metal distribution in the CGM. △ Less

Submitted 2 December, 2022; originally announced December 2022.

Comments: 17 pages, 14 figures and 10 pages of appendices. The associated galaxy catalogue will be made available online. Accepted for publication in MNRAS

arXiv:2211.16517 [pdf, other]

doi 10.1093/mnras/stac2546

MUSE-ALMA Haloes VII: Survey Science Goals & Design, Data Processing and Final Catalogues

Authors: Céline Péroux, Simon Weng, Arjun Karki, Ramona Augustin, Varsha P. Kulkarni, Roland Szakacs, Anne Klitsch, Aleksandra Hamanowicz, Alejandra Y. Fresco, Martin A. Zwaan, Andrew Biggs, Andrew J. Fox, Mattjew Hayes, J. Christopher Howk, Glenn G. Kacprzak, Susan Kassin, Harald Kuntschner, Dylan Nelson, Max Pettini

Abstract: The gas cycling in the circumgalactic regions of galaxies is known to be multi-phase. The MUSE-ALMA Haloes survey gathers a large multi-wavelength observational sample of absorption and emission data with the goal to significantly advance our understanding of the physical properties of such CGM gas. A key component of the MUSE-ALMA Haloes survey is the multi-facility observational campaign conduct… ▽ More The gas cycling in the circumgalactic regions of galaxies is known to be multi-phase. The MUSE-ALMA Haloes survey gathers a large multi-wavelength observational sample of absorption and emission data with the goal to significantly advance our understanding of the physical properties of such CGM gas. A key component of the MUSE-ALMA Haloes survey is the multi-facility observational campaign conducted with VLT/MUSE, ALMA and HST. MUSE-ALMA Haloes targets comprise 19 VLT/MUSE IFS quasar fields, including 32 $z_{\rm abs}<$0.85 strong absorbers with measured N$_{HI}$ $\geq 10^{18}$ cm$^{\rm -2}$ from UV-spectroscopy. We additionally use a new complementary HST medium program to characterise the stellar content of the galaxies through a 40-orbit three-band UVIS and IR WFC3 imaging. Beyond the absorber-selected targets, we detect 3658 sources all fields combined, including 703 objects with spectroscopic redshifts. This galaxy-selected sample constitutes the main focus of the current paper. We have secured millimeter ALMA observations of some of the fields to probe the molecular gas properties of these objects. Here, we present the overall survey science goals, target selection, observational strategy, data processing and source identification of the full sample. Furthermore, we provide catalogues of magnitude measurements for all objects detected in VLT/MUSE, ALMA and HST broad-band images and associated spectroscopic redshifts derived from VLT/MUSE observations. Together, this data set provides robust characterisation of the neutral atomic gas, molecular gas and stars in the same objects resulting in the baryon census of condensed matter in complex galaxy structures. △ Less

Submitted 1 December, 2022; v1 submitted 29 November, 2022; originally announced November 2022.

Comments: 19 pages, 4 figures. This is the final (proof-corrected) version, published in MNRAS. Galaxy catalogues are available online

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 516, Issue 4, November 2022, Pages 5618-5636 November 2022

arXiv:2211.06212 [pdf, other]

From Competition to Collaboration: Making Toy Datasets on Kaggle Clinically Useful for Chest X-Ray Diagnosis Using Federated Learning

Authors: Pranav Kulkarni, Adway Kanhere, Paul H. Yi, Vishwa S. Parekh

Abstract: Chest X-ray (CXR) datasets hosted on Kaggle, though useful from a data science competition standpoint, have limited utility in clinical use because of their narrow focus on diagnosing one specific disease. In real-world clinical use, multiple diseases need to be considered since they can co-exist in the same patient. In this work, we demonstrate how federated learning (FL) can be used to make thes… ▽ More Chest X-ray (CXR) datasets hosted on Kaggle, though useful from a data science competition standpoint, have limited utility in clinical use because of their narrow focus on diagnosing one specific disease. In real-world clinical use, multiple diseases need to be considered since they can co-exist in the same patient. In this work, we demonstrate how federated learning (FL) can be used to make these toy CXR datasets from Kaggle clinically useful. Specifically, we train a single FL classification model (`global`) using two separate CXR datasets -- one annotated for presence of pneumonia and the other for presence of pneumothorax (two common and life-threatening conditions) -- capable of diagnosing both. We compare the performance of the global FL model with models trained separately on both datasets (`baseline`) for two different model architectures. On a standard, naive 3-layer CNN architecture, the global FL model achieved AUROC of 0.84 and 0.81 for pneumonia and pneumothorax, respectively, compared to 0.85 and 0.82, respectively, for both baseline models (p>0.05). Similarly, on a pretrained DenseNet121 architecture, the global FL model achieved AUROC of 0.88 and 0.91 for pneumonia and pneumothorax, respectively, compared to 0.89 and 0.91, respectively, for both baseline models (p>0.05). Our results suggest that FL can be used to create global `meta` models to make toy datasets from Kaggle clinically useful, a step forward towards bridging the gap from bench to bedside. △ Less

Submitted 11 November, 2022; originally announced November 2022.

Comments: Accepted paper for Medical Imaging meet NeurIPS (MedNeurIPS) Workshop 2022

arXiv:2208.08670 [pdf, other]

Approximation Algorithms for Envy-Free Cake Division with Connected Pieces

Authors: Siddharth Barman, Pooja Kulkarni

Abstract: Cake cutting is a classic model for studying fair division of a heterogeneous, divisible resource among agents with individual preferences. Addressing cake division under a typical requirement that each agent must receive a connected piece of the cake, we develop approximation algorithms for finding envy-free (fair) cake divisions. In particular, this work improves the state-of-the-art additive ap… ▽ More Cake cutting is a classic model for studying fair division of a heterogeneous, divisible resource among agents with individual preferences. Addressing cake division under a typical requirement that each agent must receive a connected piece of the cake, we develop approximation algorithms for finding envy-free (fair) cake divisions. In particular, this work improves the state-of-the-art additive approximation bound for this fundamental problem. Our results hold for general cake division instances in which the agents' valuations satisfy basic assumptions and are normalized (to have value $1$ for the cake). Furthermore, the developed algorithms execute in polynomial time under the standard Robertson-Webb query model. Prior work has shown that one can efficiently compute a cake division (with connected pieces) in which the additive envy of any agent is at most $1/3$. An efficient algorithm is also known for finding connected cake divisions that are (almost) $1/2$-multiplicatively envy-free. Improving the additive approximation guarantee and maintaining the multiplicative one, we develop a polynomial-time algorithm that computes a connected cake division that is both $\left(\frac{1}{4} +o(1) \right)$-additively envy-free and $\left(\frac{1}{2} - o(1) \right)$-multiplicatively envy-free. Our algorithm is based on the ideas of interval growing and envy-cycle-elimination. In addition, we study cake division instances in which the number of distinct valuations across the agents is parametrically bounded. We show that such cake division instances admit a fully polynomial-time approximation scheme for connected envy-free cake division. △ Less

Submitted 27 April, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

Comments: 20 pages

arXiv:2208.04973 [pdf, other]

doi 10.3847/1538-4357/ac7b88

On the Kinematics of Cold, Metal-enriched Galactic Fountain Flows in Nearby Star-forming Galaxies

Authors: Kate H. R. Rubin, Christian Juarez, Kathy L. Cooksey, Jessica K. Werk, J. Xavier Prochaska, John M. O'Meara, Joseph N. Burchett, Ryan J. Rickards Vaught, Varsha P. Kulkarni, Lorrie A. Straka

Abstract: We use medium-resolution Keck/Echellette Spectrograph and Imager spectroscopy of bright quasars to study cool gas traced by CaII 3934,3969 and NaI 5891,5897 absorption in the interstellar/circumgalactic media of 21 foreground star-forming galaxies at redshifts 0.03 < z < 0.20 with stellar masses 7.4 < log M_*/M_sun < 10.6. The quasar-galaxy pairs were drawn from a unique sample of Sloan Digital Sk… ▽ More We use medium-resolution Keck/Echellette Spectrograph and Imager spectroscopy of bright quasars to study cool gas traced by CaII 3934,3969 and NaI 5891,5897 absorption in the interstellar/circumgalactic media of 21 foreground star-forming galaxies at redshifts 0.03 < z < 0.20 with stellar masses 7.4 < log M_*/M_sun < 10.6. The quasar-galaxy pairs were drawn from a unique sample of Sloan Digital Sky Survey quasar spectra with intervening nebular emission, and thus have exceptionally close impact parameters (R_perp < 13 kpc). The strength of this line emission implies that the galaxies' star formation rates (SFRs) span a broad range, with several lying well above the star-forming sequence. We use Voigt profile modeling to derive column densities and component velocities for each absorber, finding that column densities N(CaII) > 10^12.5 cm^-2 (N(NaI) > 10^12.0 cm^-2) occur with an incidence f_C(CaII) = 0.63^+0.10_-0.11 (f_C(NaI) = 0.57^+0.10_-0.11). We find no evidence for a dependence of f_C or the rest-frame equivalent widths W_r(CaII K) or W_r(NaI 5891) on R_perp or M_*. Instead, W_r(CaII K) is correlated with local SFR at >3sigma significance, suggesting that CaII traces star formation-driven outflows. While most of the absorbers have velocities within +/-50 km/s of the host redshift, their velocity widths (characterized by Delta v_90) are universally 30-177 km/s larger than that implied by tilted-ring modeling of the velocities of interstellar material. These kinematics must trace galactic fountain flows and demonstrate that they persist at R_perp > 5 kpc. Finally, we assess the relationship between dust reddening and W_r(CaII K) (W_r(NaI 5891)), finding that 33% (24%) of the absorbers are inconsistent with the best-fit Milky Way E(B-V)-W_r relations at >3sigma significance. △ Less

Submitted 9 August, 2022; originally announced August 2022.

Comments: 38 pages, 16 figures, 4 tables. Accepted to ApJ

arXiv:2204.10271 [pdf, ps, other]

A co-kurtosis based dimensionality reduction method for combustion datasets

Authors: Anirudh Jonnalagadda, Shubham P. Kulkarni, Akash Rodhiya, Hemanth Kolla, Konduri Aditya

Abstract: Principal Component Analysis (PCA) is a dimensionality reduction technique widely used to reduce the computational cost associated with numerical simulations of combustion phenomena. However, PCA, which transforms the thermo-chemical state space based on eigenvectors of co-variance of the data, could fail to capture information regarding important localized chemical dynamics, such as the formation… ▽ More Principal Component Analysis (PCA) is a dimensionality reduction technique widely used to reduce the computational cost associated with numerical simulations of combustion phenomena. However, PCA, which transforms the thermo-chemical state space based on eigenvectors of co-variance of the data, could fail to capture information regarding important localized chemical dynamics, such as the formation of ignition kernels, appearing as \rev{extreme-valued} samples in a dataset. In this paper, we propose an alternate dimensionality reduction procedure, co-kurtosis PCA (CoK-PCA), wherein the required principal vectors are computed from a high-order joint statistical moment, namely the co-kurtosis tensor, which may better identify directions in the state space that represent stiff dynamics. We first demonstrate the potential of the proposed CoK-PCA method using a synthetically generated dataset that is representative of typical combustion simulations. Thereafter, we characterize and contrast the accuracy of CoK-PCA against PCA for datasets representing spontaneous ignition of premixed ethylene-air in a simple homogeneous reactor and ethanol-fueled homogeneous charged compression ignition (HCCI) engine. Specifically, we compare the low-dimensional manifolds in terms of reconstruction errors of the original thermo-chemical state, and species production and heat release rates computed from the reconstructed state. \rev{The latter -- a comparison of species production and heat release rates -- is a more rigorous assessment of the accuracy of dimensionality reduction.} We find that, even using a simplistic linear reconstruction, the co-kurtosis based reduced manifold represents the original thermo-chemical state more accurately than PCA, especially in the regions where chemical reactions are important. △ Less

Submitted 28 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

arXiv:2203.14349 [pdf, other]

Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection

Authors: Rajkumar Pujari, Erik Oveson, Priyanka Kulkarni, Elnaz Nouri

Abstract: As large Pre-trained Language Models (PLMs) trained on large amounts of data in an unsupervised manner become more ubiquitous, identifying various types of bias in the text has come into sharp focus. Existing "Stereotype Detection" datasets mainly adopt a diagnostic approach toward large PLMs. Blodgett et. al (2021a) show that there are significant reliability issues with the existing benchmark da… ▽ More As large Pre-trained Language Models (PLMs) trained on large amounts of data in an unsupervised manner become more ubiquitous, identifying various types of bias in the text has come into sharp focus. Existing "Stereotype Detection" datasets mainly adopt a diagnostic approach toward large PLMs. Blodgett et. al (2021a) show that there are significant reliability issues with the existing benchmark datasets. Annotating a reliable dataset requires a precise understanding of the subtle nuances of how stereotypes manifest in text. In this paper, we annotate a focused evaluation set for "Stereotype Detection" that addresses those pitfalls by de-constructing various ways in which stereotypes manifest in text. Further, we present a multi-task model that leverages the abundance of data-rich neighboring tasks such as hate speech detection, offensive language detection, misogyny detection, etc., to improve the empirical performance on "Stereotype Detection". We then propose a reinforcement-learning agent that guides the multi-task learning model by learning to identify the training examples from the neighboring tasks that help the target task the most. We show that the proposed models achieve significant empirical gains over existing baselines on all the tasks. △ Less

Submitted 27 March, 2022; originally announced March 2022.

Comments: Long paper at ACL 2022 main conference

arXiv:2203.13442 [pdf, other]

Dualband and Tripleband Metamaterial absorber in WR1 band and Lower TeraHertz frequencies

Authors: Sarvesh Gharat, Prutha Kulkarni, Shriganesh Prabhu, Chandrashekhar Garde

Abstract: Frequency Band of 0.5 THz to 1.1 THz (WR1.5 and WR1) is one of the promising bands when it comes to 6G. In this paper, we propose a novel metamaterial absorber suitable to be used at lower WR1 frequencies in TE mode. Alternatively, the design can be used as a strong absorber at lower Terahertz frequencies. In addition, when used in TM mode the absorber works as a perfect absorber at higher WR1 ban… ▽ More Frequency Band of 0.5 THz to 1.1 THz (WR1.5 and WR1) is one of the promising bands when it comes to 6G. In this paper, we propose a novel metamaterial absorber suitable to be used at lower WR1 frequencies in TE mode. Alternatively, the design can be used as a strong absorber at lower Terahertz frequencies. In addition, when used in TM mode the absorber works as a perfect absorber at higher WR1 band. △ Less

Submitted 25 March, 2022; originally announced March 2022.

arXiv:2202.12337 [pdf]

Time Efficient Training of Progressive Generative Adversarial Network using Depthwise Separable Convolution and Super Resolution Generative Adversarial Network

Authors: Atharva Karwande, Pranesh Kulkarni, Tejas Kolhe, Akshay Joshi, Soham Kamble

Abstract: Generative Adversarial Networks have been employed successfully to generate high-resolution augmented images of size 1024^2. Although the augmented images generated are unprecedented, the training time of the model is exceptionally high. Conventional GAN requires training of both Discriminator as well as the Generator. In Progressive GAN, which is the current state-of-the-art GAN for image augment… ▽ More Generative Adversarial Networks have been employed successfully to generate high-resolution augmented images of size 1024^2. Although the augmented images generated are unprecedented, the training time of the model is exceptionally high. Conventional GAN requires training of both Discriminator as well as the Generator. In Progressive GAN, which is the current state-of-the-art GAN for image augmentation, instead of training the GAN all at once, a new concept of progressing growing of Discriminator and Generator simultaneously, was proposed. Although the lower stages such as 4x4 and 8x8 train rather quickly, the later stages consume a tremendous amount of time which could take days to finish the model training. In our paper, we propose a novel pipeline that combines Progressive GAN with slight modifications and Super Resolution GAN. Super Resolution GAN up samples low-resolution images to high-resolution images which can prove to be a useful resource to reduce the training time exponentially. △ Less

Submitted 24 February, 2022; originally announced February 2022.

arXiv:2112.00870 [pdf]

doi 10.3847/1538-4357/ac5fab

Damped Ly-alpha Absorbers in Star-forming Galaxies at z < 0.15 Detected with the Hubble Space Telescope and Implications for Galaxy Evolution

Authors: Varsha P. Kulkarni, David V. Bowen, Lorrie A. Straka, Donald G. York, Neeraj Gupta, Pasquier Noterdaeme, Raghunathan Srianand

Abstract: We report {\it HST} COS spectroscopy of 10 quasars with foreground star-forming galaxies at 0.02$<$$z$$<$ 0.14 within impact parameters of $\sim$1-7 kpc. We detect damped/sub-damped Ly$α$ absorption in 100$\%$ of cases where no higher-redshift Lyman-limit systems extinguish the flux at the expected wavelength of Ly$α$ absorption, obtaining the largest targeted sample of DLA/sub-DLAs in low-redshif… ▽ More We report {\it HST} COS spectroscopy of 10 quasars with foreground star-forming galaxies at 0.02$<$$z$$<$ 0.14 within impact parameters of $\sim$1-7 kpc. We detect damped/sub-damped Ly$α$ absorption in 100$\%$ of cases where no higher-redshift Lyman-limit systems extinguish the flux at the expected wavelength of Ly$α$ absorption, obtaining the largest targeted sample of DLA/sub-DLAs in low-redshift galaxies. We present absorption measurements of neutral hydrogen and metals. Additionally, we present GBT 21-cm emission measurements for 5 of the galaxies (including 2 detections). Combining our sample with the literature, we construct a sample of 115 galaxies associated with DLA/sub-DLAs spanning 0$<$$z$$<$4.4, and examine trends between gas and stellar properties, and with redshift. The H~I column density is anti-correlated with impact parameter and stellar mass. More massive galaxies appear to have gas-rich regions out to larger distances. The specific SFR (sSFR) of absorbing galaxies increases with redshift and decreases with $M^{\ast}$, consistent with evolution of the star-formation main sequence (SFMS). However, $\sim$20$\%$ of absorbing galaxies lie below the SFMS, indicating that some DLA/sub-DLAs trace galaxies with longer-than-typical gas-depletion time-scales. Most DLA/sub-DLA galaxies with 21-cm emission have higher H I masses than typical galaxies with comparable $M^{\ast}$. High $M_{\rm H I}/M^{\ast}$ ratios and high sSFRs in DLA/sub-DLA galaxies with $M^{\ast}$$<$$10^{9}$$M_{\odot}$ suggest these galaxies may be gas-rich because of recent gas accretion rather than inefficient star formation. Our study demonstrates the power of absorption and emission studies of DLA/sub-DLA galaxies for extending galaxy-evolution studies to previously under-explored regimes of low $M^{\ast}$ and low SFR. △ Less

Submitted 1 December, 2021; originally announced December 2021.

Comments: 51 pages, 12 figures. Submitted to the Astrophysical Journal

arXiv:2110.00767 [pdf, other]

Sublinear Approximation Algorithm for Nash Social Welfare with XOS Valuations

Authors: Siddharth Barman, Anand Krishna, Pooja Kulkarni, Shivika Narang

Abstract: We study the problem of allocating indivisible goods among $n$ agents with the objective of maximizing Nash social welfare (NSW). This welfare function is defined as the geometric mean of the agents' valuations and, hence, it strikes a balance between the extremes of social welfare (arithmetic mean) and egalitarian welfare (max-min value). Nash social welfare has been extensively studied in recent… ▽ More We study the problem of allocating indivisible goods among $n$ agents with the objective of maximizing Nash social welfare (NSW). This welfare function is defined as the geometric mean of the agents' valuations and, hence, it strikes a balance between the extremes of social welfare (arithmetic mean) and egalitarian welfare (max-min value). Nash social welfare has been extensively studied in recent years for various valuation classes. In particular, a notable negative result is known when the agents' valuations are complement-free and are specified via value queries: for XOS valuations, one necessarily requires exponentially many value queries to find any sublinear (in $n$) approximation for NSW. Indeed, this lower bound implies that stronger query models are needed for finding better approximations. Towards this, we utilize demand oracles and XOS oracles; both of these query models are standard and have been used in prior work on social welfare maximization with XOS valuations. We develop the first sublinear approximation algorithm for maximizing Nash social welfare under XOS valuations, specified via demand and XOS oracles. Hence, this work breaks the $O(n)$-approximation barrier for NSW maximization under XOS valuations. We obtain this result by develo** a novel connection between NSW and social welfare under a capped version of the agents' valuations. In addition to this insight, which might be of independent interest, this work relies on an intricate combination of multiple technical ideas, including the use of repeated matchings and the discrete moving knife method. In addition, we partially complement the algorithmic result by showing that, under XOS valuations, an exponential number of demand and XOS queries are necessarily required to approximate NSW within a factor of $\left(1 - \frac{1}{e}\right)$. △ Less

Submitted 15 July, 2022; v1 submitted 2 October, 2021; originally announced October 2021.

Comments: 41 pages

arXiv:2108.06947 [pdf]

Contextual Mood Analysis with Knowledge Graph Representation for Hindi Song Lyrics in Devanagari Script

Authors: Makarand Velankar, Rachita Kotian, Parag Kulkarni

Abstract: Lyrics play a significant role in conveying the song's mood and are information to understand and interpret music communication. Conventional natural language processing approaches use translation of the Hindi text into English for analysis. This approach is not suitable for lyrics as it is likely to lose the inherent intended contextual meaning. Thus, the need was identified to develop a system f… ▽ More Lyrics play a significant role in conveying the song's mood and are information to understand and interpret music communication. Conventional natural language processing approaches use translation of the Hindi text into English for analysis. This approach is not suitable for lyrics as it is likely to lose the inherent intended contextual meaning. Thus, the need was identified to develop a system for Devanagari text analysis. The data set of 300 song lyrics with equal distribution in five different moods is used for the experimentation. The proposed system performs contextual mood analysis of Hindi song lyrics in Devanagari text format. The contextual analysis is stored as a knowledge base, updated using an incremental learning approach with new data. Contextual knowledge graph with moods and associated important contextual terms provides the graphical representation of the lyric data set used. The testing results show 64% accuracy for the mood prediction. This work can be easily extended to applications related to Hindi literary work such as summarization, indexing, contextual retrieval, context-based classification and grou** of documents. △ Less

Submitted 16 August, 2021; originally announced August 2021.

Comments: 16 pages

arXiv:2107.09871 [pdf, ps, other]

On Fair and Efficient Allocations of Indivisible Public Goods

Authors: Jugal Garg, Pooja Kulkarni, Aniket Murhekar

Abstract: We study fair allocation of indivisible public goods subject to cardinality (budget) constraints. In this model, we have n agents and m available public goods, and we want to select $k \leq m$ goods in a fair and efficient manner. We first establish fundamental connections between the models of private goods, public goods, and public decision making by presenting polynomial-time reductions for the… ▽ More We study fair allocation of indivisible public goods subject to cardinality (budget) constraints. In this model, we have n agents and m available public goods, and we want to select $k \leq m$ goods in a fair and efficient manner. We first establish fundamental connections between the models of private goods, public goods, and public decision making by presenting polynomial-time reductions for the popular solution concepts of maximum Nash welfare (MNW) and leximin. These mechanisms are known to provide remarkable fairness and efficiency guarantees in private goods and public decision making settings. We show that they retain these desirable properties even in the public goods case. We prove that MNW allocations provide fairness guarantees of Proportionality up to one good (Prop1), $1/n$ approximation to Round Robin Share (RRS), and the efficiency guarantee of Pareto Optimality (PO). Further, we show that the problems of finding MNW or leximin-optimal allocations are NP-hard, even in the case of constantly many agents, or binary valuations. This is in sharp contrast to the private goods setting that admits polynomial-time algorithms under binary valuations. We also design pseudo-polynomial time algorithms for computing an exact MNW or leximin-optimal allocation for the cases of (i) constantly many agents, and (ii) constantly many goods with additive valuations. We also present an O(n)-factor approximation algorithm for MNW which also satisfies RRS, Prop1, and 1/2-Prop. △ Less

Submitted 21 July, 2021; originally announced July 2021.

Comments: 25 pages

arXiv:2106.10698 [pdf]

Plant Disease Detection Using Image Processing and Machine Learning

Authors: Pranesh Kulkarni, Atharva Karwande, Tejas Kolhe, Soham Kamble, Akshay Joshi, Medha Wyawahare

Abstract: One of the important and tedious task in agricultural practices is the detection of the disease on crops. It requires huge time as well as skilled labor. This paper proposes a smart and efficient technique for detection of crop disease which uses computer vision and machine learning techniques. The proposed system is able to detect 20 different diseases of 5 common plants with 93% accuracy. One of the important and tedious task in agricultural practices is the detection of the disease on crops. It requires huge time as well as skilled labor. This paper proposes a smart and efficient technique for detection of crop disease which uses computer vision and machine learning techniques. The proposed system is able to detect 20 different diseases of 5 common plants with 93% accuracy. △ Less

Submitted 22 November, 2021; v1 submitted 20 June, 2021; originally announced June 2021.

arXiv:2106.08938 [pdf, other]

Memory Leak Detection Algorithms in the Cloud-based Infrastructure

Authors: Anshul **dal, Paul Staab, Pooja Kulkarni, Jorge Cardoso, Michael Gerndt, Vladimir Podolskiy

Abstract: A memory leak in an application deployed on the cloud can affect the availability and reliability of the application. Therefore, identifying and ultimately resolve it quickly is highly important. However, in the production environment running on the cloud, memory leak detection is a challenge without the knowledge of the application or its internal object allocation details. This paper addresses… ▽ More A memory leak in an application deployed on the cloud can affect the availability and reliability of the application. Therefore, identifying and ultimately resolve it quickly is highly important. However, in the production environment running on the cloud, memory leak detection is a challenge without the knowledge of the application or its internal object allocation details. This paper addresses this challenge of detection of memory leaks in cloud-based infrastructure without having any internal knowledge by introducing two novel machine learning-based algorithms: Linear Backward Regression (LBR) and Precog and, their two variants: Linear Backward Regression with Change Points Detection (LBRCPD) and Precog with Maximum Filteration (PrecogMF). These algorithms only use one metric i.e the system's memory utilization on which the application is deployed for detection of a memory leak. The developed algorithm's accuracy was tested on 60 virtual machines manually labeled memory utilization data and it was found that the proposed PrecogMF algorithm achieves the highest accuracy score of 85%. The same algorithm also achieves this by decreasing the overall compute time by 80% when compared to LBR's compute time. The paper also presents the different memory leak patterns found in the various memory leak applications and are further classified into different classes based on their visual representation. △ Less

Submitted 16 June, 2021; originally announced June 2021.

Comments: 10. pages. arXiv admin note: substantial text overlap with arXiv:2101.09799

arXiv:2103.01272 [pdf, other]

Geometry-Based Gras** of Vine Tomatoes

Authors: Taeke de Haan, Padmaja Kulkarni, Robert Babuska

Abstract: We propose a geometry-based gras** method for vine tomatoes. It relies on a computer-vision pipeline to identify the required geometric features of the tomatoes and of the truss stem. The gras** method then uses a geometric model of the robotic hand and the truss to determine a suitable gras** location on the stem. This approach allows for gras** tomato trusses without requiring delicate c… ▽ More We propose a geometry-based gras** method for vine tomatoes. It relies on a computer-vision pipeline to identify the required geometric features of the tomatoes and of the truss stem. The gras** method then uses a geometric model of the robotic hand and the truss to determine a suitable gras** location on the stem. This approach allows for gras** tomato trusses without requiring delicate contact sensors or complex mechanistic models and under minimal risk of damaging the tomatoes. Lab experiments were conducted to validate the proposed methods, using an RGB-D camera and a low-cost robotic manipulator. The success rate was 83% to 92%, depending on the type of truss. △ Less

Submitted 1 March, 2021; originally announced March 2021.

Comments: 8 pages, 12 figures. This work has been submitted to the IEEE for possible publication (IROS + RAL). Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2102.13523 [pdf, other]

doi 10.3847/1538-3881/abd2b0

Significant H I and Metal Differences around the z = 0.83 Lens Galaxy Towards the Doubly Lensed Quasar SBS 0909+532

Authors: Frances H. Cashman, Varsha P. Kulkarni, Sebastian Lopez

Abstract: We report a large difference in neutral hydrogen (H I) and metal column densities between the two sight lines probing opposite sides of the lensing galaxy at $z_\mathrm{lens}$ = 0.83 toward the doubly lensed quasar SBS 0909+532. Using archival HST-STIS and Keck HIRES spectra of the lensed quasar images, we measure log $N_\mathrm{H\;I}$ = 18.77 $\pm$ 0.12 cm$^{-2}$ toward the brighter image ($A$) a… ▽ More We report a large difference in neutral hydrogen (H I) and metal column densities between the two sight lines probing opposite sides of the lensing galaxy at $z_\mathrm{lens}$ = 0.83 toward the doubly lensed quasar SBS 0909+532. Using archival HST-STIS and Keck HIRES spectra of the lensed quasar images, we measure log $N_\mathrm{H\;I}$ = 18.77 $\pm$ 0.12 cm$^{-2}$ toward the brighter image ($A$) at an impact parameter of $r_A$ = 3.15 kpc and log $N_\mathrm{H\;I}$ = 20.38 $\pm$ 0.20 cm$^{-2}$ toward the fainter image ($B$) at an impact parameter of $r_B$ = 5.74 kpc. This difference by a factor of $\sim$41 is the highest difference between sight lines for a lens galaxy in which H I has been measured, suggesting patchiness and/or anisotropy on these scales. We estimate an average Fe abundance gradient between the sight lines to be $\geq$ +0.35 dex kpc$^{-1}$. The $N_\mathrm{Fe\;II}$/$N_\mathrm{Mg\;II}$ ratios for the individual components detected in the Keck HIRES spectra have supersolar values for all components in sight line $A$ and for 11 out of 18 components in sight line $B$, suggesting that Type Ia supernovae may have contributed to the chemical enrichment of the galaxy's environment. Additionally, these observations provide complementary information to detections of cold gas in early-type galaxies and the tension between these and some models of cloud survival. △ Less

Submitted 9 March, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

Comments: 20 pages, 6 tables, 11 figures

Journal ref: AJ 161 90 (2021)

arXiv:2102.10117 [pdf, other]

doi 10.3847/1538-4357/abef6a

The Geometry of Cold, Metal-Enriched Gas Around Galaxies at $z\sim1.2$

Authors: Britt F. Lundgren, Samantha Creech, Gabriel Brammer, Nathan Kirse, Matthew Peek, David Wake, Donald G. York, John Chisholm, Dawn K. Erb, Varsha P. Kulkarni, Lorrie Straka, Christy Tremonti, Pieter van Dokkum

Abstract: We present the first results from a Hubble Space Telescope WFC3/IR program, which obtained direct imaging and grism observations of galaxies near quasar sightlines with a high frequency of uncorrelated foreground Mg II absorption. These highly efficient observations targeted 54 Mg II absorbers along the line of sight to nine quasars at $z_{qso}\sim2$. We find that 89% of the absorbers in the range… ▽ More We present the first results from a Hubble Space Telescope WFC3/IR program, which obtained direct imaging and grism observations of galaxies near quasar sightlines with a high frequency of uncorrelated foreground Mg II absorption. These highly efficient observations targeted 54 Mg II absorbers along the line of sight to nine quasars at $z_{qso}\sim2$. We find that 89% of the absorbers in the range $0.64< z < 1.6$ can be spectroscopically matched to at least one galaxy with an impact parameter less than 200 kpc and $|Δz|/(1+z)<0.006$. We have estimated the star formation rates and measured structural parameters for all detected galaxies with impact parameters in the range 7-200 kpc and star formation rates greater than 1.3 M$_{\odot}$ yr$^{-1}$. We find that galaxies associated with Mg II absorption have significantly higher mean star formation rates and marginally higher mean star formation rate surface densities compared to galaxies with no detected Mg II. Nearly half of the Mg II absorbers match to more than one galaxy, and the mean equivalent width of the Mg II absorption is found to be greater for groups, compared to isolated galaxies. Additionally, we observe a significant redshift evolution in the physical extent of Mg II-absorbing gas around galaxies and evidence of an enhancement of Mg II within 50 degrees of the minor axis, characteristic of outflows, which persists to 80 kpc around the galaxies, in agreement with recent predictions from simulations. △ Less

Submitted 19 February, 2021; originally announced February 2021.

Comments: 21 pages, 20 figures, Submitted to ApJ

arXiv:2101.00188 [pdf, other]

doi 10.1051/0004-6361/202040167

PKS1830-211: OH and HI at z=0.89 and the first MeerKAT UHF spectrum

Authors: F. Combes, N. Gupta, S. Muller, S. Balashev, G. I. G. Jozsa, R. Srianand, E. Momjian, P. Noterdaeme, H. -R. Kloeckner, A. J. Baker, E. Boettcher, A. Bosma, H. -W. Chen, R. Dutta, P. Jagannathan, J. Jose, K. Knowles, J-. K. Krogager, V. P. Kulkarni, K. Moodley, S. Pandey, P. Petitjean, S. Sekhar

Abstract: The Large Survey Project (LSP) "MeerKAT Absorption Line Survey" (MALS) is a blind HI 21-cm and OH 18-cm absorption line survey in the L- and UHF-bands, with the primary goal to better determine the occurrence of atomic and molecular gas in the circum-galactic and inter-galactic medium, and its redshift evolution. Here we present the first results using the UHF-band, obtained towards the strongly l… ▽ More The Large Survey Project (LSP) "MeerKAT Absorption Line Survey" (MALS) is a blind HI 21-cm and OH 18-cm absorption line survey in the L- and UHF-bands, with the primary goal to better determine the occurrence of atomic and molecular gas in the circum-galactic and inter-galactic medium, and its redshift evolution. Here we present the first results using the UHF-band, obtained towards the strongly lensed radio source PKS1830, detecting absorption in the lens galaxy. With merely 90min of data acquired on-source for science verification and processed using the Automated Radio Telescope Imaging Pipeline (ARTIP), we detect in absorption the known HI 21-cm and OH 18-cm main lines at z=0.89 at an unprecedented signal-to-noise ratio (4000 in the continuum, with 6km/s channels). For the first time we report the detection at z=0.89 of OH satellite lines, so far not detected at z $>$ 0.25. We decompose the OH lines into a thermal and a stimulated contribution, where the 1612 and 1720MHz lines are conjugate. The total OH 1720MHz emission line luminosity is 6100Lsun. This is the most luminous known 1720MHz maser line. The absorption components of the different images of the background source sample different light paths in the lensing galaxy, and their weights in the total absorption spectrum are expected to vary in time, on daily and monthly time scales. We compare our normalized spectra with those obtained more than 20 yrs ago, and find no variation. We interpret the absorption spectra with the help of a lens galaxy model, derived from an N-body hydro-dynamical simulation, with a morphology similar to its optical HST image. It is possible to reproduce the observations without invoking any central gas outflows. There are, however, distinct and faint high-velocity features, most likely high-velocity clouds. These clouds may contribute to broaden the HI and OH spectra. △ Less

Submitted 22 February, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

Comments: 13 pages, 11 figures, accepted in Astronomy and Astrophysics

Journal ref: A&A 648, A116 (2021)

arXiv:2010.02600 [pdf, other]

Converting the Point of View of Messages Spoken to Virtual Assistants

Authors: Isabelle G. Lee, Vera Zu, Sai Srujana Buddi, Dennis Liang, Purva Kulkarni, Jack G. M. Fitzgerald

Abstract: Virtual Assistants can be quite literal at times. If the user says "tell Bob I love him," most virtual assistants will extract the message "I love him" and send it to the user's contact named Bob, rather than properly converting the message to "I love you." We designed a system to allow virtual assistants to take a voice message from one user, convert the point of view of the message, and then del… ▽ More Virtual Assistants can be quite literal at times. If the user says "tell Bob I love him," most virtual assistants will extract the message "I love him" and send it to the user's contact named Bob, rather than properly converting the message to "I love you." We designed a system to allow virtual assistants to take a voice message from one user, convert the point of view of the message, and then deliver the result to its target user. We developed a rule-based model, which integrates a linear text classification model, part-of-speech tagging, and constituency parsing with rule-based transformation methods. We also investigated Neural Machine Translation (NMT) approaches, including LSTMs, CopyNet, and T5. We explored 5 metrics to gauge both naturalness and faithfulness automatically, and we chose to use BLEU plus METEOR for faithfulness and relative perplexity using a separately trained language model (GPT) for naturalness. Transformer-Copynet and T5 performed similarly on faithfulness metrics, with T5 achieving slight edge, a BLEU score of 63.8 and a METEOR score of 83.0. CopyNet was the most natural, with a relative perplexity of 1.59. CopyNet also has 37 times fewer parameters than T5. We have publicly released our dataset, which is composed of 46,565 crowd-sourced samples. △ Less

Submitted 7 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

Comments: 10 pages, 11 figures, Findings of EMNLP 2020

arXiv:2008.10207 [pdf, ps, other]

doi 10.1093/mnras/stab926

Metals and a Search for Molecules in the Distant Universe: Magellan MIKE Observations of sub-DLAs at 2<z<3

Authors: Suraj Poudel, Varsha P. Kulkarni, Debopam Som, Celine Peroux

Abstract: We present abundance measurements of the elements Zn, S, O, C, Si and Fe for four sub-DLAs at redshifts ranging from z=2.173 to z=2.635 using observations from the MIKE spectrograph on the Magellan telescope to constrain the chemical enrichment and star formation of gas-rich galaxies. Using weakly depleted elements O, S, and or Zn, we find the metallicities after the photoionization corrections to… ▽ More We present abundance measurements of the elements Zn, S, O, C, Si and Fe for four sub-DLAs at redshifts ranging from z=2.173 to z=2.635 using observations from the MIKE spectrograph on the Magellan telescope to constrain the chemical enrichment and star formation of gas-rich galaxies. Using weakly depleted elements O, S, and or Zn, we find the metallicities after the photoionization corrections to be [S/H]=-0.50\pm0.11, [O/H]>-0.84, [O/H]=-1.27\pm0.12, and [Zn/H]=+0.40\pm0.12 for the absorbers at z=2.173, 2.236, 2.539, and 2.635, respectively. Moreover, we are able to put constraints on the electron densities using the fine structure lines of C II* and Si II* for two of the sub-DLAs. We find that these values are much higher than the median values found in DLAs in the literature. Furthermore, we estimate the cooling rate lc=1.20\times10-26 erg s-1 per H atom for an absorber at z=2.173, suggesting higher star formation rate density in this sub-DLA than the typical star formation rate density for DLAs at similar redshifts. We also study the metallicity versus velocity dispersion relation for our absorbers. Most of the absorbers follow the trend one can expect from the mass versus metallicity relation for sub-DLAs in the literature. Finally, we are able to put limits on the molecular column density from the non detections of various strong lines of CO molecules. We estimate 3σupper limits of log N(CO,J=0)<13.87, log N(CO,J=0)<13.17, and log N(CO,J=0)<13.08, respectively, from the non-detections of absorption from the J=0 level in the CO AX 0-0, 1-0, and 2-0 bands near 1544, 1510, and 1478Å. △ Less

Submitted 29 March, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

Comments: 14 pages, 20 figures, accepted for publication in MNRAS

Showing 1–50 of 165 results for author: Kulkarni, P