Search | arXiv e-print repository

Understanding the approach to thermalization from the eigenspectrum of non-Abelian gauge theories

Authors: Harshit Pandey, Ravi Shanker, Sayantan Sharma

Abstract: We study the spectral properties of SU(3) gauge theory with and without dynamical quarks (QCD) at thermal equilibrium using lattice gauge theory techniques. By measuring eigenstates of a massless overlap Dirac operator on the gauge configurations, we provide a gauge invariant method to study spectral properties of non-Abelian gauge theories. Whereas the majority of these eigenstates below the magn… ▽ More We study the spectral properties of SU(3) gauge theory with and without dynamical quarks (QCD) at thermal equilibrium using lattice gauge theory techniques. By measuring eigenstates of a massless overlap Dirac operator on the gauge configurations, we provide a gauge invariant method to study spectral properties of non-Abelian gauge theories. Whereas the majority of these eigenstates below the magnetic scale have universal nearest-neighbor level spacing fluctuations consistent with certain class of random matrix theories at temperatures away from the chiral crossover transition in QCD, a few among them start to become prominent just above the crossover forming clusters percolating over the entire volume. By matching the non-perturbative magnetic scales in a high temperature thermal state and a particular non-equilibrium chaotic state of QCD, we provide an estimate of thermalization time $\sim 1.44$ fm/c. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 7 pages, 4 figures

arXiv:2407.08769 [pdf, other]

AuNR-SMA: Automated Gold Nanorod Spectral Morphology Analysis Pipeline

Authors: Samuel P. Gleason, Jakob C. Dahl, Mahmoud Elzouka, Xingzhi Wang, Dana O. Byrne, Mumtaz Gababa, Hannah Cho, Ravi Prasher, Sean Lubner, Emory Chan, A. Paul Alivisatos

Abstract: The development of a colloidal synthesis procedure to produce nanomaterials of a specific size with high shape and size purity is often a time consuming, iterative process. This is often due to the time, resource and expertise intensive characterization methods required for quantitative determination of nanomaterial size and shape. Absorption spectroscopy is often the easiest method of colloidal n… ▽ More The development of a colloidal synthesis procedure to produce nanomaterials of a specific size with high shape and size purity is often a time consuming, iterative process. This is often due to the time, resource and expertise intensive characterization methods required for quantitative determination of nanomaterial size and shape. Absorption spectroscopy is often the easiest method of colloidal nanomaterial characterization, however, due to the lack of a reliable method to extract nanoparticle shapes from absorption spectroscopy, it is generally treated as a more qualitative measure for metal nanoparticles. This work demonstrates a gold nanorod (AuNR) spectral morphology analysis (SMA) tool, AuNR-SMA, which is a fast and accurate method to extract quantitative information about an AuNR sample's structural parameters from its absorption spectra. We apply AuNR-SMA in three distinct applications. First, we demonstrate its utility as an automated analysis tool in a high throughput AuNR synthesis procedure by generating quantitative size information from optical spectra. Second, we use the predictions generated by this model to train a machine learning model capable of predicting the resulting AuNR size distributions from the reaction conditions used to synthesize them. Third, we turn this model to spectra extracted from the literature where no size distributions are reported to impute unreported quantitative information of AuNR synthesis. This approach can potentially be extended to any other nanocrystal system where the absorption spectra are size dependent and accurate numerical simulation of the absorption spectra is possible. In addition, this pipeline could be integrated into automated synthesis apparatuses to provide interpretable data from simple measurements and help explore the synthesis science of nanoparticles in a rational manner or facilitate closed-loop workflows. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08747 [pdf, ps, other]

The stacky concentration theorem

Authors: Dhyan Aranha, Adeel A. Khan, Alexei Latyntsev, Hyeonjun Park, Charanya Ravi

Abstract: We give a sufficient criterion for the Chow or algebraic bordism groups of an algebraic stack, localized at a set of Chern classes of line bundles, to be concentrated in some closed substack. This is a vast generalization of the torus fixed-point localization theorem in equivariant intersection theory, which is the special case of the stack quotient of a scheme $X$ by an action of a torus $T$. Tak… ▽ More We give a sufficient criterion for the Chow or algebraic bordism groups of an algebraic stack, localized at a set of Chern classes of line bundles, to be concentrated in some closed substack. This is a vast generalization of the torus fixed-point localization theorem in equivariant intersection theory, which is the special case of the stack quotient of a scheme $X$ by an action of a torus $T$. Taking on the one hand an algebraic stack in place of $X$, we deduce a generalization of torus localization to algebraic stacks. Taking on the other hand any algebraic group $G$ instead of $T$, we obtain a localization theorem in $G$-equivariant intersection theory. △ Less

Submitted 13 June, 2024; originally announced July 2024.

Comments: 32 pages; split off from arXiv:2207.01652 and revised exposition

arXiv:2407.08488 [pdf, other]

Lynx: An Open Source Hallucination Evaluation Model

Authors: Selvan Sunitha Ravi, Bartosz Mielczarek, Anand Kannappan, Douwe Kiela, Rebecca Qian

Abstract: Retrieval Augmented Generation (RAG) techniques aim to mitigate hallucinations in Large Language Models (LLMs). However, LLMs can still produce information that is unsupported or contradictory to the retrieved contexts. We introduce LYNX, a SOTA hallucination detection LLM that is capable of advanced reasoning on challenging real-world hallucination scenarios. To evaluate LYNX, we present HaluBenc… ▽ More Retrieval Augmented Generation (RAG) techniques aim to mitigate hallucinations in Large Language Models (LLMs). However, LLMs can still produce information that is unsupported or contradictory to the retrieved contexts. We introduce LYNX, a SOTA hallucination detection LLM that is capable of advanced reasoning on challenging real-world hallucination scenarios. To evaluate LYNX, we present HaluBench, a comprehensive hallucination evaluation benchmark, consisting of 15k samples sourced from various real-world domains. Our experiment results show that LYNX outperforms GPT-4o, Claude-3-Sonnet, and closed and open-source LLM-as-a-judge models on HaluBench. We release LYNX, HaluBench and our evaluation code for public access. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.07254 [pdf, other]

HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment

Authors: K M Arefeen Sultan, Md Hasibul Husain Hisham, Benjamin Orkild, Alan Morris, Eugene Kholmovski, Erik Bieging, Eugene Kwan, Ravi Ranjan, Ed DiBella, Shireen Elhabian

Abstract: The accurate evaluation of left atrial fibrosis via high-quality 3D Late Gadolinium Enhancement (LGE) MRI is crucial for atrial fibrillation management but is hindered by factors like patient movement and imaging variability. The pursuit of automated LGE MRI quality assessment is critical for enhancing diagnostic accuracy, standardizing evaluations, and improving patient outcomes. The deep learnin… ▽ More The accurate evaluation of left atrial fibrosis via high-quality 3D Late Gadolinium Enhancement (LGE) MRI is crucial for atrial fibrillation management but is hindered by factors like patient movement and imaging variability. The pursuit of automated LGE MRI quality assessment is critical for enhancing diagnostic accuracy, standardizing evaluations, and improving patient outcomes. The deep learning models aimed at automating this process face significant challenges due to the scarcity of expert annotations, high computational costs, and the need to capture subtle diagnostic details in highly variable images. This study introduces HAMIL-QA, a multiple instance learning (MIL) framework, designed to overcome these obstacles. HAMIL-QA employs a hierarchical bag and sub-bag structure that allows for targeted analysis within sub-bags and aggregates insights at the volume level. This hierarchical MIL approach reduces reliance on extensive annotations, lessens computational load, and ensures clinically relevant quality predictions by focusing on diagnostically critical image features. Our experiments show that HAMIL-QA surpasses existing MIL methods and traditional supervised approaches in accuracy, AUROC, and F1-Score on an LGE MRI scan dataset, demonstrating its potential as a scalable solution for LGE MRI quality assessment automation. The code is available at: $\href{https://github.com/arf111/HAMIL-QA}{\text{this https URL}}$ △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: Accepted to MICCAI2024, 10 pages, 2 figures

arXiv:2407.06910 [pdf, other]

Fine-grained large-scale content recommendations for MSX sellers

Authors: Manpreet Singh, Ravdeep Pasricha, Ravi Prasad Kondapalli, Kiran R, Nitish Singh, Akshita Agarwalla, Manoj R, Manish Prabhakar, Laurent Boué

Abstract: One of the most critical tasks of Microsoft sellers is to meticulously track and nurture potential business opportunities through proactive engagement and tailored solutions. Recommender systems play a central role to help sellers achieve their goals. In this paper, we present a content recommendation model which surfaces various types of content (technical documentation, comparison with competito… ▽ More One of the most critical tasks of Microsoft sellers is to meticulously track and nurture potential business opportunities through proactive engagement and tailored solutions. Recommender systems play a central role to help sellers achieve their goals. In this paper, we present a content recommendation model which surfaces various types of content (technical documentation, comparison with competitor products, customer success stories etc.) that sellers can share with their customers or use for their own self-learning. The model operates at the opportunity level which is the lowest possible granularity and the most relevant one for sellers. It is based on semantic matching between metadata from the contents and carefully selected attributes of the opportunities. Considering the volume of seller-managed opportunities in organizations such as Microsoft, we show how to perform efficient semantic matching over a very large number of opportunity-content combinations. The main challenge is to ensure that the top-5 relevant contents for each opportunity are recommended out of a total of $\approx 40,000$ published contents. We achieve this target through an extensive comparison of different model architectures and feature selection. Finally, we further examine the quality of the recommendations in a quantitative manner using a combination of human domain experts as well as by using the recently proposed "LLM as a judge" framework. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Journal ref: Microsoft Journal of Applied Research, Volume 21, 2024

arXiv:2407.06093 [pdf, other]

Artificial Intuition: Efficient Classification of Scientific Abstracts

Authors: Harsh Sakhrani, Naseela Pervez, Anirudh Ravi Kumar, Fred Morstatter, Alexandra Graddy Reed, Andrea Belz

Abstract: It is desirable to coarsely classify short scientific texts, such as grant or publication abstracts, for strategic insight or research portfolio management. These texts efficiently transmit dense information to experts possessing a rich body of knowledge to aid interpretation. Yet this task is remarkably difficult to automate because of brevity and the absence of context. To address this gap, we h… ▽ More It is desirable to coarsely classify short scientific texts, such as grant or publication abstracts, for strategic insight or research portfolio management. These texts efficiently transmit dense information to experts possessing a rich body of knowledge to aid interpretation. Yet this task is remarkably difficult to automate because of brevity and the absence of context. To address this gap, we have developed a novel approach to generate and appropriately assign coarse domain-specific labels. We show that a Large Language Model (LLM) can provide metadata essential to the task, in a process akin to the augmentation of supplemental knowledge representing human intuition, and propose a workflow. As a pilot study, we use a corpus of award abstracts from the National Aeronautics and Space Administration (NASA). We develop new assessment tools in concert with established performance metrics. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2407.05653 [pdf, other]

On the spectrum of closed neighborhood corona product of graph and its application

Authors: Bishal Sonar, Ravi Srivastava

Abstract: This paper introduces the concept of the closed neighborhood corona product of the graph. We explore the mathematical features of this product graph, specifically in terms of its spectral characteristics. We have calculated the characteristic polynomials of the adjacency, Laplacian, and signless Laplacian matrices. Moreover, we investigate the conditions under which two graphs are cospectral regar… ▽ More This paper introduces the concept of the closed neighborhood corona product of the graph. We explore the mathematical features of this product graph, specifically in terms of its spectral characteristics. We have calculated the characteristic polynomials of the adjacency, Laplacian, and signless Laplacian matrices. Moreover, we investigate the conditions under which two graphs are cospectral regarding this product. A significant portion of our study is dedicated to computing the Kirchhoff index, the number of spanning trees and the sequence of non-cospectral equienergetic product graphs. We also outline specific criteria that determine when the product graph is integral. △ Less

Submitted 8 July, 2024; originally announced July 2024.

MSC Class: 05C22; 05C50; 05C76

arXiv:2407.05544 [pdf, other]

Recovering a Message from an Incomplete Set of Noisy Fragments

Authors: Aditya Narayan Ravi, Alireza Vahid, Ilan Shomorony

Abstract: We consider the problem of communicating over a channel that breaks the message block into fragments of random lengths, shuffles them out of order, and deletes a random fraction of the fragments. Such a channel is motivated by applications in molecular data storage and forensics, and we refer to it as the torn-paper channel. We characterize the capacity of this channel under arbitrary fragment len… ▽ More We consider the problem of communicating over a channel that breaks the message block into fragments of random lengths, shuffles them out of order, and deletes a random fraction of the fragments. Such a channel is motivated by applications in molecular data storage and forensics, and we refer to it as the torn-paper channel. We characterize the capacity of this channel under arbitrary fragment length distributions and deletion probabilities. Precisely, we show that the capacity is given by a closed-form expression that can be interpreted as F - A, where F is the coverage fraction ,i.e., the fraction of the input codeword that is covered by output fragments, and A is an alignment cost incurred due to the lack of ordering in the output fragments. We then consider a noisy version of the problem, where the fragments are corrupted by binary symmetric noise. We derive upper and lower bounds to the capacity, both of which can be seen as F - A expressions. These bounds match for specific choices of fragment length distributions, and they are approximately tight in cases where there are not too many short fragments. △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: 43 pages, 3 figures

arXiv:2407.04692 [pdf, other]

Eigen-decomposition of Covariance matrices: An application to the BAO Linear Point

Authors: Jaemyoung Jason Lee, Farnik Nikakhtar, Aseem Paranjape, Ravi K. Sheth

Abstract: The Baryon Acoustic Oscillation (BAO) feature in the two-point correlation function (TPCF) of discrete tracers such as galaxies is an accurate standard ruler. The covariance matrix of the TPCF plays an important role in determining how the precision of this ruler depends on the number density and clustering strength of the tracers, as well as the survey volume. An eigen-decomposition of this matri… ▽ More The Baryon Acoustic Oscillation (BAO) feature in the two-point correlation function (TPCF) of discrete tracers such as galaxies is an accurate standard ruler. The covariance matrix of the TPCF plays an important role in determining how the precision of this ruler depends on the number density and clustering strength of the tracers, as well as the survey volume. An eigen-decomposition of this matrix provides an objective way to separate the contributions of cosmic variance from those of shot-noise to the statistical uncertainties. For the signal-to-noise levels that are expected in ongoing and next-generation surveys, the cosmic variance eigen-modes dominate. These modes are smooth functions of scale, meaning that: they are insensitive to the modest changes in binning that are allowed if one wishes to resolve the BAO feature in the TPCF; they provide a good description of the correlated residuals which result from fitting smooth functional forms to the measured TPCF; they motivate a simple but accurate approximation for the uncertainty on the Linear Point (LP) estimate of the BAO distance scale. This approximation allows one to quantify the precision of the BAO distance scale estimate without having to generate a large ensemble of mock catalogs and explains why: the uncertainty on the LP does not depend on the functional form fitted to the TPCF or the binning used; the LP is more constraining than the peak or dip scales in the TPCF; the evolved TPCF is less constraining than the initial one, so that reconstruction schemes can yield significant gains in precision. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 10 pages, 8 figures, submitted to Physical Review D

arXiv:2407.02811 [pdf, other]

SPLITZ: Certifiable Robustness via Split Lipschitz Randomized Smoothing

Authors: Meiyu Zhong, Ravi Tandon

Abstract: Certifiable robustness gives the guarantee that small perturbations around an input to a classifier will not change the prediction. There are two approaches to provide certifiable robustness to adversarial examples: a) explicitly training classifiers with small Lipschitz constants, and b) Randomized smoothing, which adds random noise to the input to create a smooth classifier. We propose \textit{S… ▽ More Certifiable robustness gives the guarantee that small perturbations around an input to a classifier will not change the prediction. There are two approaches to provide certifiable robustness to adversarial examples: a) explicitly training classifiers with small Lipschitz constants, and b) Randomized smoothing, which adds random noise to the input to create a smooth classifier. We propose \textit{SPLITZ}, a practical and novel approach which leverages the synergistic benefits of both the above ideas into a single framework. Our main idea is to \textit{split} a classifier into two halves, constrain the Lipschitz constant of the first half, and smooth the second half via randomization. Motivation for \textit{SPLITZ} comes from the observation that many standard deep networks exhibit heterogeneity in Lipschitz constants across layers. \textit{SPLITZ} can exploit this heterogeneity while inheriting the scalability of randomized smoothing. We present a principled approach to train \textit{SPLITZ} and provide theoretical analysis to derive certified robustness guarantees during inference. We present a comprehensive comparison of robustness-accuracy tradeoffs and show that \textit{SPLITZ} consistently improves upon existing state-of-the-art approaches on MNIST and CIFAR-10 datasets. For instance, with $\ell_2$ norm perturbation budget of \textbf{$ε=1$}, \textit{SPLITZ} achieves $\textbf{43.2\%}$ top-1 test accuracy on CIFAR-10 dataset compared to state-of-art top-1 test accuracy $\textbf{39.8\%} △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2407.02741 [pdf]

18 GHz Solidly Mounted Resonator in Scandium Aluminum Nitride on SiO2/Ta2O5 Bragg Reflector

Authors: Omar Barrera, Nishanth Ravi, Kapil Saha, Supratik Dasgupta, Joshua Campbell, Jack Kramer, Eugene Kwon, Tzu-Hsuan Hsu, Sinwoo Cho, Ian Anderson, Pietro Simeoni, Jue Hou, Matteo Rinaldi, Mark S. Goorsky, Ruochen Lu

Abstract: This work reports an acoustic solidly mounted resonator (SMR) at 18.64 GHz, among the highest operating frequencies reported. The device is built in scandium aluminum nitride (ScAlN) on top of silicon dioxide (SiO2) and tantalum pentoxide (Ta2O5) Bragg reflectors on silicon (Si) wafer. The stack is analyzed with X-ray reflectivity (XRR) and high-resolution X-ray diffraction (HRXRD). The resonator… ▽ More This work reports an acoustic solidly mounted resonator (SMR) at 18.64 GHz, among the highest operating frequencies reported. The device is built in scandium aluminum nitride (ScAlN) on top of silicon dioxide (SiO2) and tantalum pentoxide (Ta2O5) Bragg reflectors on silicon (Si) wafer. The stack is analyzed with X-ray reflectivity (XRR) and high-resolution X-ray diffraction (HRXRD). The resonator shows a coupling coefficient (k2) of 2.0%, high series quality factor (Qs) of 156, shunt quality factor (Qp) of 142, and maximum Bode quality factor (Qmax) of 210. The third-order harmonics at 59.64 GHz is also observed with k2 around 0.6% and Q around 40. Upon further development, the reported acoustic resonator platform can enable various front-end signal-processing functions, e.g., filters and oscillators, at future frequency range 3 (FR3) bands. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 5 pages, 9 figures, 5 tables

arXiv:2407.01853 [pdf, other]

Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets

Authors: Sathish Reddy Indurthi, Wenxuan Zhou, Shamil Chollampatt, Ravi Agrawal, Kaiqiang Song, Lingxiao Zhao, Chenguang Zhu

Abstract: Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages. Traditional methods for creating multilingual IFT datasets such as translating existing English IFT datasets or converting existing NLP datasets into IFT dataset… ▽ More Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages. Traditional methods for creating multilingual IFT datasets such as translating existing English IFT datasets or converting existing NLP datasets into IFT datasets by templating, struggle to capture linguistic nuances and ensure prompt (instruction) diversity. To address this issue, we propose a novel method for collecting multilingual IFT datasets that preserves linguistic naturalness and ensures prompt diversity. This approach leverages English-focused LLMs, monolingual corpora, and a scoring function to create high-quality, diversified IFT datasets in multiple languages. Experiments demonstrate that LLMs finetuned using these IFT datasets show notable improvements in both generative and discriminative tasks, indicating enhanced language comprehension by LLMs in non-English contexts. Specifically, on the multilingual summarization task, LLMs using our IFT dataset achieved 17.57% and 15.23% improvements over LLMs fine-tuned with translation-based and template-based datasets, respectively. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2407.00263 [pdf, other]

From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models

Authors: Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, Eunjeong Hwang, Vered Shwartz

Abstract: Despite recent advancements in vision-language models, their performance remains suboptimal on images from non-western cultures due to underrepresentation in training datasets. Various benchmarks have been proposed to test models' cultural inclusivity, but they have limited coverage of cultures and do not adequately assess cultural diversity across universal as well as culture-specific local conce… ▽ More Despite recent advancements in vision-language models, their performance remains suboptimal on images from non-western cultures due to underrepresentation in training datasets. Various benchmarks have been proposed to test models' cultural inclusivity, but they have limited coverage of cultures and do not adequately assess cultural diversity across universal as well as culture-specific local concepts. To address these limitations, we introduce the GlobalRG benchmark, comprising two challenging tasks: retrieval across universals and cultural visual grounding. The former task entails retrieving culturally diverse images for universal concepts from 50 countries, while the latter aims at grounding culture-specific concepts within images from 15 countries. Our evaluation across a wide range of models reveals that the performance varies significantly across cultures -- underscoring the necessity for enhancing multicultural understanding in vision-language models. △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: Under peer review

arXiv:2406.19256 [pdf, other]

AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI

Authors: Kaveen Hiniduma, Suren Byna, Jean Luca Bez, Ravi Madduri

Abstract: "Garbage In Garbage Out" is a universally agreed quote by computer scientists from various domains, including Artificial Intelligence (AI). As data is the fuel for AI, models trained on low-quality, biased data are often ineffective. Computer scientists who use AI invest a considerable amount of time and effort in preparing the data for AI. However, there are no standard methods or frameworks for… ▽ More "Garbage In Garbage Out" is a universally agreed quote by computer scientists from various domains, including Artificial Intelligence (AI). As data is the fuel for AI, models trained on low-quality, biased data are often ineffective. Computer scientists who use AI invest a considerable amount of time and effort in preparing the data for AI. However, there are no standard methods or frameworks for assessing the "readiness" of data for AI. To provide a quantifiable assessment of the readiness of data for AI processes, we define parameters of AI data readiness and introduce AIDRIN (AI Data Readiness Inspector). AIDRIN is a framework covering a broad range of readiness dimensions available in the literature that aid in evaluating the readiness of data quantitatively and qualitatively. AIDRIN uses metrics in traditional data quality assessment such as completeness, outliers, and duplicates for data evaluation. Furthermore, AIDRIN uses metrics specific to assess data for AI, such as feature importance, feature correlations, class imbalance, fairness, privacy, and FAIR (Findability, Accessibility, Interoperability, and Reusability) principle compliance. AIDRIN provides visualizations and reports to assist data scientists in further investigating the readiness of data. The AIDRIN framework enhances the efficiency of the machine learning pipeline to make informed decisions on data readiness for AI applications. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 12 pages, 9 figures, Accepted to SSDBM 2024

arXiv:2406.19150 [pdf, other]

RAVEN: Multitask Retrieval Augmented Vision-Language Learning

Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resource intensive pre training, additional parameter requirements, unaddressed modality prioritization and lack of clear benefit over non-retrieval baselines. This paper introduces RAVEN, a multitask retrieval augmented VLM framework that enhances base VLMs through efficient, task specific fine-tuning. By integrating retrieval augmented samples without the need for additional retrieval-specific parameters, we show that the model acquires retrieval properties that are effective across multiple tasks. Our results and extensive ablations across retrieved modalities for the image captioning and VQA tasks indicate significant performance improvements compared to non retrieved baselines +1 CIDEr on MSCOCO, +4 CIDEr on NoCaps and nearly a +3\% accuracy on specific VQA question types. This underscores the efficacy of applying RAG approaches to VLMs, marking a stride toward more efficient and accessible multimodal learning. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.19040 [pdf, ps, other]

On Convex Optimization with Semi-Sensitive Features

Authors: Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Raghu Meka, Chiyuan Zhang

Abstract: We study the differentially private (DP) empirical risk minimization (ERM) problem under the semi-sensitive DP setting where only some features are sensitive. This generalizes the Label DP setting where only the label is sensitive. We give improved upper and lower bounds on the excess risk for DP-ERM. In particular, we show that the error only scales polylogarithmically in terms of the sensitive d… ▽ More We study the differentially private (DP) empirical risk minimization (ERM) problem under the semi-sensitive DP setting where only some features are sensitive. This generalizes the Label DP setting where only the label is sensitive. We give improved upper and lower bounds on the excess risk for DP-ERM. In particular, we show that the error only scales polylogarithmically in terms of the sensitive domain size, improving upon previous results that scale polynomially in the sensitive domain size (Ghazi et al., 2021). △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: To appear in COLT 2024

arXiv:2406.17005 [pdf, other]

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Authors: Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Cheng**g Wu, Ting Liu, Luoqi Liu, Xinyu Liu, **g Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, **gnan Luo , et al. (12 additional authors not shown)

Abstract: Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Segmentation track based on MeViS dataset. In the two new tracks, we provide additional videos and annotations that feature challenging elements, such as… ▽ More Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Segmentation track based on MeViS dataset. In the two new tracks, we provide additional videos and annotations that feature challenging elements, such as the disappearance and reappearance of objects, inconspicuous small objects, heavy occlusions, and crowded environments in MOSE. Moreover, we provide a new motion expression guided video segmentation dataset MeViS to study the natural language-guided video understanding in complex environments. These new videos, sentences, and annotations enable us to foster the development of a more comprehensive and robust pixel-level understanding of video scenes in complex environments and realistic scenarios. The MOSE challenge had 140 registered teams in total, 65 teams participated the validation phase and 12 teams made valid submissions in the final challenge phase. The MeViS challenge had 225 registered teams in total, 50 teams participated the validation phase and 5 teams made valid submissions in the final challenge phase. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: MOSE Challenge: https://henghuiding.github.io/MOSE/ChallengeCVPR2024, MeViS Challenge: https://henghuiding.github.io/MeViS/ChallengeCVPR2024

arXiv:2406.16305 [pdf, ps, other]

On Computing Pairwise Statistics with Local Differential Privacy

Authors: Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Adam Sealfon

Abstract: We study the problem of computing pairwise statistics, i.e., ones of the form $\binom{n}{2}^{-1} \sum_{i \ne j} f(x_i, x_j)$, where $x_i$ denotes the input to the $i$th user, with differential privacy (DP) in the local model. This formulation captures important metrics such as Kendall's $τ$ coefficient, Area Under Curve, Gini's mean difference, Gini's entropy, etc. We give several novel and generi… ▽ More We study the problem of computing pairwise statistics, i.e., ones of the form $\binom{n}{2}^{-1} \sum_{i \ne j} f(x_i, x_j)$, where $x_i$ denotes the input to the $i$th user, with differential privacy (DP) in the local model. This formulation captures important metrics such as Kendall's $τ$ coefficient, Area Under Curve, Gini's mean difference, Gini's entropy, etc. We give several novel and generic algorithms for the problem, leveraging techniques from DP algorithms for linear queries. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: Published in NeurIPS 2023

arXiv:2406.16302 [pdf, other]

doi 10.1111/cgf.15152

Residual path integrals for re-rendering

Authors: Bing Xu, Tzu-Mao Li, Iliyan Georgiev, Trevor Hedstrom, Ravi Ramamoorthi

Abstract: Conventional rendering techniques are primarily designed and optimized for single-frame rendering. In practical applications, such as scene editing and animation rendering, users frequently encounter scenes where only a small portion is modified between consecutive frames. In this paper, we develop a novel approach to incremental re-rendering of scenes with dynamic objects, where only a small part… ▽ More Conventional rendering techniques are primarily designed and optimized for single-frame rendering. In practical applications, such as scene editing and animation rendering, users frequently encounter scenes where only a small portion is modified between consecutive frames. In this paper, we develop a novel approach to incremental re-rendering of scenes with dynamic objects, where only a small part of a scene moves from one frame to the next. We formulate the difference (or residual) in the image between two frames as a (correlated) light-transport integral which we call the residual path integral. Efficient numerical solution of this integral then involves (1)~devising importance sampling strategies to focus on paths with non-zero residual-transport contributions and (2)~choosing appropriate map**s between the native path spaces of the two frames. We introduce a set of path importance sampling strategies that trace from the moving object(s) which are the sources of residual energy. We explore path map** strategies that generalize those from gradient-domain path tracing to our importance sampling techniques specially for dynamic scenes. Additionally, our formulation can be applied to material editing as a simpler special case. We demonstrate speed-ups over previous correlated sampling of path differences and over rendering the new frame independently. Our formulation brings new insights into the re-rendering problem and paves the way for devising new types of sampling techniques and path map**s with different trade-offs. △ Less

Submitted 23 June, 2024; originally announced June 2024.

Comments: 14 pages, 13 figures

ACM Class: I.3.0

arXiv:2406.16135 [pdf, other]

Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models

Authors: Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chulin Xie, Chiyuan Zhang

Abstract: Large language models (LLMs) are typically multilingual due to pretraining on diverse multilingual corpora. But can these models relate corresponding concepts across languages, effectively being crosslingual? This study evaluates six state-of-the-art LLMs on inherently crosslingual tasks. We observe that while these models show promising surface-level crosslingual abilities on machine translation… ▽ More Large language models (LLMs) are typically multilingual due to pretraining on diverse multilingual corpora. But can these models relate corresponding concepts across languages, effectively being crosslingual? This study evaluates six state-of-the-art LLMs on inherently crosslingual tasks. We observe that while these models show promising surface-level crosslingual abilities on machine translation and embedding space analyses, they struggle with deeper crosslingual knowledge transfer, revealing a crosslingual knowledge barrier in both general (MMLU benchmark) and domain-specific (Harry Potter quiz) contexts. We observe that simple inference-time mitigation methods offer only limited improvement. On the other hand, we propose fine-tuning of LLMs on mixed-language data, which effectively reduces these gaps, even when using out-of-domain datasets like WikiText. Our findings suggest the need for explicit optimization to unlock the full crosslingual potential of LLMs. Our code is publicly available at https://github.com/google-research/crosslingual-knowledge-barriers. △ Less

Submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.15721 [pdf, other]

Clapton: Clifford-Assisted Problem Transformation for Error Mitigation in Variational Quantum Algorithms

Authors: Lennart Maximilian Seifert, Siddharth Dangwal, Frederic T. Chong, Gokul Subramanian Ravi

Abstract: Variational quantum algorithms (VQAs) show potential for quantum advantage in the near term of quantum computing, but demand a level of accuracy that surpasses the current capabilities of NISQ devices. To systematically mitigate the impact of quantum device error on VQAs, we propose Clapton: Clifford-Assisted Problem Transformation for Error Mitigation in Variational Quantum Algorithms. Clapton le… ▽ More Variational quantum algorithms (VQAs) show potential for quantum advantage in the near term of quantum computing, but demand a level of accuracy that surpasses the current capabilities of NISQ devices. To systematically mitigate the impact of quantum device error on VQAs, we propose Clapton: Clifford-Assisted Problem Transformation for Error Mitigation in Variational Quantum Algorithms. Clapton leverages classically estimated good quantum states for a given VQA problem, classical simulable models of device noise, and the variational principle for VQAs. It applies transformations on the VQA problem's Hamiltonian to lower the energy estimates of known good VQA states in the presence of the modeled device noise. The Clapton hypothesis is that as long as the known good states of the VQA problem are close to the problem's ideal ground state and the device noise modeling is reasonably accurate (both of which are generally true), then the Clapton transformation substantially decreases the impact of device noise on the ground state of the VQA problem, thereby increasing the accuracy of the VQA solution. Clapton is built as an end-to-end application-to-device framework and achieves mean VQA initialization improvements of 1.7x to 3.7x, and up to a maximum of 13.3x, over the state-of-the-art baseline when evaluated for a variety of scientific applications from physics and chemistry on noise models and real quantum devices. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.14780 [pdf, other]

ACR: A Benchmark for Automatic Cohort Retrieval

Authors: Dung Ngoc Thai, Victor Ardulov, Jose Ulises Mena, Simran Tiwari, Gleb Erofeev, Ramy Eskander, Karim Tarabishy, Ravi B Parikh, Wael Salloum

Abstract: Identifying patient cohorts is fundamental to numerous healthcare tasks, including clinical trial recruitment and retrospective studies. Current cohort retrieval methods in healthcare organizations rely on automated queries of structured data combined with manual curation, which are time-consuming, labor-intensive, and often yield low-quality results. Recent advancements in large language models (… ▽ More Identifying patient cohorts is fundamental to numerous healthcare tasks, including clinical trial recruitment and retrospective studies. Current cohort retrieval methods in healthcare organizations rely on automated queries of structured data combined with manual curation, which are time-consuming, labor-intensive, and often yield low-quality results. Recent advancements in large language models (LLMs) and information retrieval (IR) offer promising avenues to revolutionize these systems. Major challenges include managing extensive eligibility criteria and handling the longitudinal nature of unstructured Electronic Medical Records (EMRs) while ensuring that the solution remains cost-effective for real-world application. This paper introduces a new task, Automatic Cohort Retrieval (ACR), and evaluates the performance of LLMs and commercial, domain-specific neuro-symbolic approaches. We provide a benchmark task, a query dataset, an EMR dataset, and an evaluation framework. Our findings underscore the necessity for efficient, high-quality ACR systems capable of longitudinal reasoning across extensive patient databases. △ Less

Submitted 1 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

arXiv:2406.14480 [pdf, other]

Parity and Lepton Masses in the Left Right Symmetric Model

Authors: Ravi Kuchimanchi

Abstract: Curiously in the minimal left right symmetric model, chiral ($χ$) symmetry that protects the electron's mass ($m_e$), due to parity (P) implies the vanishing of its neutrino mixing angles. We break the $χ$ symmetry to generate the observed neutrino mixing which causes the electron to acquire its mass on RGE running, and in turn determines the B-L gauge symmetry breaking scale ($v_R$) to be… ▽ More Curiously in the minimal left right symmetric model, chiral ($χ$) symmetry that protects the electron's mass ($m_e$), due to parity (P) implies the vanishing of its neutrino mixing angles. We break the $χ$ symmetry to generate the observed neutrino mixing which causes the electron to acquire its mass on RGE running, and in turn determines the B-L gauge symmetry breaking scale ($v_R$) to be $10^{10} GeV \lesssim v_R \leq 10^{15} GeV $ (and with fine-tuning can be at $10 TeV$ scale). If the muon's mass is also generated radiatively, the B-L breaking scale is $\sim 10^{14-15}$ GeV. Regardless of how the high scale $v_R$ is, this is a testable model for obtaining the mass of the electron (and muon), since on RGE running and P breaking, a large strong CP phase ($\barθ >> 10^{-10}$) which depends logarithmically on $v_R$ is generated if there is O(1) CP violation in leptonic Yukawa couplings. Hence we expect that leptonic CP phases including the Dirac CP phase $δ_{CP}$ of the PMNS matrix must be consistent with $0$ or $180^o$ to within a degree, which can be verified or excluded by neutrino experiments such as DUNE and Hyper-Kamiokande. In lieu of P, if charge conjugation C is used, the same results follow. However with C and no P, axions would likely need to be added anyway, in which case there is no constraint on $δ_{CP}$. △ Less

Submitted 10 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

Comments: "P & m_e" was the previous version's title of this paper

arXiv:2406.14322 [pdf, other]

Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Authors: Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Ravi Kumar, Daogao Liu, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

Abstract: Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy (DP) offers a promising solution by ensuring models are 'almost indistinguishable' with or without any particular privacy unit, current evaluations on LLMs most… ▽ More Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy (DP) offers a promising solution by ensuring models are 'almost indistinguishable' with or without any particular privacy unit, current evaluations on LLMs mostly treat each example (text record) as the privacy unit. This leads to uneven user privacy guarantees when contributions per user vary. We therefore study user-level DP motivated by applications where it necessary to ensure uniform privacy protection across users. We present a systematic evaluation of user-level DP for LLM fine-tuning on natural language generation tasks. Focusing on two mechanisms for achieving user-level DP guarantees, Group Privacy and User-wise DP-SGD, we investigate design choices like data selection strategies and parameter tuning for the best privacy-utility tradeoff. △ Less

Submitted 3 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

arXiv:2406.13967 [pdf, other]

Hardware-Efficient Randomized Compiling

Authors: Neelay Fruitwala, Akel Hashim, Abhi D. Rajagopala, Yilun Xu, Jordan Hines, Ravi K. Naik, Irfan Siddiqi, Katherine Klymko, Gang Huang, Kasra Nowrouzi

Abstract: Randomized compiling (RC) is an efficient method for tailoring arbitrary Markovian errors into stochastic Pauli channels. However, the standard procedure for implementing the protocol in software comes with a large experimental overhead -- namely, it scales linearly in the number of desired randomizations, each of which must be generated and measured independently. In this work, we introduce a har… ▽ More Randomized compiling (RC) is an efficient method for tailoring arbitrary Markovian errors into stochastic Pauli channels. However, the standard procedure for implementing the protocol in software comes with a large experimental overhead -- namely, it scales linearly in the number of desired randomizations, each of which must be generated and measured independently. In this work, we introduce a hardware-efficient algorithm for performing RC on a cycle-by-cycle basis on the lowest level of our FPGA-based control hardware during the execution of a circuit. Importantly, this algorithm performs a different randomization per shot with zero runtime overhead beyond measuring a circuit without RC. We implement our algorithm using the QubiC control hardware, where we demonstrate significant reduction in the overall runtime of circuits implemented with RC, as well as a significantly lower variance in measured observables. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.12815 [pdf, other]

Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation

Authors: Nikolas Koutsoubis, Yasin Yilmaz, Ravi P. Ramachandran, Matthew Schabath, Ghulam Rasool

Abstract: Machine learning (ML) and Artificial Intelligence (AI) have fueled remarkable advancements, particularly in healthcare. Within medical imaging, ML models hold the promise of improving disease diagnoses, treatment planning, and post-treatment monitoring. Various computer vision tasks like image classification, object detection, and image segmentation are poised to become routine in clinical analysi… ▽ More Machine learning (ML) and Artificial Intelligence (AI) have fueled remarkable advancements, particularly in healthcare. Within medical imaging, ML models hold the promise of improving disease diagnoses, treatment planning, and post-treatment monitoring. Various computer vision tasks like image classification, object detection, and image segmentation are poised to become routine in clinical analysis. However, privacy concerns surrounding patient data hinder the assembly of large training datasets needed for develo** and training accurate, robust, and generalizable models. Federated Learning (FL) emerges as a compelling solution, enabling organizations to collaborate on ML model training by sharing model training information (gradients) rather than data (e.g., medical images). FL's distributed learning framework facilitates inter-institutional collaboration while preserving patient privacy. However, FL, while robust in privacy preservation, faces several challenges. Sensitive information can still be gleaned from shared gradients that are passed on between organizations during model training. Additionally, in medical imaging, quantifying model confidence\uncertainty accurately is crucial due to the noise and artifacts present in the data. Uncertainty estimation in FL encounters unique hurdles due to data heterogeneity across organizations. This paper offers a comprehensive review of FL, privacy preservation, and uncertainty estimation, with a focus on medical imaging. Alongside a survey of current research, we identify gaps in the field and suggest future directions for FL research to enhance privacy and address noisy medical imaging data challenges. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 31 pages, 5 figures, 3 tables, Journal preprint

arXiv:2406.12411 [pdf, other]

TADM: Temporally-Aware Diffusion Model for Neurodegenerative Progression on Brain MRI

Authors: Mattia Litrico, Francesco Guarnera, Valerio Giuffirda, Daniele Ravì, Sebastiano Battiato

Abstract: Generating realistic images to accurately predict changes in the structure of brain MRI is a crucial tool for clinicians. Such applications help assess patients' outcomes and analyze how diseases progress at the individual level. However, existing methods for this task present some limitations. Some approaches attempt to model the distribution of MRI scans directly by conditioning the model on pat… ▽ More Generating realistic images to accurately predict changes in the structure of brain MRI is a crucial tool for clinicians. Such applications help assess patients' outcomes and analyze how diseases progress at the individual level. However, existing methods for this task present some limitations. Some approaches attempt to model the distribution of MRI scans directly by conditioning the model on patients' ages, but they fail to explicitly capture the relationship between structural changes in the brain and time intervals, especially on age-unbalanced datasets. Other approaches simply rely on interpolation between scans, which limits their clinical application as they do not predict future MRIs. To address these challenges, we propose a Temporally-Aware Diffusion Model (TADM), which introduces a novel approach to accurately infer progression in brain MRIs. TADM learns the distribution of structural changes in terms of intensity differences between scans and combines the prediction of these changes with the initial baseline scans to generate future MRIs. Furthermore, during training, we propose to leverage a pre-trained Brain-Age Estimator (BAE) to refine the model's training process, enhancing its ability to produce accurate MRIs that match the expected age gap between baseline and generated scans. Our assessment, conducted on the OASIS-3 dataset, uses similarity metrics and region sizes computed by comparing predicted and real follow-up scans on 3 relevant brain regions. TADM achieves large improvements over existing approaches, with an average decrease of 24% in region size error and an improvement of 4% in similarity metrics. These evaluations demonstrate the improvement of our model in mimicking temporal brain neurodegenerative progression compared to existing methods. Our approach will benefit applications, such as predicting patient outcomes or improving treatments for patients. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.11827 [pdf, other]

WPO: Enhancing RLHF with Weighted Preference Optimization

Authors: Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu

Abstract: Reinforcement learning from human feedback (RLHF) is a promising solution to align large language models (LLMs) more closely with human values. Off-policy preference optimization, where the preference data is obtained from other models, is widely adopted due to its cost efficiency and scalability. However, off-policy preference optimization often suffers from a distributional gap between the polic… ▽ More Reinforcement learning from human feedback (RLHF) is a promising solution to align large language models (LLMs) more closely with human values. Off-policy preference optimization, where the preference data is obtained from other models, is widely adopted due to its cost efficiency and scalability. However, off-policy preference optimization often suffers from a distributional gap between the policy used for data collection and the target policy, leading to suboptimal optimization. In this paper, we propose a novel strategy to mitigate this problem by simulating on-policy learning with off-policy preference data. Our Weighted Preference Optimization (WPO) method adapts off-policy data to resemble on-policy data more closely by reweighting preference pairs according to their probability under the current policy. This method not only addresses the distributional gap problem but also enhances the optimization process without incurring additional costs. We validate our method on instruction following benchmarks including Alpaca Eval 2 and MT-bench. WPO not only outperforms Direct Preference Optimization (DPO) by up to 5.6% on Alpaca Eval 2 but also establishes a remarkable length-controlled winning rate against GPT-4-turbo of 48.6% based on Llama-3-8B-Instruct, making it the strongest 8B model on the leaderboard. We will release the code and models at https://github.com/wzhouad/WPO. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.10764 [pdf, other]

GNOME: Generating Negotiations through Open-Domain Map** of Exchanges

Authors: Darshan Deshpande, Shambhavi Sinha, Anirudh Ravi Kumar, Debaditya Pal, Jonathan May

Abstract: Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes exi… ▽ More Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes existing human-annotated, closed-domain datasets using Large Language Models and produces synthetic open-domain dialogues for negotiation. GNOME improves the generalizability of negotiation systems while reducing the expensive and subjective task of manual data curation. Through our experimental setup, we create a benchmark comparing encoder and decoder models trained on existing datasets against datasets created through GNOME. Our results show that models trained on our dataset not only perform better than previous state of the art models on domain specific strategy prediction, but also generalize better to previously unseen domains. △ Less

Submitted 15 June, 2024; originally announced June 2024.

arXiv:2406.10118 [pdf, other]

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Authors: Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Railey Montalan, Ryan Ignatius, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse , et al. (36 additional authors not shown)

Abstract: Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due t… ▽ More Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due to the scarcity of high-quality datasets, compounded by the dominance of English training data, raising concerns about potential cultural misrepresentation. To address these challenges, we introduce SEACrowd, a collaborative initiative that consolidates a comprehensive resource hub that fills the resource gap by providing standardized corpora in nearly 1,000 SEA languages across three modalities. Through our SEACrowd benchmarks, we assess the quality of AI models on 36 indigenous languages across 13 tasks, offering valuable insights into the current AI landscape in SEA. Furthermore, we propose strategies to facilitate greater AI advancements, maximizing potential utility and resource equity for the future of AI in SEA. △ Less

Submitted 8 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

Comments: https://github.com/SEACrowd

arXiv:2406.09427 [pdf, other]

On Optimal Server Allocation for Moldable Jobs with Concave Speed-Up

Authors: Samira Ghanbarian, Arpan Mukhopadhyay, Ravi R. Mazumdar, Fabrice M. Guillemin

Abstract: A large proportion of jobs submitted to modern computing clusters and data centers are parallelizable and capable of running on a flexible number of computing cores or servers. Although allocating more servers to such a job results in a higher speed-up in the job's execution, it reduces the number of servers available to other jobs, which in the worst case, can result in an incoming job not findin… ▽ More A large proportion of jobs submitted to modern computing clusters and data centers are parallelizable and capable of running on a flexible number of computing cores or servers. Although allocating more servers to such a job results in a higher speed-up in the job's execution, it reduces the number of servers available to other jobs, which in the worst case, can result in an incoming job not finding any available server to run immediately upon arrival. Hence, a key question to address is: how to optimally allocate servers to jobs such that (i) the average execution time across jobs is minimized and (ii) almost all jobs find at least one server immediately upon arrival. To address this question, we consider a system with $n$ servers, where jobs are parallelizable up to $d^{(n)}$ servers and the speed-up function of jobs is concave and increasing. Jobs not finding any available servers upon entry are blocked and lost. We propose a simple server allocation scheme that achieves the minimum average execution time of accepted jobs while ensuring that the blocking probability of jobs vanishes as the system becomes large ($n \to \infty$). This result is established for various traffic conditions as well as for heterogeneous workloads. To prove our result, we employ Stein's method which also yields non-asymptotic bounds on the blocking probability and the mean execution time. Furthermore, our simulations show that the performance of the scheme is insensitive to the distribution of job execution times. △ Less

Submitted 15 April, 2024; originally announced June 2024.

MSC Class: 60J28 (Primary) 60K25; 68M20 (Secondary)

arXiv:2406.09379 [pdf, other]

The Stability of the BAO Linear Point under Modified Gravity

Authors: Jaemyoung Jason Lee, Bartolomeo Fiorini, Farnik Nikakhtar, Ravi K. Sheth

Abstract: Baryon Acoustic Oscillations (BAOs) are crucial in cosmological analysis, providing a standard ruler, as well as constraints on dark energy. In General Relativity models, the BAO Linear Point - the midpoint between the dip and the peak in the correlation function - has been shown to be rather robust to evolution and redshift space distortions. We show that this remains true even when the gravity m… ▽ More Baryon Acoustic Oscillations (BAOs) are crucial in cosmological analysis, providing a standard ruler, as well as constraints on dark energy. In General Relativity models, the BAO Linear Point - the midpoint between the dip and the peak in the correlation function - has been shown to be rather robust to evolution and redshift space distortions. We show that this remains true even when the gravity model is not General Relativity, at least for $f(R)$ and DGP gravity models which have the same expansion history as the standard $Λ$CDM. For the Linear Point to be able to distinguish between modified gravity (MG) and $Λ$CDM, survey volumes of order tens of cubic Gpc are required. △ Less

Submitted 19 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

Comments: 9 pages, 5 figures, submitted to Physical Review D, v2

arXiv:2406.08988 [pdf, other]

Probing the polarized emission from SMC X-1: the brightest X-ray pulsar observed by IXPE

Authors: Sofia V. Forsblom, Sergey S. Tsygankov, Juri Poutanen, Victor Doroshenko, Alexander A. Mushtukov, Mason Ng, Swati Ravi, Herman L. Marshall, Alessandro Di Marco, Fabio La Monaca, Christian Malacaria, Guglielmo Mastroserio, Vladislav Loktev, Andrea Possenti, Valery F. Suleimanov, Roberto Taverna, Ivan Agudo, Lucio A. Antonelli, Matteo Bachetti, Luca Baldini, Wayne H. Baumgartner, Ronaldo Bellazzini, Stefano Bianchi, Stephen D. Bongiorno, Raffaella Bonino , et al. (79 additional authors not shown)

Abstract: Recent observations of X-ray pulsars (XRPs) performed by the Imaging X-ray Polarimetry Explorer (IXPE) have made it possible to investigate the intricate details of these objects in a new way, thanks to the added value of X-ray polarimetry. Here we present the results of the IXPE observations of SMC X-1, a member of the small group of XRPs displaying super-orbital variability. SMC X-1 was observed… ▽ More Recent observations of X-ray pulsars (XRPs) performed by the Imaging X-ray Polarimetry Explorer (IXPE) have made it possible to investigate the intricate details of these objects in a new way, thanks to the added value of X-ray polarimetry. Here we present the results of the IXPE observations of SMC X-1, a member of the small group of XRPs displaying super-orbital variability. SMC X-1 was observed by IXPE three separate times during the high state of its super-orbital period. The observed luminosity in the 2-8 keV energy band of $L=2\times10^{38}$ erg/s makes SMC X-1 the brightest XRP ever observed by IXPE. We detect significant polarization in all three observations, with values of the phase-averaged polarization degree (PD) and polarization angle (PA) of $3.2\pm0.8$% and $97°\pm8°$ for Observation 1, $3.0\pm0.9$% and $90°\pm8°$ for Observation 2, and $5.5\pm1.1$% and $80°\pm6°$ for Observation 3, for the spectro-polarimetric analysis. The observed PD shows an increase over time with decreasing luminosity, while the PA decreases in decrements of 10°. The phase-resolved spectro-polarimetric analysis reveals significant detection of polarization in three out of seven phase bins, with the PD ranging between 2% and 10%, and a corresponding range in the PA from $\sim$70° to $\sim$100°. The pulse-phase resolved PD displays an apparent anti-correlation with the flux. Using the rotating vector model, we obtain constraints on the pulsar's geometrical properties for the individual observations. The position angle of the pulsar displays an evolution over time supporting the idea that we observe changes related to different super-orbital phases. Scattering in the wind of the precessing accretion disk may be responsible for the behavior of the polarimetric properties observed during the high-state of SMC X-1's super-orbital period. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 11 pages, 11 figures, submitted to A&A

arXiv:2406.08428 [pdf, other]

Improving Noise Robustness through Abstractions and its Impact on Machine Learning

Authors: Alfredo Ibias, Karol Capala, Varun Ravi Varma, Anna Drozdz, Jose Sousa

Abstract: Noise is a fundamental problem in learning theory with huge effects in the application of Machine Learning (ML) methods, due to real world data tendency to be noisy. Additionally, introduction of malicious noise can make ML methods fail critically, as is the case with adversarial attacks. Thus, finding and develo** alternatives to improve robustness to noise is a fundamental problem in ML. In th… ▽ More Noise is a fundamental problem in learning theory with huge effects in the application of Machine Learning (ML) methods, due to real world data tendency to be noisy. Additionally, introduction of malicious noise can make ML methods fail critically, as is the case with adversarial attacks. Thus, finding and develo** alternatives to improve robustness to noise is a fundamental problem in ML. In this paper, we propose a method to deal with noise: mitigating its effect through the use of data abstractions. The goal is to reduce the effect of noise over the model's performance through the loss of information produced by the abstraction. However, this information loss comes with a cost: it can result in an accuracy reduction due to the missing information. First, we explored multiple methodologies to create abstractions, using the training dataset, for the specific case of numerical data and binary classification tasks. We also tested how these abstractions can affect robustness to noise with several experiments that explore the robustness of an Artificial Neural Network to noise when trained using raw data \emph{vs} when trained using abstracted data. The results clearly show that using abstractions is a viable approach for develo** noise robust ML methods. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.07664 [pdf, other]

Implications for Galactic Electron Density Structure from Pulsar Sightlines Intersecting HII Regions

Authors: S. K. Ocker, L. D. Anderson, J. Lazio, J. M. Cordes, V. Ravi

Abstract: Recent radio surveys have revealed pulsars with dispersion and scattering delays induced by ionized gas that are larger than the rest of the observed pulsar population, in some cases with electron column densities (or dispersion measures; DMs) larger than the maximum predictions of Galactic electron density models. By cross-matching the observed pulsar population against HII region catalogs, we sh… ▽ More Recent radio surveys have revealed pulsars with dispersion and scattering delays induced by ionized gas that are larger than the rest of the observed pulsar population, in some cases with electron column densities (or dispersion measures; DMs) larger than the maximum predictions of Galactic electron density models. By cross-matching the observed pulsar population against HII region catalogs, we show that the majority of pulsars with $\rm DM > 600$ pc cm$^{-3}$ and scattering delays $τ(1\ {\rm GHz}) > 10$ ms lie behind HII regions, and that HII region intersections may be relevant to as much as a third of the observed pulsar population. Accounting for HII regions resolves apparent discrepancies where Galactic electron density models place high-DM pulsars beyond the Galactic disk. By comparing emission measures (EMs) inferred from recombination line observations to pulsar DMs, we show that HII regions can contribute tens to hundreds of pc cm$^{-3}$ in electron column density along a pulsar LOS. We find that nearly all pulsars with significant excess (and deficit) scattering from the mean $τ$-DM relation are spatially coincident with known discrete ionized gas structures, including HII regions. Accounting for HII regions is critical to the interpretation of radio dispersion and scattering measurements as electron density tracers, both in the Milky Way and in other galaxies. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 23 pages, 10 figures. Comments appreciated

arXiv:2406.03986 [pdf, other]

On The Persona-based Summarization of Domain-Specific Documents

Authors: Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku

Abstract: In an ever-expanding world of domain-specific knowledge, the increasing complexity of consuming, and storing information necessitates the generation of summaries from large information repositories. However, every persona of a domain has different requirements of information and hence their summarization. For example, in the healthcare domain, a persona-based (such as Doctor, Nurse, Patient etc.)… ▽ More In an ever-expanding world of domain-specific knowledge, the increasing complexity of consuming, and storing information necessitates the generation of summaries from large information repositories. However, every persona of a domain has different requirements of information and hence their summarization. For example, in the healthcare domain, a persona-based (such as Doctor, Nurse, Patient etc.) approach is imperative to deliver targeted medical information efficiently. Persona-based summarization of domain-specific information by humans is a high cognitive load task and is generally not preferred. The summaries generated by two different humans have high variability and do not scale in cost and subject matter expertise as domains and personas grow. Further, AI-generated summaries using generic Large Language Models (LLMs) may not necessarily offer satisfactory accuracy for different domains unless they have been specifically trained on domain-specific data and can also be very expensive to use in day-to-day operations. Our contribution in this paper is two-fold: 1) We present an approach to efficiently fine-tune a domain-specific small foundation LLM using a healthcare corpus and also show that we can effectively evaluate the summarization quality using AI-based critiquing. 2) We further show that AI-based critiquing has good concordance with Human-based critiquing of the summaries. Hence, such AI-based pipelines to generate domain-specific persona-based summaries can be easily scaled to other domains such as legal, enterprise documents, education etc. in a very efficient and cost-effective manner. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Journal ref: ACL 2024 Findings (Association for Computational Linguistics)

arXiv:2406.03685 [pdf, other]

Shockingly Bright Warm Carbon Monoxide Molecular Features in the Supernova Remnant Cassiopeia A Revealed by JWST

Authors: J. Rho, S. -H. Park, R. Arendt, M. Matsuura, D. Milisavljevic, T. Temim, I. De Looze, W. P. Blair, A. Rest, O. Fox, A. P. Ravi, B. -C. Koo, M. Barlow, A. Burrows, R. Chevalier, G. Clayton, R. Fesen, C. Fransson, C. Fryer, H. L. Gomez, H. -T. Janka, F. Kirchschlarger, J. M. Laming, S. Orlando, D. Patnaude , et al. (14 additional authors not shown)

Abstract: We present JWST NIRCam (F356W and F444W filters) and MIRI (F770W) images and NIRSpec- IFU spectroscopy of the young supernova remnant Cassiopeia A (Cas A). We obtained the data as part of a JWST survey of Cas A. The NIRCam and MIRI images map the spatial distributions of synchrotron radiation, Ar-rich ejecta, and CO on both large and small scales, revealing remarkably complex structures. The CO em… ▽ More We present JWST NIRCam (F356W and F444W filters) and MIRI (F770W) images and NIRSpec- IFU spectroscopy of the young supernova remnant Cassiopeia A (Cas A). We obtained the data as part of a JWST survey of Cas A. The NIRCam and MIRI images map the spatial distributions of synchrotron radiation, Ar-rich ejecta, and CO on both large and small scales, revealing remarkably complex structures. The CO emission is stronger at the outer layers than the Ar ejecta, which indicates the reformation of CO molecules behind the reverse shock. NIRSpec-IFU spectra (3 - 5.5 microns) were obtained toward two representative knots in the NE and S fields. Both regions are dominated by the bright fundamental rovibrational band of CO in the two R and P branches, with strong [Ar VI] and relatively weaker, variable strength ejecta lines of [Si IX], [Ca IV], [Ca V] and [Mg IV]. The NIRSpec-IFU data resolve individual ejecta knots and filaments spatially and in velocity space. The fundamental CO band in the JWST spectra reveals unique shapes of CO, showing a few tens of sinusoidal patterns of rovibrational lines with pseudo-continuum underneath, which is attributed to the high-velocity widths of CO lines. The CO also shows high J lines at different vibrational transitions. Our results with LTE modeling of CO emission indicate a temperature of 1080 K and provide unique insight into the correlations between dust, molecules, and highly ionized ejecta in supernovae, and have strong ramifications for modeling dust formation that is led by CO cooling in the early Universe. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: accepted for the ApJ letter (17 pages and 10 figures)

arXiv:2406.02922 [pdf, other]

Saturated de Rham-Witt complexes with unit-root coefficients

Authors: Ravi Fernando

Abstract: The saturated de Rham-Witt complex, introduced by Bhatt-Lurie-Mathew, is a variant of the classical de Rham-Witt complex which provides a conceptual simplification of the construction and which is expected to produce better results for non-smooth varieties. In this paper, we introduce a generalization of the saturated de Rham-Witt complex which allows coefficients in a unit-root $F$-crystal. We de… ▽ More The saturated de Rham-Witt complex, introduced by Bhatt-Lurie-Mathew, is a variant of the classical de Rham-Witt complex which provides a conceptual simplification of the construction and which is expected to produce better results for non-smooth varieties. In this paper, we introduce a generalization of the saturated de Rham-Witt complex which allows coefficients in a unit-root $F$-crystal. We define our complex by a universal property in a category of so-called de Rham-Witt modules. We prove a number of results about it, including existence, quasicoherence, and comparisons to the de Rham-Witt complex of Bhatt-Lurie-Mathew and (in the smooth case) to crystalline cohomology and the classical de Rham-Witt complex with coefficients. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 91 pages

MSC Class: 14F30

arXiv:2406.02778 [pdf, other]

MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence… ▽ More Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence between the embedding space and the input feature space which aids in deriving feature importance of the original features. We theoretically justify our approach and demonstrate that, in Paley-Wiener spaces on combinatorial graphs, the spectral graph wavelets operator offers greater flexibility and better control over smoothness properties compared to the Laplacian operator. We validate the effectiveness of our proposed graph embedding on a variety of public datasets through a range of downstream tasks, including clustering and unsupervised feature importance. △ Less

Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.01947 [pdf, other]

Data-Driven Approaches for Thrust Prediction in Underwater Flap** Fin Propulsion Systems

Authors: Julian Lee, Kamal Viswanath, Alisha Sharma, Jason Geder, Ravi Ramamurti, Marius D. Pruessner

Abstract: Flap**-fin underwater vehicle propulsion systems provide an alternative to propeller-driven systems in situations that require involve a constrained environment or require high maneuverability. Testing new configurations through experiments or high-fidelity simulations is an expensive process, slowing development of new systems. This is especially true when introducing new fin geometries. In thi… ▽ More Flap**-fin underwater vehicle propulsion systems provide an alternative to propeller-driven systems in situations that require involve a constrained environment or require high maneuverability. Testing new configurations through experiments or high-fidelity simulations is an expensive process, slowing development of new systems. This is especially true when introducing new fin geometries. In this work, we propose machine learning approaches for thrust prediction given the system's fin geometries and kinematics. We introduce data-efficient fin shape parameterization strategies that enable our network to predict thrust profiles for unseen fin geometries given limited fin shapes in input data. In addition to faster development of systems, generalizable surrogate models offer fast, accurate predictions that could be used on an unmanned underwater vehicle control system. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 9 pages, 11 figures, AAAI 2021 Fall Series Symposium on Science-Guided AI

arXiv:2406.01936 [pdf]

Fluid Implicit Particles on Coadjoint Orbits

Authors: Mohammad Sina Nabizadeh, Ritoban Roy-Chowdhury, Hang Yin, Ravi Ramamoorthi, Albert Chern

Abstract: We propose Coadjoint Orbit FLIP (CO-FLIP), a high order accurate, structure preserving fluid simulation method in the hybrid Eulerian-Lagrangian framework. We start with a Hamiltonian formulation of the incompressible Euler Equations, and then, using a local, explicit, and high order divergence free interpolation, construct a modified Hamiltonian system that governs our discrete Euler flow. The re… ▽ More We propose Coadjoint Orbit FLIP (CO-FLIP), a high order accurate, structure preserving fluid simulation method in the hybrid Eulerian-Lagrangian framework. We start with a Hamiltonian formulation of the incompressible Euler Equations, and then, using a local, explicit, and high order divergence free interpolation, construct a modified Hamiltonian system that governs our discrete Euler flow. The resulting discretization, when paired with a geometric time integration scheme, is energy and circulation preserving (formally the flow evolves on a coadjoint orbit) and is similar to the Fluid Implicit Particle (FLIP) method. CO-FLIP enjoys multiple additional properties including that the pressure projection is exact in the weak sense, and the particle-to-grid transfer is an exact inverse of the grid-to-particle interpolation. The method is demonstrated numerically with outstanding stability, energy, and Casimir preservation. We show that the method produces benchmarks and turbulent visual effects even at low grid resolutions. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.01471 [pdf]

Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing

Authors: Luka Grbcic, Minok Park, Mahmoud Elzouka, Ravi Prasher, Juliane Müller, Costas P. Grigoropoulos, Sean D. Lubner, Vassilia Zorba, Wibe Albert de Jong

Abstract: We demonstrate a multi-fidelity (MF) machine learning ensemble framework for the inverse design of photonic surfaces, trained on a dataset of 11,759 samples that we fabricate using high throughput femtosecond laser processing. The MF ensemble combines an initial low fidelity model for generating design solutions, with a high fidelity model that refines these solutions through local optimization. T… ▽ More We demonstrate a multi-fidelity (MF) machine learning ensemble framework for the inverse design of photonic surfaces, trained on a dataset of 11,759 samples that we fabricate using high throughput femtosecond laser processing. The MF ensemble combines an initial low fidelity model for generating design solutions, with a high fidelity model that refines these solutions through local optimization. The combined MF ensemble can generate multiple disparate sets of laser-processing parameters that can each produce the same target input spectral emissivity with high accuracy (root mean squared errors < 2%). SHapley Additive exPlanations analysis shows transparent model interpretability of the complex relationship between laser parameters and spectral emissivity. Finally, the MF ensemble is experimentally validated by fabricating and evaluating photonic surface designs that it generates for improved efficiency energy harvesting devices. Our approach provides a powerful tool for advancing the inverse design of photonic surfaces in energy harvesting applications. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00982 [pdf, ps, other]

Constructing Dynamic Feedback Linearizable Discretizations

Authors: Ashutosh **dal, Florentina Nicolau, David Martin Diego, Ravi Banavar

Abstract: Dynamic feedback linearization-based methods allow us to design control algorithms for a fairly large class of nonlinear systems in continuous time. However, this feature does not extend to their sampled counterparts, i.e., for a given dynamically feedback linearizable continuous time system, its numerical discretization may fail to be so. In this article, we present a way to construct discretizat… ▽ More Dynamic feedback linearization-based methods allow us to design control algorithms for a fairly large class of nonlinear systems in continuous time. However, this feature does not extend to their sampled counterparts, i.e., for a given dynamically feedback linearizable continuous time system, its numerical discretization may fail to be so. In this article, we present a way to construct discretization schemes (accurate up to first order) that result in schemes that are feedback linearizable. This result is an extension of our previous work, where we had considered only static feedback linearizable systems. The result presented here applies to a fairly general class of nonlinear systems, in particular, our analysis applies to both endogenous and exogenous types of feedback. While the results in this article are presented on a control affine form of nonlinear systems, they can be readily modified to general nonlinear systems. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00611 [pdf, other]

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

Authors: Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong

Abstract: Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers… ▽ More Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: Accepted at ICML 2024. 22 pages, 5 figures

arXiv:2406.00365 [pdf, other]

SynthBA: Reliable Brain Age Estimation Across Multiple MRI Sequences and Resolutions

Authors: Lemuel Puglisi, Alessia Rondinella, Linda De Meo, Francesco Guarnera, Sebastiano Battiato, Daniele Ravì

Abstract: Brain age is a critical measure that reflects the biological ageing process of the brain. The gap between brain age and chronological age, referred to as brain PAD (Predicted Age Difference), has been utilized to investigate neurodegenerative conditions. Brain age can be predicted using MRIs and machine learning techniques. However, existing methods are often sensitive to acquisition-related varia… ▽ More Brain age is a critical measure that reflects the biological ageing process of the brain. The gap between brain age and chronological age, referred to as brain PAD (Predicted Age Difference), has been utilized to investigate neurodegenerative conditions. Brain age can be predicted using MRIs and machine learning techniques. However, existing methods are often sensitive to acquisition-related variabilities, such as differences in acquisition protocols, scanners, MRI sequences, and resolutions, significantly limiting their application in highly heterogeneous clinical settings. In this study, we introduce Synthetic Brain Age (SynthBA), a robust deep-learning model designed for predicting brain age. SynthBA utilizes an advanced domain randomization technique, ensuring effective operation across a wide array of acquisition-related variabilities. To assess the effectiveness and robustness of SynthBA, we evaluate its predictive capabilities on internal and external datasets, encompassing various MRI sequences and resolutions, and compare it with state-of-the-art techniques. Additionally, we calculate the brain PAD in a large cohort of subjects with Alzheimer's Disease (AD), demonstrating a significant correlation with AD-related measures of cognitive dysfunction. SynthBA holds the potential to facilitate the broader adoption of brain age prediction in clinical settings, where re-training or fine-tuning is often unfeasible. The SynthBA source code and pre-trained models are publicly available at https://github.com/LemuelPuglisi/SynthBA. △ Less

Submitted 1 June, 2024; originally announced June 2024.

arXiv:2406.00172 [pdf, other]

Dissecting the Crab Nebula with JWST: Pulsar wind, dusty filaments, and Ni/Fe abundance constraints on the explosion mechanism

Authors: Tea Temim, J. Martin Laming, P. J. Kavanagh, Nathan Smith, Patrick Slane, William P. Blair, Ilse De Looze, Niccolò Bucciantini, Anders Jerkstrand, Nicole Marcelina Gountanis, Ravi Sankrit, Dan Milisavljevic, Armin Rest, Maxim Lyutikov, Joseph DePasquale, Thomas Martin, Laurent Drissen, John Raymond, Ori D. Fox, Maryam Modjaz, Anatoly Spitkovsky, Lou Strolger

Abstract: We present JWST observations of the Crab Nebula, the iconic remnant of the historical SN 1054. The observations include NIRCam and MIRI imaging mosaics, plus MIRI/MRS IFU spectra that probe two select locations within the ejecta filaments. We derive a high-resolution map of dust emission and show that the grains are concentrated in the innermost, high-density filaments. These dense filaments coinc… ▽ More We present JWST observations of the Crab Nebula, the iconic remnant of the historical SN 1054. The observations include NIRCam and MIRI imaging mosaics, plus MIRI/MRS IFU spectra that probe two select locations within the ejecta filaments. We derive a high-resolution map of dust emission and show that the grains are concentrated in the innermost, high-density filaments. These dense filaments coincide with multiple synchrotron bays around the periphery of the Crab's pulsar wind nebula (PWN). We measure synchrotron spectral index changes in small-scale features within the PWN's torus region, including the well-known knot and wisp structures. The index variations are consistent with Doppler boosting of emission from particles with a broken power-law distribution, providing the first direct evidence that the curvature in the particle injection spectrum is tied to the acceleration mechanism at the termination shock. We detect multiple nickel and iron lines in the ejecta filaments and use photoionization models to derive nickel-to-iron abundance ratios that are a factor of 3-8 higher than the solar ratio. We also find that the previously reported order-of-magnitude higher Ni/Fe values from optical data are consistent with the lower values from JWST when we reanalyze the optical emission using updated atomic data and account for local extinction from dust. We discuss the implications of our results for understanding the nature of the explosion that produced the Crab Nebula and conclude that the observational properties are most consistent with a low-mass iron-core-collapse supernova, even though an electron-capture explosion cannot be ruled out. △ Less

Submitted 31 May, 2024; originally announced June 2024.

Comments: 32 pages, 3 tables, 20 figures, accepted for publication in ApJL

arXiv:2406.00066 [pdf, other]

Estimates on the domain of validity for Lyapunov-Schmidt reduction

Authors: Pranav Gupta, Anastasia Bizyaeva, Ravi Banavar

Abstract: Lyapunov-Schmidt reduction is a dimensionality reduction technique in nonlinear systems analysis that is commonly utilised in the study of bifurcation problems in high-dimensional systems. The method is a systematic procedure for reducing the dimensionality of systems of algebraic equations that have singular points, preserving essential features of their solution sets. In this article, we establi… ▽ More Lyapunov-Schmidt reduction is a dimensionality reduction technique in nonlinear systems analysis that is commonly utilised in the study of bifurcation problems in high-dimensional systems. The method is a systematic procedure for reducing the dimensionality of systems of algebraic equations that have singular points, preserving essential features of their solution sets. In this article, we establish estimates for the region of validity of the reduction by applying bounds on the implicit function theorem derived in [https://doi.org/10.1007/s00498-023-00370-5]. We then apply these bounds to an illustrative example of a two-dimensional system with a pitchfork bifurcation. △ Less

Submitted 30 May, 2024; originally announced June 2024.

Comments: An abbreviated version of this manuscript has been submitted to the IEEE Conference on Decision and Control taking place in Milan, in December 2024

arXiv:2405.20207 [pdf, other]

The energy shear of protohaloes

Authors: Marcello Musso, Giulia Despali, Ravi K. Sheth

Abstract: As it collapses to form a halo, the shape of a protohalo patch is deformed by the initial shear field. This deformation is often modeled using the "deformation" tensor, constructed from second derivatives of the gravitational potential, whose trace gives the initial overdensity. However, especially for lower mass protohalos, this matrix is not always positive definite: one of its eigenvalues has a… ▽ More As it collapses to form a halo, the shape of a protohalo patch is deformed by the initial shear field. This deformation is often modeled using the "deformation" tensor, constructed from second derivatives of the gravitational potential, whose trace gives the initial overdensity. However, especially for lower mass protohalos, this matrix is not always positive definite: one of its eigenvalues has a different sign from the others. We show that the evolution of a patch is better described by the "energy shear" tensor, which is positive definite and plays a direct role in the evolution. This positive-definiteness simplifies models of halo abundances, assembly and of the cosmic web. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 9 pages, 10 figures

arXiv:2405.19700 [pdf, other]

Initial measurement of reactor antineutrino oscillation at SNO+

Authors: SNO+ Collaboration, :, A. Allega, M. R. Anderson, S. Andringa, M. Askins, D. J. Auty, A. Bacon, J. Baker, F. Barão, N. Barros, R. Bayes, E. W. Beier, T. S. Bezerra, A. Bialek, S. D. Biller, E. Blucher, E. Caden, E. J. Callaghan, M. Chen, S. Cheng, B. Cleveland, D. Cookman, J. Corning, M. A. Cox , et al. (96 additional authors not shown)

Abstract: The SNO+ collaboration reports its first spectral analysis of long-baseline reactor antineutrino oscillation using 114 tonne-years of data. Fitting the neutrino oscillation probability to the observed energy spectrum yields constraints on the neutrino mass-squared difference $Δm^2_{21}$. In the ranges allowed by previous measurements, the best-fit $Δm^2_{21}$ is (8.85$^{+1.10}_{-1.33}$) $\times$ 1… ▽ More The SNO+ collaboration reports its first spectral analysis of long-baseline reactor antineutrino oscillation using 114 tonne-years of data. Fitting the neutrino oscillation probability to the observed energy spectrum yields constraints on the neutrino mass-squared difference $Δm^2_{21}$. In the ranges allowed by previous measurements, the best-fit $Δm^2_{21}$ is (8.85$^{+1.10}_{-1.33}$) $\times$ 10$^{-5}$ eV$^2$. This measurement is continuing in the next phases of SNO+ and is expected to surpass the present global precision on $Δm^2_{21}$ with about three years of data. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Showing 1–50 of 2,894 results for author: Ravi