-
Insulator-to-Metal Transition and Isotropic Gigantic Magnetoresistance in Layered Magnetic Semiconductors
Authors:
Gokul Acharya,
Bimal Neupane,
Chia-Hsiu Hsu,
Xian P. Yang,
David Graf,
Eun Sang Choi,
Krishna Pandey,
Md Rafique Un Nabi,
Santosh Karki Chhetri,
Rabindra Basnet,
Sumaya Rahman,
Jian Wang,
Zhengxin Hu,
Bo Da,
Hugh Churchill,
Guoqing Chang,
M. Zahid Hasan,
Yuanxi Wang,
** Hu
Abstract:
Magnetotransport, the response of electrical conduction to external magnetic field, acts as an important tool to reveal fundamental concepts behind exotic phenomena and plays a key role in enabling spintronic applications. Magnetotransport is generally sensitive to magnetic field orientations. In contrast, efficient and isotropic modulation of electronic transport, which is useful in technology ap…
▽ More
Magnetotransport, the response of electrical conduction to external magnetic field, acts as an important tool to reveal fundamental concepts behind exotic phenomena and plays a key role in enabling spintronic applications. Magnetotransport is generally sensitive to magnetic field orientations. In contrast, efficient and isotropic modulation of electronic transport, which is useful in technology applications such as omnidirectional sensing, is rarely seen, especially for pristine crystals. Here we propose a strategy to realize extremely strong modulation of electron conduction by magnetic field which is independent of field direction. GdPS, a layered antiferromagnetic semiconductor with resistivity anisotropies, supports a field-driven insulator-to-metal transition with a paradoxically isotropic gigantic negative magnetoresistance insensitive to magnetic field orientations. This isotropic magnetoresistance originates from the combined effects of a near-zero spin-orbit coupling of Gd3+-based half-filling f-electron system and the strong on-site f-d exchange coupling in Gd atoms. Our results not only provide a novel material system with extraordinary magnetotransport that offers a missing block for antiferromagnet-based ultrafast and efficient spintronic devices, but also demonstrate the key ingredients for designing magnetic materials with desired transport properties for advanced functionalities.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Optimization study of a Z-type airflow cooling system of a lithium-ion battery pack
Authors:
Santosh Argade,
Ashoke De
Abstract:
The present study aims to optimize the structural design of a Z-type flow lithium-ion battery pack with a forced air-cooling system (FACS) known as BTMS (Battery Thermal Management System). The main goal is to minimize Tmax (maximum temperature) and δTmax (maximum temperature difference) while ensuring an even airflow distribution within the battery module. The present study thoroughly investigate…
▽ More
The present study aims to optimize the structural design of a Z-type flow lithium-ion battery pack with a forced air-cooling system (FACS) known as BTMS (Battery Thermal Management System). The main goal is to minimize Tmax (maximum temperature) and δTmax (maximum temperature difference) while ensuring an even airflow distribution within the battery module. The present study thoroughly investigates critical factors such as the inlet air velocity, tapered inlet manifold, and the number of secondary outlets to evaluate their impact on thermal performance and airflow uniformity within the battery module. Increasing the inlet air velocity from 3 to 4.5 m/s significantly improves the thermal cooling performance of the BTMS, resulting in a decrease of 4.57 °C (10.05%) in Tmax and 0.29 °C (9.79%) in δTmax compared to the original 3 m/s velocity. Further, the study assesses the significance of a tapered inlet manifold as a critical factor, revealing its substantial impact on cooling performance and temperature reductions in battery cells 3-9. It also facilitates a more uniform airflow distribution, decreasing the velocity difference between channel 9 and channel 1 from 3.32 m/s to 2.50 m/s. Incorporating 7 secondary outlets significantly improves the heat dissipation ability of the BTMS, resulting in a decrease of 0.894 °C (2.18%) in Tmax and 2.23 °C (72.84%) in δTmax compared to the configuration with 0 secondary outlets. By optimizing these parameters, the aim is to enhance BTMS's capabilities, improving LIB packs' performance and reliability.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
Authors:
Hui Wei,
Maxwell A. Xu,
Colin Samplawski,
James M. Rehg,
Santosh Kumar,
Benjamin M. Marlin
Abstract:
Wearable sensors enable health researchers to continuously collect data pertaining to the physiological state of individuals in real-world settings. However, such data can be subject to extensive missingness due to a complex combination of factors. In this work, we study the problem of imputation of missing step count data, one of the most ubiquitous forms of wearable sensor data. We construct a n…
▽ More
Wearable sensors enable health researchers to continuously collect data pertaining to the physiological state of individuals in real-world settings. However, such data can be subject to extensive missingness due to a complex combination of factors. In this work, we study the problem of imputation of missing step count data, one of the most ubiquitous forms of wearable sensor data. We construct a novel and large scale data set consisting of a training set with over 3 million hourly step count observations and a test set with over 2.5 million hourly step count observations. We propose a domain knowledge-informed sparse self-attention model for this task that captures the temporal multi-scale nature of step-count data. We assess the performance of the model relative to baselines and conduct ablation studies to verify our specific model designs.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Chandra detects low-luminosity AGN with $M_\mathrm{BH}=10^{4}-10^{6}~M_\mathrm{\odot}$ in nearby ($z<0.5$), dwarf and star-forming galaxies
Authors:
Mainak Singha,
Julissa Sarmiento,
Sangeeta Malhotra,
James E. Rhoads,
L. Y. Aaron Yung,
Junxian Wang,
Zhen-Ya Zheng,
Ruqiu Lin,
Keunho Kim,
Jialai Kang,
Santosh Harish
Abstract:
We searched the Chandra and XMM archives for observations of 900 green pea galaxies to find AGN signatures. Green peas are low-mass galaxies with prominent emission lines, similar in size and star formation rate to high-redshift dwarf galaxies. Of the 29 observations found, 9 show X-ray detections with $S/N>3$. The 2-10 keV X-ray luminosity for these 9 sources exceeds…
▽ More
We searched the Chandra and XMM archives for observations of 900 green pea galaxies to find AGN signatures. Green peas are low-mass galaxies with prominent emission lines, similar in size and star formation rate to high-redshift dwarf galaxies. Of the 29 observations found, 9 show X-ray detections with $S/N>3$. The 2-10 keV X-ray luminosity for these 9 sources exceeds $10^{40}~\mathrm{erg~s}^{-1}$, with 2 sources exceeding $10^{41}~\mathrm{erg~s}^{-1}$, suggesting the presence of intermediate-mass black holes (IMBH) or low-luminosity AGN (LLAGN) with BH masses between $100-10^6M_\mathrm{\odot}$. All X-ray detected sources (plus 6 additional sources) show He~II$\lambda4686$ emission and a broad component of the H$α$ emission line, indicating winds. The line widths of the broad H$α$ and He II$\lambda4686$ emitting gas clouds are weakly correlated ($R^{2}=0.15$), suggesting He II$\lambda4686$ emission is inconsistent with winds from super-Eddington accretors. However, the ratio of X-ray luminosity to star formation rate shows an anti-correlation with metallicity in 5 out of 9 X-ray detected sources, implying ultraluminous X-ray sources are key contributors to the observed X-ray luminosity. This could be due to super-Eddington accretors or IMBH. The X-ray emission is much higher than that produced by Wolf-Rayet stars and supernovae-driven winds. Thus, the X-ray luminosity in these 9 sources can only be explained by black holes with masses over $100~M_\mathrm{\odot}$. Our findings suggest the presence of LLAGN in these galaxies, with broad H$α$ line widths implying BH masses of $10^4-10^6M_\mathrm{\odot}$. Given Green Peas' role as significant Lyman Continuum leakers, LLAGN in these galaxies could have contributed significantly to cosmic reionization.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Coordination of Transmission and Distribution Systems in Load Restoration
Authors:
Santosh Sharma
Abstract:
The power distribution system is evolving in the form of an intelligent grid. The proliferation of distributed energy resources (DERs) makes the previously passive system active and more complicated. With the adoption of de-carbonization principles, large-scale coal and nuclear power plants are being gradually replaced by renewables and carbon-free DERs. With this rapid transformation, the power s…
▽ More
The power distribution system is evolving in the form of an intelligent grid. The proliferation of distributed energy resources (DERs) makes the previously passive system active and more complicated. With the adoption of de-carbonization principles, large-scale coal and nuclear power plants are being gradually replaced by renewables and carbon-free DERs. With this rapid transformation, the power system operates with less inertia and minimal margins. In recent years, power systems have been facing apocalyptic weather events more frequently, and large-scale blackouts have become regular. After the complete or partial blackouts, the power system goes through different stages before it reaches the normal operating condition. The load restoration is the stage where the power system is fully established after the blackouts; however, due to the limiting ram** rates of centralized generation, the energization of large amounts of loads is delayed by some time. To mitigate the negative impact of ram** rates of centralized generation, DERs in distribution systems are proposed to serve the loads in both transmission and distribution systems in coordination with limited centralized generation in the transmission system. The problem is formulated as a centralized or integrated transmission and distribution (T$\&$D) coordination model. The modified IEEE 14 bus test case and IEEE 13 node test feeders are used to validate the proposed strategy; the results indicate the validity of the proposed model.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
A Scotogenic model with U(1) symmetry and a scalar dark matter
Authors:
Anjan Kumar Barik,
Najimuddin Khan,
Santosh Kumar Rai
Abstract:
We study a scotogenic model augmented with an additional U(1) gauge and a discrete Z2 symmetry. The lightest Z2-odd particle in our model becomes the dark matter (DM) candidate while tiny neutrino masses are realized at one loop. We explore the parameter space of the model for which the DM relic density is satisfied, and the correct low-energy neutrino observables are reproduced. The extended gaug…
▽ More
We study a scotogenic model augmented with an additional U(1) gauge and a discrete Z2 symmetry. The lightest Z2-odd particle in our model becomes the dark matter (DM) candidate while tiny neutrino masses are realized at one loop. We explore the parameter space of the model for which the DM relic density is satisfied, and the correct low-energy neutrino observables are reproduced. The extended gauge symmetry includes beyond Standard Model (SM) particle spectrum consisting of vector-like fermions and scalars. We also highlight possible collider signatures of these particles at the LHC.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms
Authors:
Mirabel Reid,
Santosh S. Vempala
Abstract:
As Large Language Models (LLMs) perform (and sometimes excel at) more and more complex cognitive tasks, a natural question is whether AI really understands. The study of understanding in LLMs is in its infancy, and the community has yet to incorporate well-trodden research in philosophy, psychology, and education. We initiate this, specifically focusing on understanding algorithms, and propose a h…
▽ More
As Large Language Models (LLMs) perform (and sometimes excel at) more and more complex cognitive tasks, a natural question is whether AI really understands. The study of understanding in LLMs is in its infancy, and the community has yet to incorporate well-trodden research in philosophy, psychology, and education. We initiate this, specifically focusing on understanding algorithms, and propose a hierarchy of levels of understanding. We use the hierarchy to design and conduct a study with human subjects (undergraduate and graduate students) as well as large language models (generations of GPT), revealing interesting similarities and differences. We expect that our rigorous criteria will be useful to keep track of AI's progress in such cognitive domains.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Single Channel-based Motor Imagery Classification using Fisher's Ratio and Pearson Correlation
Authors:
Sonal Santosh Baberwal,
Tomas Ward,
Shirley Coyle
Abstract:
Motor imagery-based BCI systems have been promising and gaining popularity in rehabilitation and Activities of daily life(ADL). Despite this, the technology is still emerging and has not yet been outside the laboratory constraints. Channel reduction is one contributing avenue to make these systems part of ADL. Although Motor Imagery classification heavily depends on spatial factors, single channel…
▽ More
Motor imagery-based BCI systems have been promising and gaining popularity in rehabilitation and Activities of daily life(ADL). Despite this, the technology is still emerging and has not yet been outside the laboratory constraints. Channel reduction is one contributing avenue to make these systems part of ADL. Although Motor Imagery classification heavily depends on spatial factors, single channel-based classification remains an avenue to be explored thoroughly. Since Fisher's ratio and Pearson Correlation are powerful measures actively used in the domain, we propose an integrated framework (FRPC integrated framework) that integrates Fisher's Ratio to select the best channel and Pearson correlation to select optimal filter banks and extract spectral and temporal features respectively. The framework is tested for a 2-class motor imagery classification on 2 open-source datasets and 1 collected dataset and compared with state-of-art work. Apart from implementing the framework, this study also explores the most optimal channel among all the subjects and later explores classes where the single-channel framework is efficient.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Charm and Bottom Hadrons in Hot Hadronic Matter
Authors:
Santosh K. Das,
Juan M. Torres-Rincon,
Ralf Rapp
Abstract:
Heavy quarks, and the hadrons containing them, are excellent probes of the QCD medium formed in high-energy heavy-ion collisions, as they provide direct information on the transport properties of the medium and how quarks color-neutralize into hadrons. Large theoretical and phenomenological efforts have been dedicated thus far to assess the diffusion of charm and bottom quarks in the quark-gluon p…
▽ More
Heavy quarks, and the hadrons containing them, are excellent probes of the QCD medium formed in high-energy heavy-ion collisions, as they provide direct information on the transport properties of the medium and how quarks color-neutralize into hadrons. Large theoretical and phenomenological efforts have been dedicated thus far to assess the diffusion of charm and bottom quarks in the quark-gluon plasma and their subsequent hadronization into heavy-flavor (HF) hadrons. However, the fireball formed in heavy-ion collisions also features an extended hadronic phase, and therefore any quantitative analysis of experimental observables needs to account for rescattering of charm and bottom hadrons. This is further reinforced by the presence of a QCD cross-over transition and the notion that the interaction strength is maximal in the vicinity of the pseudo-critical temperature. We review existing approaches for evaluating the interactions of open HF hadrons in a hadronic heat bath and the pertinent results for scattering amplitudes, spectral functions and transport coefficients. While most of the work to date has focused on $D$ mesons, we also discuss excited states as well as HF baryons and the bottom sector. Both the HF hadro-chemistry and bottom observables will play a key role in future experimental measurements. We also conduct a survey of transport calculations in heavy-ion collisions that have included effects of hadronic HF diffusion and assess their sensitivity to various observables.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Machine learning evaluation in the Global Event Processor FPGA for the ATLAS trigger upgrade
Authors:
Zhixing Jiang,
Scott Hauck,
Dennis Yin,
Bowen Zuo,
Ben Carlson,
Shih-Chieh Hsu,
Allison Deiana,
Rohin Narayan,
Santosh Parajuli,
Jeff Eastlack
Abstract:
The Global Event Processor (GEP) FPGA is an area-constrained, performance-critical element of the Large Hadron Collider's (LHC) ATLAS experiment. It needs to very quickly determine which small fraction of detected events should be retained for further processing, and which other events will be discarded. This system involves a large number of individual processing tasks, brought together within th…
▽ More
The Global Event Processor (GEP) FPGA is an area-constrained, performance-critical element of the Large Hadron Collider's (LHC) ATLAS experiment. It needs to very quickly determine which small fraction of detected events should be retained for further processing, and which other events will be discarded. This system involves a large number of individual processing tasks, brought together within the overall Algorithm Processing Platform (APP), to make filtering decisions at an overall latency of no more than 8ms. Currently, such filtering tasks are hand-coded implementations of standard deterministic signal processing tasks.
In this paper we present methods to automatically create machine learning based algorithms for use within the APP framework, and demonstrate several successful such deployments. We leverage existing machine learning to FPGA flows such as hls4ml and fwX to significantly reduce the complexity of algorithm design. These have resulted in implementations of various machine learning algorithms with latencies of 1.2us and less than 5% resource utilization on an Xilinx XCVU9P FPGA. Finally, we implement these algorithms into the GEP system and present their actual performance.
Our work shows the potential of using machine learning in the GEP for high-energy physics applications. This can significantly improve the performance of the trigger system and enable the ATLAS experiment to collect more data and make more discoveries. The architecture and approach presented in this paper can also be applied to other applications that require real-time processing of large volumes of data.
△ Less
Submitted 7 May, 2024;
originally announced June 2024.
-
Optimal Bailouts in Diversified Financial Networks
Authors:
Krishna Dasaratha,
Santosh Venkatesh,
Rakesh Vohra
Abstract:
Widespread default involves substantial deadweight costs which could be countered by injecting capital into failing firms. Injections have positive spillovers that can trigger a repayment cascade. But which firms should a regulator bailout so as to minimize the total injection of capital while ensuring solvency of all firms? While the problem is, in general, NP-hard, for a wide range of networks t…
▽ More
Widespread default involves substantial deadweight costs which could be countered by injecting capital into failing firms. Injections have positive spillovers that can trigger a repayment cascade. But which firms should a regulator bailout so as to minimize the total injection of capital while ensuring solvency of all firms? While the problem is, in general, NP-hard, for a wide range of networks that arise from a stochastic block model, we show that the optimal bailout can be implemented by a simple policy that targets firms based on their characteristics and position in the network. Specific examples of the setting include core-periphery networks.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Towards Supporting Legal Argumentation with NLP: Is More Data Really All You Need?
Authors:
T. Y. S. S Santosh,
Kevin D. Ashley,
Katie Atkinson,
Matthias Grabmair
Abstract:
Modeling legal reasoning and argumentation justifying decisions in cases has always been central to AI & Law, yet contemporary developments in legal NLP have increasingly focused on statistically classifying legal conclusions from text. While conceptually simpler, these approaches often fall short in providing usable justifications connecting to appropriate legal concepts. This paper reviews both…
▽ More
Modeling legal reasoning and argumentation justifying decisions in cases has always been central to AI & Law, yet contemporary developments in legal NLP have increasingly focused on statistically classifying legal conclusions from text. While conceptually simpler, these approaches often fall short in providing usable justifications connecting to appropriate legal concepts. This paper reviews both traditional symbolic works in AI & Law and recent advances in legal NLP, and distills possibilities of integrating expert-informed knowledge to strike a balance between scalability and explanation in symbolic vs. data-driven approaches. We identify open challenges and discuss the potential of modern NLP models and methods that integrate
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Accretion Geometry of GX 339-4 in the Hard State: AstroSat View
Authors:
Swadesh Chand,
Gulab C. Dewangan,
Andrzej A. Zdziarski,
Dipankar Bhattacharya,
N. P. S. Mithun,
Santosh V. Vadawale
Abstract:
We perform broadband ($0.7-100$ keV) spectral analysis of five hard state observations of the low-mass back hole X-ray binary GX~339--4 taken by AstroSat during the rising phase of three outbursts from $2019$ to $2022$. We find that the outburst in 2021 was the only successful/full outburst, while the source was unable to make transition to the soft state during the other two outbursts in 2019 and…
▽ More
We perform broadband ($0.7-100$ keV) spectral analysis of five hard state observations of the low-mass back hole X-ray binary GX~339--4 taken by AstroSat during the rising phase of three outbursts from $2019$ to $2022$. We find that the outburst in 2021 was the only successful/full outburst, while the source was unable to make transition to the soft state during the other two outbursts in 2019 and 2022. Our spectral analysis employs two different model combinations, requiring two separate Comptonizing regions and their associated reflection components, and soft X-ray excess emission. The harder Comptonizing component dominates the overall bolometric luminosity, while the softer one remains relatively weak. Our spectral fits indicate that the disk evolves with the source luminosity, where the inner disk radius decreases with increasing luminosity. However, the disk remains substantially truncated throughout all the observations at the source luminosity of $\sim2-8\%\times$ of the Eddington luminosity. We note that our assumption of the soft X-ray excess emission as disk blackbody may not be realistic, and this kind of soft excess may arise due the non-homogeneity in the disk/corona geometry. Our temporal analysis deriving the power density spectra suggests that the break frequency increases with the source luminosity. Furthermore, our analysis demonstrates a consistency between the inner disk radii estimated from break frequency of the power density spectra and those obtained from the reflection modelling, supporting the truncated disk geometry in the hard state.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
COSMOS-Web: The over-abundance and physical nature of "little red dots"--Implications for early galaxy and SMBH assembly
Authors:
Hollis B. Akins,
Caitlin M. Casey,
Erini Lambrides,
Natalie Allen,
Irham T. Andika,
Malte Brinch,
Jaclyn B. Champagne,
Olivia Cooper,
Xuheng Ding,
Nicole E. Drakos,
Andreas Faisst,
Steven L. Finkelstein,
Maximilien Franco,
Seiji Fujimoto,
Fabrizio Gentile,
Steven Gillman,
Ghassem Gozaliasl,
Santosh Harish,
Christopher C. Hayward,
Michaela Hirschmann,
Olivier Ilbert,
Jeyhan S. Kartaltepe,
Dale D. Kocevski,
Anton M. Koekemoer,
Vasily Kokorev
, et al. (16 additional authors not shown)
Abstract:
JWST has revealed a population of compact and extremely red galaxies at $z>4$, which likely host active galactic nuclei (AGN). We present a sample of 434 ``little red dots'' (LRDs), selected from the 0.54 deg$^2$ COSMOS-Web survey. We fit galaxy and AGN SED models to derive redshifts and physical properties; the sample spans $z\sim5$-$9$ after removing brown dwarf contaminants. We consider two ext…
▽ More
JWST has revealed a population of compact and extremely red galaxies at $z>4$, which likely host active galactic nuclei (AGN). We present a sample of 434 ``little red dots'' (LRDs), selected from the 0.54 deg$^2$ COSMOS-Web survey. We fit galaxy and AGN SED models to derive redshifts and physical properties; the sample spans $z\sim5$-$9$ after removing brown dwarf contaminants. We consider two extreme physical scenarios: either LRDs are all AGN, and their continuum emission is dominated by the accretion disk, or they are all compact star-forming galaxies, and their continuum is dominated by stars. If LRDs are AGN-dominated, our sample exhibits bolometric luminosities $\sim10^{45-47}$ erg\,s$^{-1}$, spanning the gap between JWST AGN in the literature and bright, rare quasars. We derive a bolometric luminosity function (LF) $\sim100$ times the (UV-selected) quasar LF, implying a non-evolving black hole accretion density of $\sim10^{-4}$ M$_\odot$ yr$^{-1}$ Mpc$^{-3}$ from $z\sim2$-$9$. By contrast, if LRDs are dominated by star formation, we derive stellar masses $\sim10^{8.5-10}\,M_\odot$. MIRI/F770W is key to deriving accurate stellar masses; without it, we derive a mass function inconsistent with $Λ$CDM. The median stellar mass profile is broadly consistent with the maximal stellar mass surface densities seen in the nearby universe, though the most massive $\sim50$\% of objects exceed this limit, requiring substantial AGN contribution to the continuum. Nevertheless, stacking all available X-ray, mid-IR, far-IR/sub-mm, and radio data yields non-detections. Whether dominated by dusty AGN, compact star-formation, or both, the high masses/luminosities and remarkable abundance of LRDs implies a dominant mode of early galaxy/SMBH growth.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator
Authors:
Mandovi Mukherjee,
Xiangyu Mao,
Nael Rahman,
Coleman DeLude,
Joe Driscoll,
Sudarshan Sharma,
Payman Behnam,
Uday Kamal,
Jongseok Woo,
Daehyun Kim,
Sharjeel Khan,
Jianming Tong,
Jamin Seo,
Prachi Sinha,
Madhavan Swaminathan,
Tushar Krishna,
Santosh Pande,
Justin Romberg,
Saibal Mukhopadhyay
Abstract:
A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous…
▽ More
A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous control to extract concurrency in compute as well as low latency. It achieves a $518$ MHz per channel bandwidth in a prototype $4$-node system. The maximum emulation range supported in this paradigm is $9.5$ km with $0.24$ $μ$s of per-sample emulation latency. 2). The FPGA-based implementation, evaluated on a Xilinx ZCU104 board, demonstrates a $9$-node test case (two Transmitters, one Receiver, and $6$ passive reflectors) with an emulation range of $1.13$ km to $27.3$ km at $215$ MHz bandwidth.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Real-time Digital RF Emulation -- I: The Direct Path Computational Model
Authors:
Coleman DeLude,
Joe Driscoll,
Mandovi Mukherjee,
Nael Rahman,
Uday Kamal,
Xiangyu Mao,
Sharjeel Khan,
Hariharan Sivaraman,
Eric Huang,
Jeffrey McHarg,
Madhavan Swaminathan,
Santosh Pande,
Saibal Mukhopadhyay,
Justin Romberg
Abstract:
In this paper we consider the problem of develo** a computational model for emulating an RF channel. The motivation for this is that an accurate and scalable emulator has the potential to minimize the need for field testing, which is expensive, slow, and difficult to replicate. Traditionally, emulators are built using a tapped delay line model where long filters modeling the physical interaction…
▽ More
In this paper we consider the problem of develo** a computational model for emulating an RF channel. The motivation for this is that an accurate and scalable emulator has the potential to minimize the need for field testing, which is expensive, slow, and difficult to replicate. Traditionally, emulators are built using a tapped delay line model where long filters modeling the physical interactions of objects are implemented directly. For an emulation scenario consisting of $M$ objects all interacting with one another, the tapped delay line model's computational requirements scale as $O(M^3)$ per sample: there are $O(M^2)$ channels, each with $O(M)$ complexity. In this paper, we develop a new ``direct path" model that, while remaining physically faithful, allows us to carefully factor the emulator operations, resulting in an $O(M^2)$ per sample scaling of the computational requirements. The impact of this is drastic, a $200$ object scenario sees about a $100\times$ reduction in the number of per sample computations. Furthermore, the direct path model gives us a natural way to distribute the computations for an emulation: each object is mapped to a computational node, and these nodes are networked in a fully connected communication graph.
Alongside a discussion of the model and the physical phenomena it emulates, we show how to efficiently parameterize antenna responses and scattering profiles within this direct path framework. To verify the model and demonstrate its viability in hardware, we provide several numerical experiments produced using a cycle level C++ simulator of a hardware implementation of the model.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Report on laser-induced fluorescence transitions relevant for the microelectronics industry and sustainability applications
Authors:
V. S. Santosh K. Kondeti,
Shurik Yatom,
Ivan Romadanov,
Yevgeny Raitses,
Leonid Dorf,
Andrei Khomenko
Abstract:
A wide variety of feed gases are used to generate low-temperature plasmas for the microelectronics and the sustainability applications. These plasmas often have a complex combination of reactive and non-reactive species which may have spatial and temporal variations in the density, the temperature and the energy. Accurate knowledge of these parameters and their variations is critically important f…
▽ More
A wide variety of feed gases are used to generate low-temperature plasmas for the microelectronics and the sustainability applications. These plasmas often have a complex combination of reactive and non-reactive species which may have spatial and temporal variations in the density, the temperature and the energy. Accurate knowledge of these parameters and their variations is critically important for understanding and advancing these applications through validated and predictive modeling and design of relevant devices. Laser-induced fluorescence (LIF) provides both spatial and temporally resolved information about the plasma-produced radicals, ions, and metastables. However, the use of this powerful diagnostic tool requires the knowledge of optical transitions including excitation and fluorescence wavelengths which may not be available or scattered through a huge literature domain. In this manuscript, we collected, analyzed and compiled the available transitions for laser-induced fluorescence for more than 160 chemical species relevant to the microelectronics industry and the sustainability applications. A list of species with overlap** LIF excitation and fluorescence wavelengths have been identified. This summary is intended to serve as a data reference for LIF transitions and should be updated in the future.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Coin-Flip** In The Brain: Statistical Learning with Neuronal Assemblies
Authors:
Max Dabagia,
Daniel Mitropolsky,
Christos H. Papadimitriou,
Santosh S. Vempala
Abstract:
How intelligence arises from the brain is a central problem in science. A crucial aspect of intelligence is dealing with uncertainty -- develo** good predictions about one's environment, and converting these predictions into decisions. The brain itself seems to be noisy at many levels, from chemical processes which drive development and neuronal activity to trial variability of responses to stim…
▽ More
How intelligence arises from the brain is a central problem in science. A crucial aspect of intelligence is dealing with uncertainty -- develo** good predictions about one's environment, and converting these predictions into decisions. The brain itself seems to be noisy at many levels, from chemical processes which drive development and neuronal activity to trial variability of responses to stimuli. One hypothesis is that the noise inherent to the brain's mechanisms is used to sample from a model of the world and generate predictions. To test this hypothesis, we study the emergence of statistical learning in NEMO, a biologically plausible computational model of the brain based on stylized neurons and synapses, plasticity, and inhibition, and giving rise to assemblies -- a group of neurons whose coordinated firing is tantamount to recalling a location, concept, memory, or other primitive item of cognition. We show in theory and simulation that connections between assemblies record statistics, and ambient noise can be harnessed to make probabilistic choices between assemblies. This allows NEMO to create internal models such as Markov chains entirely from the presentation of sequences of stimuli. Our results provide a foundation for biologically plausible probabilistic computation, and add theoretical support to the hypothesis that noise is a useful component of the brain's mechanism for cognition.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Realization of higher coordinated Er in high-pressure cotunnite phase of Er$_2$Ti$_2$O$_7$
Authors:
M. Modak,
Rahul Kaiwart,
Santosh K. Gupta,
A. Dwivedi,
K. K. Pandey,
A. K. Poswal,
H. K. Poswal
Abstract:
In this article we report the structural stability of Er$_2$Ti$_2$O$_7$ cubic pyrochlore with pressure using x-ray diffraction, Raman spectroscopy, photoluminescence, x-ray absorption and ab-initio calculations. Our studies establish a phase transformation in Er$_2$Ti$_2$O$_7$ from ambient cubic phase to high-pressure orthorhombic (cotunnite) phase, initiated at ~40 GPa. The transformation is slug…
▽ More
In this article we report the structural stability of Er$_2$Ti$_2$O$_7$ cubic pyrochlore with pressure using x-ray diffraction, Raman spectroscopy, photoluminescence, x-ray absorption and ab-initio calculations. Our studies establish a phase transformation in Er$_2$Ti$_2$O$_7$ from ambient cubic phase to high-pressure orthorhombic (cotunnite) phase, initiated at ~40 GPa. The transformation is sluggish and it does not complete even at the highest measured pressure in our study i.e. ~60.0 GPa. This is further supported by the first principle calculations which reveal that cotunnite phase is energetically more stable than the ambient phase above ~53 GPa. After complete release of pressure, the high-pressure cotunnite phase is retained while the fraction of untransformed pyrochlore phase becomes amorphous. Furthermore, the EXAFS data of the recovered sample at L3 edge of Er3+ ion show an increase in the coordination number of cations from eight at ambient to nine in the high-pressure phase. The mechanism of structural transformation is explained in terms of accumulation of cation antisite defects and subsequent disordering of cations and anions in their respective sublattice. The amorphization of the pyrochlore phase upon release is interpreted as the inability of accommodating the point defects at ambient conditions, which are formed in the pyrochlore lattice under compression.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Authors:
Li Lin,
Santosh,
Xin Wang,
Shu Hu
Abstract:
AI-generated faces have enriched human life, such as entertainment, education, and art. However, they also pose misuse risks. Therefore, detecting AI-generated faces becomes crucial, yet current detectors show biased performance across different demographic groups. Mitigating biases can be done by designing algorithmic fairness methods, which usually require demographically annotated face datasets…
▽ More
AI-generated faces have enriched human life, such as entertainment, education, and art. However, they also pose misuse risks. Therefore, detecting AI-generated faces becomes crucial, yet current detectors show biased performance across different demographic groups. Mitigating biases can be done by designing algorithmic fairness methods, which usually require demographically annotated face datasets for model training. However, no existing dataset comprehensively encompasses both demographic attributes and diverse generative methods, which hinders the development of fair detectors for AI-generated faces. In this work, we introduce the AI-Face dataset, the first million-scale demographically annotated AI-generated face image dataset, including real faces, faces from deepfake videos, and faces generated by Generative Adversarial Networks and Diffusion Models. Based on this dataset, we conduct the first comprehensive fairness benchmark to assess various AI face detectors and provide valuable insights and findings to promote the future fair design of AI face detectors. Our AI-Face dataset and benchmark code are publicly available at https://github.com/Purdue-M2/AI-Face-FairnessBench.
△ Less
Submitted 4 June, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
DSAM: A Deep Learning Framework for Analyzing Temporal and Spatial Dynamics in Brain Networks
Authors:
Bishal Thapaliya,
Robyn Miller,
Jiayu Chen,
Yu-** Wang,
Esra Akbas,
Ram Sapkota,
Bhaskar Ray,
Pranav Suresh,
Santosh Ghimire,
Vince Calhoun,
**gyu Liu
Abstract:
Resting-state functional magnetic resonance imaging (rs-fMRI) is a noninvasive technique pivotal for understanding human neural mechanisms of intricate cognitive processes. Most rs-fMRI studies compute a single static functional connectivity matrix across brain regions of interest, or dynamic functional connectivity matrices with a sliding window approach. These approaches are at risk of oversimpl…
▽ More
Resting-state functional magnetic resonance imaging (rs-fMRI) is a noninvasive technique pivotal for understanding human neural mechanisms of intricate cognitive processes. Most rs-fMRI studies compute a single static functional connectivity matrix across brain regions of interest, or dynamic functional connectivity matrices with a sliding window approach. These approaches are at risk of oversimplifying brain dynamics and lack proper consideration of the goal at hand. While deep learning has gained substantial popularity for modeling complex relational data, its application to uncovering the spatiotemporal dynamics of the brain is still limited. We propose a novel interpretable deep learning framework that learns goal-specific functional connectivity matrix directly from time series and employs a specialized graph neural network for the final classification. Our model, DSAM, leverages temporal causal convolutional networks to capture the temporal dynamics in both low- and high-level feature representations, a temporal attention unit to identify important time points, a self-attention unit to construct the goal-specific connectivity matrix, and a novel variant of graph neural network to capture the spatial dynamics for downstream classification. To validate our approach, we conducted experiments on the Human Connectome Project dataset with 1075 samples to build and interpret the model for the classification of sex group, and the Adolescent Brain Cognitive Development Dataset with 8520 samples for independent testing. Compared our proposed framework with other state-of-art models, results suggested this novel approach goes beyond the assumption of a fixed connectivity matrix and provides evidence of goal-specific brain connectivity patterns, which opens up the potential to gain deeper insights into how the human brain adapts its functional connectivity specific to the task at hand.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
Heavy Neutrino as Dark Matter in a Neutrinophilic U(1) Model
Authors:
Waleed Abdallah,
Anjan Kumar Barik,
Santosh Kumar Rai,
Tousik Samui
Abstract:
We study the prospect of heavy singlet neutrinos as a dark matter (DM) candidate within a neutrinophilic U(1) model, where the Standard Model (SM) is extended with a U(1) gauge symmetry, and neutrino mass and oscillation parameters are explained through an inverse see-saw mechanism. The lightest of the heavy neutrinos plays the role of the DM while the newly introduced scalars and the extra gauge…
▽ More
We study the prospect of heavy singlet neutrinos as a dark matter (DM) candidate within a neutrinophilic U(1) model, where the Standard Model (SM) is extended with a U(1) gauge symmetry, and neutrino mass and oscillation parameters are explained through an inverse see-saw mechanism. The lightest of the heavy neutrinos plays the role of the DM while the newly introduced scalars and the extra gauge boson Z' act as mediators between the dark sector and the SM sector. We show the range of model parameters where this DM candidate can be accommodated in the Weakly Interacting Massive Particle (WIMP) or Feebly Interacting Massive Particle (FIMP) scenario. The observed DM relic density is achieved via the new gauge boson and singlet scalar portals in the WIMP scenario whereas within the FIMP scenario, these two particles assume a distinct yet pivotal role in generating the observed relic density of dark matter.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks
Authors:
T. Y. S. S Santosh,
Tuan-Quang Vuong,
Matthias Grabmair
Abstract:
This study investigates the challenges posed by the dynamic nature of legal multi-label text classification tasks, where legal concepts evolve over time. Existing models often overlook the temporal dimension in their training process, leading to suboptimal performance of those models over time, as they treat training data as a single homogeneous block. To address this, we introduce ChronosLex, an…
▽ More
This study investigates the challenges posed by the dynamic nature of legal multi-label text classification tasks, where legal concepts evolve over time. Existing models often overlook the temporal dimension in their training process, leading to suboptimal performance of those models over time, as they treat training data as a single homogeneous block. To address this, we introduce ChronosLex, an incremental training paradigm that trains models on chronological splits, preserving the temporal order of the data. However, this incremental approach raises concerns about overfitting to recent data, prompting an assessment of mitigation strategies using continual learning and temporal invariant methods. Our experimental results over six legal multi-label text classification datasets reveal that continual learning methods prove effective in preventing overfitting thereby enhancing temporal generalizability, while temporal invariant methods struggle to capture these dynamics of temporal shifts.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Continual Learning in Medical Imaging from Theory to Practice: A Survey and Practical Analysis
Authors:
Mohammad Areeb Qazi,
Anees Ur Rehman Hashmi,
Santosh Sanjeev,
Ibrahim Almakky,
Numan Saeed,
Mohammad Yaqub
Abstract:
Deep Learning has shown great success in resha** medical imaging, yet it faces numerous challenges hindering widespread application. Issues like catastrophic forgetting and distribution shifts in the continuously evolving data stream increase the gap between research and applications. Continual Learning offers promise in addressing these hurdles by enabling the sequential acquisition of new know…
▽ More
Deep Learning has shown great success in resha** medical imaging, yet it faces numerous challenges hindering widespread application. Issues like catastrophic forgetting and distribution shifts in the continuously evolving data stream increase the gap between research and applications. Continual Learning offers promise in addressing these hurdles by enabling the sequential acquisition of new knowledge without forgetting previous learnings in neural networks. In this survey, we comprehensively review the recent literature on continual learning in the medical domain, highlight recent trends, and point out the practical issues. Specifically, we survey the continual learning studies on classification, segmentation, detection, and other tasks in the medical domain. Furthermore, we develop a taxonomy for the reviewed studies, identify the challenges, and provide insights to overcome them. We also critically discuss the current state of continual learning in medical imaging, including identifying open problems and outlining promising future directions. We hope this survey will provide researchers with a useful overview of the developments in the field and will further increase interest in the community. To keep up with the fast-paced advancements in this field, we plan to routinely update the repository with the latest relevant papers at https://github.com/BioMedIA-MBZUAI/awesome-cl-in-medical .
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Curve of Growth Analysis of SZ Lyn
Authors:
Janaka Adassuriya,
Shashikiran Ganesh,
Peter de Cat,
Santosh Joshi,
Chandana Jayaratne
Abstract:
We present one high-resolution and a time series of 561 low-resolution follow-up spectroscopic observations of SZ Lyn. It is a high-amplitude Delta Scuti-type pulsating star in a binary system. The photometric observations reveal the existence of radial and non-radial oscillation modes in SZ Lyn. In spectroscopy, the variation of equivalent width of the line profiles reflects the temperature varia…
▽ More
We present one high-resolution and a time series of 561 low-resolution follow-up spectroscopic observations of SZ Lyn. It is a high-amplitude Delta Scuti-type pulsating star in a binary system. The photometric observations reveal the existence of radial and non-radial oscillation modes in SZ Lyn. In spectroscopy, the variation of equivalent width of the line profiles reflects the temperature variations. The equivalent widths of the Balmer lines, H-alpha, H-hbeta, and H-gamma were measured over the pulsation cycle of SZ Lyn using time sequence spectra. Hence, the temperature profile of SZ Lyn was derived using the curve of growth analysis. Furthermore, the stellar parameters were determined through the best fit analysis of observed and synthetic high-resolution spectral lines. The best fit determines a model of Teff=6750 K, log(g)=3.5 dex, and vrot=10 km/s for solar abundance.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Testing spatial curvature in an anisotropic extension of $w$CDM model with low redshift data
Authors:
Vikrant Yadav,
Rajpal,
Pardeep,
Manish Yadav,
Santosh Kumar Yadav
Abstract:
In this letter, we report the observational constraints on a Bianchi type I anisotropic extension of $w$CDM model with spatial curvature from observational data including Baryon Acoustic Oscillations (BAO), Cosmic chronometers (CC), Big Bang nucleosynthesis (BBN), Pantheon+ (PP) compilation of SNe Ia and SH0ES Cepheid host distance anchors. The anisotropy is found to be of the order $10^{-13}$, wh…
▽ More
In this letter, we report the observational constraints on a Bianchi type I anisotropic extension of $w$CDM model with spatial curvature from observational data including Baryon Acoustic Oscillations (BAO), Cosmic chronometers (CC), Big Bang nucleosynthesis (BBN), Pantheon+ (PP) compilation of SNe Ia and SH0ES Cepheid host distance anchors. The anisotropy is found to be of the order $10^{-13}$, which interplay with spatial curvature to reduce $H_0$ tension by $\sim 1σ$ as found in the analyses with BAO+CC+BBN+PP combination of data, while no significant effect of anisotropy is observed with BAO+CC+BBN+PPSH0ES combination of data. A closed Universe is favored by $w$CDM as well as anisotropic $w$CDM models with spatial curvature in analyses with BAO+CC+BBN+PP combination of data. An observation of an open Universe from $w$CDM model with spatial curvature in analyses with BAO+CC+BBN+PPSH0ES combination of data and a closed Universe from anisotropic $w$CDM model with curvature in analyses with same combination of data is made. The quintessence form of dark energy is favored at 95\% CL in both analyses.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
DarsakX: A Python Package for Designing and Analyzing Imaging Performance of X-ray Telescopes
Authors:
Neeraj K. Tiwari,
Santosh V. Vadawale,
N. P. S. Mithun,
C. S. Vaishnava,
Bharath Saiguhan
Abstract:
The imaging performance and sensitivity of an X-ray telescope when observing astrophysical sources are primarily governed by the optical design, geometrical uncertainties (figure errors, surface roughness, and mirror alignment inaccuracies), and the reflectivity properties of the X-ray reflecting mirror surface. To thoroughly evaluate the imaging performance of an X-ray telescope with an optical d…
▽ More
The imaging performance and sensitivity of an X-ray telescope when observing astrophysical sources are primarily governed by the optical design, geometrical uncertainties (figure errors, surface roughness, and mirror alignment inaccuracies), and the reflectivity properties of the X-ray reflecting mirror surface. To thoroughly evaluate the imaging performance of an X-ray telescope with an optical design similar to Wolter-1 optics, which comprises multiple shells with known geometrical uncertainties and mirror reflectivity properties, appropriate computational tools are essential. These tools are used to estimate the angular resolution and effective area for various source energies and locations and, more importantly, to assess the impact of figure errors on the telescope's imaging performance. Additionally, they can also be used to optimize optics geometry by modifying it in reference to the Wolter-1 optics, aiming to minimize the optical aberration associated with the Wolter-1 configuration. In this paper, we introduce DarsakX, a Python-based ray tracing computational tool specifically designed to estimate the imaging performance of a multi-shell X-ray telescope. DarsakX has the capability to simulate the impact of figure errors present in the axial direction of a mirror shell. The geometrical shape of the mirror shells can be defined as a combination of figure error with the base optics, such as Wolter-1 or Conical optics. Additionally, DarsakX allows the exploration of new optical designs involving two reflections similar to Wolter-1 optics but with an improved angular resolution for wide-field telescopes. Developed through an analytical approach, DarsakX ensures computational efficiency, enabling fast processing.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Consciousness Driven Spike Timing Dependent Plasticity
Authors:
Sushant Yadav,
Santosh Chaudhary,
Rajesh Kumar
Abstract:
Spiking Neural Networks (SNNs), recognized for their biological plausibility and energy efficiency, employ sparse and asynchronous spikes for communication. However, the training of SNNs encounters difficulties coming from non-differentiable activation functions and the movement of spike-based inter-layer data. Spike-Timing Dependent Plasticity (STDP), inspired by neurobiology, plays a crucial rol…
▽ More
Spiking Neural Networks (SNNs), recognized for their biological plausibility and energy efficiency, employ sparse and asynchronous spikes for communication. However, the training of SNNs encounters difficulties coming from non-differentiable activation functions and the movement of spike-based inter-layer data. Spike-Timing Dependent Plasticity (STDP), inspired by neurobiology, plays a crucial role in SNN's learning, but its still lacks the conscious part of the brain used for learning. Considering the issue, this research work proposes a Consciousness Driven STDP (CD-STDP), an improved solution addressing inherent limitations observed in conventional STDP models. CD-STDP, designed to infuse the conscious part as coefficients of long-term potentiation (LTP) and long-term depression (LTD), exhibit a dynamic nature. The model connects LTP and LTD coefficients to current and past state of synaptic activities, respectively, enhancing consciousness and adaptability. This consciousness empowers the model to effectively learn while understanding the input patterns. The conscious coefficient adjustment in response to current and past synaptic activity extends the model's conscious and other cognitive capabilities, offering a refined and efficient approach for real-world applications. Evaluations on MNIST, FashionMNIST and CALTECH datasets showcase $CD$-STDP's remarkable accuracy of 98.6%, 85.61% and 99.0%, respectively, in a single hidden layer SNN. In addition, analysis of conscious elements and consciousness of the proposed model on SNN is performed.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Authors:
Kailash Gogineni,
Sai Santosh Dayapule,
Juan Gómez-Luna,
Karthikeya Gogineni,
Peng Wei,
Tian Lan,
Mohammad Sadrosadati,
Onur Mutlu,
Guru Venkataramani
Abstract:
Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing…
▽ More
Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing RL algorithms like Tabular Q-learning and SARSA on UPMEM PIM systems and optimizing for hardware. Our experiments on OpenAI GYM environments using UPMEM hardware demonstrate superior performance compared to CPU and GPU implementations.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
QBER: Quantifying Cyber Risks for Strategic Decisions
Authors:
Muriel Figueredo Franco,
Aiatur Rahaman Mullick,
Santosh Jha
Abstract:
Quantifying cyber risks is essential for organizations to grasp their vulnerability to threats and make informed decisions. However, current approaches still need to work on blending economic viewpoints to provide insightful analysis. To bridge this gap, we introduce QBER approach to offer decision-makers measurable risk metrics. The QBER evaluates losses from cyberattacks, performs detailed risk…
▽ More
Quantifying cyber risks is essential for organizations to grasp their vulnerability to threats and make informed decisions. However, current approaches still need to work on blending economic viewpoints to provide insightful analysis. To bridge this gap, we introduce QBER approach to offer decision-makers measurable risk metrics. The QBER evaluates losses from cyberattacks, performs detailed risk analyses based on existing cybersecurity measures, and provides thorough cost assessments. Our contributions involve outlining cyberattack probabilities and risks, identifying Technical, Economic, and Legal (TEL) impacts, creating a model to gauge impacts, suggesting risk mitigation strategies, and examining trends and challenges in implementing widespread Cyber Risk Quantification (CRQ). The QBER approach serves as a guided approach for organizations to assess risks and strategically invest in cybersecurity.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Uncertainty Quantification and Propagation in Atomistic Machine Learning
Authors:
** Dai,
Santosh Adhikari,
Mingjian Wen
Abstract:
Machine learning (ML) offers promising new approaches to tackle complex problems and has been increasingly adopted in chemical and materials sciences. Broadly speaking, ML models employ generic mathematical functions and attempt to learn essential physics and chemistry from a large amount of data. Consequently, because of the lack of physical or chemical principles in the functional form, the reli…
▽ More
Machine learning (ML) offers promising new approaches to tackle complex problems and has been increasingly adopted in chemical and materials sciences. Broadly speaking, ML models employ generic mathematical functions and attempt to learn essential physics and chemistry from a large amount of data. Consequently, because of the lack of physical or chemical principles in the functional form, the reliability of the predictions is oftentimes not guaranteed, particularly for data far out of distribution. It is critical to quantify the uncertainty in model predictions and understand how the uncertainty propagates to downstream chemical and materials applications. Herein, we review and categorize existing uncertainty quantification (UQ) methods for atomistic ML under a united framework of probabilistic modeling with the aim of elucidating the similarities and differences between them. We also discuss performance metrics to evaluate the calibration, precision, accuracy, and efficiency of the UQ methods and techniques for model recalibration. In addition, we discuss uncertainty propagation (UP) in widely used simulation techniques in chemical and materials science, such as molecular dynamics and microkinetic modeling. We also provide remarks on the challenges and future opportunities of UQ and UP in atomistic ML.
△ Less
Submitted 8 May, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies
Authors:
Yunbum Kook,
Santosh S. Vempala,
Matthew S. Zhang
Abstract:
We present a new random walk for uniformly sampling high-dimensional convex bodies. It achieves state-of-the-art runtime complexity with stronger guarantees on the output than previously known, namely in Rényi divergence (which implies TV, $\mathcal{W}_2$, KL, $χ^2$). The proof departs from known approaches for polytime algorithms for the problem -- we utilize a stochastic diffusion perspective to…
▽ More
We present a new random walk for uniformly sampling high-dimensional convex bodies. It achieves state-of-the-art runtime complexity with stronger guarantees on the output than previously known, namely in Rényi divergence (which implies TV, $\mathcal{W}_2$, KL, $χ^2$). The proof departs from known approaches for polytime algorithms for the problem -- we utilize a stochastic diffusion perspective to show contraction to the target distribution with the rate of convergence determined by functional isoperimetric constants of the stationary density.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results
Authors:
Yuekun Dai,
Dafeng Zhang,
Xiaoming Li,
Zongsheng Yue,
Chongyi Li,
Shangchen Zhou,
Ruicheng Feng,
Peiqing Yang,
Zhezhu **,
Guanqun Liu,
Chen Change Loy,
Lize Zhang,
Shuai Liu,
Chaoyu Feng,
Luyang Wang,
Shuan Chen,
Guangqi Shao,
Xiaotao Wang,
Lei Lei,
Qirui Yang,
Qihua Cheng,
Zhiqiang Xu,
Yihao Liu,
Huan**g Yue,
**gyu Yang
, et al. (38 additional authors not shown)
Abstract:
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra…
▽ More
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging (MIPI). Building on the achievements of the previous MIPI Workshops held at ECCV 2022 and CVPR 2023, we introduce our third MIPI challenge including three tracks focusing on novel image sensors and imaging algorithms. In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2024. In total, 170 participants were successfully registered, and 14 teams submitted results in the final testing phase. The developed solutions in this challenge achieved state-of-the-art performance on Nighttime Flare Removal. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2024/.
△ Less
Submitted 27 May, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
Authors:
Mihir Parmar,
Nisarg Patel,
Neeraj Varshney,
Mutsumi Nakamura,
Man Luo,
Santosh Mashetty,
Arindam Mitra,
Chitta Baral
Abstract:
Recently developed large language models (LLMs) have been shown to perform remarkably well on a wide range of language understanding tasks. But, can they really "reason" over the natural language? This question has been receiving significant research attention and many reasoning skills such as commonsense, numerical, and qualitative have been studied. However, the crucial skill pertaining to 'logi…
▽ More
Recently developed large language models (LLMs) have been shown to perform remarkably well on a wide range of language understanding tasks. But, can they really "reason" over the natural language? This question has been receiving significant research attention and many reasoning skills such as commonsense, numerical, and qualitative have been studied. However, the crucial skill pertaining to 'logical reasoning' has remained underexplored. Existing work investigating this reasoning ability of LLMs has focused only on a couple of inference rules (such as modus ponens and modus tollens) of propositional and first-order logic. Addressing the above limitation, we comprehensively evaluate the logical reasoning ability of LLMs on 25 different reasoning patterns spanning over propositional, first-order, and non-monotonic logics. To enable systematic evaluation, we introduce LogicBench, a natural language question-answering dataset focusing on the use of a single inference rule. We conduct detailed analysis with a range of LLMs such as GPT-4, ChatGPT, Gemini, Llama-2, and Mistral using chain-of-thought prompting. Experimental results show that existing LLMs do not fare well on LogicBench; especially, they struggle with instances involving complex reasoning and negations. Furthermore, they sometimes overlook contextual information necessary for reasoning to arrive at the correct conclusion. We believe that our work and findings facilitate future research for evaluating and enhancing the logical reasoning ability of LLMs. Data and code are available at https://github.com/Mihir3009/LogicBench.
△ Less
Submitted 6 June, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
Evolution of Magnetism in Magnetic Topological Semimetal NdSb$_x$Te$_{2-x+δ}$
Authors:
Santosh Karki Chhetri,
Rabindra Basnet,
Jian Wang,
Krishna Pandey,
Gokul Acharya,
Md Rafique Un Nabi,
Dinesh Upreti,
Josh Sakon,
Mansour Mortazavi,
** Hu
Abstract:
Magnetic topological semimetals LnSbTe (Ln = Lanthanide) have attracted intensive attention because of the presence of interplay between magnetism, topological, and electron correlations depending on the choices of magnetic Ln elements. Recently, varying Sb-Te composition has been found to effectively control the electronic and magnetic states in LnSbxTe$_{2-x}$. With this motivation, we report th…
▽ More
Magnetic topological semimetals LnSbTe (Ln = Lanthanide) have attracted intensive attention because of the presence of interplay between magnetism, topological, and electron correlations depending on the choices of magnetic Ln elements. Recently, varying Sb-Te composition has been found to effectively control the electronic and magnetic states in LnSbxTe$_{2-x}$. With this motivation, we report the evolution of magnetic properties with Sb-Te substitution in NdSb$_x$Te$_{2-x+δ}$. Our work reveals the interesting non-monotonic change in magnetic ordering temperature with varying composition stoichiometry. In addition, reducing the Sb content x drives the reorientation of moments from in-plane (ab-plane) to out-of-plane (c-axis) direction that results in the distinct magnetic structures for two end compounds NdTe$_2$ ($x = 0$) and NdSbTe ($x = 1$). Furthermore, the moment orientation in NdSb$_x$Te$_{2-x+δ}$ is also found to be strongly tunable upon application of weak magnetic field, leading to rich magnetic phases depending on the composition stoichiometry, temperature, and magnetic field. Such strong tuning of magnetism in this material establishes it as a promising platform for investigating tunable topological states and correlated topological physics.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
DynaMMo: Dynamic Model Merging for Efficient Class Incremental Learning for Medical Images
Authors:
Mohammad Areeb Qazi,
Ibrahim Almakky,
Anees Ur Rehman Hashmi,
Santosh Sanjeev,
Mohammad Yaqub
Abstract:
Continual learning, the ability to acquire knowledge from new data while retaining previously learned information, is a fundamental challenge in machine learning. Various approaches, including memory replay, knowledge distillation, model regularization, and dynamic network expansion, have been proposed to address this issue. Thus far, dynamic network expansion methods have achieved state-of-the-ar…
▽ More
Continual learning, the ability to acquire knowledge from new data while retaining previously learned information, is a fundamental challenge in machine learning. Various approaches, including memory replay, knowledge distillation, model regularization, and dynamic network expansion, have been proposed to address this issue. Thus far, dynamic network expansion methods have achieved state-of-the-art performance at the cost of incurring significant computational overhead. This is due to the need for additional model buffers, which makes it less feasible in resource-constrained settings, particularly in the medical domain. To overcome this challenge, we propose Dynamic Model Merging, DynaMMo, a method that merges multiple networks at different stages of model training to achieve better computational efficiency. Specifically, we employ lightweight learnable modules for each task and combine them into a unified model to minimize computational overhead. DynaMMo achieves this without compromising performance, offering a cost-effective solution for continual learning in medical applications. We evaluate DynaMMo on three publicly available datasets, demonstrating its effectiveness compared to existing approaches. DynaMMo offers around 10-fold reduction in GFLOPS with a small drop of 2.76 in average accuracy when compared to state-of-the-art dynamic-based approaches. The code implementation of this work will be available upon the acceptance of this work at https://github.com/BioMedIA-MBZUAI/DynaMMo.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Insights on the Optical and Infrared Nature of MAXI J0709-159: Implications for High-Mass X-ray Binaries
Authors:
Suman Bhattacharyya,
Blesson Mathew,
Gourav Banerjee,
Sindhu G,
S. Muneer,
S. Pramod Kumar,
Santosh Joshi
Abstract:
In our previous study (Bhattacharyya et al., 2022), HD~54786, the optical counterpart of the MAXI J0709-159 system, was identified to be an evolved star, departing from the main sequence, based on comparisons with non-X-ray binary systems. In this paper, using color-magnitude diagram (CMD) analysis for High-Mass X-ray Binaries (HMXBs) and statistical t-tests, we found evidence supporting HD 54786'…
▽ More
In our previous study (Bhattacharyya et al., 2022), HD~54786, the optical counterpart of the MAXI J0709-159 system, was identified to be an evolved star, departing from the main sequence, based on comparisons with non-X-ray binary systems. In this paper, using color-magnitude diagram (CMD) analysis for High-Mass X-ray Binaries (HMXBs) and statistical t-tests, we found evidence supporting HD 54786's potential membership in both Be/X-ray binaries (BeXRBs) and supergaint X-ray binaries (SgXBs) populations of HMXBs. Hence, our study points towards dual optical characteristics of HD~54786, as an X-ray binary star and also belonging to a distinct evolutionary phase from BeXRB towards SgXB. Our further analysis suggests that MAXI J0709-159, associated with HD 54786, exhibits low-level activity during the current epoch and possesses a limited amount of circumstellar material. Although similarities with the previously studied BeXRB system LSI +61$^{\circ}$ 235 (Coe et al., 1994) are noted, continued monitoring and data collection are essential to fully comprehend the complexities of MAXI J0709-159 and its evolutionary trajectory within the realm of HMXBs.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Influence of strain and point defects on the electronic structure and related properties of (111)NiO epitaxial films
Authors:
Bhabani Prasad Sahu,
Poonam Sharma,
Santosh Kumar Yadav,
Alok Shukla,
Subhabrata Dhar
Abstract:
(111)NiO epitaxial films are grown on c-sapphire substrates at various growth temperatures ranging from room-temperature to 600C using pulsed laser deposition (PLD) technique. Two series of samples, where different laser fluences are used to ablate the target, are studied here. Films grown with higher laser fluence, are found to be embedded with Ni-clusters crystallographically aligned with the (1…
▽ More
(111)NiO epitaxial films are grown on c-sapphire substrates at various growth temperatures ranging from room-temperature to 600C using pulsed laser deposition (PLD) technique. Two series of samples, where different laser fluences are used to ablate the target, are studied here. Films grown with higher laser fluence, are found to be embedded with Ni-clusters crystallographically aligned with the (111)NiO matrix. While the layers grown with lower laser energy density exhibit p-type conductivity specially at low growth temperatures. X-ray diffraction study shows the coexistence of biaxial compressive and tensile hydrostatic strains in these samples, which results in an expansion of the lattice primarily along the growth direction. This effective uniaxial expansion {epsilon}_perpendicular increases with the reduction of the growth temperature. Band gap of these samples is found to decrease linearly with {epsilon}_perpendicular. This result is validated by density functional theory (DFT) calculations. Experimental findings and the theoretical study further indicate that V_Ni + O_I and V_O + Ni_I complexes exist as the dominant native defects in samples grown with Ni-deficient (low laser fluence) and Ni-rich (high laser fluence) conditions, respectively. P-type conductivity observed in the samples grown in Ni-deficient condition is more likely to be resulting from V_Ni + O_I defects than Ni-vacancies (V_Ni).
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images
Authors:
Santosh,
Li Lin,
Irene Amerini,
Xin Wang,
Shu Hu
Abstract:
Diffusion models (DMs) have revolutionized image generation, producing high-quality images with applications spanning various fields. However, their ability to create hyper-realistic images poses significant challenges in distinguishing between real and synthetic content, raising concerns about digital authenticity and potential misuse in creating deepfakes. This work introduces a robust detection…
▽ More
Diffusion models (DMs) have revolutionized image generation, producing high-quality images with applications spanning various fields. However, their ability to create hyper-realistic images poses significant challenges in distinguishing between real and synthetic content, raising concerns about digital authenticity and potential misuse in creating deepfakes. This work introduces a robust detection framework that integrates image and text features extracted by CLIP model with a Multilayer Perceptron (MLP) classifier. We propose a novel loss that can improve the detector's robustness and handle imbalanced datasets. Additionally, we flatten the loss landscape during the model training to improve the detector's generalization capabilities. The effectiveness of our method, which outperforms traditional detection techniques, is demonstrated through extensive experiments, underscoring its potential to set a new state-of-the-art approach in DM-generated image detection. The code is available at https://github.com/Purdue-M2/Robust_DM_Generated_Image_Detection.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Tao: Re-Thinking DL-based Microarchitecture Simulation
Authors:
Santosh Pandey,
Amir Yazdanbakhsh,
Hang Liu
Abstract:
Microarchitecture simulators are indispensable tools for microarchitecture designers to validate, estimate, and optimize new hardware that meets specific design requirements. While the quest for a fast, accurate and detailed microarchitecture simulation has been ongoing for decades, existing simulators excel and fall short at different aspects: (i) Although execution-driven simulation is accurate…
▽ More
Microarchitecture simulators are indispensable tools for microarchitecture designers to validate, estimate, and optimize new hardware that meets specific design requirements. While the quest for a fast, accurate and detailed microarchitecture simulation has been ongoing for decades, existing simulators excel and fall short at different aspects: (i) Although execution-driven simulation is accurate and detailed, it is extremely slow and requires expert-level experience to design. (ii) Trace-driven simulation reuses the execution traces in pursuit of fast simulation but faces accuracy concerns and fails to achieve significant speedup. (iii) Emerging deep learning (DL)-based simulations are remarkably fast and have acceptable accuracy but fail to provide adequate low-level microarchitectural performance metrics crucial for microarchitectural bottleneck analysis. Additionally, they introduce substantial overheads from trace regeneration and model re-training when simulating a new microarchitecture.
Re-thinking the advantages and limitations of the aforementioned simulation paradigms, this paper introduces TAO that redesigns the DL-based simulation with three primary contributions: First, we propose a new training dataset design such that the subsequent simulation only needs functional trace as inputs, which can be rapidly generated and reused across microarchitectures. Second, we redesign the input features and the DL model using self-attention to support predicting various performance metrics. Third, we propose techniques to train a microarchitecture agnostic embedding layer that enables fast transfer learning between different microarchitectural configurations and reduces the re-training overhead of conventional DL-based simulators. Our extensive evaluation shows TAO can reduce the overall training and simulation time by 18.06x over the state-of-the-art DL-based endeavors.
△ Less
Submitted 29 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Study of the Balmer decrements for Galactic classical Be stars using the Himalayan Chandra Telescope of India
Authors:
Gourav Banerjee,
Blesson Mathew,
Suman Bhattacharyya,
Ashish Devaraj,
Sreeja S Kartha,
Santosh Joshi
Abstract:
In a recent study, Banerjee et al. (2021) produced an atlas of all major emission lines found in a large sample of 115 Galactic field Be stars using the 2-m Himalayan Chandra Telescope (HCT) facility located at Ladakh, India. This paper presents our further exploration of these stars to estimate the electron density in their discs. Our study using Balmer decrement values indicate that their discs…
▽ More
In a recent study, Banerjee et al. (2021) produced an atlas of all major emission lines found in a large sample of 115 Galactic field Be stars using the 2-m Himalayan Chandra Telescope (HCT) facility located at Ladakh, India. This paper presents our further exploration of these stars to estimate the electron density in their discs. Our study using Balmer decrement values indicate that their discs are generally optically thick in nature with electron density (n_e) in their circumstellar envelopes (CEs) being in excess of 10^13 cm^-3 for around 65% of the stars. For another 19% stars, the average n_e in their discs probably range between 10^12 cm^-3 and 10^13 cm^-3. We noticed that the nature of the Hα and H\b{eta} line profiles might not influence the observed Balmer decrement values (i.e. D_34 and D_54) of the sample of stars. Interestingly, we also found that around 50% of the Be stars displaying D_34 greater than 2.7 are of earlier spectral types, i.e. within B0 -B3.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Learning to rank quantum circuits for hardware-optimized performance enhancement
Authors:
Gavin S. Hartnett,
Aaron Barbosa,
Pranav S. Mundada,
Michael Hush,
Michael J. Biercuk,
Yuval Baum
Abstract:
We introduce and experimentally test a machine-learning-based method for ranking logically equivalent quantum circuits based on expected performance estimates derived from a training procedure conducted on real hardware. We apply our method to the problem of layout selection, in which abstracted qubits are assigned to physical qubits on a given device. Circuit measurements performed on IBM hardwar…
▽ More
We introduce and experimentally test a machine-learning-based method for ranking logically equivalent quantum circuits based on expected performance estimates derived from a training procedure conducted on real hardware. We apply our method to the problem of layout selection, in which abstracted qubits are assigned to physical qubits on a given device. Circuit measurements performed on IBM hardware indicate that the maximum and median fidelities of logically equivalent layouts can differ by an order of magnitude. We introduce a circuit score used for ranking that is parameterized in terms of a physics-based, phenomenological error model whose parameters are fit by training a ranking-loss function over a measured dataset. The dataset consists of quantum circuits exhibiting a diversity of structures and executed on IBM hardware, allowing the model to incorporate the contextual nature of real device noise and errors without the need to perform an exponentially costly tomographic protocol. We perform model training and execution on the 16-qubit ibmq_guadalupe device and compare our method to two common approaches: random layout selection and a publicly available baseline called Mapomatic. Our model consistently outperforms both approaches, predicting layouts that exhibit lower noise and higher performance. In particular, we find that our best model leads to a $1.8\times$ reduction in selection error when compared to the baseline approach and a $3.2\times$ reduction when compared to random selection. Beyond delivering a new form of predictive quantum characterization, verification, and validation, our results reveal the specific way in which context-dependent and coherent gate errors appear to dominate the divergence from performance estimates extrapolated from simple proxy measures.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
$c {\bar c}$ and $b {\bar b}$ suppression in Glasma
Authors:
Pooja,
Mohammad Yousuf Jamal,
Partha Pratim Bhaduri,
Marco Ruggieri,
Santosh K. Das
Abstract:
This study investigates the evolution and dissociation dynamics of $c\bar{c}$ and $b\bar{b}$ pairs within the Glasma medium, resulting from the collision of ultra-relativistic heavy ions. An attractive potential is used to create the pairs, but the strong Glasma fields dominate over it, causing an increase in pair separation and subsequent dissociation. The observed finite probability of dissociat…
▽ More
This study investigates the evolution and dissociation dynamics of $c\bar{c}$ and $b\bar{b}$ pairs within the Glasma medium, resulting from the collision of ultra-relativistic heavy ions. An attractive potential is used to create the pairs, but the strong Glasma fields dominate over it, causing an increase in pair separation and subsequent dissociation. The observed finite probability of dissociation for these states reveals the intricate interplay between QCD dynamics and the suppression of $c\bar{c}$ and $b\bar{b}$ states during the Glasma phase. The research highlights differences between $c\bar{c}$ and $b\bar{b}$ pairs, revealing the role of quark flavor in the dissociation process. Dissociation spectra analysis indicates a peak shift towards higher momentum, reflecting a slight energy gain by the pairs. This investigation provides valuable insights into the complex dynamics of $c\bar{c}$ and $b\bar{b}$ pairs in the Glasma, which may help in better interpretation of experimental results on further integration with subsequent phases of the created matter.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Field-induced spin polarization in lightly Cr-substituted layered antiferromagnet NiPS3
Authors:
Rabindra Basnet,
Dinesh Upreti,
Taksh Patel,
Santosh Karki Chhetri,
Gokul Acharya,
Md Rafique Un Nabi,
Manish Mani Sharma,
Josh Sakon,
Mansour Mortazavi,
** Hu
Abstract:
Tuning magnetic properties in layered magnets is an important route to realize novel phenomenon related to two-dimensional (2D) magnetism. Recently, tuning antiferromagnetic (AFM) properties through substitution and intercalation techniques have been widely studied in MPX3 compounds. Interesting phenomena, such as diverse AFM structures and even the signatures of ferrimagnetism, have been reported…
▽ More
Tuning magnetic properties in layered magnets is an important route to realize novel phenomenon related to two-dimensional (2D) magnetism. Recently, tuning antiferromagnetic (AFM) properties through substitution and intercalation techniques have been widely studied in MPX3 compounds. Interesting phenomena, such as diverse AFM structures and even the signatures of ferrimagnetism, have been reported. However, long-range ferromagnetic (FM) ordering has remained elusive. In this work, we explored the magnetic properties of the previously unreported Cr-substituted NiPS3. We found that Cr substitution is extremely efficient in controlling spin orientation in NiPS3. Our study reveals a field-induced spin polarization in lightly (9%) Cr-substituted NiPS3, which is likely attributed to the attenuation of AFM interactions and magnetic anisotropy due to Cr do**. Our work provides a possible strategy to achieve FM phase in AFM MPX3, which could be useful for investigating 2D magnetism as well as potential device applications.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents
Authors:
T. Y. S. S Santosh,
Hassan Sarwat,
Ahmed Abdou,
Matthias Grabmair
Abstract:
Rhetorical Role Labeling (RRL) of legal judgments is essential for various tasks, such as case summarization, semantic search and argument mining. However, it presents challenges such as inferring sentence roles from context, interrelated roles, limited annotated data, and label imbalance. This study introduces novel techniques to enhance RRL performance by leveraging knowledge from semantically s…
▽ More
Rhetorical Role Labeling (RRL) of legal judgments is essential for various tasks, such as case summarization, semantic search and argument mining. However, it presents challenges such as inferring sentence roles from context, interrelated roles, limited annotated data, and label imbalance. This study introduces novel techniques to enhance RRL performance by leveraging knowledge from semantically similar instances (neighbours). We explore inference-based and training-based approaches, achieving remarkable improvements in challenging macro-F1 scores. For inference-based methods, we explore interpolation techniques that bolster label predictions without re-training. While in training-based methods, we integrate prototypical learning with our novel discourse-aware contrastive method that work directly on embedding spaces. Additionally, we assess the cross-domain applicability of our methods, demonstrating their effectiveness in transferring knowledge across diverse legal domains.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
On cumulative and relative cumulative past information generating function
Authors:
Santosh Kumar Chaudhary,
Nitin Gupta,
Achintya Roy
Abstract:
In this paper, we introduce the cumulative past information generating function (CPIG) and relative cumulative past information generating function (RCPIG). We study its properties. We establish its relation with generalized cumulative past entropy (GCPE). We defined CPIG stochastic order and its relation with dispersive order. We provide the results for the CPIG measure of the convoluted random v…
▽ More
In this paper, we introduce the cumulative past information generating function (CPIG) and relative cumulative past information generating function (RCPIG). We study its properties. We establish its relation with generalized cumulative past entropy (GCPE). We defined CPIG stochastic order and its relation with dispersive order. We provide the results for the CPIG measure of the convoluted random variables in terms of the measures of its components. We found some inequality relating to Shannon entropy, CPIG and GCPE. Some characterization and estimation results are also discussed regarding CPIG. We defined divergence measures between two random variables, Jensen-cumulative past information generating function(JCPIG), Jensen fractional cumulative past entropy measure, cumulative past Taneja entropy, and Jensen cumulative past Taneja entropy information measure.
△ Less
Submitted 22 April, 2024; v1 submitted 31 March, 2024;
originally announced April 2024.
-
ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human Rights
Authors:
T. Y. S. S Santosh,
Rashid Gustav Haddad,
Matthias Grabmair
Abstract:
In common law jurisdictions, legal practitioners rely on precedents to construct arguments, in line with the doctrine of \emph{stare decisis}. As the number of cases grow over the years, prior case retrieval (PCR) has garnered significant attention. Besides lacking real-world scale, existing PCR datasets do not simulate a realistic setting, because their queries use complete case documents while o…
▽ More
In common law jurisdictions, legal practitioners rely on precedents to construct arguments, in line with the doctrine of \emph{stare decisis}. As the number of cases grow over the years, prior case retrieval (PCR) has garnered significant attention. Besides lacking real-world scale, existing PCR datasets do not simulate a realistic setting, because their queries use complete case documents while only masking references to prior cases. The query is thereby exposed to legal reasoning not yet available when constructing an argument for an undecided case as well as spurious patterns left behind by citation masks, potentially short-circuiting a comprehensive understanding of case facts and legal principles. To address these limitations, we introduce a PCR dataset based on judgements from the European Court of Human Rights (ECtHR), which explicitly separate facts from arguments and exhibit precedential practices, aiding us to develop this PCR dataset to foster systems' comprehensive understanding. We benchmark different lexical and dense retrieval approaches with various negative sampling strategies, adapting them to deal with long text sequences using hierarchical variants. We found that difficulty-based negative sampling strategies were not effective for the PCR task, highlighting the need for investigation into domain-specific difficulty criteria. Furthermore, we observe performance of the dense models degrade with time and calls for further research into temporal adaptation of retrieval models. Additionally, we assess the influence of different views , Halsbury's and Goodhart's, in practice in ECtHR jurisdiction using PCR task.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Query-driven Relevant Paragraph Extraction from Legal Judgments
Authors:
T. Y. S. S Santosh,
Elvin Quero Hernandez,
Matthias Grabmair
Abstract:
Legal professionals often grapple with navigating lengthy legal judgements to pinpoint information that directly address their queries. This paper focus on this task of extracting relevant paragraphs from legal judgements based on the query. We construct a specialized dataset for this task from the European Court of Human Rights (ECtHR) using the case law guides. We assess the performance of curre…
▽ More
Legal professionals often grapple with navigating lengthy legal judgements to pinpoint information that directly address their queries. This paper focus on this task of extracting relevant paragraphs from legal judgements based on the query. We construct a specialized dataset for this task from the European Court of Human Rights (ECtHR) using the case law guides. We assess the performance of current retrieval models in a zero-shot way and also establish fine-tuning benchmarks using various models. The results highlight the significant gap between fine-tuned and zero-shot performance, emphasizing the challenge of handling distribution shift in the legal domain. We notice that the legal pre-training handles distribution shift on the corpus side but still struggles on query side distribution shift, with unseen legal queries. We also explore various Parameter Efficient Fine-Tuning (PEFT) methods to evaluate their practicality within the context of information retrieval, shedding light on the effectiveness of different PEFT methods across diverse configurations with pre-training and model architectures influencing the choice of PEFT method.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
LexAbSumm: Aspect-based Summarization of Legal Decisions
Authors:
T. Y. S. S Santosh,
Mahmoud Aly,
Matthias Grabmair
Abstract:
Legal professionals frequently encounter long legal judgments that hold critical insights for their work. While recent advances have led to automated summarization solutions for legal documents, they typically provide generic summaries, which may not meet the diverse information needs of users. To address this gap, we introduce LexAbSumm, a novel dataset designed for aspect-based summarization of…
▽ More
Legal professionals frequently encounter long legal judgments that hold critical insights for their work. While recent advances have led to automated summarization solutions for legal documents, they typically provide generic summaries, which may not meet the diverse information needs of users. To address this gap, we introduce LexAbSumm, a novel dataset designed for aspect-based summarization of legal case decisions, sourced from the European Court of Human Rights jurisdiction. We evaluate several abstractive summarization models tailored for longer documents on LexAbSumm, revealing a challenge in conditioning these models to produce aspect-specific summaries. We release LexAbSum to facilitate research in aspect-based summarization for legal domain.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
CuSINeS: Curriculum-driven Structure Induced Negative Sampling for Statutory Article Retrieval
Authors:
T. Y. S. S Santosh,
Kristina Kaiser,
Matthias Grabmair
Abstract:
In this paper, we introduce CuSINeS, a negative sampling approach to enhance the performance of Statutory Article Retrieval (SAR). CuSINeS offers three key contributions. Firstly, it employs a curriculum-based negative sampling strategy guiding the model to focus on easier negatives initially and progressively tackle more difficult ones. Secondly, it leverages the hierarchical and sequential infor…
▽ More
In this paper, we introduce CuSINeS, a negative sampling approach to enhance the performance of Statutory Article Retrieval (SAR). CuSINeS offers three key contributions. Firstly, it employs a curriculum-based negative sampling strategy guiding the model to focus on easier negatives initially and progressively tackle more difficult ones. Secondly, it leverages the hierarchical and sequential information derived from the structural organization of statutes to evaluate the difficulty of samples. Lastly, it introduces a dynamic semantic difficulty assessment using the being-trained model itself, surpassing conventional static methods like BM25, adapting the negatives to the model's evolving competence. Experimental results on a real-world expert-annotated SAR dataset validate the effectiveness of CuSINeS across four different baselines, demonstrating its versatility.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.