-
Operator algebra, quantum entanglement, and emergent geometry from matrix degrees of freedom
Authors:
Vaibhav Gautam,
Masanori Hanada,
Antal Jevicki
Abstract:
For matrix model and QFT, we discuss how dual gravitational geometry emerges from matrix degrees of freedom (specifically, adjoint scalars in super Yang-Mills theory) and how operator algebra that describes an arbitrary region of the bulk geometry can be constructed. We pay attention to the subtle difference between the notions of wave packets that describe low-energy excitations: QFT wave packet…
▽ More
For matrix model and QFT, we discuss how dual gravitational geometry emerges from matrix degrees of freedom (specifically, adjoint scalars in super Yang-Mills theory) and how operator algebra that describes an arbitrary region of the bulk geometry can be constructed. We pay attention to the subtle difference between the notions of wave packets that describe low-energy excitations: QFT wave packet associated with the spatial dimensions of QFT, matrix wave packet associated with the emergent dimensions from matrix degrees of freedom, and bulk wave packet which is a combination of QFT and matrix wave packets. In QFT, there is an intriguing interplay between QFT wave packet and matrix wave packet that connects quantum entanglement and emergent geometry. We propose that the bulk wave packet is the physical object in QFT that describes the emergent geometry from entanglement. This proposal sets a unified view on two seemingly different mechanisms of holographic emergent geometry: one based on matrix eigenvalues and the other based on quantum entanglement. Further intuition comes from the similarity to a traversable wormhole discussed as the dual description of the coupled SYK model by Maldacena and Qi: the bulk can be seen as an eternal traversable wormhole connecting boundary regions.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP
Authors:
Marius Mosbach,
Vagrant Gautam,
Tomás Vergara-Browne,
Dietrich Klakow,
Mor Geva
Abstract:
Interpretability and analysis (IA) research is a growing subfield within NLP with the goal of develo** a deeper understanding of the behavior or inner workings of NLP systems and methods. Despite growing interest in the subfield, a commonly voiced criticism is that it lacks actionable insights and therefore has little impact on NLP. In this paper, we seek to quantify the impact of IA research on…
▽ More
Interpretability and analysis (IA) research is a growing subfield within NLP with the goal of develo** a deeper understanding of the behavior or inner workings of NLP systems and methods. Despite growing interest in the subfield, a commonly voiced criticism is that it lacks actionable insights and therefore has little impact on NLP. In this paper, we seek to quantify the impact of IA research on the broader field of NLP. We approach this with a mixed-methods analysis of: (1) a citation graph of 185K+ papers built from all papers published at ACL and EMNLP conferences from 2018 to 2023, and (2) a survey of 138 members of the NLP community. Our quantitative results show that IA work is well-cited outside of IA, and central in the NLP citation graph. Through qualitative analysis of survey responses and manual annotation of 556 papers, we find that NLP researchers build on findings from IA work and perceive it is important for progress in NLP, multiple subfields, and rely on its findings and terminology for their own work. Many novel methods are proposed based on IA findings and highly influenced by them, but highly influential non-IA work cites IA findings without being driven by them. We end by summarizing what is missing in IA work today and provide a call to action, to pave the way for a more impactful future of IA research.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Understanding "Democratization" in NLP and ML Research
Authors:
Arjun Subramonian,
Vagrant Gautam,
Dietrich Klakow,
Zeerak Talat
Abstract:
Recent improvements in natural language processing (NLP) and machine learning (ML) and increased mainstream adoption have led to researchers frequently discussing the "democratization" of artificial intelligence. In this paper, we seek to clarify how democratization is understood in NLP and ML publications, through large-scale mixed-methods analyses of papers using the keyword "democra*" published…
▽ More
Recent improvements in natural language processing (NLP) and machine learning (ML) and increased mainstream adoption have led to researchers frequently discussing the "democratization" of artificial intelligence. In this paper, we seek to clarify how democratization is understood in NLP and ML publications, through large-scale mixed-methods analyses of papers using the keyword "democra*" published in NLP and adjacent venues. We find that democratization is most frequently used to convey (ease of) access to or use of technologies, without meaningfully engaging with theories of democratization, while research using other invocations of "democra*" tends to be grounded in theories of deliberation and debate. Based on our findings, we call for researchers to enrich their use of the term democratization with appropriate theory, towards democratic technologies beyond superficial access.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP
Authors:
Vagrant Gautam,
Arjun Subramonian,
Anne Lauscher,
Os Keyes
Abstract:
Personal names simultaneously differentiate individuals and categorize them in ways that are important in a given society. While the natural language processing community has thus associated personal names with sociodemographic characteristics in a variety of tasks, researchers have engaged to varying degrees with the established methodological problems in doing so. To guide future work, we presen…
▽ More
Personal names simultaneously differentiate individuals and categorize them in ways that are important in a given society. While the natural language processing community has thus associated personal names with sociodemographic characteristics in a variety of tasks, researchers have engaged to varying degrees with the established methodological problems in doing so. To guide future work, we present an interdisciplinary background on names and naming. We then survey the issues inherent to associating names with sociodemographic attributes, covering problems of validity (e.g., systematic error, construct validity), as well as ethical concerns (e.g., harms, differential impact, cultural insensitivity). Finally, we provide guiding questions along with normative recommendations to avoid validity and ethical pitfalls when dealing with names and sociodemographic characteristics in natural language processing.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Wilson Loops and Random Matrices
Authors:
Georg Bergner,
Vaibhav Gautam,
Masanori Hanada,
Jack Holden
Abstract:
Linear confinement with Casimir scaling of the string tension in confining gauge theories is a consequence of a certain property of the Polyakov loop related to random matrices. This mechanism does not depend on the details of the theories (neither the gauge group nor dimensions) and explains approximate Casimir scaling below string-breaking length. In this paper, we study 3d SU(2) pure Yang-Mills…
▽ More
Linear confinement with Casimir scaling of the string tension in confining gauge theories is a consequence of a certain property of the Polyakov loop related to random matrices. This mechanism does not depend on the details of the theories (neither the gauge group nor dimensions) and explains approximate Casimir scaling below string-breaking length. In this paper, we study 3d SU(2) pure Yang-Mills theory numerically and find the same random-matrix behavior for rectangular Wilson loops. We conjecture that this is a universal feature of strongly coupled confining gauge theories.
△ Less
Submitted 20 June, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?
Authors:
Vagrant Gautam,
Eileen Bingert,
Dawei Zhu,
Anne Lauscher,
Dietrich Klakow
Abstract:
Robust, faithful and harm-free pronoun use for individuals is an important goal for language models as their use increases, but prior work tends to study only one or two of these characteristics at a time. To measure progress towards the combined goal, we introduce the task of pronoun fidelity: given a context introducing a co-referring entity and pronoun, the task is to reuse the correct pronoun…
▽ More
Robust, faithful and harm-free pronoun use for individuals is an important goal for language models as their use increases, but prior work tends to study only one or two of these characteristics at a time. To measure progress towards the combined goal, we introduce the task of pronoun fidelity: given a context introducing a co-referring entity and pronoun, the task is to reuse the correct pronoun later. We present RUFF, a carefully-designed dataset of over 5 million instances to measure robust pronoun fidelity in English, and we evaluate 37 popular large language models across architectures (encoder-only, decoder-only and encoder-decoder) and scales (11M-70B parameters). When an individual is introduced with a pronoun, models can mostly faithfully reuse this pronoun in the next sentence, but they are significantly worse with she/her/her, singular they and neopronouns. Moreover, models are easily distracted by non-adversarial sentences discussing other people; even one additional sentence with a distractor pronoun causes accuracy to drop on average by 34%. Our results show that pronoun fidelity is neither robust, nor due to reasoning, in a simple, naturalistic setting where humans achieve nearly 100% accuracy. We encourage researchers to bridge the gaps we find and to carefully evaluate reasoning in settings where superficial repetition might inflate perceptions of model performance.
△ Less
Submitted 1 May, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Boundary scattering in massless $AdS_3$
Authors:
Daniele Bielli,
Vaibhav Gautam,
Vasileios Moustakis,
Andrea Prinsloo,
Alessandro Torrielli
Abstract:
We study the boundary integrability problem of the massless sector of $AdS_3 \times S^3 \times T^4 $ string theory. Exploiting the difference-form of the massless scattering theory, we find a very simple and exhaustive list of reflection matrices for all the possible boundary coideal subalgebras - singlet and vector representations, right and left boundary - and check basic properties of our solut…
▽ More
We study the boundary integrability problem of the massless sector of $AdS_3 \times S^3 \times T^4 $ string theory. Exploiting the difference-form of the massless scattering theory, we find a very simple and exhaustive list of reflection matrices for all the possible boundary coideal subalgebras - singlet and vector representations, right and left boundary - and check basic properties of our solutions, primarily the boundary Yang-Baxter equation, for all possible combinations of scattering particles.
△ Less
Submitted 6 May, 2024; v1 submitted 27 March, 2024;
originally announced March 2024.
-
What explains the success of cross-modal fine-tuning with ORCA?
Authors:
Paloma García-de-Herreros,
Vagrant Gautam,
Philipp Slusallek,
Dietrich Klakow,
Marius Mosbach
Abstract:
ORCA (Shen et al., 2023) is a recent technique for cross-modal fine-tuning, i.e., applying pre-trained transformer models to modalities beyond their training data. The technique consists primarily of training an embedder and fine-tuning the embedder and model. Despite its high performance on a variety of downstream tasks, we do not understand precisely how each of these components contribute to OR…
▽ More
ORCA (Shen et al., 2023) is a recent technique for cross-modal fine-tuning, i.e., applying pre-trained transformer models to modalities beyond their training data. The technique consists primarily of training an embedder and fine-tuning the embedder and model. Despite its high performance on a variety of downstream tasks, we do not understand precisely how each of these components contribute to ORCA's success. Therefore, we run a series of ablations and find that embedder training does not help 2D tasks at all, contrary to what the original paper posits. In 1D tasks, some amount of embedder training is necessary but more is not better. In 4 out of 6 datasets we experiment with, it is model fine-tuning that makes the biggest difference. Through our ablations and baselines, we contribute a better understanding of the individual components of ORCA.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis
Authors:
Miaoran Zhang,
Vagrant Gautam,
Mingyang Wang,
Jesujoba O. Alabi,
Xiaoyu Shen,
Dietrich Klakow,
Marius Mosbach
Abstract:
In-context learning is a popular inference strategy where large language models solve a task using only a few labeled demonstrations without needing any parameter updates. Although there have been extensive studies on English in-context learning, multilingual in-context learning remains under-explored, and we lack an in-depth understanding of the role of demonstrations in this context. To address…
▽ More
In-context learning is a popular inference strategy where large language models solve a task using only a few labeled demonstrations without needing any parameter updates. Although there have been extensive studies on English in-context learning, multilingual in-context learning remains under-explored, and we lack an in-depth understanding of the role of demonstrations in this context. To address this gap, we conduct a multidimensional analysis of multilingual in-context learning, experimenting with 5 models from different model families, 9 datasets covering classification and generation tasks, and 56 typologically diverse languages. Our results reveal that the effectiveness of demonstrations varies significantly across models, tasks, and languages. We also find that strong instruction-following models including Llama 2-Chat, GPT-3.5, and GPT-4 are largely insensitive to the quality of demonstrations. Instead, a carefully crafted template often eliminates the benefits of demonstrations for some tasks and languages altogether. These findings show that the importance of demonstrations might be overestimated. Our work highlights the need for granular evaluation across multiple axes towards a better understanding of in-context learning.
△ Less
Submitted 7 June, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Color Confinement and Random Matrices -- A random walk down group manifold toward Casimir scaling --
Authors:
Georg Bergner,
Vaibhav Gautam,
Masanori Hanada
Abstract:
We explain the microscopic origin of linear confinement potential with the Casimir scaling in generic confining gauge theories. In the low-temperature regime of confining gauge theories such as QCD, Polyakov lines are slowly varying Haar random modulo exponentially small corrections with respect to the inverse temperature, as shown by one of the authors (M.~H.) and Watanabe. With exact Haar random…
▽ More
We explain the microscopic origin of linear confinement potential with the Casimir scaling in generic confining gauge theories. In the low-temperature regime of confining gauge theories such as QCD, Polyakov lines are slowly varying Haar random modulo exponentially small corrections with respect to the inverse temperature, as shown by one of the authors (M.~H.) and Watanabe. With exact Haar randomness, computation of the two-point correlator of Polyakov loops reduces to the problem of random walk on group manifold. Linear confinement potential with approximate Casimir scaling except at short distances follows naturally from slowly varying Haar randomness. With exponentially small corrections to Haar randomness, string breaking and loss of Casimir scaling at long distance follow. Hence we obtain the Casimir scaling which is only approximate and holds only at intermediate distance, which is precisely needed to explain the results of lattice simulations. For $(1+1)$-dimensional theories, there is a simplification that admits the Casimir scaling at short distances as well.
△ Less
Submitted 9 January, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
A Lightweight Method to Generate Unanswerable Questions in English
Authors:
Vagrant Gautam,
Miaoran Zhang,
Dietrich Klakow
Abstract:
If a question cannot be answered with the available information, robust systems for question answering (QA) should know _not_ to answer. One way to build QA models that do this is with additional training data comprised of unanswerable questions, created either by employing annotators or through automated methods for unanswerable question generation. To show that the model complexity of existing a…
▽ More
If a question cannot be answered with the available information, robust systems for question answering (QA) should know _not_ to answer. One way to build QA models that do this is with additional training data comprised of unanswerable questions, created either by employing annotators or through automated methods for unanswerable question generation. To show that the model complexity of existing automated approaches is not justified, we examine a simpler data augmentation method for unanswerable question generation in English: performing antonym and entity swaps on answerable questions. Compared to the prior state-of-the-art, data generated with our training-free and lightweight strategy results in better models (+1.6 F1 points on SQuAD 2.0 data with BERT-large), and has higher human-judged relatedness and readability. We quantify the raw benefits of our approach compared to no augmentation across multiple encoder models, using different amounts of generated data, and also on TydiQA-MinSpan data (+9.3 F1 points with BERT-large). Our results establish swaps as a simple but strong baseline for future work.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Joint-YODNet: A Light-weight Object Detector for UAVs to Achieve Above 100fps
Authors:
Vipin Gautam,
Shitala Prasad,
Sharad Sinha
Abstract:
Small object detection via UAV (Unmanned Aerial Vehicle) images captured from drones and radar is a complex task with several formidable challenges. This domain encompasses numerous complexities that impede the accurate detection and localization of small objects. To address these challenges, we propose a novel method called JointYODNet for UAVs to detect small objects, leveraging a joint loss fun…
▽ More
Small object detection via UAV (Unmanned Aerial Vehicle) images captured from drones and radar is a complex task with several formidable challenges. This domain encompasses numerous complexities that impede the accurate detection and localization of small objects. To address these challenges, we propose a novel method called JointYODNet for UAVs to detect small objects, leveraging a joint loss function specifically designed for this task. Our method revolves around the development of a joint loss function tailored to enhance the detection performance of small objects. Through extensive experimentation on a diverse dataset of UAV images captured under varying environmental conditions, we evaluated different variations of the loss function and determined the most effective formulation. The results demonstrate that our proposed joint loss function outperforms existing methods in accurately localizing small objects. Specifically, our method achieves a recall of 0.971, and a F1Score of 0.975, surpassing state-of-the-art techniques. Additionally, our method achieves a [email protected](%) of 98.6, indicating its robustness in detecting small objects across varying scales
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
AaP-ReID: Improved Attention-Aware Person Re-identification
Authors:
Vipin Gautam,
Shitala Prasad,
Sharad Sinha
Abstract:
Person re-identification (ReID) is a well-known problem in the field of computer vision. The primary objective is to identify a specific individual within a gallery of images. However, this task is challenging due to various factors, such as pose variations, illumination changes, obstructions, and the presence ofconfusing backgrounds. Existing ReID methods often fail to capture discriminative feat…
▽ More
Person re-identification (ReID) is a well-known problem in the field of computer vision. The primary objective is to identify a specific individual within a gallery of images. However, this task is challenging due to various factors, such as pose variations, illumination changes, obstructions, and the presence ofconfusing backgrounds. Existing ReID methods often fail to capture discriminative features (e.g., head, shoes, backpacks) and instead capture irrelevant features when the target is occluded. Motivated by the success of part-based and attention-based ReID methods, we improve AlignedReID++ and present AaP-ReID, a more effective method for person ReID that incorporates channel-wise attention into a ResNet-based architecture. Our method incorporates the Channel-Wise Attention Bottleneck (CWAbottleneck) block and can learn discriminating features by dynamically adjusting the importance ofeach channel in the feature maps. We evaluated Aap-ReID on three benchmark datasets: Market-1501, DukeMTMC-reID, and CUHK03. When compared with state-of-the-art person ReID methods, we achieve competitive results with rank-1 accuracies of 95.6% on Market-1501, 90.6% on DukeMTMC-reID, and 82.4% on CUHK03.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
YOLORe-IDNet: An Efficient Multi-Camera System for Person-Tracking
Authors:
Vipin Gautam,
Shitala Prasad,
Sharad Sinha
Abstract:
The growing need for video surveillance in public spaces has created a demand for systems that can track individuals across multiple cameras feeds in real-time. While existing tracking systems have achieved impressive performance using deep learning models, they often rely on pre-existing images of suspects or historical data. However, this is not always feasible in cases where suspicious individu…
▽ More
The growing need for video surveillance in public spaces has created a demand for systems that can track individuals across multiple cameras feeds in real-time. While existing tracking systems have achieved impressive performance using deep learning models, they often rely on pre-existing images of suspects or historical data. However, this is not always feasible in cases where suspicious individuals are identified in real-time and without prior knowledge. We propose a person-tracking system that combines correlation filters and Intersection Over Union (IOU) constraints for robust tracking, along with a deep learning model for cross-camera person re-identification (Re-ID) on top of YOLOv5. The proposed system quickly identifies and tracks suspect in real-time across multiple cameras and recovers well after full or partial occlusion, making it suitable for security and surveillance applications. It is computationally efficient and achieves a high F1-Score of 79% and an IOU of 59% comparable to existing state-of-the-art algorithms, as demonstrated in our evaluation on a publicly available OTB-100 dataset. The proposed system offers a robust and efficient solution for the real-time tracking of individuals across multiple camera feeds. Its ability to track targets without prior knowledge or historical data is a significant improvement over existing systems, making it well-suited for public safety and surveillance applications.
△ Less
Submitted 23 September, 2023;
originally announced September 2023.
-
Automated rendering of multi-stranded DNA complexes with pseudoknots
Authors:
Malgorzata Nowicka,
Vinay K. Gautam,
Pekka Orponen
Abstract:
We present a general method for rendering representations of multi-stranded DNA complexes from textual descriptions into 2D diagrams. The complexes can be arbitrarily pseudoknotted, and if a planar rendering is possible, the method will determine one in time which is essentially linear in the size of the textual description. (That is, except for a final stochastic fine-tuning step.) If a planar re…
▽ More
We present a general method for rendering representations of multi-stranded DNA complexes from textual descriptions into 2D diagrams. The complexes can be arbitrarily pseudoknotted, and if a planar rendering is possible, the method will determine one in time which is essentially linear in the size of the textual description. (That is, except for a final stochastic fine-tuning step.) If a planar rendering is not possible, the method will compute a visually pleasing approximate rendering in quadratic time. Examples of diagrams produced by the method are presented in the paper.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Stability of nucleic acid bases in concentrated sulfuric acid: Implications for the habitability of Venus' clouds
Authors:
Sara Seager,
Janusz J. Petkowski,
Maxwell D. Seager,
John H. Grimes Jr.,
Zachary Zinsli,
Heidi R. Vollmer-Snarr,
Mohamed K. Abd El-Rahman,
David S. Wishart,
Brian L. Lee,
Vasuk Gautam,
Lauren Herrington,
William Bains,
Charles Darrow
Abstract:
What constitutes a habitable planet is a frontier to be explored and requires pushing the boundaries of our terracentric viewpoint for what we deem to be a habitable environment. Despite Venus' 700 K surface temperature being too hot for any plausible solvent and most organic covalent chemistry, Venus' cloud-filled atmosphere layers at 48 to 60 km above the surface hold the main requirements for l…
▽ More
What constitutes a habitable planet is a frontier to be explored and requires pushing the boundaries of our terracentric viewpoint for what we deem to be a habitable environment. Despite Venus' 700 K surface temperature being too hot for any plausible solvent and most organic covalent chemistry, Venus' cloud-filled atmosphere layers at 48 to 60 km above the surface hold the main requirements for life: suitable temperatures for covalent bonds; an energy source (sunlight); and a liquid solvent. Yet, the Venus clouds are widely thought to be incapable of supporting life because the droplets are composed of concentrated liquid sulfuric acid-an aggressive solvent that is assumed to rapidly destroy most biochemicals of life on Earth. Recent work, however, demonstrates that a rich organic chemistry can evolve from simple precursor molecules seeded into concentrated sulfuric acid, a result that is corroborated by domain knowledge in industry that such chemistry leads to complex molecules, including aromatics. We aim to expand the set of molecules known to be stable in concentrated sulfuric acid. Here, we show that nucleic acid bases adenine, cytosine, guanine, thymine, and uracil, as well as 2,6-diaminopurine and the "core" nucleic acid bases purine and pyrimidine, are stable in sulfuric acid in the Venus cloud temperature and sulfuric acid concentration range, using UV spectroscopy and combinations of 1D and 2D 1H 13C 15N NMR spectroscopy. The stability of nucleic acid bases in concentrated sulfuric acid advances the idea that chemistry to support life may exist in the Venus cloud particle environment.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
Performance of a front-end prototype ASIC for the ATLAS High Granularity Timing Detector
Authors:
C. Agapopoulou,
L. A. Beresford,
D. E. Boumediene,
L. Castillo García,
S. Conforti,
C. de la Taille,
L. D. Corpe,
M. J. Da Cunha Sargedas de Sousa,
P. Dinaucourt,
A. Falou,
V. Gautam,
D. Gong,
C. Grieco,
S. Grinstein,
S. Guindon,
A. Howard,
O. Kurdysh,
E. Kuwertz,
C. Li,
N. Makovec,
B. Markovic,
G. Martin-Chassal,
R. Mazzini,
C. Milke,
M. Morenas
, et al. (12 additional authors not shown)
Abstract:
This paper presents the design and characterisation of a front-end prototype ASIC for the ATLAS High Granularity Timing Detector, which is planned for the High-Luminosity phase of the LHC. This prototype, called ALTIROC1, consists of a 5$\times$5-pad matrix and contains the analog part of the single-channel readout (preamplifier, discriminator, two TDCs and SRAM). Two preamplifier architectures (t…
▽ More
This paper presents the design and characterisation of a front-end prototype ASIC for the ATLAS High Granularity Timing Detector, which is planned for the High-Luminosity phase of the LHC. This prototype, called ALTIROC1, consists of a 5$\times$5-pad matrix and contains the analog part of the single-channel readout (preamplifier, discriminator, two TDCs and SRAM). Two preamplifier architectures (transimpedance and voltage) were implemented and tested. The ASIC was characterised both alone and as a module when connected to a 5$\times$5-pad array of LGAD sensors. In calibration measurements, the ASIC operating alone was found to satisfy the technical requirements for the project, with similar performances for both preamplifier types. In particular, the jitter was found to be 15$\pm$1~ps (35$\pm$1~ps) for an injected charge of 10~fC (4~fC). A degradation in performance was observed when the ASIC was connected to the LGAD array. This is attributed to digital couplings at the entrance of the preamplifiers. When the ASIC is connected to the LGAD array, the lowest detectable charge increased from 1.5~fC to 3.4~fC. As a consequence, the jitter increased for an injected charge of 4~fC. Despite this increase, ALTIROC1 still satisfies the maximum jitter specification (below 65~ps) for the HGTD project. This coupling issue also affects the time over threshold measurements and the time-walk correction can only be performed with transimpedance preamplifiers. Beam test measurements with a pion beam at CERN were also undertaken to evaluate the performance of the module. The best time resolution obtained using only ALTIROC TDC data was 46.3$\pm$0.7~ps for a restricted time of arrival range where the coupling issue is minimized. The residual time-walk contribution is equal to 23~ps and is the dominant electronic noise contribution to the time resolution at 15~fC.
△ Less
Submitted 25 July, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Factoring the Matrix of Domination: A Critical Review and Reimagination of Intersectionality in AI Fairness
Authors:
Anaelia Ovalle,
Arjun Subramonian,
Vagrant Gautam,
Gilbert Gee,
Kai-Wei Chang
Abstract:
Intersectionality is a critical framework that, through inquiry and praxis, allows us to examine how social inequalities persist through domains of structure and discipline. Given AI fairness' raison d'etre of "fairness", we argue that adopting intersectionality as an analytical framework is pivotal to effectively operationalizing fairness. Through a critical review of how intersectionality is dis…
▽ More
Intersectionality is a critical framework that, through inquiry and praxis, allows us to examine how social inequalities persist through domains of structure and discipline. Given AI fairness' raison d'etre of "fairness", we argue that adopting intersectionality as an analytical framework is pivotal to effectively operationalizing fairness. Through a critical review of how intersectionality is discussed in 30 papers from the AI fairness literature, we deductively and inductively: 1) map how intersectionality tenets operate within the AI fairness paradigm and 2) uncover gaps between the conceptualization and operationalization of intersectionality. We find that researchers overwhelmingly reduce intersectionality to optimizing for fairness metrics over demographic subgroups. They also fail to discuss their social context and when mentioning power, they mostly situate it only within the AI pipeline. We: 3) outline and assess the implications of these gaps for critical inquiry and praxis, and 4) provide actionable recommendations for AI fairness researchers to engage with intersectionality in their work by grounding it in AI epistemology.
△ Less
Submitted 20 July, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Performance in beam tests of Carbon-enriched irradiated Low Gain Avalanche Detectors for the ATLAS High Granularity Timing Detector
Authors:
S. Ali,
H. Arnold,
S. L. Auwens,
L. A. Beresford,
D. E. Boumediene,
A. M. Burger,
L. Cadamuro,
L. Castillo García,
L. D. Corpe,
M. J. Da Cunha Sargedas de Sousa,
D. Dannheim,
V. Dao,
A. Gabrielli,
Y. El Ghazali,
H. El Jarrari,
V. Gautam,
S. Grinstein,
J. Guimarães da Costa,
S. Guindon,
X. Jia,
G. Kramberger,
Y. Liu,
K. Ma,
N. Makovec,
S. Manzoni
, et al. (12 additional authors not shown)
Abstract:
The High Granularity Timing Detector (HGTD) will be installed in the ATLAS experiment to mitigate pile-up effects during the High Luminosity (HL) phase of the Large Hadron Collider (LHC) at CERN. Low Gain Avalanche Detectors (LGADs) will provide high-precision measurements of the time of arrival of particles at the HGTD, improving the particle-vertex assignment. To cope with the high-radiation env…
▽ More
The High Granularity Timing Detector (HGTD) will be installed in the ATLAS experiment to mitigate pile-up effects during the High Luminosity (HL) phase of the Large Hadron Collider (LHC) at CERN. Low Gain Avalanche Detectors (LGADs) will provide high-precision measurements of the time of arrival of particles at the HGTD, improving the particle-vertex assignment. To cope with the high-radiation environment, LGADs have been optimized by adding carbon in the gain layer, thus reducing the acceptor removal rate after irradiation. Performances of several carbon-enriched LGAD sensors from different vendors, and irradiated with high fluences of 1.5 and 2.5 x 10^15 neq/cm2, have been measured in beam test campaigns during the years 2021 and 2022 at CERN SPS and DESY. This paper presents the results obtained with data recorded by an oscilloscope synchronized with a beam telescope which provides particle position information within a resolution of a few um. Collected charge, time resolution and hit efficiency measurements are presented. In addition, the efficiency uniformity is also studied as a function of the position of the incident particle inside the sensor pad.
△ Less
Submitted 17 March, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
A study of integrable form factors in massless relativistic $AdS_2$
Authors:
Daniele Bielli,
Vaibhav Gautam,
Alessandro Torrielli
Abstract:
In this paper we initiate the study of form factors for the massless scattering of integrable $AdS_2$ superstrings, where the difference-form of the $S$-matrix can be exploited to implement the relativistic form factor bootstrap. The non-standard nature of the $S$-matrix implies that traditional methods do not apply. We use the fact that the massless $AdS_2$ $S$-matrix is a limit of a better-behav…
▽ More
In this paper we initiate the study of form factors for the massless scattering of integrable $AdS_2$ superstrings, where the difference-form of the $S$-matrix can be exploited to implement the relativistic form factor bootstrap. The non-standard nature of the $S$-matrix implies that traditional methods do not apply. We use the fact that the massless $AdS_2$ $S$-matrix is a limit of a better-behaved $S$-matrix found by Fendley. We show that the previously conjectured massless $AdS_2$ dressing factor coincides with the limit of the De Martino - Moriconi improved dressing factor for the Fendley $S$-matrix. After finding a method to construct integral representations of relativistic dressing factors satisfying specific assumptions, we obtain analytic proofs of crossing and unitarity relations and propose a solution to the form factors constraints in the two-particle case.
△ Less
Submitted 23 July, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Linear confinement in the partially-deconfined phase
Authors:
Vaibhav Gautam,
Masanori Hanada,
Jack Holden,
Enrico Rinaldi
Abstract:
We consider the partially-deconfined saddle of large-$N$ pure Yang-Mills theory lying between confined and deconfined phases, in which the color degrees of freedom split into confined and deconfined sectors. Based on the microscopic mechanism of deconfinement, we argue that a flux tube is formed in the confined sector and a linear confinement potential is generated. The string tension should not d…
▽ More
We consider the partially-deconfined saddle of large-$N$ pure Yang-Mills theory lying between confined and deconfined phases, in which the color degrees of freedom split into confined and deconfined sectors. Based on the microscopic mechanism of deconfinement, we argue that a flux tube is formed in the confined sector and a linear confinement potential is generated. The string tension should not depend on the size of the confined sector. We provide evidence by studying the finite-temperature strong-coupling lattice gauge theory. In particular, we make analytic predictions assuming linear confinement in the confined sector, and then confirm these by numerical simulations. We discuss some implications of the conjecture to QCD and holography.
△ Less
Submitted 16 March, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.
-
Producing Histopathology Phantom Images using Generative Adversarial Networks to improve Tumor Detection
Authors:
Vidit Gautam
Abstract:
Advance in medical imaging is an important part in deep learning research. One of the goals of computer vision is development of a holistic, comprehensive model which can identify tumors from histology slides obtained via biopsies. A major problem that stands in the way is lack of data for a few cancer-types. In this paper, we ascertain that data augmentation using GANs can be a viable solution to…
▽ More
Advance in medical imaging is an important part in deep learning research. One of the goals of computer vision is development of a holistic, comprehensive model which can identify tumors from histology slides obtained via biopsies. A major problem that stands in the way is lack of data for a few cancer-types. In this paper, we ascertain that data augmentation using GANs can be a viable solution to reduce the unevenness in the distribution of different cancer types in our dataset. Our demonstration showed that a dataset augmented to a 50% increase causes an increase in tumor detection from 80% to 87.5%
△ Less
Submitted 21 May, 2022;
originally announced May 2022.
-
Matrix Entanglement
Authors:
Vaibhav Gautam,
Masanori Hanada,
Antal Jevicki,
Cheng Peng
Abstract:
In gauge/gravity duality, matrix degrees of freedom on the gauge theory side play important roles for the emergent geometry. In this paper, we discuss how the entanglement on the gravity side can be described as the entanglement between matrix degrees of freedom. Our approach, which we call 'matrix entanglement', is different from 'target-space entanglement' proposed and discussed recently by seve…
▽ More
In gauge/gravity duality, matrix degrees of freedom on the gauge theory side play important roles for the emergent geometry. In this paper, we discuss how the entanglement on the gravity side can be described as the entanglement between matrix degrees of freedom. Our approach, which we call 'matrix entanglement', is different from 'target-space entanglement' proposed and discussed recently by several groups. We consider several classes of quantum states to which our approach can play important roles. When applied to fuzzy sphere, matrix entanglement can be used to define the usual spatial entanglement in two-brane or five-brane world-volume theory nonperturbatively in a regularized setup. Another application is to a small black hole in AdS5*S5 that can evaporate without being attached to a heat bath, for which our approach suggests a gauge theory origin of the Page curve. The confined degrees of freedom in the partially-deconfined states play the important roles.
△ Less
Submitted 13 May, 2022; v1 submitted 13 April, 2022;
originally announced April 2022.
-
The E-Intelligence System
Authors:
Vibhor Gautam,
Vikalp Shishodia
Abstract:
Electronic Intelligence (ELINT), often known as E-Intelligence, is intelligence obtained through electronic sensors. Other than personal communications, ELINT intelligence is usually obtained. The goal is usually to determine a target's capabilities, such as radar placement. Active or passive sensors can be employed to collect data. A provided signal is analyzed and contrasted to collected data fo…
▽ More
Electronic Intelligence (ELINT), often known as E-Intelligence, is intelligence obtained through electronic sensors. Other than personal communications, ELINT intelligence is usually obtained. The goal is usually to determine a target's capabilities, such as radar placement. Active or passive sensors can be employed to collect data. A provided signal is analyzed and contrasted to collected data for recognized signal types. The information may be stored if the signal type is detected; it can be classed as new if no match is found. ELINT collects and categorizes data. In a military setting (and others that have adopted the usage, such as a business), intelligence helps an organization make decisions that can provide them a strategic advantage over the competition. The term "intel" is frequently shortened. The two main subfields of signals intelligence (SIGINT) are ELINT and Communications Intelligence (COMINT). The US Department of Defense specifies the terminologies, and intelligence communities use the categories of data reviewed worldwide.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
A Cross-lingual Natural Language Processing Framework for Infodemic Management
Authors:
Ridam Pal,
Rohan Pandey,
Vaibhav Gautam,
Kanav Bhagat,
Tavpritesh Sethi
Abstract:
The COVID-19 pandemic has put immense pressure on health systems which are further strained due to the misinformation surrounding it. Under such a situation, providing the right information at the right time is crucial. There is a growing demand for the management of information spread using Artificial Intelligence. Hence, we have exploited the potential of Natural Language Processing for identify…
▽ More
The COVID-19 pandemic has put immense pressure on health systems which are further strained due to the misinformation surrounding it. Under such a situation, providing the right information at the right time is crucial. There is a growing demand for the management of information spread using Artificial Intelligence. Hence, we have exploited the potential of Natural Language Processing for identifying relevant information that needs to be disseminated amongst the masses. In this work, we present a novel Cross-lingual Natural Language Processing framework to provide relevant information by matching daily news with trusted guidelines from the World Health Organization. The proposed pipeline deploys various techniques of NLP such as summarizers, word embeddings, and similarity metrics to provide users with news articles along with a corresponding healthcare guideline. A total of 36 models were evaluated and a combination of LexRank based summarizer on Word2Vec embedding with Word Mover distance metric outperformed all other models. This novel open-source approach can be used as a template for proactive dissemination of relevant healthcare information in the midst of misinformation spread associated with epidemics.
△ Less
Submitted 30 October, 2020;
originally announced October 2020.
-
A Machine Learning Application for Raising WASH Awareness in the Times of COVID-19 Pandemic
Authors:
Rohan Pandey,
Vaibhav Gautam,
Ridam Pal,
Harsh Bandhey,
Lovedeep Singh Dhingra,
Himanshu Sharma,
Chirag Jain,
Kanav Bhagat,
Arushi,
Lajjaben Patel,
Mudit Agarwal,
Samprati Agrawal,
Rishabh Jalan,
Akshat Wadhwa,
Ayush Garg,
Vihaan Misra,
Yashwin Agrawal,
Bhavika Rana,
Ponnurangam Kumaraguru,
Tavpritesh Sethi
Abstract:
Background: The COVID-19 pandemic has uncovered the potential of digital misinformation in sha** the health of nations. The deluge of unverified information that spreads faster than the epidemic itself is an unprecedented phenomenon that has put millions of lives in danger. Mitigating this Infodemic requires strong health messaging systems that are engaging, vernacular, scalable, effective and c…
▽ More
Background: The COVID-19 pandemic has uncovered the potential of digital misinformation in sha** the health of nations. The deluge of unverified information that spreads faster than the epidemic itself is an unprecedented phenomenon that has put millions of lives in danger. Mitigating this Infodemic requires strong health messaging systems that are engaging, vernacular, scalable, effective and continuously learn the new patterns of misinformation.
Objective: We created WashKaro, a multi-pronged intervention for mitigating misinformation through conversational AI, machine translation and natural language processing. WashKaro provides the right information matched against WHO guidelines through AI, and delivers it in the right format in local languages.
Methods: We theorize (i) an NLP based AI engine that could continuously incorporate user feedback to improve relevance of information, (ii) bite sized audio in the local language to improve penetrance in a country with skewed gender literacy ratios, and (iii) conversational but interactive AI engagement with users towards an increased health awareness in the community. Results: A total of 5026 people who downloaded the app during the study window, among those 1545 were active users. Our study shows that 3.4 times more females engaged with the App in Hindi as compared to males, the relevance of AI-filtered news content doubled within 45 days of continuous machine learning, and the prudence of integrated AI chatbot Satya increased thus proving the usefulness of an mHealth platform to mitigate health misinformation.
Conclusion: We conclude that a multi-pronged machine learning application delivering vernacular bite-sized audios and conversational AI is an effective approach to mitigate health misinformation.
△ Less
Submitted 30 October, 2020; v1 submitted 16 March, 2020;
originally announced March 2020.
-
Diamond nano-pillar arrays for quantum microscopy of neuronal signals
Authors:
Liam Hanlon,
Vini Gautam,
James D. A. Wood,
Prithvi Reddy,
Michael S. J. Barson,
Marika Niihori,
Alexander R. J. Silalahi,
Ben Corry,
Joerg Wrachtrup,
Matthew J. Sellars,
Vincent R. Daria,
Patrick Maletinsky,
Gregory J. Stuart,
Marcus W. Doherty
Abstract:
Modern neuroscience is currently limited in its capacity to perform long term, wide-field measurements of neuron electromagnetics with nanoscale resolution. Quantum microscopy using the nitrogen vacancy centre (NV) can provide a potential solution to this problem with electric and magnetic field sensing at nano-scale resolution and good biocompatibility. However, the performance of existing NV sen…
▽ More
Modern neuroscience is currently limited in its capacity to perform long term, wide-field measurements of neuron electromagnetics with nanoscale resolution. Quantum microscopy using the nitrogen vacancy centre (NV) can provide a potential solution to this problem with electric and magnetic field sensing at nano-scale resolution and good biocompatibility. However, the performance of existing NV sensing technology does not allow for studies of small mammalian neurons yet. In this paper, we propose a solution to this problem by engineering NV quantum sensors in diamond nanopillar arrays. The pillars improve light collection efficiency by guiding excitation/emission light, which improves sensitivity. More importantly, they also improve the size of the signal at the NV by removing screening charges as well as coordinating the neuron growth to the tips of the pillars where the NV is located. Here, we provide a growth study to demonstrate coordinated neuron growth as well as the first simulation of nano-scopic neuron electric and magnetic fields to assess the enhancement provided by the nanopillar geometry.
△ Less
Submitted 25 January, 2019;
originally announced January 2019.
-
`Zero-spin-photon hypothesis' finds another important application: Could possibly solve the `infinity-problem' of QED without the need of renormalization
Authors:
R. C. Gupta,
Anirudh Pradhan,
V. P. Gautam,
M. S. Kalara,
B. Das,
Sushant Gupta
Abstract:
`Zero-spin-photon hypothesis' as proposed in an earlier paper [1] states that: `due to inevitable consequence of the second-law of thermodynamics and spin-conservation, the `zero-spin-photon' is generated in pair-production process (of elementary particles), which decays into neutrino and antineutrino'. The zero-spin photon hypothesis explains [1] several riddles of physics and universe. In the…
▽ More
`Zero-spin-photon hypothesis' as proposed in an earlier paper [1] states that: `due to inevitable consequence of the second-law of thermodynamics and spin-conservation, the `zero-spin-photon' is generated in pair-production process (of elementary particles), which decays into neutrino and antineutrino'. The zero-spin photon hypothesis explains [1] several riddles of physics and universe. In the present paper, it is shown that `the zero-spin photon hypothesis' when incorporated into the higer-order Feynman diagram (with a closed-loop) could possibly solve the half-a-century-old and famous `infinity-problem' of QED, and thus could avoid the need of the so called `re-normalization' procedure.
△ Less
Submitted 21 September, 2009; v1 submitted 21 January, 2009;
originally announced January 2009.
-
Zero-spin-photon hypothesis: `Zero-spin-photon generation in pair-production and its subsequent decay into neutrino and antineutrino' - solves many-riddles of physics and universe
Authors:
R. C. Gupta,
Anirudh Pradhan,
Ruchi Gupta,
Sanjay Gupta,
V. P. Gautam,
B. Das,
Sushant Gupta
Abstract:
`What is work and what is heat' is re-investigated from the perspective of second law of thermodynamics. It is shown that the inevitable consequence of second law of thermodynamics and spin conservation necessitates the possible generation of zero spin photon in pair production process, and its subsequent decay explains the birth of neutrino and antineutrino. The proposed neutrino-genesis, solve…
▽ More
`What is work and what is heat' is re-investigated from the perspective of second law of thermodynamics. It is shown that the inevitable consequence of second law of thermodynamics and spin conservation necessitates the possible generation of zero spin photon in pair production process, and its subsequent decay explains the birth of neutrino and antineutrino. The proposed neutrino-genesis, solves many riddles of physics and universe. The riddles considered and explained are about: (i) mysterious neutrino (and antineutrino) and its bizarre properties such as handed-ness and parity-violation, (ii) questionable asymmetry/ excess of matter over antimatter, (iii) possibility of existence of antimatter world and (iv) parity (P) violation and aspects of CP and CPT violation or restoration in the universe.
△ Less
Submitted 22 July, 2009; v1 submitted 25 November, 2005;
originally announced November 2005.
-
A categorical construction of 2-dimensional extended Topological Quantum Field Theory
Authors:
Vishvajit V. S. Gautam
Abstract:
In this paper we propose a naive construction of 2-dimensional extended topological quantum field theories (TQFTs), which can be further generalized to the higher-dimension extended TQFTs.
In this paper we propose a naive construction of 2-dimensional extended topological quantum field theories (TQFTs), which can be further generalized to the higher-dimension extended TQFTs.
△ Less
Submitted 29 August, 2003;
originally announced August 2003.
-
A note on Closure Operators in Category of Modules
Authors:
Vishvajit V. S. Gautam
Abstract:
In this article we give application of closure operators in category of modules. Our main result shows that every subcategory A of injective modules of R-mod (under a mild condition) induces a torsion theory of R-mod.
In this article we give application of closure operators in category of modules. Our main result shows that every subcategory A of injective modules of R-mod (under a mild condition) induces a torsion theory of R-mod.
△ Less
Submitted 29 July, 2003; v1 submitted 25 March, 2003;
originally announced March 2003.
-
A note on Closure Operators in Category of Groups
Authors:
Vishvajit V. S. Gautam
Abstract:
We give some applications of closure operators in category of groups and link them with the join problem of subnormal subgroups.
We give some applications of closure operators in category of groups and link them with the join problem of subnormal subgroups.
△ Less
Submitted 11 December, 2002;
originally announced December 2002.
-
On Regular Closure Operators and Cowellpowered Subcategories
Authors:
Vishvajit V. S. Gautam
Abstract:
Many Properties of a category X, as for instance the existence of an adjoint or a factorization system, are a consequence of the cowellpoweredness of X. In the absence of cowellpoweredness, for general results, fairly strong assumption on the category are needed. This paper provides a number of novel and useful observations to tackle the cowellpoweredness problem of subcategories by means of reg…
▽ More
Many Properties of a category X, as for instance the existence of an adjoint or a factorization system, are a consequence of the cowellpoweredness of X. In the absence of cowellpoweredness, for general results, fairly strong assumption on the category are needed. This paper provides a number of novel and useful observations to tackle the cowellpoweredness problem of subcategories by means of regular closure operators. Our exposition focusses on the question when two subcategories A and B induce the same regular closure operators, then information about (non)-cowellpoweredness of A may be gained from corresponding property of B, and vice versa.
△ Less
Submitted 13 August, 2002; v1 submitted 12 June, 2002;
originally announced June 2002.
-
Some equivalent of extremally disconnected spaces
Authors:
Vishvajit V S Gautam
Abstract:
We give some equivalent characterizations of exremally disconnected spaces
We give some equivalent characterizations of exremally disconnected spaces
△ Less
Submitted 12 June, 2002;
originally announced June 2002.