Search | arXiv e-print repository

doi 10.14778/3476249.3476298

Phoebe: A Learning-based Checkpoint Optimizer

Authors: Yiwen Zhu, Matteo Interlandi, Abhishek Roy, Krishnadhan Das, Hiren Patel, Malay Bag, Hitesh Sharma, Alekh **dal

Abstract: Easy-to-use programming interfaces paired with cloud-scale processing engines have enabled big data system users to author arbitrarily complex analytical jobs over massive volumes of data. However, as the complexity and scale of analytical jobs increase, they encounter a number of unforeseen problems, hotspots with large intermediate data on temporary storage, longer job recovery time after failur… ▽ More Easy-to-use programming interfaces paired with cloud-scale processing engines have enabled big data system users to author arbitrarily complex analytical jobs over massive volumes of data. However, as the complexity and scale of analytical jobs increase, they encounter a number of unforeseen problems, hotspots with large intermediate data on temporary storage, longer job recovery time after failures, and worse query optimizer estimates being examples of issues that we are facing at Microsoft. To address these issues, we propose Phoebe, an efficient learning-based checkpoint optimizer. Given a set of constraints and an objective function at compile-time, Phoebe is able to determine the decomposition of job plans, and the optimal set of checkpoints to preserve their outputs to durable global storage. Phoebe consists of three machine learning predictors and one optimization module. For each stage of a job, Phoebe makes accurate predictions for: (1) the execution time, (2) the output size, and (3) the start/end time taking into account the inter-stage dependencies. Using these predictions, we formulate checkpoint optimization as an integer programming problem and propose a scalable heuristic algorithm that meets the latency requirement of the production environment. We demonstrate the effectiveness of Phoebe in production workloads, and show that we can free the temporary storage on hotspots by more than 70% and restart failed jobs 68% faster on average with minimum performance impact. Phoebe also illustrates that adding multiple sets of checkpoints is not cost-efficient, which dramatically reduces the complexity of the optimization. △ Less

Submitted 5 October, 2021; originally announced October 2021.

Journal ref: Proceedings of the VLDB Endowment 14 (11), 2505-2518, 2021

arXiv:2110.01395 [pdf]

Prediction of IPL Match Outcome Using Machine Learning Techniques

Authors: Srikantaiah K C, Aryan Khetan, Baibhav Kumar, Divy Tolani, Harshal Patel

Abstract: India's most popular sport is cricket and is played across all over the nation in different formats like T20, ODI, and Test. The Indian Premier League (IPL) is a national cricket match where players are drawn from regional teams of India, National Team and also from international team. Many factors like live streaming, radio, TV broadcast made this league as popular among cricket fans. The predict… ▽ More India's most popular sport is cricket and is played across all over the nation in different formats like T20, ODI, and Test. The Indian Premier League (IPL) is a national cricket match where players are drawn from regional teams of India, National Team and also from international team. Many factors like live streaming, radio, TV broadcast made this league as popular among cricket fans. The prediction of the outcome of the IPL matches is very important for online traders and sponsors. We can predict the match between two teams based on various factors like team composition, batting and bowling averages of each player in the team, and the team's success in their previous matches, in addition to traditional factors such as toss, venue, and day-night, the probability of winning by batting first at a specified match venue against a specific team. In this paper, we have proposed a model for predicting outcome of the IPL matches using Machine learning Algorithms namely SVM, Random Forest Classifier (RFC), Logistic Regression and K-Nearest Neighbor. Experimental results showed that the Random Forest algorithm outperforms other algorithms with an accuracy of 88.10%. △ Less

Submitted 30 September, 2021; originally announced October 2021.

Comments: 8 pages. Atlantis Highlights in Computer Sciences, Proceedings of the 3rd International Conference on Integrated Intelligent Computing Communication & Security ICIIC 2021

arXiv:2109.01630 [pdf, other]

Do Minimal Parity Solutions to the Strong CP Problem Work?

Authors: Jordy de Vries, Patrick Draper, Hiren H. Patel

Abstract: One class of solutions to the strong CP problem relies on generalized parity symmetries. A minimal model of this type, constructed by Babu and Mohapatra and based on a softly broken parity symmetry, has the remarkable property that effective QCD vacuum angle $\barθ$ vanishes up to one-loop order. We compute the leading two-loop contributions to $\barθ$ in this model and estimate subleading contrib… ▽ More One class of solutions to the strong CP problem relies on generalized parity symmetries. A minimal model of this type, constructed by Babu and Mohapatra and based on a softly broken parity symmetry, has the remarkable property that effective QCD vacuum angle $\barθ$ vanishes up to one-loop order. We compute the leading two-loop contributions to $\barθ$ in this model and estimate subleading contributions. In contrast to previous estimates, we argue that $\bar θ$ is not suppressed by the weak scale, and we find contributions of order $10^{-3}$-$10^{-2}$ multiplying unknown mixing angles and phases. Thus the model does not generically address the strong CP problem, but it might be made consistent with $\barθ<10^{-10}$ in some corners of parameter space. For such non-generic parameters, $\barθ$ is still likely to be just below present bounds, and therefore provides the dominant source of hadronic EDMs. We discuss the resulting EDM phenomenology. △ Less

Submitted 3 September, 2021; originally announced September 2021.

Comments: 8 pages, 1 figure

arXiv:2108.08791 [pdf, other]

Image Inpainting using Partial Convolution

Authors: Harsh Patel, Amey Kulkarni, Shivam Sahni, Udit Vyas

Abstract: Image Inpainting is one of the very popular tasks in the field of image processing with broad applications in computer vision. In various practical applications, images are often deteriorated by noise due to the presence of corrupted, lost, or undesirable information. There have been various restoration techniques used in the past with both classical and deep learning approaches for handling such… ▽ More Image Inpainting is one of the very popular tasks in the field of image processing with broad applications in computer vision. In various practical applications, images are often deteriorated by noise due to the presence of corrupted, lost, or undesirable information. There have been various restoration techniques used in the past with both classical and deep learning approaches for handling such issues. Some traditional methods include image restoration by filling gap pixels using the nearby known pixels or using the moving average over the same. The aim of this paper is to perform image inpainting using robust deep learning methods that use partial convolution layers. △ Less

Submitted 19 August, 2021; originally announced August 2021.

arXiv:2108.08665 [pdf, other]

Trust as a Metric for Resiliency in Signed Social Networks

Authors: Harsh Patel, Shivam Sahni, Pushkar Mujumdar

Abstract: Recent technological advancements have resulted in a surge in online trading, raising severe concerns about theft and fraud, especially on platforms like Bitcoin OTC (over-the-counter), where users' identities remain anonymous. To mitigate the risk, it has become essential to capture the reputation of users based on their trade histories. The who-trusts-whom signed network of people has the capabi… ▽ More Recent technological advancements have resulted in a surge in online trading, raising severe concerns about theft and fraud, especially on platforms like Bitcoin OTC (over-the-counter), where users' identities remain anonymous. To mitigate the risk, it has become essential to capture the reputation of users based on their trade histories. The who-trusts-whom signed network of people has the capability to reflect the nature of such positive and negative relations between the users. It can be used to analyze linkage patterns, strength, and resiliency of such platforms. Due to the dynamic nature of trust between individuals, these trust networks are often vulnerable to link or node failures, making it critical to understand the stability of such systems. In this paper, we consider the problem of quantifying the resiliency of signed networks with the help of trustworthy community structures. We propose a metric for computing the Trustworthiness of a community structure. Using the trustworthiness scores of all communities structures, we generate a pipeline for assessing the resiliency of a signed network. We also show how these generated resiliency scores are concordant with the true nature of the network. △ Less

Submitted 19 August, 2021; originally announced August 2021.

arXiv:2108.05935 [pdf, other]

Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets

Authors: Nitin Gupta, Hima Patel, Shazia Afzal, Naveen Panwar, Ruhi Sharma Mittal, Shanmukha Guttula, Abhinav Jain, Lokesh Nagalapatti, Sameep Mehta, Sandeep Hans, Pranay Lohia, Aniya Aggarwal, Diptikalyan Saha

Abstract: The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes… ▽ More The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes etc. We attempt to re-look at the data quality issues in the context of building a machine learning pipeline and build a tool that can detect, explain and remediate issues in the data, and systematically and automatically capture all the changes applied to the data. We introduce the Data Quality Toolkit for machine learning as a library of some key quality metrics and relevant remediation techniques to analyze and enhance the readiness of structured training datasets for machine learning projects. The toolkit can reduce the turn-around times of data preparation pipelines and streamline the data quality assessment process. Our toolkit is publicly available via IBM API Hub [1] platform, any developer can assess the data quality using the IBM's Data Quality for AI apis [2]. Detailed tutorials are also available on IBM Learning Path [3]. △ Less

Submitted 5 September, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

arXiv:2107.08594 [pdf, other]

Optimal Resource Allocation for Serverless Queries

Authors: Anish Pimpley, Shuo Li, Anubha Srivastava, Vishal Rohra, Yi Zhu, Soundararajan Srinivasan, Alekh **dal, Hiren Patel, Shi Qiao, Rathijit Sen

Abstract: Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they frequently misallocate by orders of magnitude. Unfortunately, prior work focused on predicting peak allocation while ignoring aggressive trade-offs between resource al… ▽ More Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they frequently misallocate by orders of magnitude. Unfortunately, prior work focused on predicting peak allocation while ignoring aggressive trade-offs between resource allocation and run-time. Additionally, these methods fail to predict allocation for queries that have not been observed in the past. In this paper, we tackle both these problems. We introduce a system for optimal resource allocation that can predict performance with aggressive trade-offs, for both new and past observed queries. We introduce the notion of a performance characteristic curve (PCC) as a parameterized representation that can compactly capture the relationship between resources and performance. To tackle training data sparsity, we introduce a novel data augmentation technique to efficiently synthesize the entire PCC using a single run of the query. Lastly, we demonstrate the advantages of a constrained loss function coupled with GNNs, over traditional ML methods, for capturing the domain specific behavior through an extensive experimental evaluation over SCOPE big data workloads at Microsoft. △ Less

Submitted 18 July, 2021; originally announced July 2021.

arXiv:2106.15644 [pdf, other]

doi 10.1016/j.aop.2022.168867

Construction of Quantum Target Space from World-Sheet States using Quantum State Tomography

Authors: Salman Sajad Wani, Arshid Shabir, Junaid Ul Hassan, S. Kannan, Hrishikesh Patel, C. Sudheesh, Mir Faizal

Abstract: In this paper, we will construct the quantum states of target space coordinates from world-sheet states, using quantum state tomography. To perform quantum state tomography of an open string, we will construct suitable quadrature operators. We do this by first defining the quadrature operators in world-sheet, and then using them to construct the quantum target space quadrature operators for an ope… ▽ More In this paper, we will construct the quantum states of target space coordinates from world-sheet states, using quantum state tomography. To perform quantum state tomography of an open string, we will construct suitable quadrature operators. We do this by first defining the quadrature operators in world-sheet, and then using them to construct the quantum target space quadrature operators for an open string. We will connect the quantum target space to classical geometry using coherent string states. We will be using a novel construction based on a string displacement operator to construct these coherent states. The coherent states of the world-sheet will also be used to construct the coherent states in target space. △ Less

Submitted 29 June, 2021; originally announced June 2021.

Comments: 13 pages, 4 figures, 1 appendix

Journal ref: Annals Phys. 441 (2022) 168867

arXiv:2105.07809 [pdf, other]

Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report

Authors: Andrey Ignatov, Cheng-Ming Chiang, Hsien-Kai Kuo, Anastasia Sycheva, Radu Timofte, Min-Hung Chen, Man-Yu Lee, Yu-Syuan Xu, Yu Tseng, Shusong Xu, ** Guo, Chao-Hung Chen, Ming-Chun Hsyu, Wen-Chia Tsai, Chao-Wei Chen, Grigory Malivenko, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Zheng Shaolong, Hao Dejun, Xie Fen, Feng Zhuang , et al. (16 additional authors not shown)

Abstract: As the quality of mobile cameras starts to play a crucial role in modern smartphones, more and more attention is now being paid to ISP algorithms used to improve various perceptual aspects of mobile photos. In this Mobile AI challenge, the target was to develop an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly r… ▽ More As the quality of mobile cameras starts to play a crucial role in modern smartphones, more and more attention is now being paid to ISP algorithms used to improve various perceptual aspects of mobile photos. In this Mobile AI challenge, the target was to develop an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly real-time performance on smartphone NPUs. For this, the participants were provided with a novel learned ISP dataset consisting of RAW-RGB image pairs captured with the Sony IMX586 Quad Bayer mobile sensor and a professional 102-megapixel medium format camera. The runtime of all models was evaluated on the MediaTek Dimensity 1000+ platform with a dedicated AI processing unit capable of accelerating both floating-point and quantized neural networks. The proposed solutions are fully compatible with the above NPU and are capable of processing Full HD photos under 60-100 milliseconds while achieving high fidelity results. A detailed description of all models developed in this challenge is provided in this paper. △ Less

Submitted 17 May, 2021; originally announced May 2021.

Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/

arXiv:2105.04559 [pdf, other]

doi 10.1103/PhysRevD.104.083021

Discovering new forces with gravitational waves from supermassive black holes

Authors: Jeff A. Dror, Benjamin V. Lehmann, Hiren H. Patel, Stefano Profumo

Abstract: Supermassive black hole binary mergers generate a stochastic gravitational wave background detectable by pulsar timing arrays. While the amplitude of this background is subject to significant uncertainties, the frequency dependence is a robust prediction of general relativity. We show that the effects of new forces beyond the Standard Model can modify this prediction and introduce unique features… ▽ More Supermassive black hole binary mergers generate a stochastic gravitational wave background detectable by pulsar timing arrays. While the amplitude of this background is subject to significant uncertainties, the frequency dependence is a robust prediction of general relativity. We show that the effects of new forces beyond the Standard Model can modify this prediction and introduce unique features into the spectral shape. In particular, we consider the possibility that black holes in binaries are charged under a new long-range force, and we find that pulsar timing arrays are capable of robustly detecting such forces. Supermassive black holes and their environments can acquire charge due to high-energy particle production or dark sector interactions, making the measurement of the spectral shape a powerful test of fundamental physics. △ Less

Submitted 18 October, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

Comments: 10 pages, 3 figures. Matched published version

Journal ref: Phys. Rev. D 104, 083021 (2021)

arXiv:2102.11916 [pdf, other]

Event Camera Based Real-Time Detection and Tracking of Indoor Ground Robots

Authors: Himanshu Patel, Craig Iaboni, Deepan Lobo, Ji-won Choi, Pramod Abichandani

Abstract: This paper presents a real-time method to detect and track multiple mobile ground robots using event cameras. The method uses density-based spatial clustering of applications with noise (DBSCAN) to detect the robots and a single k-dimensional ($k - d$) tree to accurately keep track of them as they move in an indoor arena. Robust detections and tracks are maintained in the face of event camera nois… ▽ More This paper presents a real-time method to detect and track multiple mobile ground robots using event cameras. The method uses density-based spatial clustering of applications with noise (DBSCAN) to detect the robots and a single k-dimensional ($k - d$) tree to accurately keep track of them as they move in an indoor arena. Robust detections and tracks are maintained in the face of event camera noise and lack of events (due to robots moving slowly or stop**). An off-the-shelf RGB camera-based tracking system was used to provide ground truth. Experiments including up to 4 robots are performed to study the effect of i) varying DBSCAN parameters, ii) the event accumulation time, iii) the number of robots in the arena, iv) the speed of the robots, and v) variation in ambient light conditions on the detection and tracking performance. The experimental results showed 100% detection and tracking fidelity in the face of event camera noise and robots stop** for tests involving up to 3 robots (and upwards of 93% for 4 robots). When the lighting conditions were varied, a graceful degradation in detection and tracking fidelity was observed. △ Less

Submitted 2 August, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

arXiv:2101.06949 [pdf, other]

HinFlair: pre-trained contextual string embeddings for pos tagging and text classification in the Hindi language

Authors: Harsh Patel

Abstract: Recent advancements in language models based on recurrent neural networks and transformers architecture have achieved state-of-the-art results on a wide range of natural language processing tasks such as pos tagging, named entity recognition, and text classification. However, most of these language models are pre-trained in high resource languages like English, German, Spanish. Multi-lingual langu… ▽ More Recent advancements in language models based on recurrent neural networks and transformers architecture have achieved state-of-the-art results on a wide range of natural language processing tasks such as pos tagging, named entity recognition, and text classification. However, most of these language models are pre-trained in high resource languages like English, German, Spanish. Multi-lingual language models include Indian languages like Hindi, Telugu, Bengali in their training corpus, but they often fail to represent the linguistic features of these languages as they are not the primary language of the study. We introduce HinFlair, which is a language representation model (contextual string embeddings) pre-trained on a large monolingual Hindi corpus. Experiments were conducted on 6 text classification datasets and a Hindi dependency treebank to analyze the performance of these contextualized string embeddings for the Hindi language. Results show that HinFlair outperforms previous state-of-the-art publicly available pre-trained embeddings for downstream tasks like text classification and pos tagging. Also, HinFlair when combined with FastText embeddings outperforms many transformers-based language models trained particularly for the Hindi language. △ Less

Submitted 18 January, 2021; originally announced January 2021.

arXiv:2012.13065 [pdf, other]

doi 10.1088/1475-7516/2021/11/041

Comments on Axions, Domain Walls, and Cosmic Strings

Authors: Michael Dine, Nicolas Fernandez, Akshay Ghalsasi, Hiren H. Patel

Abstract: Axions have for some time been considered a plausible candidate for dark matter. They can be produced through misalignment, but it has been argued that when inflation occurs before a Peccei-Quinn transition, appreciable production can result from cosmic strings. This has been the subject of extensive simulations. But there are reasons to be skeptical about the possible role of axion strings. We re… ▽ More Axions have for some time been considered a plausible candidate for dark matter. They can be produced through misalignment, but it has been argued that when inflation occurs before a Peccei-Quinn transition, appreciable production can result from cosmic strings. This has been the subject of extensive simulations. But there are reasons to be skeptical about the possible role of axion strings. We review and elaborate on these questions, and argue that parametrically strings are already accounted for by the assumption of random misalignment angles. We review and elaborate on these questions, and provide several qualitative arguments that parametrically strings are already accounted for by the assumption of random misalignment angles. The arguments are base on considerations of the collective modes of the string solutions, on computations of axion radiation in particular models, and reviews of simulations. △ Less

Submitted 18 November, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

Comments: 31 pages, 3 figures; published version

Journal ref: JCAP 11 (2021) 041

arXiv:2012.05516 [pdf, other]

Explainable Link Prediction for Privacy-Preserving Contact Tracing

Authors: Balaji Ganesan, Hima Patel, Sameep Mehta

Abstract: Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update t… ▽ More Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update their infection status on these applications. In this concept paper, we present ideas from Graph Neural Networks and explainability, that could improve trust in these applications, and encourage adoption by people. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: 8 pages, 7 figures, SpicyFL 2020 Workshop at NeurIPS 2020

arXiv:2011.07313 [pdf]

Classification of Reverse-Engineered Class Diagram and Forward-Engineered Class Diagram using Machine Learning

Authors: Kaushil Mangaroliya, Het Patel

Abstract: UML Class diagram is very important to visualize the whole software we are working on and helps understand the whole system in the easiest way possible by showing the system classes, its attributes, methods, and relations with other objects. In the real world, there are two types of Class diagram engineers work with namely 1) Forward Engineered Class Diagram (FwCD) which are hand-made as part of t… ▽ More UML Class diagram is very important to visualize the whole software we are working on and helps understand the whole system in the easiest way possible by showing the system classes, its attributes, methods, and relations with other objects. In the real world, there are two types of Class diagram engineers work with namely 1) Forward Engineered Class Diagram (FwCD) which are hand-made as part of the forward-looking development process, and 2). Reverse Engineered Class Diagram (RECD) which are those diagrams that are reverse engineered from the source code. In the software industry while working with new open software projects it is important to know which type of class diagram it is. Which UML diagram was used in a particular project is an important factor to be known? To solve this problem, we propose to build a classifier that can classify a UML diagram into FwCD or RECD. We propose to solve this problem by using a supervised Machine Learning technique. The approach in this involves analyzing the features that are useful in classifying class diagrams. Different Machine Learning models are used in this process and the Random Forest algorithm has proved to be the best out of all. Performance testing was done on 999 Class diagrams. △ Less

Submitted 14 November, 2020; originally announced November 2020.

arXiv:2011.03340 [pdf, ps, other]

doi 10.1038/s41598-021-86355-3

Testing Short Distance Anisotropy in Space

Authors: Robert B. Mann, Idrus Husin, Hrishikesh Patel, Mir Faizal, Anto Sulaksono, Agus Suroso

Abstract: The isotropy of space is not a logical requirement but rather is an empirical question; indeed there is suggestive evidence that universe might be anisotropic. A plausible source of these anisotropies could be quantum gravity corrections. If these corrections happen to be between the electroweak scale and the Planck scale, then these anisotropies can have measurable consequences at short distances… ▽ More The isotropy of space is not a logical requirement but rather is an empirical question; indeed there is suggestive evidence that universe might be anisotropic. A plausible source of these anisotropies could be quantum gravity corrections. If these corrections happen to be between the electroweak scale and the Planck scale, then these anisotropies can have measurable consequences at short distances and their effects can be measured using ultra sensitive condensed matter systems. We investigate how such anisotropic quantum gravity corrections modify low energy physics through an anisotropic deformation of the Heisenberg algebra. We discuss how such anisotropies might be observed using a scanning tunneling microscope. △ Less

Submitted 5 July, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

Comments: 15 pages, 1 figure

Journal ref: Sci. Rep. 11, 1, 7474 (2021)

arXiv:2011.01504 [pdf, other]

BioNerFlair: biomedical named entity recognition using flair embedding and sequence tagger

Authors: Harsh Patel

Abstract: Motivation: The proliferation of Biomedical research articles has made the task of information retrieval more important than ever. Scientists and Researchers are having difficulty in finding articles that contain information relevant to them. Proper extraction of biomedical entities like Disease, Drug/chem, Species, Gene/protein, can considerably improve the filtering of articles resulting in bett… ▽ More Motivation: The proliferation of Biomedical research articles has made the task of information retrieval more important than ever. Scientists and Researchers are having difficulty in finding articles that contain information relevant to them. Proper extraction of biomedical entities like Disease, Drug/chem, Species, Gene/protein, can considerably improve the filtering of articles resulting in better extraction of relevant information. Performance on BioNer benchmarks has progressively improved because of progression in transformers-based models like BERT, XLNet, OpenAI, GPT2, etc. These models give excellent results; however, they are computationally expensive and we can achieve better scores for domain-specific tasks using other contextual string-based models and LSTM-CRF based sequence tagger. Results: We introduce BioNerFlair, a method to train models for biomedical named entity recognition using Flair plus GloVe embeddings and Bidirectional LSTM-CRF based sequence tagger. With almost the same generic architecture widely used for named entity recognition, BioNerFlair outperforms previous state-of-the-art models. I performed experiments on 8 benchmarks datasets for biomedical named entity recognition. Compared to current state-of-the-art models, BioNerFlair achieves the best F1-score of 90.17 beyond 84.72 on the BioCreative II gene mention (BC2GM) corpus, best F1-score of 94.03 beyond 92.36 on the BioCreative IV chemical and drug (BC4CHEMD) corpus, best F1-score of 88.73 beyond 78.58 on the JNLPBA corpus, best F1-score of 91.1 beyond 89.71 on the NCBI disease corpus, best F1-score of 85.48 beyond 78.98 on the Species-800 corpus, while near best results was observed on BC5CDR-chem, BC3CDR-disease, and LINNAEUS corpus. △ Less

Submitted 3 November, 2020; originally announced November 2020.

arXiv:2010.07213 [pdf, other]

Data Readiness Report

Authors: Shazia Afzal, Rajmohan C, Manish Kesarwani, Sameep Mehta, Hima Patel

Abstract: Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which limits their reuse and results in loss of productivity. We introduce the concept of a Data Readiness Report as an accompanying documentation to a dataset that allow… ▽ More Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which limits their reuse and results in loss of productivity. We introduce the concept of a Data Readiness Report as an accompanying documentation to a dataset that allows data consumers to get detailed insights into the quality of input data. Data characteristics and challenges on various quality dimensions are identified and documented kee** in mind the principles of transparency and explainability. The Data Readiness Report also serves as a record of all data assessment operations including applied transformations. This provides a detailed lineage for the purpose of data governance and management. In effect, the report captures and documents the actions taken by various personas in a data readiness and assessment workflow. Overtime this becomes a repository of best practices and can potentially drive a recommendation system for building automated data readiness workflows on the lines of AutoML [8]. We anticipate that together with the Datasheets [9], Dataset Nutrition Label [11], FactSheets [1] and Model Cards [15], the Data Readiness Report makes significant progress towards Data and AI lifecycle documentation. △ Less

Submitted 15 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

arXiv:2009.14457 [pdf, other]

Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning

Authors: Subhojeet Pramanik, Shashank Mujumdar, Hima Patel

Abstract: Recent approaches in literature have exploited the multi-modal information in documents (text, layout, image) to serve specific downstream document tasks. However, they are limited by their - (i) inability to learn cross-modal representations across text, layout and image dimensions for documents and (ii) inability to process multi-page documents. Pre-training techniques have been shown in Natural… ▽ More Recent approaches in literature have exploited the multi-modal information in documents (text, layout, image) to serve specific downstream document tasks. However, they are limited by their - (i) inability to learn cross-modal representations across text, layout and image dimensions for documents and (ii) inability to process multi-page documents. Pre-training techniques have been shown in Natural Language Processing (NLP) domain to learn generic textual representations from large unlabelled datasets, applicable to various downstream NLP tasks. In this paper, we propose a multi-task learning-based framework that utilizes a combination of self-supervised and supervised pre-training tasks to learn a generic document representation applicable to various downstream document tasks. Specifically, we introduce Document Topic Modelling and Document Shuffle Prediction as novel pre-training tasks to learn rich image representations along with the text and layout representations for documents. We utilize the Longformer network architecture as the backbone to encode the multi-modal information from multi-page documents in an end-to-end fashion. We showcase the applicability of our pre-training framework on a variety of different real-world document tasks such as document classification, document information extraction, and document retrieval. We evaluate our framework on different standard document datasets and conduct exhaustive experiments to compare performance against various ablations of our framework and state-of-the-art baselines. △ Less

Submitted 5 January, 2022; v1 submitted 30 September, 2020; originally announced September 2020.

arXiv:2009.12076 [pdf, other]

Assessing the Interplay between travel patterns and SARS-CoV-2 outbreak in realistic urban setting

Authors: Rohan Patil, Raviraj Dave, Harsh Patel, Viraj M Shah, Deep Chakrabarti, Udit Bhatia

Abstract: The dense social contact networks and high mobility in congested urban areas facilitate the rapid transmission of infectious diseases. Typical mechanistic epidemiological models are either based on uniform mixing with ad-hoc contact processes or need real-time or archived population mobility data to simulate the social networks. However, the rapid and global transmission of the novel coronavirus (… ▽ More The dense social contact networks and high mobility in congested urban areas facilitate the rapid transmission of infectious diseases. Typical mechanistic epidemiological models are either based on uniform mixing with ad-hoc contact processes or need real-time or archived population mobility data to simulate the social networks. However, the rapid and global transmission of the novel coronavirus (SARS-CoV-2) has led to unprecedented lockdowns at global and regional scales, leaving the archived datasets to limited use. While it is often hypothesized that population density is a significant driver in disease propagation, the disparate disease trajectories and infection rates exhibited by the different cities with comparable densities require a high-resolution description of the disease and its drivers. In this study, we explore the impact of the creation of containment zones on travel patterns within the city. Further, we use a dynamical network-based infectious disease model to understand the key drivers of disease spread at sub-kilometer scales demonstrated in the city of Ahmedabad, India, which has been classified as a SARS-CoV-2 hotspot. We find that in addition to the contact network and population density, road connectivity patterns and ease of transit are strongly correlated with the rate of transmission of the disease. Given the limited access to real-time traffic data during lockdowns, we generate road connectivity networks using open-source imageries and travel patterns from open-source surveys and government reports. Within the proposed framework, we then analyze the relative merits of social distancing, enforced lockdowns, and enhanced testing and quarantining mitigating the disease spread. △ Less

Submitted 25 September, 2020; originally announced September 2020.

arXiv:2009.04012 [pdf, other]

doi 10.1088/1475-7516/2021/07/024

Asymptotic analysis of the Boltzmann equation for dark matter relic abundance

Authors: Logan A. Morrison, Hiren H. Patel, Jaryd F. Ulbricht

Abstract: A solution to the Boltzmann equation governing the thermal relic abundance of cold dark matter is constructed by matched asymptotic approximations. The approximation of the relic density is an asymptotic series valid when the abundance does not deviate significantly from its equilibrium value until small temperatures. Resonance and threshold effects are taken into account at leading order and foun… ▽ More A solution to the Boltzmann equation governing the thermal relic abundance of cold dark matter is constructed by matched asymptotic approximations. The approximation of the relic density is an asymptotic series valid when the abundance does not deviate significantly from its equilibrium value until small temperatures. Resonance and threshold effects are taken into account at leading order and found to be negligible unless the annihilation cross section is negligible at threshold. Comparisons are made to previously attempted constructions and to the freeze out approximation commonly employed in the literature. Extensions to higher order matching is outlined, and implications for solving related systems are discussed. We compare our results to a numerical determination of the relic abundance using a benchmark model and find a fantastic agreement. The method developed also serves as a solution to a wide class of problems containing an infinite order turning point. △ Less

Submitted 8 September, 2020; originally announced September 2020.

Comments: 15 pages, 6 figures

Journal ref: JCAP07(2021)024

arXiv:2009.01258 [pdf, other]

doi 10.1103/PhysRevD.102.115042

Electron EDM in the complex two-Higgs doublet model

Authors: Wolfgang Altmannshofer, Stefania Gori, Nick Hamer, Hiren H. Patel

Abstract: We present the first complete two loop calculation of the electron EDM in the complex two-Higgs doublet model. We confirm gauge-independence by demonstrating analytic cancellation of the gauge parameter $ξ$ in the background field gauge and the 't Hooft $R_ξ$ gauge. We also investigate the behavior of the electron EDM near the decoupling limit, and determine the short- and long-distance contributi… ▽ More We present the first complete two loop calculation of the electron EDM in the complex two-Higgs doublet model. We confirm gauge-independence by demonstrating analytic cancellation of the gauge parameter $ξ$ in the background field gauge and the 't Hooft $R_ξ$ gauge. We also investigate the behavior of the electron EDM near the decoupling limit, and determine the short- and long-distance contributions by matching onto an effective field theory. Compared with earlier studies of the electron EDM in the complex two-Higgs doublet model, we note disagreements in several places and provide diagnoses where possible. We also provide expressions for EDMs of light quarks. △ Less

Submitted 28 September, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

Comments: 18 pages, 14 figures v2: references added; comparison with 1311.4704 corrected; Mathematica notebook containing all relevant equations added as ancillary file

Journal ref: Phys. Rev. D 102, 115042 (2020)

arXiv:2006.04532 [pdf, other]

Detecting Problem Statements in Peer Assessments

Authors: Yunkai Xiao, Gabriel Zingle, Qin** Jia, Harsh R. Shah, Yi Zhang, Tianyi Li, Mohsin Karovaliya, Weixiang Zhao, Yang Song, Jie Ji, Ashwin Balasubramaniam, Harshit Patel, Priyankha Bhalasubbramanian, Vikram Patel, Edward F. Gehringer

Abstract: Effective peer assessment requires students to be attentive to the deficiencies in the work they rate. Thus, their reviews should identify problems. But what ways are there to check that they do? We attempt to automate the process of deciding whether a review comment detects a problem. We use over 18,000 review comments that were labeled by the reviewees as either detecting or not detecting a prob… ▽ More Effective peer assessment requires students to be attentive to the deficiencies in the work they rate. Thus, their reviews should identify problems. But what ways are there to check that they do? We attempt to automate the process of deciding whether a review comment detects a problem. We use over 18,000 review comments that were labeled by the reviewees as either detecting or not detecting a problem with the work. We deploy several traditional machine-learning models, as well as neural-network models using GloVe and BERT embeddings. We find that the best performer is the Hierarchical Attention Network classifier, followed by the Bidirectional Gated Recurrent Units (GRU) Attention and Capsule model with scores of 93.1% and 90.5% respectively. The best non-neural network model was the support vector machine with a score of 89.71%. This is followed by the Stochastic Gradient Descent model and the Logistic Regression model with 89.70% and 88.98%. △ Less

Submitted 29 May, 2020; originally announced June 2020.

Comments: 8 pages, 9 images. Extended version of a paper published at EDM 2020, 13th International Conference on Educational Data Mining

ACM Class: I.2.7

arXiv:2003.04732 [pdf, other]

Link Prediction using Graph Neural Networks for Master Data Management

Authors: Balaji Ganesan, Srinivas Parkala, Neeraj R Singh, Sumit Bhatia, Gayatri Mishra, Matheen Ahmed Pasha, Hima Patel, Somashekar Naganna

Abstract: Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNN… ▽ More Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNNs have typically been applied so far. We introduce novel methods for anonymizing data, model training, explainability and verification for Link Prediction in Master Data Management, and discuss our results. △ Less

Submitted 28 August, 2020; v1 submitted 7 March, 2020; originally announced March 2020.

Comments: 10 pages, 11 figures

arXiv:2003.02924 [pdf, other]

doi 10.1142/S0217751X21501153

Probing Short Gravity using Temporal Lensing

Authors: Mir Faizal, Hrishikesh Patel

Abstract: It is known that probing gravity in the submillimeter-micrometer range is difficult due to the relative weakness of the gravitational force. We intend to overcome this challenge by using extreme temporal precision to monitor transient events in a gravitational field. We propose a compressed ultrafast photography system called T-CUP to serve this purpose. We show that the T-CUP's precision of 10 tr… ▽ More It is known that probing gravity in the submillimeter-micrometer range is difficult due to the relative weakness of the gravitational force. We intend to overcome this challenge by using extreme temporal precision to monitor transient events in a gravitational field. We propose a compressed ultrafast photography system called T-CUP to serve this purpose. We show that the T-CUP's precision of 10 trillion frames per second can allow us to better resolve gravity at short distances. We also show the feasibility of the setup in measuring Yukawa and power-law corrections to gravity which have substantial theoretical motivation. △ Less

Submitted 29 June, 2021; v1 submitted 3 March, 2020; originally announced March 2020.

Comments: 15 pages, 5 figures

Journal ref: Int. J. Mod. Phys. A 36, 17, 2150115 (2021)

arXiv:2003.00200 [pdf, ps, other]

doi 10.1142/S0217751X21501025

Compactification, T-Duality and Quantum Erasers

Authors: Salman Sajad Wani, Dylan Sutherland, Behnam Pourhassan, Mir Faizal, Hrishikesh Patel

Abstract: Using T-duality, we will argue that a zero point length exists in the low energy effective field theory of string theory on compactified extra dimensions. Furthermore, if we neglect the oscillator modes, this zero point length would modify low quantum mechanical systems. As this zero length is fixed geometrically, it is important to analyze how it modifies purely quantum mechanical effects. Thus,… ▽ More Using T-duality, we will argue that a zero point length exists in the low energy effective field theory of string theory on compactified extra dimensions. Furthermore, if we neglect the oscillator modes, this zero point length would modify low quantum mechanical systems. As this zero length is fixed geometrically, it is important to analyze how it modifies purely quantum mechanical effects. Thus, we will analyze its effects on quantum erasers, because they are based on quantum effects like entanglement. It will be observed that the behavior of these quantum erasers gets modified by this zero point length. As the zero point length is fixed by the radius of compactification, we argue that these results demonstrate a deeper connection between geometry and quantum effects. △ Less

Submitted 29 February, 2020; originally announced March 2020.

Comments: 14 pages, 4 figures

Journal ref: Int. J. Mod. Phys. A 36, 18, 2150102 (2021)

arXiv:2002.12449 [pdf, other]

Behavior of Cross Sections for Large Numbers of Particles

Authors: Michael Dine, Hiren H. Patel, Jaryd F. Ulbricht

Abstract: It has been suggested that scattering cross sections at very high energies for producing large numbers of Higgs particles may exhibit factorial growth, and that curing this growth might be relevant to other questions in the Standard Model. We point out, first, that the question is inherently non-perturbative; low orders in the formal perturbative expansion do not give a good approximation to the s… ▽ More It has been suggested that scattering cross sections at very high energies for producing large numbers of Higgs particles may exhibit factorial growth, and that curing this growth might be relevant to other questions in the Standard Model. We point out, first, that the question is inherently non-perturbative; low orders in the formal perturbative expansion do not give a good approximation to the scattering amplitude for sufficiently large N for any fixed, small value of the coupling. Focusing on $λφ^{4}$ theory, we argue that there may be a systematic approximation scheme for processes where N particles near threshold scatter to produce N particles, and discuss the leading contributions to the scattering amplitude and cross sections in this limit. Scattering amplitudes do not grow as rapidly as in perturbation theory. Additionally, partial and total cross sections do not show factorial growth. In the case of cross sections for $2 \to N$ particles, there is no systematic large N approximation available. That said, we provide evidence that non-perturbatively, there is no factorial growth in partial or total cross sections. △ Less

Submitted 27 February, 2020; originally announced February 2020.

Comments: 22 pages, 3 figures

arXiv:2002.12393 [pdf, other]

Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings

Authors: Tarique Siddiqui, Alekh **dal, Shi Qiao, Hiren Patel, Wangchao le

Abstract: Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost model, therefore, is akin to better resource efficiency and lower operational costs. Unfortunately, the production workloads at Microsoft show that costs are very co… ▽ More Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost model, therefore, is akin to better resource efficiency and lower operational costs. Unfortunately, the production workloads at Microsoft show that costs are very complex to model for big data systems. In this work, we investigate two key questions: (i) can we learn accurate cost models for big data systems, and (ii) can we integrate the learned models within the query optimizer. To answer these, we make three core contributions. First, we exploit workload patterns to learn a large number of individual cost models and combine them to achieve high accuracy and coverage over a long period. Second, we propose extensions to Cascades framework to pick optimal resources, i.e, number of containers, during query planning. And third, we integrate the learned cost models within the Cascade-style query optimizer of SCOPE at Microsoft. We evaluate the resulting system, Cleo, in a production environment using both production and TPC-H workloads. Our results show that the learned cost models are 2 to 3 orders of magnitude more accurate, and 20X more correlated with the actual runtimes, with a large majority (70%) of the plan changes leading to substantial improvements in latency as well as resource usage. △ Less

Submitted 27 February, 2020; originally announced February 2020.

Comments: To appear at SIGMOD 2020

arXiv:2002.10943 [pdf, other]

Data Augmentation for Personal Knowledge Base Population

Authors: Lingraj S Vannur, Balaji Ganesan, Lokesh Nagalapatti, Hima Patel, MN Thippeswamy

Abstract: Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to d… ▽ More Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to data protection, fairness and privacy. In this work, we present a system that uses rule based annotators and a graph neural network for missing link prediction, to populate a more complete, fair and diverse knowledge base from the TACRED dataset. △ Less

Submitted 18 August, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

Comments: 8 pages, 9 figures, 6 tables. under review. arXiv admin note: text overlap with arXiv:2001.08013

arXiv:2002.01400 [pdf, other]

doi 10.1007/JHEP05(2020)069

Implications for Electric Dipole Moments of a Leptoquark Scenario for the $B$-Physics Anomalies

Authors: Wolfgang Altmannshofer, Stefania Gori, Hiren H. Patel, Stefano Profumo, Douglas Tuckler

Abstract: Vector leptoquarks can address the lepton flavor universality anomalies in decays associated with the $b \to c \ell ν$ and $b \to s \ell \ell$ transitions, as observed in recent years. Generically, these leptoquarks yield new sources of CP violation. In this paper, we explore constraints and discovery potential for electric dipole moments (EDMs) in leptonic and hadronic systems. We provide the mos… ▽ More Vector leptoquarks can address the lepton flavor universality anomalies in decays associated with the $b \to c \ell ν$ and $b \to s \ell \ell$ transitions, as observed in recent years. Generically, these leptoquarks yield new sources of CP violation. In this paper, we explore constraints and discovery potential for electric dipole moments (EDMs) in leptonic and hadronic systems. We provide the most generic expressions for dipole moments induced by vector leptoquarks at one loop. We find that $O(1)$ CP-violating phases in tau and muon couplings can lead to corresponding EDMs within reach of next-generation EDM experiments, and that existing bounds on the electron EDM already put stringent constraints on CP-violating electron couplings. △ Less

Submitted 17 February, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

Comments: 36 pages, 4 figures; muon EDM projection updated, references added, conclusions unchanged

arXiv:2001.08013 [pdf, other]

A Neural Architecture for Person Ontology population

Authors: Balaji Ganesan, Riddhiman Dasgupta, Akshay Parekh, Hima Patel, Berthold Reinwald

Abstract: A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual pr… ▽ More A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual process, because it requires a fixed set of semantic relations between concepts. In this work, we present a system for automatically populating a person ontology graph from unstructured data using neural models for Entity Classification and Relation Extraction. We introduce a new dataset for these tasks and discuss our results. △ Less

Submitted 22 January, 2020; originally announced January 2020.

Comments: 6 pages, 10 figures. arXiv admin note: substantial text overlap with arXiv:1811.09368

arXiv:1912.08220 [pdf, other]

doi 10.1103/PhysRevLett.126.131801

Parity-Violating Møller Scattering at NNLO: Closed Fermion Loops

Authors: Yong Du, Ayres Freitas, Hiren H. Patel, Michael J. Ramsey-Musolf

Abstract: A complete, gauge-invariant computation of two loop virtual corrections involving closed fermion loops to the polarized Møller scattering asymmetry is presented. The set of contributions involving two closed fermion loops and the set involving one closed fermion loop are numerically similar in magnitude to the one-loop bosonic corrections and yield an overall correction of 1.3% relative to the tre… ▽ More A complete, gauge-invariant computation of two loop virtual corrections involving closed fermion loops to the polarized Møller scattering asymmetry is presented. The set of contributions involving two closed fermion loops and the set involving one closed fermion loop are numerically similar in magnitude to the one-loop bosonic corrections and yield an overall correction of 1.3% relative to the tree-level asymmetry. We estimate sizes of remaining two-loop contributions and discuss implications for the upcoming MOLLER experiment. △ Less

Submitted 2 April, 2021; v1 submitted 17 December, 2019; originally announced December 2019.

Comments: 6 pages; v2: numerical error and few typos corrected, published version

Report number: ACFI-T19-15

Journal ref: Phys. Rev. Lett. 126, 131801 (2021)

arXiv:1912.05581 [pdf, other]

doi 10.1103/PhysRevD.101.095001

Loop Dominated Signals from Neutrino Portal Dark Matter

Authors: Hiren H. Patel, Stefano Profumo, Bibhushan Shakya

Abstract: We study scenarios where loop processes give the dominant contributions to dark matter decay or annihilation despite the presence of tree level channels. We illustrate this possibility in a specific model where dark matter is part of a hidden sector that communicates with the Standard Model sector via a heavy neutrino portal. We explain the underpinning rationale for how loop processes mediated by… ▽ More We study scenarios where loop processes give the dominant contributions to dark matter decay or annihilation despite the presence of tree level channels. We illustrate this possibility in a specific model where dark matter is part of a hidden sector that communicates with the Standard Model sector via a heavy neutrino portal. We explain the underpinning rationale for how loop processes mediated by the portal neutrinos can parametrically dominate over tree level decay channels, and demonstrate that this qualitatively changes the indirect detection signals in positrons, neutrinos, and gamma rays. △ Less

Submitted 11 December, 2019; originally announced December 2019.

Comments: 7 pages, 2 figures

Report number: CERN-TH-2019-203

Journal ref: Phys. Rev. D 101, 095001 (2020)

arXiv:1911.03137 [pdf]

doi 10.1016/j.atmosenv.2020.117428

Hierarchical network design for nitrogen dioxide measurement in urban environments, part 1: proxy selection

Authors: Lena Weissert, Georgia Miskell, Elaine Miles, Kyle Alberti, Brandon Feenstra, Hamesh Patel, Vasileios Papapostolou, Andrea Polidori, Geoff S Henshaw, Jennifer A Salmond, David E Williams

Abstract: Previous studies have shown that a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable air quality data at high temporal and spatial resolution for ozone at neighbourhood scales. Key to this framework is the concept of a proxy: a reliable (regulatory) data source whose results have sufficient statistical similar… ▽ More Previous studies have shown that a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable air quality data at high temporal and spatial resolution for ozone at neighbourhood scales. Key to this framework is the concept of a proxy: a reliable (regulatory) data source whose results have sufficient statistical similarity over some period of time to those from any given low-cost measurement site. This enables the low-cost instruments to be calibrated remotely, avoiding the need for costly on-site calibration of dense networks. This paper assesses the suitability of this method for local air pollutants such as nitrogen dioxide which show large temporal and spatial variability in concentration. The proxy technique is evaluated using the data from the network of regulatory air monitoring stations measuring nitrogen dioxide in Southern California to avoid errors introduced by low-cost instrument performance. Proxies chosen based on land use similarity signalled typically less than 0.1 percent false alarms. Although poor proxy performance was observed when the local geography was unusual (a semi-enclosed valley) in this instance the closest neighbour station proved to be an appropriate alternative. The method also struggled when wind speeds were low and very local sources presumably dominated the concentration patterns. Overall, we demonstrate that the technique can be applied to nitrogen dioxide, and that appropriate proxies can be found even within a spatially sparse network of stations in a region with large spatio-temporal variation in concentration. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: 24 pages, 7 figures, supplementary information appended to main text

arXiv:1911.03136 [pdf]

Hierarchical network design for nitrogen dioxide measurement in urban environments, part 2: network-based sensor calibration

Authors: Lena Weissert, Elaine Miles, Georgia Miskell, Kyle Alberti, Brandon Feenstra, Geoff S Henshaw, Vasileios Papapostolou, Hamesh Patel, Andrea Polidori, Jennifer A Salmond, David E Williams

Abstract: We present a management and data correction framework for low-cost electrochemical sensors for nitrogen dioxide (NO2) deployed within a hierarchical network of low-cost and regulatory-grade instruments. The framework is founded on the idea that it is possible in a suitably configured network to identify a source of reliable proxy data for each sensor site that has a similar probability distributio… ▽ More We present a management and data correction framework for low-cost electrochemical sensors for nitrogen dioxide (NO2) deployed within a hierarchical network of low-cost and regulatory-grade instruments. The framework is founded on the idea that it is possible in a suitably configured network to identify a source of reliable proxy data for each sensor site that has a similar probability distribution of measurement values over a suitable time period. Previous work successfully applied these ideas to a sensor system with a simple linear 2-parameter (slope and offset) response. Applying these ideas to electrochemical sensors for NO2 presents significant additional difficulties for which we demonstrate solutions. The three NO2 sensor response parameters (offset, ozone (O3) response slope, and NO2 response slope) are known to vary significantly as a consequence of ambient humidity and temperature variations. Here we demonstrate that these response parameters can be estimated by minimising the Kullback-Leibler divergence between sensor-estimated and proxy NO2 distributions over a 3-day window. We then estimate an additional offset term by using co-location data. This offset term is dependent on climate and spatially correlated and can thus be projected across the network. Co-location data also estimates the time-, space- and concentration-dependent error distribution between sensors and regulatory-grade instruments. We show how the parameter variations can be used to indicate both sensor failure and failure of the proxy assumption. We apply the procedures to a network of 56 sensors distributed across the Inland Empire and Los Angeles County regions, demonstrating the need for reliable data from dense networks of monitors to supplement the existing regulatory networks. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: 40 pages inclusive of supporting information. 13 figure in main text; 15 figure in SI

arXiv:1911.03130 [pdf]

doi 10.1016/j.atmosenv.2020.117287

Low-cost sensor networks and land-use regression: interpolating nitrogen dioxide concentration at high temporal and spatial resolution in Southern California

Authors: Lena Weissert, Kyle Alberti, Elaine Miles, Georgia Miskell, Brandon Feenstra, Geoff S Henshaw, Vasileios Papapostolou, Hamesh Patel, Andrea Polidori, Jennifer A Salmond, David E Williams

Abstract: The development of low-cost sensors and novel calibration algorithms offer new opportunities to supplement existing regulatory networks to measure air pollutants at a high spatial resolution and at hourly and sub-hourly timescales. We use a random forest model on data from a network of low-cost sensors to describe the effect of land use features on local-scale air quality, extend this model to des… ▽ More The development of low-cost sensors and novel calibration algorithms offer new opportunities to supplement existing regulatory networks to measure air pollutants at a high spatial resolution and at hourly and sub-hourly timescales. We use a random forest model on data from a network of low-cost sensors to describe the effect of land use features on local-scale air quality, extend this model to describe the hourly-scale variation of air quality at high spatial resolution, and show that deviations from the model can be used to identify particular conditions and locations where air quality differs from the expected land-use effect. The conditions and locations under which deviations were detected conform to expectations based on general experience. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: 26 pages, 11 figures

arXiv:1911.00681 [pdf, other]

Machine Translation Evaluation using Bi-directional Entailment

Authors: Rakesh Khobragade, Heaven Patel, Anand Namdev, Anish Mishra, Pushpak Bhattacharyya

Abstract: In this paper, we propose a new metric for Machine Translation (MT) evaluation, based on bi-directional entailment. We show that machine generated translation can be evaluated by determining paraphrasing with a reference translation provided by a human translator. We hypothesize, and show through experiments, that paraphrasing can be detected by evaluating entailment relationship in the forward an… ▽ More In this paper, we propose a new metric for Machine Translation (MT) evaluation, based on bi-directional entailment. We show that machine generated translation can be evaluated by determining paraphrasing with a reference translation provided by a human translator. We hypothesize, and show through experiments, that paraphrasing can be detected by evaluating entailment relationship in the forward and backward direction. Unlike conventional metrics, like BLEU or METEOR, our approach uses deep learning to determine the semantic similarity between candidate and reference translation for generating scores rather than relying upon simple n-gram overlap. We use BERT's pre-trained implementation of transformer networks, fine-tuned on MNLI corpus, for natural language inferencing. We apply our evaluation metric on WMT'14 and WMT'17 dataset to evaluate systems participating in the translation task and find that our metric has a better correlation with the human annotated score compared to the other traditional metrics at system level. △ Less

Submitted 2 November, 2019; originally announced November 2019.

arXiv:1909.02029 [pdf, other]

doi 10.1103/PhysRevD.100.095015

The Majoron at two loops

Authors: Julian Heeck, Hiren H. Patel

Abstract: We present singlet-Majoron couplings to Standard Model particles through two loops at leading order in the seesaw expansion, including couplings to gauge bosons as well as flavor-changing quark interactions. We discuss and compare the relevant phenomenological constraints on Majoron production as well as decaying Majoron dark matter. A comparison with standard seesaw observables in low-scale setti… ▽ More We present singlet-Majoron couplings to Standard Model particles through two loops at leading order in the seesaw expansion, including couplings to gauge bosons as well as flavor-changing quark interactions. We discuss and compare the relevant phenomenological constraints on Majoron production as well as decaying Majoron dark matter. A comparison with standard seesaw observables in low-scale settings highlights the importance of searches for lepton-flavor-violating two-body decays $\ell \to \ell' +$Majoron in both the muon and tau sectors. △ Less

Submitted 14 November, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

Comments: 14 pages, matches PRD version

Report number: UCI-TR-2019-23

Journal ref: Phys. Rev. D 100, 095015 (2019)

arXiv:1909.00084 [pdf, other]

Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML

Authors: Ashvin Agrawal, Rony Chatterjee, Carlo Curino, Avrilia Floratou, Neha Gowdal, Matteo Interlandi, Alekh **dal, Kostantinos Karanasos, Subru Krishnan, Brian Kroth, Jyoti Leeka, Kwanghyun Park, Hiren Patel, Olga Poppe, Fotis Psallidas, Raghu Ramakrishnan, Abhishek Roy, Karla Saur, Rathijit Sen, Markus Weimer, Travis Wright, Yiwen Zhu

Abstract: Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex… ▽ More Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex financial predictions, just to name a few. Meanwhile, as the value of data is increasingly recognized and monetized, concerns about securing valuable data and risks to individual privacy have been growing. Consequently, rigorous data management has emerged as a key requirement in enterprise settings. How will these trends (ML growing popularity, and stricter data governance) intersect? What are the unmet requirements for applying ML in enterprise settings? What are the technical challenges for the DB community to solve? In this paper, we present our vision of how ML and database systems are likely to come together, and early steps we take towards making this vision a reality. △ Less

Submitted 27 December, 2019; v1 submitted 30 August, 2019; originally announced September 2019.

arXiv:1908.07018 [pdf, other]

Tale of tails using rule augmented sequence labeling for event extraction

Authors: Ayush Maheshwari, Hrishikesh Patel, Nandan Rathod, Ritesh Kumar, Ganesh Ramakrishnan, Pushpak Bhattacharyya

Abstract: The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data. Moreover, the task becomes complex for tail (rarely occurring) labels wherein extremely less data is available. In this paper, we present a new dataset (InDEE-2019) in the disaster domain for multiple Indic languages, collected from news websites. Usin… ▽ More The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data. Moreover, the task becomes complex for tail (rarely occurring) labels wherein extremely less data is available. In this paper, we present a new dataset (InDEE-2019) in the disaster domain for multiple Indic languages, collected from news websites. Using this dataset, we evaluate several rule-based mechanisms to augment deep learning based models. We formulate our problem of event extraction as a sequence labeling task and perform extensive experiments to study and understand the effectiveness of different approaches. We further show that tail labels can be easily incorporated by creating new rules without the requirement of large annotated data. △ Less

Submitted 31 January, 2020; v1 submitted 19 August, 2019; originally announced August 2019.

Comments: 9 pages, 4 figures, 6 tables

Journal ref: StarAI Workshop at AAAI 2020

arXiv:1907.02783 [pdf, ps, other]

doi 10.1142/S0218271821501224

Proposed experimental test of Randall-Sundrum Models

Authors: Behnam Pourhassan, Anha Bhat, Hrishikesh Patel, Mir Faizal, Nicholas Mantella

Abstract: The Randall-Sundrum models are expected to modify the short distance behavior of general relativity. In this paper, we will propose an experimental test for this short distance modification due to Randall-Sundrum models. This will be done by analyzing motion of a particle which is moving in spherical gravitational field with a drag force. The position at which the particle stops will be different… ▽ More The Randall-Sundrum models are expected to modify the short distance behavior of general relativity. In this paper, we will propose an experimental test for this short distance modification due to Randall-Sundrum models. This will be done by analyzing motion of a particle which is moving in spherical gravitational field with a drag force. The position at which the particle stops will be different in general relativity and Randall-Sundrum model. This difference in the distance moved by the particle before stop** can be measured using a Nanoelectromechanical setup. Thus, it is possible to experimentally test Randall-Sundrum models using currently available technology. △ Less

Submitted 27 January, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

Journal ref: Int. J. Mod. Phys. D 31, 01, 2150122 (2022)

arXiv:1906.08421 [pdf]

doi 10.1016/j.atmosenv.2019.116870

Reliable data from low cost ozone sensors in a hierarchical network

Authors: Georgia Miskell, Kyle Alberti, Brandon Feenstra, Geoff S Henshaw, Vasileios Papapostolou, Hamesh Patel, Andrea Polidori, Jennifer A Salmond, Lena Weissert, David E Williams

Abstract: We demonstrate how a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable high temporal-resolution ozone data at neighbourhood scales. The framework, demonstrated originally for a smaller scale regional network deployed in the Lower Fraser Valley, BC was tested and refined using two much more extensive networks o… ▽ More We demonstrate how a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable high temporal-resolution ozone data at neighbourhood scales. The framework, demonstrated originally for a smaller scale regional network deployed in the Lower Fraser Valley, BC was tested and refined using two much more extensive networks of gas-sensitive semiconductor-based (GSS) sensors deployed at neighbourhood scales in Los Angeles: one of ~20 and one of ~45 GSS ozone sensors. Of these, ten sensors were co-located with different regulatory measurement stations, allowing a rigorous test of the accuracy of the algorithms used for off-site calibration and adjustment of low cost sensors. The method is based on adjusting the gain and offset of the low-cost sensor to match the first two moments of the probability distribution of the sensor result to that of a proxy: a calibrated independent measurement (usually derived from regulatory monitors) whose probability distribution evaluated over a time that emphasizes diurnal variations is similar to that at the test location. The regulatory measurement station physically closest to the low-cost sensor was a good proxy for most sites. The algorithms developed were successful in detecting and correcting sensor drift, and in identifying locations where geographical features resulted in significantly different patterns of ozone variation due to the relative dominance of different dispersion, emission and chemical processes. The entire network results show very large variations in ozone concentration that take place on short time- and distance scales across the Los-Angeles region. Such patterns were not captured by the more sparsely distributed stations of the existing regulatory network and demonstrate the need for reliable data from dense networks of monitors. △ Less

Submitted 19 June, 2019; originally announced June 2019.

Comments: 28 pages, 12 figures, Supplementray information appended has 14 pages

MSC Class: 62P12

Journal ref: Atmospheric Environment 214 (2019) 116870

arXiv:1906.04245 [pdf, other]

doi 10.1103/PhysRevC.104.024314

Decay spectroscopy of $^{50}$Sc and $^{50m}$Sc to $^{50}$Ti

Authors: M. Bowry, C. E. Jones, A. B. Garnsworthy, G. C. Ball, S. Cruz, S. Georges, G. Hackman, J. D. Holt, J. Measures, B. Olaizola, H. P. Patel, C. J. Pearson, C. E. Svensson

Abstract: The $β$ decay of the isomeric and ground state of $^{50}$Sc to the semi-magic nucleus $^{50}_{22}$Ti$_{28}$ has been studied using a $^{50}$Ca beam delivered to the GRIFFIN $γ$-ray spectrometer at the TRIUMF-ISAC facility. $β$-decay branching ratios are reported to 16 excited states with a total of 38 $γ$-ray transitions linking them. These new data significantly expands the information available… ▽ More The $β$ decay of the isomeric and ground state of $^{50}$Sc to the semi-magic nucleus $^{50}_{22}$Ti$_{28}$ has been studied using a $^{50}$Ca beam delivered to the GRIFFIN $γ$-ray spectrometer at the TRIUMF-ISAC facility. $β$-decay branching ratios are reported to 16 excited states with a total of 38 $γ$-ray transitions linking them. These new data significantly expands the information available over previous studies. Relative intensities are measured to less than 0.001$\%$ that of the strongest transition with the majority of $γ$-ray transitions observed here in $β$ decay for the first time. The data are compared to shell-model calculations utilizing both phenomenologically-derived interactions employed in the ${\it pf}$ shell as well as a state-of-the-art, ${\it ab~initio}$ based interaction built in the valence-space in-medium similarity renormalization group framework. △ Less

Submitted 13 August, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

Comments: Regular article, nuclear beta decay, gamma-ray spectroscopy, ab initio nuclear model, 12 pages, 5 figures, 3 tables

Journal ref: Phys. Rev. C 104, 024314 (2021)

arXiv:1905.02196 [pdf, other]

doi 10.1145/3306618.3314263

Map** Missing Population in Rural India: A Deep Learning Approach with Satellite Imagery

Authors: Wenjie Hu, Jay Harshadbhai Patel, Zoe-Alanah Robert, Paul Novosad, Samuel Asher, Zhongyi Tang, Marshall Burke, David Lobell, Stefano Ermon

Abstract: Millions of people worldwide are absent from their country's census. Accurate, current, and granular population metrics are critical to improving government allocation of resources, to measuring disease control, to responding to natural disasters, and to studying any aspect of human life in these communities. Satellite imagery can provide sufficient information to build a population map without th… ▽ More Millions of people worldwide are absent from their country's census. Accurate, current, and granular population metrics are critical to improving government allocation of resources, to measuring disease control, to responding to natural disasters, and to studying any aspect of human life in these communities. Satellite imagery can provide sufficient information to build a population map without the cost and time of a government census. We present two Convolutional Neural Network (CNN) architectures which efficiently and effectively combine satellite imagery inputs from multiple sources to accurately predict the population density of a region. In this paper, we use satellite imagery from rural villages in India and population labels from the 2011 SECC census. Our best model achieves better performance than previous papers as well as LandScan, a community standard for global population distribution. △ Less

Submitted 4 May, 2019; originally announced May 2019.

Comments: 7 pages

ACM Class: I.2.10; I.2.6; J.2; J.4

Journal ref: AAAI/ACM Conference on AI, Ethics, and Society (AIES '19), January 27-28, 2019, Honolulu, HI, USA

arXiv:1903.01985 [pdf]

doi 10.1109/PVSC.2018.8547869

Detailed Performance Loss Analysis of Silicon Solar Cells using High-Throughput Metrology Methods

Authors: Mohammad Jobayer Hossain, Geoffrey Gregory, Hardik Patel, Siyu Guo, Eric J. Schneller, Andrew M. Gabor, Zhihao Yang, Adrienne L. Blum, Kristopher O. Davis

Abstract: In this work, novel, high-throughput metrology methods are used to perform a detailed performance loss analysis of approximately 400 industrial crystalline silicon solar cells, all coming from the same production line. The characterization sequence includes a non-destructive transfer length method (TLM) measurement technique featuring circular TLM structures hidden within the busbar region of the… ▽ More In this work, novel, high-throughput metrology methods are used to perform a detailed performance loss analysis of approximately 400 industrial crystalline silicon solar cells, all coming from the same production line. The characterization sequence includes a non-destructive transfer length method (TLM) measurement technique featuring circular TLM structures hidden within the busbar region of the cells. It also includes a very fast external quantum efficiency and reflectance measurement technique. More traditional measurements, like illuminated current-voltage, Suns-VOC, and photoluminescence imaging are also used to carry out the loss analysis. The variance of the individual loss parameters and their impact on cell performance are investigated and quantified for this large group of industrial solar cells. Some important correlations between the measured loss parameters are found. The nature of these distributions and correlations provide important insights about loss mechanisms in a cell and help prioritize efforts to optimize the performance of the production line. △ Less

Submitted 26 February, 2019; originally announced March 2019.

Comments: 5 pages, 6 figures, conference

arXiv:1812.05308 [pdf, other]

FDFNet : A Secure Cancelable Deep Finger Dorsal Template Generation Network Secured via. Bio-Hashing

Authors: Avantika Singh, Ashish Arora, Shreya Hasmukh Patel, Gaurav Jaswal, Aditya Nigam

Abstract: Present world has already been consistently exploring the fine edges of online and digital world by imposing multiple challenging problems/scenarios. Similar to physical world, personal identity management is very crucial in-order to provide any secure online system. Last decade has seen a lot of work in this area using biometrics such as face, fingerprint, iris etc. Still there exist several vuln… ▽ More Present world has already been consistently exploring the fine edges of online and digital world by imposing multiple challenging problems/scenarios. Similar to physical world, personal identity management is very crucial in-order to provide any secure online system. Last decade has seen a lot of work in this area using biometrics such as face, fingerprint, iris etc. Still there exist several vulnerabilities and one should have to address the problem of compromised biometrics much more seriously, since they cannot be modified easily once compromised. In this work, we have proposed a secure cancelable finger dorsal template generation network (learning domain specific features) secured via. Bio-Hashing. Proposed system effectively protects the original finger dorsal images by withdrawing compromised template and reassigning the new one. A novel Finger-Dorsal Feature Extraction Net (FDFNet) has been proposed for extracting the discriminative features. This network is exclusively trained on trait specific features without using any kind of pre-trained architecture. Later Bio-Hashing, a technique based on assigning a tokenized random number to each user, has been used to hash the features extracted from FDFNet. To test the performance of the proposed architecture, we have tested it over two benchmark public finger knuckle datasets: PolyU FKP and PolyU Contactless FKI. The experimental results shows the effectiveness of the proposed system in terms of security and accuracy. △ Less

Submitted 13 December, 2018; originally announced December 2018.

Comments: Accepted in ISBA 2019: International Conference on Identity, Security and Behavior Analysis

arXiv:1809.08473 [pdf, other]

doi 10.1109/ICSESS.2018.8663851

Split-Scale: Scaling Bitcoin by Partitioning the UTXO Space

Authors: Kazım Rıfat Özyılmaz, Harsh Patel, Ankit Malik

Abstract: The Bitcoin protocol is a significant milestone in the history of money. However, its adoption is currently constrained by the transaction limits of the system. As the chief problem of blockchain technology, the scaling issue has attracted many valuable solutions both on-chain and off-chain. In this paper, our goal is to explore the notion of unspent transaction outputs (UTXOs) to propose an augme… ▽ More The Bitcoin protocol is a significant milestone in the history of money. However, its adoption is currently constrained by the transaction limits of the system. As the chief problem of blockchain technology, the scaling issue has attracted many valuable solutions both on-chain and off-chain. In this paper, our goal is to explore the notion of unspent transaction outputs (UTXOs) to propose an augmented Bitcoin protocol that can scale gracefully. Our proposal aims to increase the transaction throughput by partitioning the UTXO space and splitting the blockchain. In addition, a new type of Bitcoin node is introduced to preserve the capability to run validating nodes in low-bandwidth environments, despite the increased transaction throughput. △ Less

Submitted 18 January, 2019; v1 submitted 22 September, 2018; originally announced September 2018.

Comments: Accepted for publication in 9th IEEE International Conference on Software Engineering and Service Science (ICSESS 2018) on 09.07.2018 - published version may differ

arXiv:1809.07183 [pdf, other]

doi 10.1016/j.nima.2018.11.115

The GRIFFIN Facility for Decay-Spectroscopy Studies at TRIUMF-ISAC

Authors: A. B. Garnsworthy, C. E. Svensson, M. Bowry, R. Dunlop, A. D. MacLean, B. Olaizola, J. K. Smith, F. A. Ali, C. Andreoiu, J. E. Ash, W. H. Ashfield, G. C. Ball, T. Ballast, C. Bartlett, Z. Beadle, P. C. Bender, N. Bernier, S. S. Bhattacharjee, H. Bidaman, V. Bildstein, D. Bishop, P. Boubel, R. Braid, D. Brennan, T. Bruhn , et al. (79 additional authors not shown)

Abstract: Gamma-Ray Infrastructure For Fundamental Investigations of Nuclei, GRIFFIN, is a new high-efficiency $γ$-ray spectrometer designed for use in decay spectroscopy experiments with low-energy radioactive ion beams provided by TRIUMF's Isotope Separator and Accelerator (ISAC-I) facility. GRIFFIN is composed of sixteen Compton-suppressed large-volume clover-type high-purity germanium (HPGe) $γ$-ray det… ▽ More Gamma-Ray Infrastructure For Fundamental Investigations of Nuclei, GRIFFIN, is a new high-efficiency $γ$-ray spectrometer designed for use in decay spectroscopy experiments with low-energy radioactive ion beams provided by TRIUMF's Isotope Separator and Accelerator (ISAC-I) facility. GRIFFIN is composed of sixteen Compton-suppressed large-volume clover-type high-purity germanium (HPGe) $γ$-ray detectors combined with a suite of ancillary detection systems and coupled to a custom digital data acquisition system. The infrastructure and detectors of the spectrometer as well as the performance characteristics and the analysis techniques applied to the experimental data are described. △ Less

Submitted 6 December, 2018; v1 submitted 17 September, 2018; originally announced September 2018.

arXiv:1808.07615 [pdf, ps, other]

doi 10.1103/PhysRevD.98.076008

Two-loop effective potential for generalized gauge fixing

Authors: Stephen P. Martin, Hiren H. Patel

Abstract: We obtain the two-loop effective potential for general renormalizable theories, using a generalized gauge-fixing scheme that includes as special cases the background-field $R_ξ$ gauges, the Fermi gauges, and the familiar Landau gauge, and using dimensional regularization in the bare and \MSbar renormalization schemes. As examples, the results are then specialized to the Abelian Higgs model and to… ▽ More We obtain the two-loop effective potential for general renormalizable theories, using a generalized gauge-fixing scheme that includes as special cases the background-field $R_ξ$ gauges, the Fermi gauges, and the familiar Landau gauge, and using dimensional regularization in the bare and \MSbar renormalization schemes. As examples, the results are then specialized to the Abelian Higgs model and to the Standard Model. In the case of the Standard Model, we study how the vacuum expectation value and the minimum vacuum energy depend numerically on the gauge-fixing parameters. The results at fixed two-loop order exhibit non-convergent behavior for sufficiently large gauge-fixing parameters; this can presumably be addressed by a resummation of higher-order contributions. △ Less

Submitted 22 August, 2018; originally announced August 2018.

Comments: 53 pages

Journal ref: Phys. Rev. D 98, 076008 (2018)

arXiv:1807.10197 [pdf, other]

doi 10.1103/PhysRevLett.121.241804

Reduced hadronic uncertainty in the determination of $V_{ud}$

Authors: Chien-Yeah Seng, Mikhail Gorchtein, Hiren H. Patel, Michael J. Ramsey-Musolf

Abstract: We analyze the universal radiative correction $Δ_R^V$ to neutron and superallowed nuclear $β$ decay by expressing the hadronic $γW$-box contribution in terms of a dispersion relation, which we identify as an integral over the first Nachtmann moment of the $γW$ interference structure function $F_3^{(0)}$. By connecting the needed input to existing data on neutrino and antineutrino scattering, we ob… ▽ More We analyze the universal radiative correction $Δ_R^V$ to neutron and superallowed nuclear $β$ decay by expressing the hadronic $γW$-box contribution in terms of a dispersion relation, which we identify as an integral over the first Nachtmann moment of the $γW$ interference structure function $F_3^{(0)}$. By connecting the needed input to existing data on neutrino and antineutrino scattering, we obtain an updated value of $Δ_R^V = 0.02467(22)$, wherein the hadronic uncertainty is reduced. Assuming other Standard Model theoretical calculations and experimental measurements remain unchanged, we obtain an updated value of $|V_{ud}| = 0.97366(15)$, raising tension with the first row CKM unitarity constraint. We comment on ways current and future experiments can provide input to our dispersive analysis. △ Less

Submitted 16 August, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

Comments: 5 pages, 5 figures, references updated; version submitted to PRL

Journal ref: Phys. Rev. Lett. 121, 241804 (2018)

Showing 51–100 of 135 results for author: Patel, H