-
Phoebe: A Learning-based Checkpoint Optimizer
Authors:
Yiwen Zhu,
Matteo Interlandi,
Abhishek Roy,
Krishnadhan Das,
Hiren Patel,
Malay Bag,
Hitesh Sharma,
Alekh **dal
Abstract:
Easy-to-use programming interfaces paired with cloud-scale processing engines have enabled big data system users to author arbitrarily complex analytical jobs over massive volumes of data. However, as the complexity and scale of analytical jobs increase, they encounter a number of unforeseen problems, hotspots with large intermediate data on temporary storage, longer job recovery time after failur…
▽ More
Easy-to-use programming interfaces paired with cloud-scale processing engines have enabled big data system users to author arbitrarily complex analytical jobs over massive volumes of data. However, as the complexity and scale of analytical jobs increase, they encounter a number of unforeseen problems, hotspots with large intermediate data on temporary storage, longer job recovery time after failures, and worse query optimizer estimates being examples of issues that we are facing at Microsoft.
To address these issues, we propose Phoebe, an efficient learning-based checkpoint optimizer. Given a set of constraints and an objective function at compile-time, Phoebe is able to determine the decomposition of job plans, and the optimal set of checkpoints to preserve their outputs to durable global storage. Phoebe consists of three machine learning predictors and one optimization module. For each stage of a job, Phoebe makes accurate predictions for: (1) the execution time, (2) the output size, and (3) the start/end time taking into account the inter-stage dependencies. Using these predictions, we formulate checkpoint optimization as an integer programming problem and propose a scalable heuristic algorithm that meets the latency requirement of the production environment.
We demonstrate the effectiveness of Phoebe in production workloads, and show that we can free the temporary storage on hotspots by more than 70% and restart failed jobs 68% faster on average with minimum performance impact. Phoebe also illustrates that adding multiple sets of checkpoints is not cost-efficient, which dramatically reduces the complexity of the optimization.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Prediction of IPL Match Outcome Using Machine Learning Techniques
Authors:
Srikantaiah K C,
Aryan Khetan,
Baibhav Kumar,
Divy Tolani,
Harshal Patel
Abstract:
India's most popular sport is cricket and is played across all over the nation in different formats like T20, ODI, and Test. The Indian Premier League (IPL) is a national cricket match where players are drawn from regional teams of India, National Team and also from international team. Many factors like live streaming, radio, TV broadcast made this league as popular among cricket fans. The predict…
▽ More
India's most popular sport is cricket and is played across all over the nation in different formats like T20, ODI, and Test. The Indian Premier League (IPL) is a national cricket match where players are drawn from regional teams of India, National Team and also from international team. Many factors like live streaming, radio, TV broadcast made this league as popular among cricket fans. The prediction of the outcome of the IPL matches is very important for online traders and sponsors. We can predict the match between two teams based on various factors like team composition, batting and bowling averages of each player in the team, and the team's success in their previous matches, in addition to traditional factors such as toss, venue, and day-night, the probability of winning by batting first at a specified match venue against a specific team. In this paper, we have proposed a model for predicting outcome of the IPL matches using Machine learning Algorithms namely SVM, Random Forest Classifier (RFC), Logistic Regression and K-Nearest Neighbor. Experimental results showed that the Random Forest algorithm outperforms other algorithms with an accuracy of 88.10%.
△ Less
Submitted 30 September, 2021;
originally announced October 2021.
-
Do Minimal Parity Solutions to the Strong CP Problem Work?
Authors:
Jordy de Vries,
Patrick Draper,
Hiren H. Patel
Abstract:
One class of solutions to the strong CP problem relies on generalized parity symmetries. A minimal model of this type, constructed by Babu and Mohapatra and based on a softly broken parity symmetry, has the remarkable property that effective QCD vacuum angle $\barθ$ vanishes up to one-loop order. We compute the leading two-loop contributions to $\barθ$ in this model and estimate subleading contrib…
▽ More
One class of solutions to the strong CP problem relies on generalized parity symmetries. A minimal model of this type, constructed by Babu and Mohapatra and based on a softly broken parity symmetry, has the remarkable property that effective QCD vacuum angle $\barθ$ vanishes up to one-loop order. We compute the leading two-loop contributions to $\barθ$ in this model and estimate subleading contributions. In contrast to previous estimates, we argue that $\bar θ$ is not suppressed by the weak scale, and we find contributions of order $10^{-3}$-$10^{-2}$ multiplying unknown mixing angles and phases. Thus the model does not generically address the strong CP problem, but it might be made consistent with $\barθ<10^{-10}$ in some corners of parameter space. For such non-generic parameters, $\barθ$ is still likely to be just below present bounds, and therefore provides the dominant source of hadronic EDMs. We discuss the resulting EDM phenomenology.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Image Inpainting using Partial Convolution
Authors:
Harsh Patel,
Amey Kulkarni,
Shivam Sahni,
Udit Vyas
Abstract:
Image Inpainting is one of the very popular tasks in the field of image processing with broad applications in computer vision. In various practical applications, images are often deteriorated by noise due to the presence of corrupted, lost, or undesirable information. There have been various restoration techniques used in the past with both classical and deep learning approaches for handling such…
▽ More
Image Inpainting is one of the very popular tasks in the field of image processing with broad applications in computer vision. In various practical applications, images are often deteriorated by noise due to the presence of corrupted, lost, or undesirable information. There have been various restoration techniques used in the past with both classical and deep learning approaches for handling such issues. Some traditional methods include image restoration by filling gap pixels using the nearby known pixels or using the moving average over the same. The aim of this paper is to perform image inpainting using robust deep learning methods that use partial convolution layers.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
Trust as a Metric for Resiliency in Signed Social Networks
Authors:
Harsh Patel,
Shivam Sahni,
Pushkar Mujumdar
Abstract:
Recent technological advancements have resulted in a surge in online trading, raising severe concerns about theft and fraud, especially on platforms like Bitcoin OTC (over-the-counter), where users' identities remain anonymous. To mitigate the risk, it has become essential to capture the reputation of users based on their trade histories. The who-trusts-whom signed network of people has the capabi…
▽ More
Recent technological advancements have resulted in a surge in online trading, raising severe concerns about theft and fraud, especially on platforms like Bitcoin OTC (over-the-counter), where users' identities remain anonymous. To mitigate the risk, it has become essential to capture the reputation of users based on their trade histories. The who-trusts-whom signed network of people has the capability to reflect the nature of such positive and negative relations between the users. It can be used to analyze linkage patterns, strength, and resiliency of such platforms. Due to the dynamic nature of trust between individuals, these trust networks are often vulnerable to link or node failures, making it critical to understand the stability of such systems. In this paper, we consider the problem of quantifying the resiliency of signed networks with the help of trustworthy community structures. We propose a metric for computing the Trustworthiness of a community structure. Using the trustworthiness scores of all communities structures, we generate a pipeline for assessing the resiliency of a signed network. We also show how these generated resiliency scores are concordant with the true nature of the network.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets
Authors:
Nitin Gupta,
Hima Patel,
Shazia Afzal,
Naveen Panwar,
Ruhi Sharma Mittal,
Shanmukha Guttula,
Abhinav Jain,
Lokesh Nagalapatti,
Sameep Mehta,
Sandeep Hans,
Pranay Lohia,
Aniya Aggarwal,
Diptikalyan Saha
Abstract:
The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes…
▽ More
The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes etc. We attempt to re-look at the data quality issues in the context of building a machine learning pipeline and build a tool that can detect, explain and remediate issues in the data, and systematically and automatically capture all the changes applied to the data. We introduce the Data Quality Toolkit for machine learning as a library of some key quality metrics and relevant remediation techniques to analyze and enhance the readiness of structured training datasets for machine learning projects. The toolkit can reduce the turn-around times of data preparation pipelines and streamline the data quality assessment process. Our toolkit is publicly available via IBM API Hub [1] platform, any developer can assess the data quality using the IBM's Data Quality for AI apis [2]. Detailed tutorials are also available on IBM Learning Path [3].
△ Less
Submitted 5 September, 2021; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Optimal Resource Allocation for Serverless Queries
Authors:
Anish Pimpley,
Shuo Li,
Anubha Srivastava,
Vishal Rohra,
Yi Zhu,
Soundararajan Srinivasan,
Alekh **dal,
Hiren Patel,
Shi Qiao,
Rathijit Sen
Abstract:
Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they frequently misallocate by orders of magnitude. Unfortunately, prior work focused on predicting peak allocation while ignoring aggressive trade-offs between resource al…
▽ More
Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they frequently misallocate by orders of magnitude. Unfortunately, prior work focused on predicting peak allocation while ignoring aggressive trade-offs between resource allocation and run-time. Additionally, these methods fail to predict allocation for queries that have not been observed in the past. In this paper, we tackle both these problems. We introduce a system for optimal resource allocation that can predict performance with aggressive trade-offs, for both new and past observed queries. We introduce the notion of a performance characteristic curve (PCC) as a parameterized representation that can compactly capture the relationship between resources and performance. To tackle training data sparsity, we introduce a novel data augmentation technique to efficiently synthesize the entire PCC using a single run of the query. Lastly, we demonstrate the advantages of a constrained loss function coupled with GNNs, over traditional ML methods, for capturing the domain specific behavior through an extensive experimental evaluation over SCOPE big data workloads at Microsoft.
△ Less
Submitted 18 July, 2021;
originally announced July 2021.
-
Construction of Quantum Target Space from World-Sheet States using Quantum State Tomography
Authors:
Salman Sajad Wani,
Arshid Shabir,
Junaid Ul Hassan,
S. Kannan,
Hrishikesh Patel,
C. Sudheesh,
Mir Faizal
Abstract:
In this paper, we will construct the quantum states of target space coordinates from world-sheet states, using quantum state tomography. To perform quantum state tomography of an open string, we will construct suitable quadrature operators. We do this by first defining the quadrature operators in world-sheet, and then using them to construct the quantum target space quadrature operators for an ope…
▽ More
In this paper, we will construct the quantum states of target space coordinates from world-sheet states, using quantum state tomography. To perform quantum state tomography of an open string, we will construct suitable quadrature operators. We do this by first defining the quadrature operators in world-sheet, and then using them to construct the quantum target space quadrature operators for an open string. We will connect the quantum target space to classical geometry using coherent string states. We will be using a novel construction based on a string displacement operator to construct these coherent states. The coherent states of the world-sheet will also be used to construct the coherent states in target space.
△ Less
Submitted 29 June, 2021;
originally announced June 2021.
-
Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report
Authors:
Andrey Ignatov,
Cheng-Ming Chiang,
Hsien-Kai Kuo,
Anastasia Sycheva,
Radu Timofte,
Min-Hung Chen,
Man-Yu Lee,
Yu-Syuan Xu,
Yu Tseng,
Shusong Xu,
** Guo,
Chao-Hung Chen,
Ming-Chun Hsyu,
Wen-Chia Tsai,
Chao-Wei Chen,
Grigory Malivenko,
Minsu Kwon,
Myungje Lee,
Jaeyoon Yoo,
Changbeom Kang,
Shinjo Wang,
Zheng Shaolong,
Hao Dejun,
Xie Fen,
Feng Zhuang
, et al. (16 additional authors not shown)
Abstract:
As the quality of mobile cameras starts to play a crucial role in modern smartphones, more and more attention is now being paid to ISP algorithms used to improve various perceptual aspects of mobile photos. In this Mobile AI challenge, the target was to develop an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly r…
▽ More
As the quality of mobile cameras starts to play a crucial role in modern smartphones, more and more attention is now being paid to ISP algorithms used to improve various perceptual aspects of mobile photos. In this Mobile AI challenge, the target was to develop an end-to-end deep learning-based image signal processing (ISP) pipeline that can replace classical hand-crafted ISPs and achieve nearly real-time performance on smartphone NPUs. For this, the participants were provided with a novel learned ISP dataset consisting of RAW-RGB image pairs captured with the Sony IMX586 Quad Bayer mobile sensor and a professional 102-megapixel medium format camera. The runtime of all models was evaluated on the MediaTek Dimensity 1000+ platform with a dedicated AI processing unit capable of accelerating both floating-point and quantized neural networks. The proposed solutions are fully compatible with the above NPU and are capable of processing Full HD photos under 60-100 milliseconds while achieving high fidelity results. A detailed description of all models developed in this challenge is provided in this paper.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
Discovering new forces with gravitational waves from supermassive black holes
Authors:
Jeff A. Dror,
Benjamin V. Lehmann,
Hiren H. Patel,
Stefano Profumo
Abstract:
Supermassive black hole binary mergers generate a stochastic gravitational wave background detectable by pulsar timing arrays. While the amplitude of this background is subject to significant uncertainties, the frequency dependence is a robust prediction of general relativity. We show that the effects of new forces beyond the Standard Model can modify this prediction and introduce unique features…
▽ More
Supermassive black hole binary mergers generate a stochastic gravitational wave background detectable by pulsar timing arrays. While the amplitude of this background is subject to significant uncertainties, the frequency dependence is a robust prediction of general relativity. We show that the effects of new forces beyond the Standard Model can modify this prediction and introduce unique features into the spectral shape. In particular, we consider the possibility that black holes in binaries are charged under a new long-range force, and we find that pulsar timing arrays are capable of robustly detecting such forces. Supermassive black holes and their environments can acquire charge due to high-energy particle production or dark sector interactions, making the measurement of the spectral shape a powerful test of fundamental physics.
△ Less
Submitted 18 October, 2021; v1 submitted 10 May, 2021;
originally announced May 2021.
-
Event Camera Based Real-Time Detection and Tracking of Indoor Ground Robots
Authors:
Himanshu Patel,
Craig Iaboni,
Deepan Lobo,
Ji-won Choi,
Pramod Abichandani
Abstract:
This paper presents a real-time method to detect and track multiple mobile ground robots using event cameras. The method uses density-based spatial clustering of applications with noise (DBSCAN) to detect the robots and a single k-dimensional ($k - d$) tree to accurately keep track of them as they move in an indoor arena. Robust detections and tracks are maintained in the face of event camera nois…
▽ More
This paper presents a real-time method to detect and track multiple mobile ground robots using event cameras. The method uses density-based spatial clustering of applications with noise (DBSCAN) to detect the robots and a single k-dimensional ($k - d$) tree to accurately keep track of them as they move in an indoor arena. Robust detections and tracks are maintained in the face of event camera noise and lack of events (due to robots moving slowly or stop**). An off-the-shelf RGB camera-based tracking system was used to provide ground truth. Experiments including up to 4 robots are performed to study the effect of i) varying DBSCAN parameters, ii) the event accumulation time, iii) the number of robots in the arena, iv) the speed of the robots, and v) variation in ambient light conditions on the detection and tracking performance. The experimental results showed 100% detection and tracking fidelity in the face of event camera noise and robots stop** for tests involving up to 3 robots (and upwards of 93% for 4 robots). When the lighting conditions were varied, a graceful degradation in detection and tracking fidelity was observed.
△ Less
Submitted 2 August, 2021; v1 submitted 23 February, 2021;
originally announced February 2021.
-
HinFlair: pre-trained contextual string embeddings for pos tagging and text classification in the Hindi language
Authors:
Harsh Patel
Abstract:
Recent advancements in language models based on recurrent neural networks and transformers architecture have achieved state-of-the-art results on a wide range of natural language processing tasks such as pos tagging, named entity recognition, and text classification. However, most of these language models are pre-trained in high resource languages like English, German, Spanish. Multi-lingual langu…
▽ More
Recent advancements in language models based on recurrent neural networks and transformers architecture have achieved state-of-the-art results on a wide range of natural language processing tasks such as pos tagging, named entity recognition, and text classification. However, most of these language models are pre-trained in high resource languages like English, German, Spanish. Multi-lingual language models include Indian languages like Hindi, Telugu, Bengali in their training corpus, but they often fail to represent the linguistic features of these languages as they are not the primary language of the study. We introduce HinFlair, which is a language representation model (contextual string embeddings) pre-trained on a large monolingual Hindi corpus. Experiments were conducted on 6 text classification datasets and a Hindi dependency treebank to analyze the performance of these contextualized string embeddings for the Hindi language. Results show that HinFlair outperforms previous state-of-the-art publicly available pre-trained embeddings for downstream tasks like text classification and pos tagging. Also, HinFlair when combined with FastText embeddings outperforms many transformers-based language models trained particularly for the Hindi language.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
Comments on Axions, Domain Walls, and Cosmic Strings
Authors:
Michael Dine,
Nicolas Fernandez,
Akshay Ghalsasi,
Hiren H. Patel
Abstract:
Axions have for some time been considered a plausible candidate for dark matter. They can be produced through misalignment, but it has been argued that when inflation occurs before a Peccei-Quinn transition, appreciable production can result from cosmic strings. This has been the subject of extensive simulations. But there are reasons to be skeptical about the possible role of axion strings. We re…
▽ More
Axions have for some time been considered a plausible candidate for dark matter. They can be produced through misalignment, but it has been argued that when inflation occurs before a Peccei-Quinn transition, appreciable production can result from cosmic strings. This has been the subject of extensive simulations. But there are reasons to be skeptical about the possible role of axion strings. We review and elaborate on these questions, and argue that parametrically strings are already accounted for by the assumption of random misalignment angles. We review and elaborate on these questions, and provide several qualitative arguments that parametrically strings are already accounted for by the assumption of random misalignment angles. The arguments are base on considerations of the collective modes of the string solutions, on computations of axion radiation in particular models, and reviews of simulations.
△ Less
Submitted 18 November, 2021; v1 submitted 23 December, 2020;
originally announced December 2020.
-
Explainable Link Prediction for Privacy-Preserving Contact Tracing
Authors:
Balaji Ganesan,
Hima Patel,
Sameep Mehta
Abstract:
Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update t…
▽ More
Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update their infection status on these applications. In this concept paper, we present ideas from Graph Neural Networks and explainability, that could improve trust in these applications, and encourage adoption by people.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Classification of Reverse-Engineered Class Diagram and Forward-Engineered Class Diagram using Machine Learning
Authors:
Kaushil Mangaroliya,
Het Patel
Abstract:
UML Class diagram is very important to visualize the whole software we are working on and helps understand the whole system in the easiest way possible by showing the system classes, its attributes, methods, and relations with other objects. In the real world, there are two types of Class diagram engineers work with namely 1) Forward Engineered Class Diagram (FwCD) which are hand-made as part of t…
▽ More
UML Class diagram is very important to visualize the whole software we are working on and helps understand the whole system in the easiest way possible by showing the system classes, its attributes, methods, and relations with other objects. In the real world, there are two types of Class diagram engineers work with namely 1) Forward Engineered Class Diagram (FwCD) which are hand-made as part of the forward-looking development process, and 2). Reverse Engineered Class Diagram (RECD) which are those diagrams that are reverse engineered from the source code. In the software industry while working with new open software projects it is important to know which type of class diagram it is. Which UML diagram was used in a particular project is an important factor to be known? To solve this problem, we propose to build a classifier that can classify a UML diagram into FwCD or RECD. We propose to solve this problem by using a supervised Machine Learning technique. The approach in this involves analyzing the features that are useful in classifying class diagrams. Different Machine Learning models are used in this process and the Random Forest algorithm has proved to be the best out of all. Performance testing was done on 999 Class diagrams.
△ Less
Submitted 14 November, 2020;
originally announced November 2020.
-
Testing Short Distance Anisotropy in Space
Authors:
Robert B. Mann,
Idrus Husin,
Hrishikesh Patel,
Mir Faizal,
Anto Sulaksono,
Agus Suroso
Abstract:
The isotropy of space is not a logical requirement but rather is an empirical question; indeed there is suggestive evidence that universe might be anisotropic. A plausible source of these anisotropies could be quantum gravity corrections. If these corrections happen to be between the electroweak scale and the Planck scale, then these anisotropies can have measurable consequences at short distances…
▽ More
The isotropy of space is not a logical requirement but rather is an empirical question; indeed there is suggestive evidence that universe might be anisotropic. A plausible source of these anisotropies could be quantum gravity corrections. If these corrections happen to be between the electroweak scale and the Planck scale, then these anisotropies can have measurable consequences at short distances and their effects can be measured using ultra sensitive condensed matter systems. We investigate how such anisotropic quantum gravity corrections modify low energy physics through an anisotropic deformation of the Heisenberg algebra. We discuss how such anisotropies might be observed using a scanning tunneling microscope.
△ Less
Submitted 5 July, 2021; v1 submitted 5 November, 2020;
originally announced November 2020.
-
BioNerFlair: biomedical named entity recognition using flair embedding and sequence tagger
Authors:
Harsh Patel
Abstract:
Motivation: The proliferation of Biomedical research articles has made the task of information retrieval more important than ever. Scientists and Researchers are having difficulty in finding articles that contain information relevant to them. Proper extraction of biomedical entities like Disease, Drug/chem, Species, Gene/protein, can considerably improve the filtering of articles resulting in bett…
▽ More
Motivation: The proliferation of Biomedical research articles has made the task of information retrieval more important than ever. Scientists and Researchers are having difficulty in finding articles that contain information relevant to them. Proper extraction of biomedical entities like Disease, Drug/chem, Species, Gene/protein, can considerably improve the filtering of articles resulting in better extraction of relevant information. Performance on BioNer benchmarks has progressively improved because of progression in transformers-based models like BERT, XLNet, OpenAI, GPT2, etc. These models give excellent results; however, they are computationally expensive and we can achieve better scores for domain-specific tasks using other contextual string-based models and LSTM-CRF based sequence tagger. Results: We introduce BioNerFlair, a method to train models for biomedical named entity recognition using Flair plus GloVe embeddings and Bidirectional LSTM-CRF based sequence tagger. With almost the same generic architecture widely used for named entity recognition, BioNerFlair outperforms previous state-of-the-art models. I performed experiments on 8 benchmarks datasets for biomedical named entity recognition. Compared to current state-of-the-art models, BioNerFlair achieves the best F1-score of 90.17 beyond 84.72 on the BioCreative II gene mention (BC2GM) corpus, best F1-score of 94.03 beyond 92.36 on the BioCreative IV chemical and drug (BC4CHEMD) corpus, best F1-score of 88.73 beyond 78.58 on the JNLPBA corpus, best F1-score of 91.1 beyond 89.71 on the NCBI disease corpus, best F1-score of 85.48 beyond 78.98 on the Species-800 corpus, while near best results was observed on BC5CDR-chem, BC3CDR-disease, and LINNAEUS corpus.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
Data Readiness Report
Authors:
Shazia Afzal,
Rajmohan C,
Manish Kesarwani,
Sameep Mehta,
Hima Patel
Abstract:
Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which limits their reuse and results in loss of productivity. We introduce the concept of a Data Readiness Report as an accompanying documentation to a dataset that allow…
▽ More
Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which limits their reuse and results in loss of productivity. We introduce the concept of a Data Readiness Report as an accompanying documentation to a dataset that allows data consumers to get detailed insights into the quality of input data. Data characteristics and challenges on various quality dimensions are identified and documented kee** in mind the principles of transparency and explainability. The Data Readiness Report also serves as a record of all data assessment operations including applied transformations. This provides a detailed lineage for the purpose of data governance and management. In effect, the report captures and documents the actions taken by various personas in a data readiness and assessment workflow. Overtime this becomes a repository of best practices and can potentially drive a recommendation system for building automated data readiness workflows on the lines of AutoML [8]. We anticipate that together with the Datasheets [9], Dataset Nutrition Label [11], FactSheets [1] and Model Cards [15], the Data Readiness Report makes significant progress towards Data and AI lifecycle documentation.
△ Less
Submitted 15 October, 2020; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning
Authors:
Subhojeet Pramanik,
Shashank Mujumdar,
Hima Patel
Abstract:
Recent approaches in literature have exploited the multi-modal information in documents (text, layout, image) to serve specific downstream document tasks. However, they are limited by their - (i) inability to learn cross-modal representations across text, layout and image dimensions for documents and (ii) inability to process multi-page documents. Pre-training techniques have been shown in Natural…
▽ More
Recent approaches in literature have exploited the multi-modal information in documents (text, layout, image) to serve specific downstream document tasks. However, they are limited by their - (i) inability to learn cross-modal representations across text, layout and image dimensions for documents and (ii) inability to process multi-page documents. Pre-training techniques have been shown in Natural Language Processing (NLP) domain to learn generic textual representations from large unlabelled datasets, applicable to various downstream NLP tasks. In this paper, we propose a multi-task learning-based framework that utilizes a combination of self-supervised and supervised pre-training tasks to learn a generic document representation applicable to various downstream document tasks. Specifically, we introduce Document Topic Modelling and Document Shuffle Prediction as novel pre-training tasks to learn rich image representations along with the text and layout representations for documents. We utilize the Longformer network architecture as the backbone to encode the multi-modal information from multi-page documents in an end-to-end fashion. We showcase the applicability of our pre-training framework on a variety of different real-world document tasks such as document classification, document information extraction, and document retrieval. We evaluate our framework on different standard document datasets and conduct exhaustive experiments to compare performance against various ablations of our framework and state-of-the-art baselines.
△ Less
Submitted 5 January, 2022; v1 submitted 30 September, 2020;
originally announced September 2020.
-
Assessing the Interplay between travel patterns and SARS-CoV-2 outbreak in realistic urban setting
Authors:
Rohan Patil,
Raviraj Dave,
Harsh Patel,
Viraj M Shah,
Deep Chakrabarti,
Udit Bhatia
Abstract:
The dense social contact networks and high mobility in congested urban areas facilitate the rapid transmission of infectious diseases. Typical mechanistic epidemiological models are either based on uniform mixing with ad-hoc contact processes or need real-time or archived population mobility data to simulate the social networks. However, the rapid and global transmission of the novel coronavirus (…
▽ More
The dense social contact networks and high mobility in congested urban areas facilitate the rapid transmission of infectious diseases. Typical mechanistic epidemiological models are either based on uniform mixing with ad-hoc contact processes or need real-time or archived population mobility data to simulate the social networks. However, the rapid and global transmission of the novel coronavirus (SARS-CoV-2) has led to unprecedented lockdowns at global and regional scales, leaving the archived datasets to limited use. While it is often hypothesized that population density is a significant driver in disease propagation, the disparate disease trajectories and infection rates exhibited by the different cities with comparable densities require a high-resolution description of the disease and its drivers. In this study, we explore the impact of the creation of containment zones on travel patterns within the city. Further, we use a dynamical network-based infectious disease model to understand the key drivers of disease spread at sub-kilometer scales demonstrated in the city of Ahmedabad, India, which has been classified as a SARS-CoV-2 hotspot. We find that in addition to the contact network and population density, road connectivity patterns and ease of transit are strongly correlated with the rate of transmission of the disease. Given the limited access to real-time traffic data during lockdowns, we generate road connectivity networks using open-source imageries and travel patterns from open-source surveys and government reports. Within the proposed framework, we then analyze the relative merits of social distancing, enforced lockdowns, and enhanced testing and quarantining mitigating the disease spread.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Asymptotic analysis of the Boltzmann equation for dark matter relic abundance
Authors:
Logan A. Morrison,
Hiren H. Patel,
Jaryd F. Ulbricht
Abstract:
A solution to the Boltzmann equation governing the thermal relic abundance of cold dark matter is constructed by matched asymptotic approximations. The approximation of the relic density is an asymptotic series valid when the abundance does not deviate significantly from its equilibrium value until small temperatures. Resonance and threshold effects are taken into account at leading order and foun…
▽ More
A solution to the Boltzmann equation governing the thermal relic abundance of cold dark matter is constructed by matched asymptotic approximations. The approximation of the relic density is an asymptotic series valid when the abundance does not deviate significantly from its equilibrium value until small temperatures. Resonance and threshold effects are taken into account at leading order and found to be negligible unless the annihilation cross section is negligible at threshold. Comparisons are made to previously attempted constructions and to the freeze out approximation commonly employed in the literature. Extensions to higher order matching is outlined, and implications for solving related systems are discussed. We compare our results to a numerical determination of the relic abundance using a benchmark model and find a fantastic agreement. The method developed also serves as a solution to a wide class of problems containing an infinite order turning point.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
Electron EDM in the complex two-Higgs doublet model
Authors:
Wolfgang Altmannshofer,
Stefania Gori,
Nick Hamer,
Hiren H. Patel
Abstract:
We present the first complete two loop calculation of the electron EDM in the complex two-Higgs doublet model. We confirm gauge-independence by demonstrating analytic cancellation of the gauge parameter $ξ$ in the background field gauge and the 't Hooft $R_ξ$ gauge. We also investigate the behavior of the electron EDM near the decoupling limit, and determine the short- and long-distance contributi…
▽ More
We present the first complete two loop calculation of the electron EDM in the complex two-Higgs doublet model. We confirm gauge-independence by demonstrating analytic cancellation of the gauge parameter $ξ$ in the background field gauge and the 't Hooft $R_ξ$ gauge. We also investigate the behavior of the electron EDM near the decoupling limit, and determine the short- and long-distance contributions by matching onto an effective field theory. Compared with earlier studies of the electron EDM in the complex two-Higgs doublet model, we note disagreements in several places and provide diagnoses where possible. We also provide expressions for EDMs of light quarks.
△ Less
Submitted 28 September, 2020; v1 submitted 2 September, 2020;
originally announced September 2020.
-
Detecting Problem Statements in Peer Assessments
Authors:
Yunkai Xiao,
Gabriel Zingle,
Qin** Jia,
Harsh R. Shah,
Yi Zhang,
Tianyi Li,
Mohsin Karovaliya,
Weixiang Zhao,
Yang Song,
Jie Ji,
Ashwin Balasubramaniam,
Harshit Patel,
Priyankha Bhalasubbramanian,
Vikram Patel,
Edward F. Gehringer
Abstract:
Effective peer assessment requires students to be attentive to the deficiencies in the work they rate. Thus, their reviews should identify problems. But what ways are there to check that they do? We attempt to automate the process of deciding whether a review comment detects a problem. We use over 18,000 review comments that were labeled by the reviewees as either detecting or not detecting a prob…
▽ More
Effective peer assessment requires students to be attentive to the deficiencies in the work they rate. Thus, their reviews should identify problems. But what ways are there to check that they do? We attempt to automate the process of deciding whether a review comment detects a problem. We use over 18,000 review comments that were labeled by the reviewees as either detecting or not detecting a problem with the work. We deploy several traditional machine-learning models, as well as neural-network models using GloVe and BERT embeddings. We find that the best performer is the Hierarchical Attention Network classifier, followed by the Bidirectional Gated Recurrent Units (GRU) Attention and Capsule model with scores of 93.1% and 90.5% respectively. The best non-neural network model was the support vector machine with a score of 89.71%. This is followed by the Stochastic Gradient Descent model and the Logistic Regression model with 89.70% and 88.98%.
△ Less
Submitted 29 May, 2020;
originally announced June 2020.
-
Link Prediction using Graph Neural Networks for Master Data Management
Authors:
Balaji Ganesan,
Srinivas Parkala,
Neeraj R Singh,
Sumit Bhatia,
Gayatri Mishra,
Matheen Ahmed Pasha,
Hima Patel,
Somashekar Naganna
Abstract:
Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNN…
▽ More
Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNNs have typically been applied so far. We introduce novel methods for anonymizing data, model training, explainability and verification for Link Prediction in Master Data Management, and discuss our results.
△ Less
Submitted 28 August, 2020; v1 submitted 7 March, 2020;
originally announced March 2020.
-
Probing Short Gravity using Temporal Lensing
Authors:
Mir Faizal,
Hrishikesh Patel
Abstract:
It is known that probing gravity in the submillimeter-micrometer range is difficult due to the relative weakness of the gravitational force. We intend to overcome this challenge by using extreme temporal precision to monitor transient events in a gravitational field. We propose a compressed ultrafast photography system called T-CUP to serve this purpose. We show that the T-CUP's precision of 10 tr…
▽ More
It is known that probing gravity in the submillimeter-micrometer range is difficult due to the relative weakness of the gravitational force. We intend to overcome this challenge by using extreme temporal precision to monitor transient events in a gravitational field. We propose a compressed ultrafast photography system called T-CUP to serve this purpose. We show that the T-CUP's precision of 10 trillion frames per second can allow us to better resolve gravity at short distances. We also show the feasibility of the setup in measuring Yukawa and power-law corrections to gravity which have substantial theoretical motivation.
△ Less
Submitted 29 June, 2021; v1 submitted 3 March, 2020;
originally announced March 2020.
-
Compactification, T-Duality and Quantum Erasers
Authors:
Salman Sajad Wani,
Dylan Sutherland,
Behnam Pourhassan,
Mir Faizal,
Hrishikesh Patel
Abstract:
Using T-duality, we will argue that a zero point length exists in the low energy effective field theory of string theory on compactified extra dimensions. Furthermore, if we neglect the oscillator modes, this zero point length would modify low quantum mechanical systems. As this zero length is fixed geometrically, it is important to analyze how it modifies purely quantum mechanical effects. Thus,…
▽ More
Using T-duality, we will argue that a zero point length exists in the low energy effective field theory of string theory on compactified extra dimensions. Furthermore, if we neglect the oscillator modes, this zero point length would modify low quantum mechanical systems. As this zero length is fixed geometrically, it is important to analyze how it modifies purely quantum mechanical effects. Thus, we will analyze its effects on quantum erasers, because they are based on quantum effects like entanglement. It will be observed that the behavior of these quantum erasers gets modified by this zero point length. As the zero point length is fixed by the radius of compactification, we argue that these results demonstrate a deeper connection between geometry and quantum effects.
△ Less
Submitted 29 February, 2020;
originally announced March 2020.
-
Behavior of Cross Sections for Large Numbers of Particles
Authors:
Michael Dine,
Hiren H. Patel,
Jaryd F. Ulbricht
Abstract:
It has been suggested that scattering cross sections at very high energies for producing large numbers of Higgs particles may exhibit factorial growth, and that curing this growth might be relevant to other questions in the Standard Model. We point out, first, that the question is inherently non-perturbative; low orders in the formal perturbative expansion do not give a good approximation to the s…
▽ More
It has been suggested that scattering cross sections at very high energies for producing large numbers of Higgs particles may exhibit factorial growth, and that curing this growth might be relevant to other questions in the Standard Model. We point out, first, that the question is inherently non-perturbative; low orders in the formal perturbative expansion do not give a good approximation to the scattering amplitude for sufficiently large N for any fixed, small value of the coupling. Focusing on $λφ^{4}$ theory, we argue that there may be a systematic approximation scheme for processes where N particles near threshold scatter to produce N particles, and discuss the leading contributions to the scattering amplitude and cross sections in this limit. Scattering amplitudes do not grow as rapidly as in perturbation theory. Additionally, partial and total cross sections do not show factorial growth. In the case of cross sections for $2 \to N$ particles, there is no systematic large N approximation available. That said, we provide evidence that non-perturbatively, there is no factorial growth in partial or total cross sections.
△ Less
Submitted 27 February, 2020;
originally announced February 2020.
-
Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings
Authors:
Tarique Siddiqui,
Alekh **dal,
Shi Qiao,
Hiren Patel,
Wangchao le
Abstract:
Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost model, therefore, is akin to better resource efficiency and lower operational costs. Unfortunately, the production workloads at Microsoft show that costs are very co…
▽ More
Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost model, therefore, is akin to better resource efficiency and lower operational costs. Unfortunately, the production workloads at Microsoft show that costs are very complex to model for big data systems. In this work, we investigate two key questions: (i) can we learn accurate cost models for big data systems, and (ii) can we integrate the learned models within the query optimizer. To answer these, we make three core contributions. First, we exploit workload patterns to learn a large number of individual cost models and combine them to achieve high accuracy and coverage over a long period. Second, we propose extensions to Cascades framework to pick optimal resources, i.e, number of containers, during query planning. And third, we integrate the learned cost models within the Cascade-style query optimizer of SCOPE at Microsoft. We evaluate the resulting system, Cleo, in a production environment using both production and TPC-H workloads. Our results show that the learned cost models are 2 to 3 orders of magnitude more accurate, and 20X more correlated with the actual runtimes, with a large majority (70%) of the plan changes leading to substantial improvements in latency as well as resource usage.
△ Less
Submitted 27 February, 2020;
originally announced February 2020.
-
Data Augmentation for Personal Knowledge Base Population
Authors:
Lingraj S Vannur,
Balaji Ganesan,
Lokesh Nagalapatti,
Hima Patel,
MN Thippeswamy
Abstract:
Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to d…
▽ More
Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to data protection, fairness and privacy. In this work, we present a system that uses rule based annotators and a graph neural network for missing link prediction, to populate a more complete, fair and diverse knowledge base from the TACRED dataset.
△ Less
Submitted 18 August, 2020; v1 submitted 23 February, 2020;
originally announced February 2020.
-
Implications for Electric Dipole Moments of a Leptoquark Scenario for the $B$-Physics Anomalies
Authors:
Wolfgang Altmannshofer,
Stefania Gori,
Hiren H. Patel,
Stefano Profumo,
Douglas Tuckler
Abstract:
Vector leptoquarks can address the lepton flavor universality anomalies in decays associated with the $b \to c \ell ν$ and $b \to s \ell \ell$ transitions, as observed in recent years. Generically, these leptoquarks yield new sources of CP violation. In this paper, we explore constraints and discovery potential for electric dipole moments (EDMs) in leptonic and hadronic systems. We provide the mos…
▽ More
Vector leptoquarks can address the lepton flavor universality anomalies in decays associated with the $b \to c \ell ν$ and $b \to s \ell \ell$ transitions, as observed in recent years. Generically, these leptoquarks yield new sources of CP violation. In this paper, we explore constraints and discovery potential for electric dipole moments (EDMs) in leptonic and hadronic systems. We provide the most generic expressions for dipole moments induced by vector leptoquarks at one loop. We find that $O(1)$ CP-violating phases in tau and muon couplings can lead to corresponding EDMs within reach of next-generation EDM experiments, and that existing bounds on the electron EDM already put stringent constraints on CP-violating electron couplings.
△ Less
Submitted 17 February, 2020; v1 submitted 4 February, 2020;
originally announced February 2020.
-
A Neural Architecture for Person Ontology population
Authors:
Balaji Ganesan,
Riddhiman Dasgupta,
Akshay Parekh,
Hima Patel,
Berthold Reinwald
Abstract:
A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual pr…
▽ More
A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual process, because it requires a fixed set of semantic relations between concepts. In this work, we present a system for automatically populating a person ontology graph from unstructured data using neural models for Entity Classification and Relation Extraction. We introduce a new dataset for these tasks and discuss our results.
△ Less
Submitted 22 January, 2020;
originally announced January 2020.
-
Parity-Violating Møller Scattering at NNLO: Closed Fermion Loops
Authors:
Yong Du,
Ayres Freitas,
Hiren H. Patel,
Michael J. Ramsey-Musolf
Abstract:
A complete, gauge-invariant computation of two loop virtual corrections involving closed fermion loops to the polarized Møller scattering asymmetry is presented. The set of contributions involving two closed fermion loops and the set involving one closed fermion loop are numerically similar in magnitude to the one-loop bosonic corrections and yield an overall correction of 1.3% relative to the tre…
▽ More
A complete, gauge-invariant computation of two loop virtual corrections involving closed fermion loops to the polarized Møller scattering asymmetry is presented. The set of contributions involving two closed fermion loops and the set involving one closed fermion loop are numerically similar in magnitude to the one-loop bosonic corrections and yield an overall correction of 1.3% relative to the tree-level asymmetry. We estimate sizes of remaining two-loop contributions and discuss implications for the upcoming MOLLER experiment.
△ Less
Submitted 2 April, 2021; v1 submitted 17 December, 2019;
originally announced December 2019.
-
Loop Dominated Signals from Neutrino Portal Dark Matter
Authors:
Hiren H. Patel,
Stefano Profumo,
Bibhushan Shakya
Abstract:
We study scenarios where loop processes give the dominant contributions to dark matter decay or annihilation despite the presence of tree level channels. We illustrate this possibility in a specific model where dark matter is part of a hidden sector that communicates with the Standard Model sector via a heavy neutrino portal. We explain the underpinning rationale for how loop processes mediated by…
▽ More
We study scenarios where loop processes give the dominant contributions to dark matter decay or annihilation despite the presence of tree level channels. We illustrate this possibility in a specific model where dark matter is part of a hidden sector that communicates with the Standard Model sector via a heavy neutrino portal. We explain the underpinning rationale for how loop processes mediated by the portal neutrinos can parametrically dominate over tree level decay channels, and demonstrate that this qualitatively changes the indirect detection signals in positrons, neutrinos, and gamma rays.
△ Less
Submitted 11 December, 2019;
originally announced December 2019.
-
Hierarchical network design for nitrogen dioxide measurement in urban environments, part 1: proxy selection
Authors:
Lena Weissert,
Georgia Miskell,
Elaine Miles,
Kyle Alberti,
Brandon Feenstra,
Hamesh Patel,
Vasileios Papapostolou,
Andrea Polidori,
Geoff S Henshaw,
Jennifer A Salmond,
David E Williams
Abstract:
Previous studies have shown that a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable air quality data at high temporal and spatial resolution for ozone at neighbourhood scales. Key to this framework is the concept of a proxy: a reliable (regulatory) data source whose results have sufficient statistical similar…
▽ More
Previous studies have shown that a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable air quality data at high temporal and spatial resolution for ozone at neighbourhood scales. Key to this framework is the concept of a proxy: a reliable (regulatory) data source whose results have sufficient statistical similarity over some period of time to those from any given low-cost measurement site. This enables the low-cost instruments to be calibrated remotely, avoiding the need for costly on-site calibration of dense networks. This paper assesses the suitability of this method for local air pollutants such as nitrogen dioxide which show large temporal and spatial variability in concentration. The proxy technique is evaluated using the data from the network of regulatory air monitoring stations measuring nitrogen dioxide in Southern California to avoid errors introduced by low-cost instrument performance. Proxies chosen based on land use similarity signalled typically less than 0.1 percent false alarms. Although poor proxy performance was observed when the local geography was unusual (a semi-enclosed valley) in this instance the closest neighbour station proved to be an appropriate alternative. The method also struggled when wind speeds were low and very local sources presumably dominated the concentration patterns. Overall, we demonstrate that the technique can be applied to nitrogen dioxide, and that appropriate proxies can be found even within a spatially sparse network of stations in a region with large spatio-temporal variation in concentration.
△ Less
Submitted 8 November, 2019;
originally announced November 2019.
-
Hierarchical network design for nitrogen dioxide measurement in urban environments, part 2: network-based sensor calibration
Authors:
Lena Weissert,
Elaine Miles,
Georgia Miskell,
Kyle Alberti,
Brandon Feenstra,
Geoff S Henshaw,
Vasileios Papapostolou,
Hamesh Patel,
Andrea Polidori,
Jennifer A Salmond,
David E Williams
Abstract:
We present a management and data correction framework for low-cost electrochemical sensors for nitrogen dioxide (NO2) deployed within a hierarchical network of low-cost and regulatory-grade instruments. The framework is founded on the idea that it is possible in a suitably configured network to identify a source of reliable proxy data for each sensor site that has a similar probability distributio…
▽ More
We present a management and data correction framework for low-cost electrochemical sensors for nitrogen dioxide (NO2) deployed within a hierarchical network of low-cost and regulatory-grade instruments. The framework is founded on the idea that it is possible in a suitably configured network to identify a source of reliable proxy data for each sensor site that has a similar probability distribution of measurement values over a suitable time period. Previous work successfully applied these ideas to a sensor system with a simple linear 2-parameter (slope and offset) response. Applying these ideas to electrochemical sensors for NO2 presents significant additional difficulties for which we demonstrate solutions. The three NO2 sensor response parameters (offset, ozone (O3) response slope, and NO2 response slope) are known to vary significantly as a consequence of ambient humidity and temperature variations. Here we demonstrate that these response parameters can be estimated by minimising the Kullback-Leibler divergence between sensor-estimated and proxy NO2 distributions over a 3-day window. We then estimate an additional offset term by using co-location data. This offset term is dependent on climate and spatially correlated and can thus be projected across the network. Co-location data also estimates the time-, space- and concentration-dependent error distribution between sensors and regulatory-grade instruments. We show how the parameter variations can be used to indicate both sensor failure and failure of the proxy assumption. We apply the procedures to a network of 56 sensors distributed across the Inland Empire and Los Angeles County regions, demonstrating the need for reliable data from dense networks of monitors to supplement the existing regulatory networks.
△ Less
Submitted 8 November, 2019;
originally announced November 2019.
-
Low-cost sensor networks and land-use regression: interpolating nitrogen dioxide concentration at high temporal and spatial resolution in Southern California
Authors:
Lena Weissert,
Kyle Alberti,
Elaine Miles,
Georgia Miskell,
Brandon Feenstra,
Geoff S Henshaw,
Vasileios Papapostolou,
Hamesh Patel,
Andrea Polidori,
Jennifer A Salmond,
David E Williams
Abstract:
The development of low-cost sensors and novel calibration algorithms offer new opportunities to supplement existing regulatory networks to measure air pollutants at a high spatial resolution and at hourly and sub-hourly timescales. We use a random forest model on data from a network of low-cost sensors to describe the effect of land use features on local-scale air quality, extend this model to des…
▽ More
The development of low-cost sensors and novel calibration algorithms offer new opportunities to supplement existing regulatory networks to measure air pollutants at a high spatial resolution and at hourly and sub-hourly timescales. We use a random forest model on data from a network of low-cost sensors to describe the effect of land use features on local-scale air quality, extend this model to describe the hourly-scale variation of air quality at high spatial resolution, and show that deviations from the model can be used to identify particular conditions and locations where air quality differs from the expected land-use effect. The conditions and locations under which deviations were detected conform to expectations based on general experience.
△ Less
Submitted 8 November, 2019;
originally announced November 2019.
-
Machine Translation Evaluation using Bi-directional Entailment
Authors:
Rakesh Khobragade,
Heaven Patel,
Anand Namdev,
Anish Mishra,
Pushpak Bhattacharyya
Abstract:
In this paper, we propose a new metric for Machine Translation (MT) evaluation, based on bi-directional entailment. We show that machine generated translation can be evaluated by determining paraphrasing with a reference translation provided by a human translator. We hypothesize, and show through experiments, that paraphrasing can be detected by evaluating entailment relationship in the forward an…
▽ More
In this paper, we propose a new metric for Machine Translation (MT) evaluation, based on bi-directional entailment. We show that machine generated translation can be evaluated by determining paraphrasing with a reference translation provided by a human translator. We hypothesize, and show through experiments, that paraphrasing can be detected by evaluating entailment relationship in the forward and backward direction. Unlike conventional metrics, like BLEU or METEOR, our approach uses deep learning to determine the semantic similarity between candidate and reference translation for generating scores rather than relying upon simple n-gram overlap. We use BERT's pre-trained implementation of transformer networks, fine-tuned on MNLI corpus, for natural language inferencing. We apply our evaluation metric on WMT'14 and WMT'17 dataset to evaluate systems participating in the translation task and find that our metric has a better correlation with the human annotated score compared to the other traditional metrics at system level.
△ Less
Submitted 2 November, 2019;
originally announced November 2019.
-
The Majoron at two loops
Authors:
Julian Heeck,
Hiren H. Patel
Abstract:
We present singlet-Majoron couplings to Standard Model particles through two loops at leading order in the seesaw expansion, including couplings to gauge bosons as well as flavor-changing quark interactions. We discuss and compare the relevant phenomenological constraints on Majoron production as well as decaying Majoron dark matter. A comparison with standard seesaw observables in low-scale setti…
▽ More
We present singlet-Majoron couplings to Standard Model particles through two loops at leading order in the seesaw expansion, including couplings to gauge bosons as well as flavor-changing quark interactions. We discuss and compare the relevant phenomenological constraints on Majoron production as well as decaying Majoron dark matter. A comparison with standard seesaw observables in low-scale settings highlights the importance of searches for lepton-flavor-violating two-body decays $\ell \to \ell' +$Majoron in both the muon and tau sectors.
△ Less
Submitted 14 November, 2019; v1 submitted 4 September, 2019;
originally announced September 2019.
-
Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML
Authors:
Ashvin Agrawal,
Rony Chatterjee,
Carlo Curino,
Avrilia Floratou,
Neha Gowdal,
Matteo Interlandi,
Alekh **dal,
Kostantinos Karanasos,
Subru Krishnan,
Brian Kroth,
Jyoti Leeka,
Kwanghyun Park,
Hiren Patel,
Olga Poppe,
Fotis Psallidas,
Raghu Ramakrishnan,
Abhishek Roy,
Karla Saur,
Rathijit Sen,
Markus Weimer,
Travis Wright,
Yiwen Zhu
Abstract:
Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex…
▽ More
Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex financial predictions, just to name a few. Meanwhile, as the value of data is increasingly recognized and monetized, concerns about securing valuable data and risks to individual privacy have been growing. Consequently, rigorous data management has emerged as a key requirement in enterprise settings. How will these trends (ML growing popularity, and stricter data governance) intersect? What are the unmet requirements for applying ML in enterprise settings? What are the technical challenges for the DB community to solve? In this paper, we present our vision of how ML and database systems are likely to come together, and early steps we take towards making this vision a reality.
△ Less
Submitted 27 December, 2019; v1 submitted 30 August, 2019;
originally announced September 2019.
-
Tale of tails using rule augmented sequence labeling for event extraction
Authors:
Ayush Maheshwari,
Hrishikesh Patel,
Nandan Rathod,
Ritesh Kumar,
Ganesh Ramakrishnan,
Pushpak Bhattacharyya
Abstract:
The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data. Moreover, the task becomes complex for tail (rarely occurring) labels wherein extremely less data is available. In this paper, we present a new dataset (InDEE-2019) in the disaster domain for multiple Indic languages, collected from news websites. Usin…
▽ More
The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data. Moreover, the task becomes complex for tail (rarely occurring) labels wherein extremely less data is available. In this paper, we present a new dataset (InDEE-2019) in the disaster domain for multiple Indic languages, collected from news websites. Using this dataset, we evaluate several rule-based mechanisms to augment deep learning based models. We formulate our problem of event extraction as a sequence labeling task and perform extensive experiments to study and understand the effectiveness of different approaches. We further show that tail labels can be easily incorporated by creating new rules without the requirement of large annotated data.
△ Less
Submitted 31 January, 2020; v1 submitted 19 August, 2019;
originally announced August 2019.
-
Proposed experimental test of Randall-Sundrum Models
Authors:
Behnam Pourhassan,
Anha Bhat,
Hrishikesh Patel,
Mir Faizal,
Nicholas Mantella
Abstract:
The Randall-Sundrum models are expected to modify the short distance behavior of general relativity. In this paper, we will propose an experimental test for this short distance modification due to Randall-Sundrum models. This will be done by analyzing motion of a particle which is moving in spherical gravitational field with a drag force. The position at which the particle stops will be different…
▽ More
The Randall-Sundrum models are expected to modify the short distance behavior of general relativity. In this paper, we will propose an experimental test for this short distance modification due to Randall-Sundrum models. This will be done by analyzing motion of a particle which is moving in spherical gravitational field with a drag force. The position at which the particle stops will be different in general relativity and Randall-Sundrum model. This difference in the distance moved by the particle before stop** can be measured using a Nanoelectromechanical setup. Thus, it is possible to experimentally test Randall-Sundrum models using currently available technology.
△ Less
Submitted 27 January, 2020; v1 submitted 5 July, 2019;
originally announced July 2019.
-
Reliable data from low cost ozone sensors in a hierarchical network
Authors:
Georgia Miskell,
Kyle Alberti,
Brandon Feenstra,
Geoff S Henshaw,
Vasileios Papapostolou,
Hamesh Patel,
Andrea Polidori,
Jennifer A Salmond,
Lena Weissert,
David E Williams
Abstract:
We demonstrate how a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable high temporal-resolution ozone data at neighbourhood scales. The framework, demonstrated originally for a smaller scale regional network deployed in the Lower Fraser Valley, BC was tested and refined using two much more extensive networks o…
▽ More
We demonstrate how a hierarchical network comprising a number of compliant reference stations and a much larger number of low-cost sensors can deliver reliable high temporal-resolution ozone data at neighbourhood scales. The framework, demonstrated originally for a smaller scale regional network deployed in the Lower Fraser Valley, BC was tested and refined using two much more extensive networks of gas-sensitive semiconductor-based (GSS) sensors deployed at neighbourhood scales in Los Angeles: one of ~20 and one of ~45 GSS ozone sensors. Of these, ten sensors were co-located with different regulatory measurement stations, allowing a rigorous test of the accuracy of the algorithms used for off-site calibration and adjustment of low cost sensors. The method is based on adjusting the gain and offset of the low-cost sensor to match the first two moments of the probability distribution of the sensor result to that of a proxy: a calibrated independent measurement (usually derived from regulatory monitors) whose probability distribution evaluated over a time that emphasizes diurnal variations is similar to that at the test location. The regulatory measurement station physically closest to the low-cost sensor was a good proxy for most sites. The algorithms developed were successful in detecting and correcting sensor drift, and in identifying locations where geographical features resulted in significantly different patterns of ozone variation due to the relative dominance of different dispersion, emission and chemical processes. The entire network results show very large variations in ozone concentration that take place on short time- and distance scales across the Los-Angeles region. Such patterns were not captured by the more sparsely distributed stations of the existing regulatory network and demonstrate the need for reliable data from dense networks of monitors.
△ Less
Submitted 19 June, 2019;
originally announced June 2019.
-
Decay spectroscopy of $^{50}$Sc and $^{50m}$Sc to $^{50}$Ti
Authors:
M. Bowry,
C. E. Jones,
A. B. Garnsworthy,
G. C. Ball,
S. Cruz,
S. Georges,
G. Hackman,
J. D. Holt,
J. Measures,
B. Olaizola,
H. P. Patel,
C. J. Pearson,
C. E. Svensson
Abstract:
The $β$ decay of the isomeric and ground state of $^{50}$Sc to the semi-magic nucleus $^{50}_{22}$Ti$_{28}$ has been studied using a $^{50}$Ca beam delivered to the GRIFFIN $γ$-ray spectrometer at the TRIUMF-ISAC facility. $β$-decay branching ratios are reported to 16 excited states with a total of 38 $γ$-ray transitions linking them. These new data significantly expands the information available…
▽ More
The $β$ decay of the isomeric and ground state of $^{50}$Sc to the semi-magic nucleus $^{50}_{22}$Ti$_{28}$ has been studied using a $^{50}$Ca beam delivered to the GRIFFIN $γ$-ray spectrometer at the TRIUMF-ISAC facility. $β$-decay branching ratios are reported to 16 excited states with a total of 38 $γ$-ray transitions linking them. These new data significantly expands the information available over previous studies. Relative intensities are measured to less than 0.001$\%$ that of the strongest transition with the majority of $γ$-ray transitions observed here in $β$ decay for the first time. The data are compared to shell-model calculations utilizing both phenomenologically-derived interactions employed in the ${\it pf}$ shell as well as a state-of-the-art, ${\it ab~initio}$ based interaction built in the valence-space in-medium similarity renormalization group framework.
△ Less
Submitted 13 August, 2021; v1 submitted 10 June, 2019;
originally announced June 2019.
-
Map** Missing Population in Rural India: A Deep Learning Approach with Satellite Imagery
Authors:
Wenjie Hu,
Jay Harshadbhai Patel,
Zoe-Alanah Robert,
Paul Novosad,
Samuel Asher,
Zhongyi Tang,
Marshall Burke,
David Lobell,
Stefano Ermon
Abstract:
Millions of people worldwide are absent from their country's census. Accurate, current, and granular population metrics are critical to improving government allocation of resources, to measuring disease control, to responding to natural disasters, and to studying any aspect of human life in these communities. Satellite imagery can provide sufficient information to build a population map without th…
▽ More
Millions of people worldwide are absent from their country's census. Accurate, current, and granular population metrics are critical to improving government allocation of resources, to measuring disease control, to responding to natural disasters, and to studying any aspect of human life in these communities. Satellite imagery can provide sufficient information to build a population map without the cost and time of a government census. We present two Convolutional Neural Network (CNN) architectures which efficiently and effectively combine satellite imagery inputs from multiple sources to accurately predict the population density of a region. In this paper, we use satellite imagery from rural villages in India and population labels from the 2011 SECC census. Our best model achieves better performance than previous papers as well as LandScan, a community standard for global population distribution.
△ Less
Submitted 4 May, 2019;
originally announced May 2019.
-
Detailed Performance Loss Analysis of Silicon Solar Cells using High-Throughput Metrology Methods
Authors:
Mohammad Jobayer Hossain,
Geoffrey Gregory,
Hardik Patel,
Siyu Guo,
Eric J. Schneller,
Andrew M. Gabor,
Zhihao Yang,
Adrienne L. Blum,
Kristopher O. Davis
Abstract:
In this work, novel, high-throughput metrology methods are used to perform a detailed performance loss analysis of approximately 400 industrial crystalline silicon solar cells, all coming from the same production line. The characterization sequence includes a non-destructive transfer length method (TLM) measurement technique featuring circular TLM structures hidden within the busbar region of the…
▽ More
In this work, novel, high-throughput metrology methods are used to perform a detailed performance loss analysis of approximately 400 industrial crystalline silicon solar cells, all coming from the same production line. The characterization sequence includes a non-destructive transfer length method (TLM) measurement technique featuring circular TLM structures hidden within the busbar region of the cells. It also includes a very fast external quantum efficiency and reflectance measurement technique. More traditional measurements, like illuminated current-voltage, Suns-VOC, and photoluminescence imaging are also used to carry out the loss analysis. The variance of the individual loss parameters and their impact on cell performance are investigated and quantified for this large group of industrial solar cells. Some important correlations between the measured loss parameters are found. The nature of these distributions and correlations provide important insights about loss mechanisms in a cell and help prioritize efforts to optimize the performance of the production line.
△ Less
Submitted 26 February, 2019;
originally announced March 2019.
-
FDFNet : A Secure Cancelable Deep Finger Dorsal Template Generation Network Secured via. Bio-Hashing
Authors:
Avantika Singh,
Ashish Arora,
Shreya Hasmukh Patel,
Gaurav Jaswal,
Aditya Nigam
Abstract:
Present world has already been consistently exploring the fine edges of online and digital world by imposing multiple challenging problems/scenarios. Similar to physical world, personal identity management is very crucial in-order to provide any secure online system. Last decade has seen a lot of work in this area using biometrics such as face, fingerprint, iris etc. Still there exist several vuln…
▽ More
Present world has already been consistently exploring the fine edges of online and digital world by imposing multiple challenging problems/scenarios. Similar to physical world, personal identity management is very crucial in-order to provide any secure online system. Last decade has seen a lot of work in this area using biometrics such as face, fingerprint, iris etc. Still there exist several vulnerabilities and one should have to address the problem of compromised biometrics much more seriously, since they cannot be modified easily once compromised. In this work, we have proposed a secure cancelable finger dorsal template generation network (learning domain specific features) secured via. Bio-Hashing. Proposed system effectively protects the original finger dorsal images by withdrawing compromised template and reassigning the new one. A novel Finger-Dorsal Feature Extraction Net (FDFNet) has been proposed for extracting the discriminative features. This network is exclusively trained on trait specific features without using any kind of pre-trained architecture. Later Bio-Hashing, a technique based on assigning a tokenized random number to each user, has been used to hash the features extracted from FDFNet. To test the performance of the proposed architecture, we have tested it over two benchmark public finger knuckle datasets: PolyU FKP and PolyU Contactless FKI. The experimental results shows the effectiveness of the proposed system in terms of security and accuracy.
△ Less
Submitted 13 December, 2018;
originally announced December 2018.
-
Split-Scale: Scaling Bitcoin by Partitioning the UTXO Space
Authors:
Kazım Rıfat Özyılmaz,
Harsh Patel,
Ankit Malik
Abstract:
The Bitcoin protocol is a significant milestone in the history of money. However, its adoption is currently constrained by the transaction limits of the system. As the chief problem of blockchain technology, the scaling issue has attracted many valuable solutions both on-chain and off-chain. In this paper, our goal is to explore the notion of unspent transaction outputs (UTXOs) to propose an augme…
▽ More
The Bitcoin protocol is a significant milestone in the history of money. However, its adoption is currently constrained by the transaction limits of the system. As the chief problem of blockchain technology, the scaling issue has attracted many valuable solutions both on-chain and off-chain. In this paper, our goal is to explore the notion of unspent transaction outputs (UTXOs) to propose an augmented Bitcoin protocol that can scale gracefully. Our proposal aims to increase the transaction throughput by partitioning the UTXO space and splitting the blockchain. In addition, a new type of Bitcoin node is introduced to preserve the capability to run validating nodes in low-bandwidth environments, despite the increased transaction throughput.
△ Less
Submitted 18 January, 2019; v1 submitted 22 September, 2018;
originally announced September 2018.
-
The GRIFFIN Facility for Decay-Spectroscopy Studies at TRIUMF-ISAC
Authors:
A. B. Garnsworthy,
C. E. Svensson,
M. Bowry,
R. Dunlop,
A. D. MacLean,
B. Olaizola,
J. K. Smith,
F. A. Ali,
C. Andreoiu,
J. E. Ash,
W. H. Ashfield,
G. C. Ball,
T. Ballast,
C. Bartlett,
Z. Beadle,
P. C. Bender,
N. Bernier,
S. S. Bhattacharjee,
H. Bidaman,
V. Bildstein,
D. Bishop,
P. Boubel,
R. Braid,
D. Brennan,
T. Bruhn
, et al. (79 additional authors not shown)
Abstract:
Gamma-Ray Infrastructure For Fundamental Investigations of Nuclei, GRIFFIN, is a new high-efficiency $γ$-ray spectrometer designed for use in decay spectroscopy experiments with low-energy radioactive ion beams provided by TRIUMF's Isotope Separator and Accelerator (ISAC-I) facility. GRIFFIN is composed of sixteen Compton-suppressed large-volume clover-type high-purity germanium (HPGe) $γ$-ray det…
▽ More
Gamma-Ray Infrastructure For Fundamental Investigations of Nuclei, GRIFFIN, is a new high-efficiency $γ$-ray spectrometer designed for use in decay spectroscopy experiments with low-energy radioactive ion beams provided by TRIUMF's Isotope Separator and Accelerator (ISAC-I) facility. GRIFFIN is composed of sixteen Compton-suppressed large-volume clover-type high-purity germanium (HPGe) $γ$-ray detectors combined with a suite of ancillary detection systems and coupled to a custom digital data acquisition system. The infrastructure and detectors of the spectrometer as well as the performance characteristics and the analysis techniques applied to the experimental data are described.
△ Less
Submitted 6 December, 2018; v1 submitted 17 September, 2018;
originally announced September 2018.
-
Two-loop effective potential for generalized gauge fixing
Authors:
Stephen P. Martin,
Hiren H. Patel
Abstract:
We obtain the two-loop effective potential for general renormalizable theories, using a generalized gauge-fixing scheme that includes as special cases the background-field $R_ξ$ gauges, the Fermi gauges, and the familiar Landau gauge, and using dimensional regularization in the bare and \MSbar renormalization schemes. As examples, the results are then specialized to the Abelian Higgs model and to…
▽ More
We obtain the two-loop effective potential for general renormalizable theories, using a generalized gauge-fixing scheme that includes as special cases the background-field $R_ξ$ gauges, the Fermi gauges, and the familiar Landau gauge, and using dimensional regularization in the bare and \MSbar renormalization schemes. As examples, the results are then specialized to the Abelian Higgs model and to the Standard Model. In the case of the Standard Model, we study how the vacuum expectation value and the minimum vacuum energy depend numerically on the gauge-fixing parameters. The results at fixed two-loop order exhibit non-convergent behavior for sufficiently large gauge-fixing parameters; this can presumably be addressed by a resummation of higher-order contributions.
△ Less
Submitted 22 August, 2018;
originally announced August 2018.
-
Reduced hadronic uncertainty in the determination of $V_{ud}$
Authors:
Chien-Yeah Seng,
Mikhail Gorchtein,
Hiren H. Patel,
Michael J. Ramsey-Musolf
Abstract:
We analyze the universal radiative correction $Δ_R^V$ to neutron and superallowed nuclear $β$ decay by expressing the hadronic $γW$-box contribution in terms of a dispersion relation, which we identify as an integral over the first Nachtmann moment of the $γW$ interference structure function $F_3^{(0)}$. By connecting the needed input to existing data on neutrino and antineutrino scattering, we ob…
▽ More
We analyze the universal radiative correction $Δ_R^V$ to neutron and superallowed nuclear $β$ decay by expressing the hadronic $γW$-box contribution in terms of a dispersion relation, which we identify as an integral over the first Nachtmann moment of the $γW$ interference structure function $F_3^{(0)}$. By connecting the needed input to existing data on neutrino and antineutrino scattering, we obtain an updated value of $Δ_R^V = 0.02467(22)$, wherein the hadronic uncertainty is reduced. Assuming other Standard Model theoretical calculations and experimental measurements remain unchanged, we obtain an updated value of $|V_{ud}| = 0.97366(15)$, raising tension with the first row CKM unitarity constraint. We comment on ways current and future experiments can provide input to our dispersive analysis.
△ Less
Submitted 16 August, 2018; v1 submitted 26 July, 2018;
originally announced July 2018.