-
Studying word order through iterative shuffling
Authors:
Nikolay Malkin,
Sameera Lanka,
Pranav Goel,
Nebojsa Jojic
Abstract:
As neural language models approach human performance on NLP benchmark tasks, their advances are widely seen as evidence of an increasingly complex understanding of syntax. This view rests upon a hypothesis that has not yet been empirically tested: that word order encodes meaning essential to performing these tasks. We refute this hypothesis in many cases: in the GLUE suite and in various genres of…
▽ More
As neural language models approach human performance on NLP benchmark tasks, their advances are widely seen as evidence of an increasingly complex understanding of syntax. This view rests upon a hypothesis that has not yet been empirically tested: that word order encodes meaning essential to performing these tasks. We refute this hypothesis in many cases: in the GLUE suite and in various genres of English text, the words in a sentence or phrase can rarely be permuted to form a phrase carrying substantially different information. Our surprising result relies on inference by iterative shuffling (IBIS), a novel, efficient procedure that finds the ordering of a bag of words having the highest likelihood under a fixed language model. IBIS can use any black-box model without additional training and is superior to existing word ordering algorithms. Coalescing our findings, we discuss how shuffling inference procedures such as IBIS can benefit language modeling and constrained generation.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
Predicting the Reproducibility of Social and Behavioral Science Papers Using Supervised Learning Models
Authors:
Jian Wu,
Rajal Nivargi,
Sree Sai Teja Lanka,
Arjun Manoj Menon,
Sai Ajay Modukuri,
Nishanth Nakshatri,
Xin Wei,
Zhuoer Wang,
James Caverlee,
Sarah M. Rajtmajer,
C. Lee Giles
Abstract:
In recent years, significant effort has been invested verifying the reproducibility and robustness of research claims in social and behavioral sciences (SBS), much of which has involved resource-intensive replication projects. In this paper, we investigate prediction of the reproducibility of SBS papers using machine learning methods based on a set of features. We propose a framework that extracts…
▽ More
In recent years, significant effort has been invested verifying the reproducibility and robustness of research claims in social and behavioral sciences (SBS), much of which has involved resource-intensive replication projects. In this paper, we investigate prediction of the reproducibility of SBS papers using machine learning methods based on a set of features. We propose a framework that extracts five types of features from scholarly work that can be used to support assessments of reproducibility of published research claims. Bibliometric features, venue features, and author features are collected from public APIs or extracted using open source machine learning libraries with customized parsers. Statistical features, such as p-values, are extracted by recognizing patterns in the body text. Semantic features, such as funding information, are obtained from public APIs or are extracted using natural language processing models. We analyze pairwise correlations between individual features and their importance for predicting a set of human-assessed ground truth labels. In doing so, we identify a subset of 9 top features that play relatively more important roles in predicting the reproducibility of SBS papers in our corpus. Results are verified by comparing performances of 10 supervised predictive classifiers trained on different sets of features.
△ Less
Submitted 21 October, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Smart Grid: A Survey of Architectural Elements, Machine Learning and Deep Learning Applications and Future Directions
Authors:
Navod Neranjan Thilakarathne,
Mohan Krishna Kagita,
Dr. Surekha Lanka,
Hussain Ahmad
Abstract:
The Smart grid (SG), generally known as the next-generation power grid emerged as a replacement for ill-suited power systems in the 21st century. It is in-tegrated with advanced communication and computing capabilities, thus it is ex-pected to enhance the reliability and the efficiency of energy distribution with minimum effects. With the massive infrastructure it holds and the underlying communic…
▽ More
The Smart grid (SG), generally known as the next-generation power grid emerged as a replacement for ill-suited power systems in the 21st century. It is in-tegrated with advanced communication and computing capabilities, thus it is ex-pected to enhance the reliability and the efficiency of energy distribution with minimum effects. With the massive infrastructure it holds and the underlying communication network in the system, it introduced a large volume of data that demands various techniques for proper analysis and decision making. Big data analytics, machine learning (ML), and deep learning (DL) plays a key role when it comes to the analysis of this massive amount of data and generation of valuable insights. This paper explores and surveys the Smart grid architectural elements, machine learning, and deep learning-based applications and approaches in the context of the Smart grid. In addition in terms of machine learning-based data an-alytics, this paper highlights the limitations of the current research and highlights future directions as well.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
A Detail Study of Security and Privacy issues of Internet of Things
Authors:
Mohan Krishna Kagita,
Navod Thilakarathne,
Dharmendra Singh Rajput,
Dr Surekha Lanka
Abstract:
The Internet of Things, or IoT, refers to the billions of physical objects around the planet that are now connected to the Internet, many of which store and exchange the data without human interaction. In recent years the Internet of Things (IoT) has incredibly become a groundbreaking technical innovation that has contributed to massive impact in the ways where all the information is handled incor…
▽ More
The Internet of Things, or IoT, refers to the billions of physical objects around the planet that are now connected to the Internet, many of which store and exchange the data without human interaction. In recent years the Internet of Things (IoT) has incredibly become a groundbreaking technical innovation that has contributed to massive impact in the ways where all the information is handled incorporate companies, computer devices, and even kitchen equipment and appliances, are designed and made. The main focus of this chapter is to systematically review the security and privacy of the Internet of Things in the present world. Most internet users are genuine, yet others are cybercriminals with individual expectations of misusing information. With such possibilities, users should know the potential security and privacy issues of IoT devices. IoT innovations are applied on numerous levels in a system that we use daily in our day-to-day life. Data confidentiality is a significant issue. The interconnection of various networks makes it impossible for users to assert extensive control of their data. Finally, this chapter discusses the IoT Security concerns in the literature and providing a critical review of the current approach and proposed solutions on present issues on the Privacy protection of IoT devices.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay
Authors:
Sameera Lanka,
Tianfu Wu
Abstract:
Experience replay is an important technique for addressing sample-inefficiency in deep reinforcement learning (RL), but faces difficulty in learning from binary and sparse rewards due to disproportionately few successful experiences in the replay buffer. Hindsight experience replay (HER) was recently proposed to tackle this difficulty by manipulating unsuccessful transitions, but in doing so, HER…
▽ More
Experience replay is an important technique for addressing sample-inefficiency in deep reinforcement learning (RL), but faces difficulty in learning from binary and sparse rewards due to disproportionately few successful experiences in the replay buffer. Hindsight experience replay (HER) was recently proposed to tackle this difficulty by manipulating unsuccessful transitions, but in doing so, HER introduces a significant bias in the replay buffer experiences and therefore achieves a suboptimal improvement in sample-efficiency. In this paper, we present an analysis on the source of bias in HER, and propose a simple and effective method to counter the bias, to most effectively harness the sample-efficiency provided by HER. Our method, motivated by counter-factual reasoning and called ARCHER, extends HER with a trade-off to make rewards calculated for hindsight experiences numerically greater than real rewards. We validate our algorithm on two continuous control environments from DeepMind Control Suite - Reacher and Finger, which simulate manipulation tasks with a robotic arm - in combination with various reward functions, task complexities and goal sampling strategies. Our experiments consistently demonstrate that countering bias using more aggressive hindsight rewards increases sample efficiency, thus establishing the greater benefit of ARCHER in RL applications with limited computing budget.
△ Less
Submitted 6 September, 2018; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Umbilical Cord Blood Banking and its Therapeutic Uses
Authors:
Nivethika Sivakumaran,
Imesha Rashmini Rathnayaka,
Rashida Shabbir,
Sasini Sandareka Wimalsinghe,
J. A. Sumalimina Jayakody,
Mahisha Chandrasekaran,
Mawatha,
Sri Lanka
Abstract:
Umbilical cord blood (UBC) can be viewed as the most promising source of stem cells, in which collection cost is minimal and its benefits are immense. The cord blood is used to treat malignant and nonmalignant diseases; this is due to its progenitor characteristics know as stem cells.Its properties of being, immunologically immature and high plasticity has made it superior to other sources of stem…
▽ More
Umbilical cord blood (UBC) can be viewed as the most promising source of stem cells, in which collection cost is minimal and its benefits are immense. The cord blood is used to treat malignant and nonmalignant diseases; this is due to its progenitor characteristics know as stem cells.Its properties of being, immunologically immature and high plasticity has made it superior to other sources of stem cells. The stem cells collected from cord blood have neutral differentiation capabilities which allow medical professionals to produce functional neural cells from these stem cells.Cord Blood Banking (CBB) is the storing of the umbilical cord blood which is collected immediately after the delivery of the baby. Great care and concern are needed for proper storage of these progenitor cells, hence cord blood banks come into the play, they are of 3 types which are: public, private and direct donation banks.Clinical trials are still at its very early stages having abundances to still be uncovered but results were obtained have demonstrated high potential and more scope towards effective development therapies and treatments for rare disorders.
△ Less
Submitted 21 February, 2018;
originally announced February 2018.