Search | arXiv e-print repository

Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning

Authors: Jacob Wiebe, Ranwa Al Mallah, Li Li

Abstract: Recent advancements in deep learning techniques have opened new possibilities for designing solutions for autonomous cyber defence. Teams of intelligent agents in computer network defence roles may reveal promising avenues to safeguard cyber and kinetic assets. In a simulated game environment, agents are evaluated on their ability to jointly mitigate attacker activity in host-based defence scenari… ▽ More Recent advancements in deep learning techniques have opened new possibilities for designing solutions for autonomous cyber defence. Teams of intelligent agents in computer network defence roles may reveal promising avenues to safeguard cyber and kinetic assets. In a simulated game environment, agents are evaluated on their ability to jointly mitigate attacker activity in host-based defence scenarios. Defender systems are evaluated against heuristic attackers with the goals of compromising network confidentiality, integrity, and availability. Value-based Independent Learning and Centralized Training Decentralized Execution (CTDE) cooperative Multi-Agent Reinforcement Learning (MARL) methods are compared revealing that both approaches outperform a simple multi-agent heuristic defender. This work demonstrates the ability of cooperative MARL to learn effective cyber defence tactics against varied threats. △ Less

Submitted 25 August, 2023; originally announced October 2023.

Comments: Presented at 2nd International Workshop on Adaptive Cyber Defense, 2023 (arXiv:2308.09520)

Report number: ACD/2023/106

arXiv:2201.10035 [pdf, other]

doi 10.1016/j.ces.2022.117469

Maximizing information from chemical engineering data sets: Applications to machine learning

Authors: Alexander Thebelt, Johannes Wiebe, Jan Kronqvist, Calvin Tsay, Ruth Misener

Abstract: It is well-documented how artificial intelligence can have (and already is having) a big impact on chemical engineering. But classical machine learning approaches may be weak for many chemical engineering applications. This review discusses how challenging data characteristics arise in chemical engineering applications. We identify four characteristics of data arising in chemical engineering appli… ▽ More It is well-documented how artificial intelligence can have (and already is having) a big impact on chemical engineering. But classical machine learning approaches may be weak for many chemical engineering applications. This review discusses how challenging data characteristics arise in chemical engineering applications. We identify four characteristics of data arising in chemical engineering applications that make applying classical artificial intelligence approaches difficult: (1) high variance, low volume data, (2) low variance, high volume data, (3) noisy/corrupt/missing data, and (4) restricted data with physics-based limitations. For each of these four data characteristics, we discuss applications where these data characteristics arise and show how current chemical engineering research is extending the fields of data science and machine learning to incorporate these challenges. Finally, we identify several challenges for future research. △ Less

Submitted 24 January, 2022; originally announced January 2022.

Comments: 34 pages, 3 figures, 1 table

arXiv:1909.12328 [pdf, other]

Approximation Algorithms for Process Systems Engineering

Authors: Dimitrios Letsios, Radu Baltean-Lugojan, Francesco Ceccon, Miten Mistry, Johannes Wiebe, Ruth Misener

Abstract: Designing and analyzing algorithms with provable performance guarantees enables efficient optimization problem solving in different application domains, e.g.\ communication networks, transportation, economics, and manufacturing. Despite the significant contributions of approximation algorithms in engineering, only limited and isolated works contribute from this perspective in process systems engin… ▽ More Designing and analyzing algorithms with provable performance guarantees enables efficient optimization problem solving in different application domains, e.g.\ communication networks, transportation, economics, and manufacturing. Despite the significant contributions of approximation algorithms in engineering, only limited and isolated works contribute from this perspective in process systems engineering. The current paper discusses three representative, NP-hard problems in process systems engineering: (i) pooling, (ii) process scheduling, and (iii) heat exchanger network synthesis. We survey relevant results and raise major open questions. Further, we present approximation algorithms applications which are relevant to process systems engineering: (i) better mathematical modeling, (ii) problem classification, (iii) designing solution methods, and (iv) dealing with uncertainty. This paper aims to motivate further research at the intersection of approximation algorithms and process systems engineering. △ Less

Submitted 26 September, 2019; originally announced September 2019.

arXiv:1707.01890 [pdf, other]

An Interactive Tool for Natural Language Processing on Clinical Text

Authors: Gaurav Trivedi, Phuong Pham, Wendy Chapman, Rebecca Hwa, Janyce Wiebe, Harry Hochheiser

Abstract: Natural Language Processing (NLP) systems often make use of machine learning techniques that are unfamiliar to end-users who are interested in analyzing clinical records. Although NLP has been widely used in extracting information from clinical text, current systems generally do not support model revision based on feedback from domain experts. We present a prototype tool that allows end users to… ▽ More Natural Language Processing (NLP) systems often make use of machine learning techniques that are unfamiliar to end-users who are interested in analyzing clinical records. Although NLP has been widely used in extracting information from clinical text, current systems generally do not support model revision based on feedback from domain experts. We present a prototype tool that allows end users to visualize and review the outputs of an NLP system that extracts binary variables from clinical text. Our tool combines multiple visualizations to help the users understand these results and make any necessary corrections, thus forming a feedback loop and hel** improve the accuracy of the NLP models. We have tested our prototype in a formative think-aloud user study with clinicians and researchers involved in colonoscopy research. Results from semi-structured interviews and a System Usability Scale (SUS) analysis show that the users are able to quickly start refining NLP models, despite having very little or no experience with machine learning. Observations from these sessions suggest revisions to the interface to better support review workflow and interpretation of results. △ Less

Submitted 7 July, 2017; v1 submitted 6 July, 2017; originally announced July 2017.

Comments: 8 pages, 2 figures, 2 tables, Presented at IUI TextVis 2015 Workshop

arXiv:1404.6491 [pdf, ps, other]

An Account of Opinion Implicatures

Authors: Janyce Wiebe, Lingjia Deng

Abstract: While previous sentiment analysis research has concentrated on the interpretation of explicitly stated opinions and attitudes, this work initiates the computational study of a type of opinion implicature (i.e., opinion-oriented inference) in text. This paper described a rule-based framework for representing and analyzing opinion implicatures which we hope will contribute to deeper automatic interp… ▽ More While previous sentiment analysis research has concentrated on the interpretation of explicitly stated opinions and attitudes, this work initiates the computational study of a type of opinion implicature (i.e., opinion-oriented inference) in text. This paper described a rule-based framework for representing and analyzing opinion implicatures which we hope will contribute to deeper automatic interpretation of subjective language. In the course of understanding implicatures, the system recognizes implicit sentiments (and beliefs) toward various events and entities in the sentence, often attributed to different sources (holders) and of mixed polarities; thus, it produces a richer interpretation than is typical in opinion analysis. △ Less

Submitted 23 April, 2014; originally announced April 2014.

Comments: 50 Pages. Submitted to the journal, Language Resources and Evaluation

arXiv:cs/9901005 [pdf, ps, other]

An Empirical Approach to Temporal Reference Resolution (journal version)

Authors: Janyce Wiebe, Thomas P. O'Hara, Thorsten Ohrstrom-Sandgren, Kenneth K. McKeever

Abstract: Scheduling dialogs, during which people negotiate the times of appointments, are common in everyday life. This paper reports the results of an in-depth empirical investigation of resolving explicit temporal references in scheduling dialogs. There are four phases of this work: data annotation and evaluation, model development, system implementation and evaluation, and model evaluation and analysi… ▽ More Scheduling dialogs, during which people negotiate the times of appointments, are common in everyday life. This paper reports the results of an in-depth empirical investigation of resolving explicit temporal references in scheduling dialogs. There are four phases of this work: data annotation and evaluation, model development, system implementation and evaluation, and model evaluation and analysis. The system and model were developed primarily on one set of data, and then applied later to a much more complex data set, to assess the generalizability of the model for the task being performed. Many different types of empirical methods are applied to pinpoint the strengths and weaknesses of the approach. Detailed annotation instructions were developed and an intercoder reliability study was performed, showing that naive annotators can reliably perform the targeted annotations. A fully automatic system has been developed and evaluated on unseen test data, with good results on both data sets. We adopt a pure realization of a recency-based focus model to identify precisely when it is and is not adequate for the task being addressed. In addition to system results, an in-depth evaluation of the model itself is presented, based on detailed manual annotations. The results are that few errors occur specifically due to the model of focus being used, and the set of anaphoric relations defined in the model are low in ambiguity for both data sets. △ Less

Submitted 13 January, 1999; originally announced January 1999.

Comments: Tar archive with LaTeX source, postscript figures, and style files

ACM Class: I.2.7

Journal ref: Journal of Artificial Intelligence Research (JAIR), 9:247-293

arXiv:cmp-lg/9710008 [pdf, ps, other]

Probabilistic Event Categorization

Authors: Janyce Wiebe, Rebecca Bruce, Lei Duan

Abstract: This paper describes the automation of a new text categorization task. The categories assigned in this task are more syntactically, semantically, and contextually complex than those typically assigned by fully automatic systems that process unseen test data. Our system for assigning these categories is a probabilistic classifier, developed with a recent method for formulating a probabilistic mod… ▽ More This paper describes the automation of a new text categorization task. The categories assigned in this task are more syntactically, semantically, and contextually complex than those typically assigned by fully automatic systems that process unseen test data. Our system for assigning these categories is a probabilistic classifier, developed with a recent method for formulating a probabilistic model from a predefined set of potential features. This paper focuses on feature selection. It presents a number of fully automatic features. It identifies and evaluates various approaches to organizing collocational properties into features, and presents the results of experiments covarying type of organization and type of property. We find that one organization is not best for all kinds of properties, so this is an experimental parameter worth investigating in NLP systems. In addition, the results suggest a way to take advantage of properties that are low frequency but strongly indicative of a class. The problems of recognizing and organizing the various kinds of contextual information required to perform a linguistically complex categorization task have rarely been systematically investigated in NLP. △ Less

Submitted 30 October, 1997; v1 submitted 30 October, 1997; originally announced October 1997.

Journal ref: Recent Advances in Natural Language Processing (RANLP-97), European Commission, DG XIII, Tzigov Chark, Bulgaria, September 1997, pp. 163--170.

arXiv:cmp-lg/9706020 [pdf, ps, other]

An Empirical Approach to Temporal Reference Resolution

Authors: Janyce Wiebe, Tom O'Hara, Kenneth McKeever, Thorsten Oehrstroem-Sandgren

Abstract: This paper presents the results of an empirical investigation of temporal reference resolution in scheduling dialogs. The algorithm adopted is primarily a linear-recency based approach that does not include a model of global focus. A fully automatic system has been developed and evaluated on unseen test data with good results. This paper presents the results of an intercoder reliability study, a… ▽ More This paper presents the results of an empirical investigation of temporal reference resolution in scheduling dialogs. The algorithm adopted is primarily a linear-recency based approach that does not include a model of global focus. A fully automatic system has been developed and evaluated on unseen test data with good results. This paper presents the results of an intercoder reliability study, a model of temporal reference resolution that supports linear recency and has very good coverage, the results of the system evaluated on unseen test data, and a detailed analysis of the dialogs assessing the viability of the approach. △ Less

Submitted 16 June, 1997; originally announced June 1997.

Comments: 13 pages, latex using aclap.sty

Journal ref: Proceedings of the Second Conference On Empirical Methods in Natural Language Processing (EMNLP-2), August 1-2, 1997, Providence, RI

arXiv:cmp-lg/9702016 [pdf, ps, other]

Instructions for Temporal Annotation of Scheduling Dialogs

Authors: Tom O'Hara, Janyce Wiebe, Karen Payne

Abstract: Human annotation of natural language facilitates standardized evaluation of natural language processing systems and supports automated feature extraction. This document consists of instructions for annotating the temporal information in scheduling dialogs, dialogs in which the participants schedule a meeting with one another. Task-oriented dialogs, such as these are, would arise in many useful a… ▽ More Human annotation of natural language facilitates standardized evaluation of natural language processing systems and supports automated feature extraction. This document consists of instructions for annotating the temporal information in scheduling dialogs, dialogs in which the participants schedule a meeting with one another. Task-oriented dialogs, such as these are, would arise in many useful applications, for instance, automated information providers and automated phone operators. Explicit instructions support good inter-rater reliability and serve as documentation for the classes being annotated. △ Less

Submitted 27 February, 1997; originally announced February 1997.

Comments: 14 pages

Report number: MCCS-97-308

arXiv:cmp-lg/9702008 [pdf, ps, other]

Sequential Model Selection for Word Sense Disambiguation

Authors: Ted Pedersen, Rebecca Bruce, Janyce Wiebe

Abstract: Statistical models of word-sense disambiguation are often based on a small number of contextual features or on a model that is assumed to characterize the interactions among a set of features. Model selection is presented as an alternative to these approaches, where a sequential search of possible models is conducted in order to find the model that best characterizes the interactions among featu… ▽ More Statistical models of word-sense disambiguation are often based on a small number of contextual features or on a model that is assumed to characterize the interactions among a set of features. Model selection is presented as an alternative to these approaches, where a sequential search of possible models is conducted in order to find the model that best characterizes the interactions among features. This paper expands existing model selection methodology and presents the first comparative study of model selection search strategies and evaluation criteria when applied to the problem of building probabilistic classifiers for word-sense disambiguation. △ Less

Submitted 11 February, 1997; originally announced February 1997.

Comments: 8 pages, Latex, uses aclap.sty

Journal ref: Proceedings of the Fifth Conference on Applied Natural Language Processing, April 1997, Washington, DC

arXiv:cmp-lg/9604018 [pdf, ps]

The Measure of a Model

Authors: Rebecca Bruce, Janyce Wiebe, Ted Pedersen

Abstract: This paper describes measures for evaluating the three determinants of how well a probabilistic classifier performs on a given test set. These determinants are the appropriateness, for the test set, of the results of (1) feature selection, (2) formulation of the parametric form of the model, and (3) parameter estimation. These are part of any model formulation procedure, even if not broken out a… ▽ More This paper describes measures for evaluating the three determinants of how well a probabilistic classifier performs on a given test set. These determinants are the appropriateness, for the test set, of the results of (1) feature selection, (2) formulation of the parametric form of the model, and (3) parameter estimation. These are part of any model formulation procedure, even if not broken out as separate steps, so the tradeoffs explored in this paper are relevant to a wide variety of methods. The measures are demonstrated in a large experiment, in which they are used to analyze the results of roughly 300 classifiers that perform word-sense disambiguation. △ Less

Submitted 28 April, 1996; originally announced April 1996.

Comments: 12 pages, uuencoded compressed postscript file

Journal ref: In Proceedings of the Empirical Methods in Natural Language Processing Conference, May 1996, Philadelphia, PA

arXiv:cmp-lg/9407019 [pdf, ps]

Tracking Point of View in Narrative

Authors: Janyce M. Wiebe

Abstract: Third-person fictional narrative text is composed not only of passages that objectively narrate events, but also of passages that present characters' thoughts, perceptions, and inner states. Such passages take a character's ``psychological point of view''. A language understander must determine the current psychological point of view in order to distinguish the beliefs of the characters from the… ▽ More Third-person fictional narrative text is composed not only of passages that objectively narrate events, but also of passages that present characters' thoughts, perceptions, and inner states. Such passages take a character's ``psychological point of view''. A language understander must determine the current psychological point of view in order to distinguish the beliefs of the characters from the facts of the story, to correctly attribute beliefs and other attitudes to their sources, and to understand the discourse relations among sentences. Tracking the psychological point of view is not a trivial problem, because many sentences are not explicitly marked for point of view, and whether the point of view of a sentence is objective or that of a character (and if the latter, which character it is) often depends on the context in which the sentence appears. Tracking the psychological point of view is the problem addressed in this work. The approach is to seek, by extensive examinations of naturally-occurring narrative, regularities in the ways that authors manipulate point of view, and to develop an algorithm that tracks point of view on the basis of the regularities found. This paper presents this algorithm, gives demonstrations of an implemented system, and describes the results of some preliminary empirical studies, which lend support to the algorithm. △ Less

Submitted 22 July, 1994; originally announced July 1994.

Comments: 55 pages, uuencoded compressed ps, appears in Computational Linguistics 20:2, pp. 233-287 (electronic version does not reflect all copy-editing changes)

Journal ref: Computational Lingustics 20:2, 233-287

arXiv:cmp-lg/9406005 [pdf, ps]

Word-Sense Disambiguation Using Decomposable Models

Authors: Rebecca Bruce, Janyce Wiebe

Abstract: Most probabilistic classifiers used for word-sense disambiguation have either been based on only one contextual feature or have used a model that is simply assumed to characterize the interdependencies among multiple contextual features. In this paper, a different approach to formulating a probabilistic model is presented along with a case study of the performance of models produced in this mann… ▽ More Most probabilistic classifiers used for word-sense disambiguation have either been based on only one contextual feature or have used a model that is simply assumed to characterize the interdependencies among multiple contextual features. In this paper, a different approach to formulating a probabilistic model is presented along with a case study of the performance of models produced in this manner for the disambiguation of the noun "interest". We describe a method for formulating probabilistic models that use multiple contextual features for word-sense disambiguation, without requiring untested assumptions regarding the form of the model. Using this approach, the joint distribution of all variables is described by only the most systematic variable interactions, thereby limiting the number of parameters to be estimated, supporting computational efficiency, and providing an understanding of the data. △ Less

Submitted 1 June, 1994; originally announced June 1994.

Comments: 8 pages, Unix compressed, uuencoded Postscript file

Report number: To appear in ACL-94

Showing 1–13 of 13 results for author: Wiebe, J