Skip to main content

Showing 1–36 of 36 results for author: Levy, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16086  [pdf, other

    cs.CL

    SEAM: A Stochastic Benchmark for Multi-Document Tasks

    Authors: Gili Lior, Avi Caciularu, Arie Cattan, Shahar Levy, Ori Shapira, Gabriel Stanovsky

    Abstract: Various tasks, such as summarization, multi-hop question answering, or coreference resolution, are naturally phrased over collections of real-world documents. Such tasks present a unique set of challenges, revolving around the lack of coherent narrative structure across documents, which often leads to contradiction, omission, or repetition of information. Despite their real-world application and c… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2403.11092  [pdf, other

    cs.CL cs.AI cs.CV cs.CY eess.IV

    Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts

    Authors: Michael Saxon, Yiran Luo, Sharon Levy, Chitta Baral, Yezhou Yang, William Yang Wang

    Abstract: Benchmarks of the multilingual capabilities of text-to-image (T2I) models compare generated images prompted in a test language to an expected image distribution over a concept set. One such benchmark, "Conceptual Coverage Across Languages" (CoCo-CroLa), assesses the tangible noun inventory of T2I models by prompting them to generate pictures from a concept list translated to seven languages and co… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 Main Conference

  3. arXiv:2403.04858  [pdf, other

    cs.CL

    Evaluating Biases in Context-Dependent Health Questions

    Authors: Sharon Levy, Tahilin Sanchez Karver, William D. Adler, Michelle R. Kaufman, Mark Dredze

    Abstract: Chat-based large language models have the opportunity to empower individuals lacking high-quality healthcare access to receive personalized information across a variety of topics. However, users may ask underspecified questions that require additional context for a model to correctly answer. We study how large language model biases are exhibited through these contextual questions in the healthcare… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2312.01217  [pdf, other

    cs.SI cs.CL cs.LG

    Understanding Opinions Towards Climate Change on Social Media

    Authors: Yashaswi Pupneja, Joseph Zou, Sacha Lévy, Shenyang Huang

    Abstract: Social media platforms such as Twitter (now known as X) have revolutionized how the public engage with important societal and political topics. Recently, climate change discussions on social media became a catalyst for political polarization and the spreading of misinformation. In this work, we aim to understand how real world events influence the opinions of individuals towards climate change rel… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  5. arXiv:2310.09624  [pdf, other

    cs.CL cs.AI cs.LG

    ASSERT: Automated Safety Scenario Red Teaming for Evaluating the Robustness of Large Language Models

    Authors: Alex Mei, Sharon Levy, William Yang Wang

    Abstract: As large language models are integrated into society, robustness toward a suite of prompts is increasingly important to maintain reliability in a high-variance environment.Robustness evaluations must comprehensively encapsulate the various settings in which a user may invoke an intelligent system. This paper proposes ASSERT, Automated Safety Scenario Red Teaming, consisting of three methods -- sem… ▽ More

    Submitted 11 November, 2023; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing

  6. arXiv:2310.01618  [pdf, other

    cs.LG math.NA

    Operator Learning Meets Numerical Analysis: Improving Neural Networks through Iterative Methods

    Authors: Emanuele Zappala, Daniel Levine, Sizhuang He, Syed Rizvi, Sacha Levy, David van Dijk

    Abstract: Deep neural networks, despite their success in numerous applications, often function without established theoretical foundations. In this paper, we bridge this gap by drawing parallels between deep learning and classical numerical analysis. By framing neural networks as operators with fixed points representing desired solutions, we develop a theoretical framework grounded in iterative methods for… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 27 pages (13+14). 8 Figures and 5 tables. Comments are welcome!

  7. arXiv:2308.13699  [pdf, other

    cs.SI cs.LG

    Party Prediction for Twitter

    Authors: Kellin Pelrine, Anne Imouza, Zachary Yang, Jacob-Junqi Tian, Sacha Lévy, Gabrielle Desrosiers-Brisebois, Aarash Feizi, Cécile Amadoro, André Blais, Jean-François Godbout, Reihaneh Rabbany

    Abstract: A large number of studies on social media compare the behaviour of users from different political parties. As a basic step, they employ a predictive model for inferring their political affiliation. The accuracy of this model can change the conclusions of a downstream analysis significantly, yet the choice between different models seems to be made arbitrarily. In this paper, we provide a comprehens… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  8. arXiv:2307.04228  [pdf, other

    physics.geo-ph cs.LG

    Bayesian tomography using polynomial chaos expansion and deep generative networks

    Authors: Giovanni Angelo Meles, Macarena Amaya, Shiran Levy, Stefano Marelli, Niklas Linde

    Abstract: Implementations of Markov chain Monte Carlo (MCMC) methods need to confront two fundamental challenges: accurate representation of prior information and efficient evaluation of likelihoods. Principal component analysis (PCA) and related techniques can in some cases facilitate the definition and sampling of the prior distribution, as well as the training of accurate surrogate models, using for inst… ▽ More

    Submitted 19 October, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: 25 pages, 15 figures

  9. arXiv:2305.11242  [pdf, other

    cs.CL

    Comparing Biases and the Impact of Multilingual Training across Multiple Languages

    Authors: Sharon Levy, Neha Anna John, Ling Liu, Yogarshi Vyas, Jie Ma, Yoshinari Fu**uma, Miguel Ballesteros, Vittorio Castelli, Dan Roth

    Abstract: Studies in bias and fairness in natural language processing have primarily examined social biases within a single language and/or across few attributes (e.g. gender, race). However, biases can manifest differently across various languages for individual attributes. As a result, it is critical to examine biases within each language and attribute. Of equal importance is to study how these biases com… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  10. arXiv:2304.11122  [pdf, other

    cs.DC

    Measuring Thread Timing to Assess the Feasibility of Early-bird Message Delivery

    Authors: W. Pepper Marts, Matthew G. F. Dosanjh, Whit Schonbein, Scott Levy, Patrick G. Bridges

    Abstract: Early-bird communication is a communication/computation overlap technique that combines fine-grained communication with partitioned communication to improve application run-time. Communication is divided among the compute threads such that each individual thread can initiate transmission of its portion of the data as soon as it is complete rather than waiting for all of the threads. However, the b… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Report number: SAND2023-02469O

  11. arXiv:2212.09667  [pdf, other

    cs.CL cs.AI cs.LG

    Foveate, Attribute, and Rationalize: Towards Physically Safe and Trustworthy AI

    Authors: Alex Mei, Sharon Levy, William Yang Wang

    Abstract: Users' physical safety is an increasing concern as the market for intelligent systems continues to grow, where unconstrained systems may recommend users dangerous actions that can lead to serious injury. Covertly unsafe text is an area of particular interest, as such text may arise from everyday scenarios and are challenging to detect as harmful. We propose FARM, a novel framework leveraging exter… ▽ More

    Submitted 19 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: In Findings of the 2023 Conference of the Association for Computational Linguistics

  12. arXiv:2210.12152  [pdf, other

    cs.CL cs.AI

    WikiWhy: Answering and Explaining Cause-and-Effect Questions

    Authors: Matthew Ho, Aditya Sharma, Justin Chang, Michael Saxon, Sharon Levy, Yujie Lu, William Yang Wang

    Abstract: As large language models (LLMs) grow larger and more sophisticated, assessing their "reasoning" capabilities in natural language grows more challenging. Recent question answering (QA) benchmarks that attempt to assess reasoning are often limited by a narrow scope of covered situations and subject matters. We introduce WikiWhy, a QA dataset built around a novel auxiliary task: explaining why an ans… ▽ More

    Submitted 30 November, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

  13. arXiv:2210.10045  [pdf, other

    cs.CL cs.AI

    SafeText: A Benchmark for Exploring Physical Safety in Language Models

    Authors: Sharon Levy, Emily Allaway, Melanie Subbiah, Lydia Chilton, Desmond Patton, Kathleen McKeown, William Yang Wang

    Abstract: Understanding what constitutes safe text is an important issue in natural language processing and can often prevent the deployment of models deemed harmful and unsafe. One such type of safety that has been scarcely studied is commonsense physical safety, i.e. text that is not explicitly violent and requires additional commonsense knowledge to comprehend that it leads to physical harm. We create th… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  14. arXiv:2210.09306  [pdf, other

    cs.AI cs.CL cs.LG

    Mitigating Covertly Unsafe Text within Natural Language Systems

    Authors: Alex Mei, Anisha Kabir, Sharon Levy, Melanie Subbiah, Emily Allaway, John Judge, Desmond Patton, Bruce Bimber, Kathleen McKeown, William Yang Wang

    Abstract: An increasingly prevalent problem for intelligent technologies is text safety, as uncontrolled systems may generate recommendations to their users that lead to injury or life-threatening consequences. However, the degree of explicitness of a generated statement that can cause physical harm varies. In this paper, we distinguish types of text that can lead to physical harm and establish one particul… ▽ More

    Submitted 20 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: In Findings of the 2022 Conference on Empirical Methods in Natural Language Processing

  15. arXiv:2209.11135  [pdf, other

    cs.SI cs.IR

    Active Keyword Selection to Track Evolving Topics on Twitter

    Authors: Sacha Lévy, Farimah Poursafaei, Kellin Pelrine, Reihaneh Rabbany

    Abstract: How can we study social interactions on evolving topics at a mass scale? Over the past decade, researchers from diverse fields such as economics, political science, and public health have often done this by querying Twitter's public API endpoints with hand-picked topical keywords to search or stream discussions. However, despite the API's accessibility, it remains difficult to select and update ke… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 10 pages, 3 figures

  16. arXiv:2205.09830  [pdf, ps, other

    cs.CL

    Towards Understanding Gender-Seniority Compound Bias in Natural Language Generation

    Authors: Samhita Honnavalli, Aesha Parekh, Lily Ou, Sophie Groenwold, Sharon Levy, Vicente Ordonez, William Yang Wang

    Abstract: Women are often perceived as junior to their male counterparts, even within the same job titles. While there has been significant progress in the evaluation of gender bias in natural language processing (NLP), existing studies seldom investigate how biases toward gender groups change when compounded with other societal biases. In this work, we investigate how seniority impacts the degree of gender… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 6 pages, LREC 2022

  17. arXiv:2204.13243  [pdf, other

    cs.CL

    HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data

    Authors: Kai Nakamura, Sharon Levy, Yi-Lin Tuan, Wenhu Chen, William Yang Wang

    Abstract: A pressing challenge in current dialogue systems is to successfully converse with users on topics with information distributed across different modalities. Previous work in multiturn dialogue systems has primarily focused on either text or table information. In more realistic scenarios, having a joint understanding of both is critical as knowledge is typically distributed over both unstructured an… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Findings of ACL 2022

  18. arXiv:2204.05959  [pdf

    cs.DC cs.PF

    "Smarter" NICs for faster molecular dynamics: a case study

    Authors: Sara Karamati, Clayton Hughes, K. Scott Hemmert, Ryan E. Grant, W. Whit Schonbein, Scott Levy, Thomas M. Conte, Jeffrey Young, Richard W. Vuduc

    Abstract: This work evaluates the benefits of using a "smart" network interface card (SmartNIC) as a compute accelerator for the example of the MiniMD molecular dynamics proxy application. The accelerator is NVIDIA's BlueField-2 card, which includes an 8-core Arm processor along with a small amount of DRAM and storage. We test the networking and data movement performance of these cards compared to a standar… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  19. arXiv:2201.11153  [pdf, other

    cs.CL cs.IR

    Addressing Issues of Cross-Linguality in Open-Retrieval Question Answering Systems For Emergent Domains

    Authors: Alon Albalak, Sharon Levy, William Yang Wang

    Abstract: Open-retrieval question answering systems are generally trained and tested on large datasets in well-established domains. However, low-resource settings such as new and emerging domains would especially benefit from reliable question answering systems. Furthermore, multilingual and cross-lingual resources in emergent domains are scarce, leading to few or no such systems. In this paper, we demonstr… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 6 pages, 8 figures

  20. arXiv:2110.13819  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    CloudFindr: A Deep Learning Cloud Artifact Masker for Satellite DEM Data

    Authors: Kalina Borkiewicz, Viraj Shah, J. P. Naiman, Chuanyue Shen, Stuart Levy, Jeff Carpenter

    Abstract: Artifact removal is an integral component of cinematic scientific visualization, and is especially challenging with big datasets in which artifacts are difficult to define. In this paper, we describe a method for creating cloud artifact masks which can be used to remove artifacts from satellite imagery using a combination of traditional image processing together with deep learning based on U-Net.… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  21. arXiv:2110.06962  [pdf, other

    cs.CL cs.IR

    Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

    Authors: Sharon Levy, Kevin Mo, Wenhan Xiong, William Yang Wang

    Abstract: Since late 2019, COVID-19 has quickly emerged as the newest biomedical domain, resulting in a surge of new information. As with other emergent domains, the discussion surrounding the topic has been rapidly changing, leading to the spread of misinformation. This has created the need for a public space for users to ask questions and receive credible, scientific answers. To fulfill this need, we turn… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 Demo

  22. arXiv:2109.08490  [pdf, other

    cs.LG cs.AI cs.RO

    Integrating Deep Reinforcement and Supervised Learning to Expedite Indoor Map**

    Authors: Elchanan Zwecher, Eran Iceland, Sean R. Levy, Shmuel Y. Hayoun, Oren Gal, Ariel Barel

    Abstract: The challenge of map** indoor environments is addressed. Typical heuristic algorithms for solving the motion planning problem are frontier-based methods, that are especially effective when the environment is completely unknown. However, in cases where prior statistical data on the environment's architectonic features is available, such algorithms can be far from optimal. Furthermore, their calcu… ▽ More

    Submitted 27 February, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted to ICRA-22 conference (23-27 May, 2022)

  23. arXiv:2109.03858  [pdf, other

    cs.CL

    Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

    Authors: Shahar Levy, Koren Lazar, Gabriel Stanovsky

    Abstract: Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale and consist mostly of artificial, out-of-distribution sentences. In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gende… ▽ More

    Submitted 10 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to Findings of EMNLP 2021

  24. Modeling Disclosive Transparency in NLP Application Descriptions

    Authors: Michael Saxon, Sharon Levy, Xinyi Wang, Alon Albalak, William Yang Wang

    Abstract: Broader disclosive transparency$-$truth and clarity in communication regarding the function of AI systems$-$is widely considered desirable. Unfortunately, it is a nebulous concept, difficult to both define and quantify. This is problematic, as previous work has demonstrated possible trade-offs and negative consequences to disclosive transparency, such as a confusion effect, where "too much informa… ▽ More

    Submitted 10 September, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: To appear at EMNLP 2021. 15 pages, 10 figures, 7 tables

    Journal ref: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp 2023-2037

  25. arXiv:2101.00379  [pdf, other

    cs.CL cs.CY

    Investigating Memorization of Conspiracy Theories in Text Generation

    Authors: Sharon Levy, Michael Saxon, William Yang Wang

    Abstract: The adoption of natural language generation (NLG) models can leave individuals vulnerable to the generation of harmful information memorized by the models, such as conspiracy theories. While previous studies examine conspiracy theories in the context of social media, they have not evaluated their presence in the new space of generative language models. In this work, we investigate the capability o… ▽ More

    Submitted 8 June, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: ACL 2021 Findings

  26. arXiv:2011.02043  [pdf, other

    cs.LG cs.RO

    Deep-Learning-Aided Path Planning and Map Construction for Expediting Indoor Map**

    Authors: Elchanan Zwecher, Eran Iceland, Shmuel Y. Hayoun, Ahavatya Revivo, Sean R. Levy, Ariel Barel

    Abstract: The problem of autonomous indoor map** is addressed. The goal is to minimize the time to achieve a predefined percentage of exposure with some desired level of certainty. The use of a pre-trained generative deep neural network, acting as a map predictor, in both the path planning and the map construction is proposed in order to expedite the map** process. This method is examined in combination… ▽ More

    Submitted 13 August, 2022; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: Submitted to Robotics and Autonomous Systems journal

  27. arXiv:2010.02510  [pdf, other

    cs.CL cs.AI

    Investigating African-American Vernacular English in Transformer-Based Text Generation

    Authors: Sophie Groenwold, Lily Ou, Aesha Parekh, Samhita Honnavalli, Sharon Levy, Diba Mirza, William Yang Wang

    Abstract: The growth of social media has encouraged the written use of African American Vernacular English (AAVE), which has traditionally been used only in oral contexts. However, NLP models have historically been developed using dominant English varieties, such as Standard American English (SAE), due to text corpora availability. We investigate the performance of GPT-2 on AAVE text by creating a dataset o… ▽ More

    Submitted 29 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 7 pages, EMNLP 2020

  28. arXiv:2007.15759  [pdf, other

    cs.CR

    The Program with a Personality: Analysis of Elk Cloner, the First Personal Computer Virus

    Authors: Scott Levy, Jedidiah R. Crandall

    Abstract: Although self-replicating programs and viruses have existed since the 1960s and 70s, Elk Cloner was the first virus to circulate among personal computers in the wild. Despite its historical significance, it received comparatively little attention when it first appeared in 1982. In this paper, we: present the first detailed examination of the operation and structure of Elk Cloner; discuss the effec… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

  29. arXiv:2006.03202  [pdf, other

    cs.CL cs.LG cs.SI

    Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment

    Authors: Sharon Levy, William Yang Wang

    Abstract: The spread of COVID-19 has become a significant and troubling aspect of society in 2020. With millions of cases reported across countries, new outbreaks have occurred and followed patterns of previously affected areas. Many disease detection models do not incorporate the wealth of social media data that can be utilized for modeling and predicting its spread. In this case, it is useful to ask, can… ▽ More

    Submitted 15 October, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

  30. arXiv:2006.00084  [pdf, other

    astro-ph.IM astro-ph.EP cs.GR

    Clustering-informed Cinematic Astrophysical Data Visualization with Application to the Moon-forming Terrestrial Synestia

    Authors: Patrick D. Aleo, Simon J. Lock, Donna J. Cox, Stuart A. Levy, J. P. Naiman, A. J. Christensen, Kalina Borkiewicz, Robert Patterson

    Abstract: Scientific visualization tools are currently not optimized to create cinematic, production-quality representations of numerical data for the purpose of science communication. In our pipeline \texttt{Estra}, we outline a step-by-step process from a raw simulation into a finished render as a way to teach non-experts in the field of visualization how to achieve production-quality outputs on their own… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: 19 pages, 16 figures, submitted to MNRAS

  31. arXiv:2004.13939  [pdf, ps, other

    cs.CL

    Evaluating Transformer-Based Multilingual Text Classification

    Authors: Sophie Groenwold, Samhita Honnavalli, Lily Ou, Aesha Parekh, Sharon Levy, Diba Mirza, William Yang Wang

    Abstract: As NLP tools become ubiquitous in today's technological landscape, they are increasingly applied to languages with a variety of typological structures. However, NLP research does not focus primarily on typological differences in its analysis of state-of-the-art language models. As a result, NLP tools perform unequally across languages with different syntactic and morphological structures. Through… ▽ More

    Submitted 30 April, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Total of 15 pages (9 pages for paper, 2 pages for references, 4 pages for appendix). Changed title

  32. arXiv:1911.03854  [pdf, other

    cs.CL cs.CY cs.IR

    r/Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection

    Authors: Kai Nakamura, Sharon Levy, William Yang Wang

    Abstract: Fake news has altered society in negative ways in politics and culture. It has adversely affected both online social network systems as well as offline communities and conversations. Using automatic machine learning classification models is an efficient way to combat the widespread dissemination of fake news. However, a lack of effective, comprehensive datasets has been a problem for fake news res… ▽ More

    Submitted 12 March, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Accepted LREC 2020

  33. arXiv:1910.07130  [pdf, other

    cs.SI cs.IR

    SCG: Spotting Coordinated Groups in Social Media

    Authors: Junhao Wang, Sacha Levy, Ren Wang, Aayushi Kulshrestha, Reihaneh Rabbany

    Abstract: Recent events have led to a burgeoning awareness on the misuse of social media sites to affect political events, sway public opinion, and confuse the voters. Such serious, hostile mass manipulation has motivated a large body of works on bots/troll detection and fake news detection, which mostly focus on classifying at the user level based on the content generated by the users. In this study, we jo… ▽ More

    Submitted 1 September, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  34. arXiv:1811.01147  [pdf, other

    cs.AI

    SafeRoute: Learning to Navigate Streets Safely in an Urban Environment

    Authors: Sharon Levy, Wenhan Xiong, Elizabeth Belding, William Yang Wang

    Abstract: Recent studies show that 85% of women have changed their traveled route to avoid harassment and assault. Despite this, current map** tools do not empower users with information to take charge of their personal safety. We propose SafeRoute, a novel solution to the problem of navigating cities and avoiding street harassment and crime. Unlike other street navigation applications, SafeRoute introduc… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

    Comments: 8 pages

  35. Intrusion Detection System for Applications using Linux Containers

    Authors: Amr S. Abed, Charles Clancy, David S. Levy

    Abstract: Linux containers are gaining increasing traction in both individual and industrial use, and as these containers get integrated into mission-critical systems, real-time detection of malicious cyber attacks becomes a critical operational requirement. This paper introduces a real-time host-based intrusion detection system that can be used to passively detect malfeasance against applications within Li… ▽ More

    Submitted 9 November, 2016; originally announced November 2016.

    Comments: The final publication is available at http://link.springer.com/chapter/10.1007%2F978-3-319-24858-5_8. arXiv admin note: substantial text overlap with arXiv:1611.03053

    Journal ref: STM 2015. LNCS, vol. 9331, pp. 123-135. Springer, Heidelberg (2015)

  36. Applying Bag of System Calls for Anomalous Behavior Detection of Applications in Linux Containers

    Authors: Amr S. Abed, T. Charles Clancy, David S. Levy

    Abstract: In this paper, we present the results of using bags of system calls for learning the behavior of Linux containers for use in anomaly-detection based intrusion detection system. By using system calls of the containers monitored from the host kernel for anomaly detection, the system does not require any prior knowledge of the container nature, neither does it require altering the container or the ho… ▽ More

    Submitted 9 November, 2016; originally announced November 2016.

    Comments: Published version available on IEEE Xplore (http://ieeexplore.ieee.org/document/7414047/) arXiv admin note: substantial text overlap with arXiv:1611.03056

    Journal ref: 2015 IEEE Globecom Workshops (GC Wkshps), San Diego, CA, 2015, pp. 1-5