Skip to main content

Showing 1–40 of 40 results for author: Jones, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10706  [pdf, other

    cs.CY cs.CL cs.HC cs.SI

    Cross-Language Evolution of Divergent Collective Memory Around the Arab Spring

    Authors: H. Laurie Jones, Brian C. Keegan

    Abstract: The Arab Spring was a historic set of protests beginning in 2011 that toppled governments and led to major conflicts. Collective memories of events like these can vary significantly across social contexts in response to political, cultural, and linguistic factors. While Wikipedia plays an important role in documenting both historic and current events, little attention has been given to how Wikiped… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  2. arXiv:2312.02312  [pdf, other

    cs.LG cs.AI cs.CV

    Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games

    Authors: Lukas Schäfer, Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, Dave Bignell, Siddhartha Sen, Andrea Treviño Gavito, Sam Devlin

    Abstract: Video games have served as useful benchmarks for the decision making community, but going beyond Atari games towards training agents in modern games has been prohibitively expensive for the vast majority of the research community. Recent progress in the research, development and open release of large vision models has the potential to amortize some of these costs across the community. However, it… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Preprint

  3. Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation

    Authors: Llion Jones, Richard Sproat, Haruko Ishikawa, Alexander Gutkin

    Abstract: If one sees the place name Houston Mercer Dog Run in New York, how does one know how to pronounce it? Assuming one knows that Houston in New York is pronounced "how-ston" and not like the Texas city, then one can probably guess that "how-ston" is also used in the name of the dog park. We present a novel architecture that learns to use the pronunciations of neighboring names in order to guess the p… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 16 pages, to appear Transactions of the Association for Computational Linguistics

  4. arXiv:2203.04540  [pdf, other

    cs.AI cs.LG

    MetaCon: Unified Predictive Segments System with Trillion Concept Meta-Learning

    Authors: Keqian Li, Yifan Hu, Logan Palanisamy, Lisa Jones, Akshay Gupta, Jason Grigsby, Ili Selinger, Matt Gillingham, Fei Tan

    Abstract: Accurate understanding of users in terms of predicative segments play an essential role in the day to day operation of modern internet enterprises. Nevertheless, there are significant challenges that limit the quality of data, especially on long tail predictive tasks. In this work, we present MetaCon, our unified predicative segments system with scalable, trillion concepts meta learning that addre… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  5. SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays

    Authors: Thi Ngoc Tho Nguyen, Douglas L. Jones, Karn N. Watcharasupat, Huy Phan, Woon-Seng Gan

    Abstract: Polyphonic sound event localization and detection (SELD) has many practical applications in acoustic sensing and monitoring. However, the development of real-time SELD has been limited by the demanding computational requirement of most recent SELD systems. In this work, we introduce SALSA-Lite, a fast and effective feature for polyphonic SELD using microphone array inputs. SALSA-Lite is a lightwei… ▽ More

    Submitted 4 May, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:2110.00275

    Journal ref: Proceedings of the 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 716-720

  6. SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection

    Authors: Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon-Seng Gan

    Abstract: Sound event localization and detection (SELD) consists of two subtasks, which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses amplitude and/or phase differences between microphones to estimate source directions. As a result, it is often di… ▽ More

    Submitted 6 June, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: (c) 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 1749-1762, 2022

  7. arXiv:2107.14549  [pdf, other

    cs.SD cs.LG eess.AS

    Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 from Audio Challenges

    Authors: Alican Akman, Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Lyn Jones, Björn W. Schuller

    Abstract: We report on cross-running the recent COVID-19 Identification ResNet (CIdeR) on the two Interspeech 2021 COVID-19 diagnosis from cough and speech audio challenges: ComParE and DiCOVA. CIdeR is an end-to-end deep learning neural network originally designed to classify whether an individual is COVID-positive or COVID-negative based on coughing and breathing audio recordings from a published crowdsou… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: 5 pages, 1 figure

  8. arXiv:2107.10471  [pdf, ps, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning

    Authors: Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Zhen Jian Lee, Douglas L. Jones, Woon Seng Gan

    Abstract: The Sørensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection. Conventional training of polyphonic sound event detection systems with binary cross-e… ▽ More

    Submitted 2 October, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Submitted to the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2021

  9. arXiv:2107.10469  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis

    Authors: Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Zhen Jian Lee, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan

    Abstract: Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD inherits the challenges of both tasks, such as noise, reverberation, interference, polyphony, and non-stationarity of sound sources. Furthermore, SELD often faces an additional challenge of assigning correct corresp… ▽ More

    Submitted 2 October, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted for the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2021

    Journal ref: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop, pp. 120-124

  10. arXiv:2106.15813  [pdf, other

    eess.AS cs.SD

    DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

    Authors: Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani

    Abstract: Single-channel speech enhancement (SE) is an important task in speech processing. A widely used framework combines an analysis/synthesis filterbank with a mask prediction network, such as the Conv-TasNet architecture. In such systems, the denoising performance and computational efficiency are mainly affected by the structure of the mask prediction network. In this study, we aim to improve the sequ… ▽ More

    Submitted 5 August, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: 5 pages, 2 figure. accepted for WASPAA 2021

  11. arXiv:2106.15190  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection

    Authors: Thi Ngoc Tho Nguyen, Karn Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan

    Abstract: Sound event localization and detection consists of two subtasks which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses magnitude or phase differences between microphones to estimate source directions. Therefore, it is often difficult to joi… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 5 pages, Technical Report for DCASE 2021 Challenge Task 3. arXiv admin note text overlap with arXiv:2110.00275

  12. arXiv:2106.05111  [pdf, other

    cs.CL cs.SD eess.AS

    A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition

    Authors: Shigeki Karita, Yotaro Kubo, Michiel Adriaan Unico Bacchiani, Llion Jones

    Abstract: End-to-end (E2E) modeling is advantageous for automatic speech recognition (ASR) especially for Japanese since word-based tokenization of Japanese is not trivial, and E2E modeling is able to model character sequences directly. This paper focuses on the latest E2E modeling techniques, and investigates their performances on character-based Japanese ASR by conducting comparative experiments. The resu… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: to be published in INTERSPEECH2021

  13. arXiv:2104.03113  [pdf, other

    cs.LG cs.MA

    Scaling Scaling Laws with Board Games

    Authors: Andy L. Jones

    Abstract: The largest experiments in machine learning now require resources far beyond the budget of all but a few institutions. Fortunately, it has recently been shown that the results of these huge experiments can often be extrapolated from the results of a sequence of far smaller, cheaper experiments. In this work, we show that not only can the extrapolation be done based on the size of the model, but on… ▽ More

    Submitted 15 April, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

  14. arXiv:2104.02443  [pdf

    cs.SE cs.AI cs.CL cs.LG cs.PL

    CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing

    Authors: Ahmed Elnaggar, Wei Ding, Llion Jones, Tom Gibbs, Tamas Feher, Christoph Angerer, Silvia Severini, Florian Matthes, Burkhard Rost

    Abstract: Currently, a growing number of mature natural language processing applications make people's life more convenient. Such applications are built by source code - the language in software engineering. However, the applications for understanding source code language to ease the software engineering process are under-researched. Simultaneously, the transformer model, especially its combination with tra… ▽ More

    Submitted 12 May, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: 28 pages, 6 tables and 1 figure

  15. arXiv:2102.08359  [pdf, other

    cs.SD cs.LG eess.AS

    End-2-End COVID-19 Detection from Breath & Cough Audio

    Authors: Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Alice Baird, Lyn Jones, Björn W. Schuller

    Abstract: Our main contributions are as follows: (I) We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples, achieving ROC-AUC of 0.846; (II) Our model, the COVID-19 Identification ResNet, (CIdeR), has potential for rapid scalability, minimal cost and improving performance as more data becomes available. This could enable regular CO… ▽ More

    Submitted 6 January, 2021; originally announced February 2021.

    Comments: 5 pages

    MSC Class: 68T11 ACM Class: I.2; I.5; J.3

  16. arXiv:2008.10530  [pdf, other

    q-bio.PE cs.LG physics.soc-ph

    A New Mathematical Model for Controlled Pandemics Like COVID-19 : AI Implemented Predictions

    Authors: Liam Dowling Jones, Malik Magdon-Ismail, Laura Mersini-Houghton, Steven Meshnick

    Abstract: We present a new mathematical model to explicitly capture the effects that the three restriction measures: the lockdown date and duration, social distancing and masks, and, schools and border closing, have in controlling the spread of COVID-19 infections $i(r, t)$. Before restrictions were introduced, the random spread of infections as described by the SEIR model grew exponentially. The addition o… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  17. arXiv:2007.06225  [pdf

    cs.LG cs.CL cs.DC stat.ML

    ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing

    Authors: Ahmed Elnaggar, Michael Heinzinger, Christian Dallago, Ghalia Rihawi, Yu Wang, Llion Jones, Tom Gibbs, Tamas Feher, Christoph Angerer, Martin Steinegger, Debsindhu Bhowmik, Burkhard Rost

    Abstract: Computational biology and bioinformatics provide vast data gold-mines from protein sequences, ideal for Language Models taken from NLP. These LMs reach for new prediction frontiers at low inference costs. Here, we trained two auto-regressive models (Transformer-XL, XLNet) and four auto-encoder models (BERT, Albert, Electra, T5) on data from UniRef and BFD containing up to 393 billion amino acids.… ▽ More

    Submitted 4 May, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 17 pages, 9 figures, 4 tables

  18. arXiv:1911.11373  [pdf, other

    eess.AS cs.SD

    A two-step system for sound event localization and detection

    Authors: T. N. T. Nguyen, D. L. Jones, R. Ranjan, S. Jayabalan, W. S. Gan

    Abstract: Sound event detection and sound event localization requires different features from audio input signals. While sound event detection mainly relies on time-frequency patterns to distinguish different event classes, sound event localization uses magnitude or phase differences between microphones to estimate source directions. Therefore, we propose a two-step system to do sound event localization and… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 5 pages

  19. arXiv:1911.08682  [pdf, other

    stat.AP cs.SI stat.ME

    Ensuring Reliable Monte Carlo Estimates of Network Properties

    Authors: Haema Nilakanta, Zack W. Almquist, Galin L. Jones

    Abstract: The literature in social network analysis has largely focused on methods and models which require complete network data; however there exist many networks which can only be studied via sampling methods due to the scale or complexity of the network, access limitations, or the population of interest is hard to reach. In such cases, the application of random walk-based Markov chain Monte Carlo (MCMC)… ▽ More

    Submitted 21 November, 2019; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: 27 pages

    MSC Class: 62H12; 62P25; 62P30; 62D05

  20. arXiv:1906.03524  [pdf, other

    cs.HC

    PizzaBox: Studying Internet Connected Physical Object Manipulation based Food Ordering

    Authors: Luke Jones, Charith Perera

    Abstract: This paper presents the designing and testing of PizzaBox, a 3D printed, interactive food ordering system that aims to differ from conventional food ordering systems and provide an entertaining and unique experience when ordering a pizza by incorporating underlying technologies that support ubiquitous computing. The PizzaBox has gone through both low and medium fidelity testing while working colla… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

  21. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  22. arXiv:1808.04444  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Character-Level Language Modeling with Deeper Self-Attention

    Authors: Rami Al-Rfou, Dokook Choe, Noah Constant, Mandy Guo, Llion Jones

    Abstract: LSTMs and other RNN variants have shown strong performance on character-level language modeling. These models are typically trained using truncated backpropagation through time, and it is common to assume that their success stems from their ability to remember long-term contexts. In this paper, we show that a deep (64-layer) transformer model with fixed context outperforms RNN variants by a large… ▽ More

    Submitted 10 December, 2018; v1 submitted 9 August, 2018; originally announced August 2018.

    Comments: 8 pages, 7 figures

  23. arXiv:1805.08804  [pdf, ps, other

    cs.DC

    Optimal Record and Replay under Causal Consistency

    Authors: Russell L. Jones, Muhammad S. Khan, Nitin H. Vaidya

    Abstract: We investigate the minimum record needed to replay executions of processes that share causally consistent memory. For a version of causal consistency, we identify optimal records under both offline and online recording setting. Under the offline setting, a central authority has information about every process' view of the execution and can decide what information to record for each process. Under… ▽ More

    Submitted 29 October, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: Added a new RnR model and results for that model. Also added some text for better reading and some references

  24. arXiv:1804.09849  [pdf, other

    cs.CL cs.AI

    The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation

    Authors: Mia Xu Chen, Orhan Firat, Ankur Bapna, Melvin Johnson, Wolfgang Macherey, George Foster, Llion Jones, Niki Parmar, Mike Schuster, Zhifeng Chen, Yonghui Wu, Macduff Hughes

    Abstract: The past year has witnessed rapid advances in sequence-to-sequence (seq2seq) modeling for Machine Translation (MT). The classic RNN-based approaches to MT were first out-performed by the convolutional seq2seq model, which was then out-performed by the more recent Transformer model. Each of these new approaches consists of a fundamental architecture accompanied by a set of modeling and training tec… ▽ More

    Submitted 26 April, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

  25. arXiv:1803.07416  [pdf, other

    cs.LG cs.CL stat.ML

    Tensor2Tensor for Neural Machine Translation

    Authors: Ashish Vaswani, Samy Bengio, Eugene Brevdo, Francois Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Łukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit

    Abstract: Tensor2Tensor is a library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model.

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1706.03762

  26. arXiv:1706.05137  [pdf, other

    cs.LG stat.ML

    One Model To Learn Them All

    Authors: Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, Jakob Uszkoreit

    Abstract: Deep learning yields great results across many fields, from speech recognition, image classification, to translation. But for each problem, getting a deep model to work well involves research into the architecture and a long period of tuning. We present a single model that yields good results on a number of problems spanning multiple domains. In particular, this single model is trained concurrentl… ▽ More

    Submitted 15 June, 2017; originally announced June 2017.

  27. arXiv:1706.03762  [pdf, other

    cs.CL cs.LG

    Attention Is All You Need

    Authors: Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

    Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experi… ▽ More

    Submitted 1 August, 2023; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: 15 pages, 5 figures

  28. arXiv:1705.00615  [pdf, other

    eess.SY cs.NI

    Guided-Processing Outperforms Duty-Cycling for Energy-Efficient Systems

    Authors: Long N. Le, Douglas L. Jones

    Abstract: Energy-efficiency is highly desirable for sensing systems in the Internet of Things (IoT). A common approach to achieve low-power systems is duty-cycling, where components in a system are turned off periodically to meet an energy budget. However, this work shows that such an approach is not necessarily optimal in energy-efficiency, and proposes \textit{guided-processing} as a fundamentally better… ▽ More

    Submitted 1 May, 2017; originally announced May 2017.

    Comments: preprint, the published version is in IEEE Transactions on Circuits and Systems I, Special Issue on Circuits and Systems for the Internet of Things - From Sensing to Sensemaking, 2017. arXiv admin note: substantial text overlap with arXiv:1705.00596

  29. Feature-Sharing in Cascade Detection Systems with Multiple Applications

    Authors: Long N. Le, Douglas L. Jones

    Abstract: Traditional distributed detection systems are often designed for a single target application. However, with the emergence of the Internet of Things (IoT) paradigm, next-generation systems are expected to be a shared infrastructure for multiple applications. To this end, we propose a modular, cascade design for resource-efficient, multi-task detection systems. Two (classes of) applications are cons… ▽ More

    Submitted 1 May, 2017; originally announced May 2017.

    Comments: preprint, the published version is in IEEE Journal of Selected Topics in Signal Processing, Special Issue on Cooperative Signal Processing for Heterogeneous and Multi-Task Wireless Sensor Networks, 2017

  30. arXiv:1612.00516  [pdf, other

    stat.ML cs.LG

    Canonical Correlation Analysis for Analyzing Sequences of Medical Billing Codes

    Authors: Corinne L. Jones, Sham M. Kakade, Lucas W. Thornblade, David R. Flum, Abraham D. Flaxman

    Abstract: We propose using canonical correlation analysis (CCA) to generate features from sequences of medical billing codes. Applying this novel use of CCA to a database of medical billing codes for patients with diverticulitis, we first demonstrate that the CCA embeddings capture meaningful relationships among the codes. We then generate features from these embeddings and establish their usefulness in pre… ▽ More

    Submitted 6 January, 2017; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: Accepted at NIPS 2016 Workshop on Machine Learning for Health

  31. arXiv:1608.03542  [pdf, other

    cs.CL

    WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia

    Authors: Daniel Hewlett, Alexandre Lacoste, Llion Jones, Illia Polosukhin, Andrew Fandrianto, Jay Han, Matthew Kelcey, David Berthelot

    Abstract: We present WikiReading, a large-scale natural language understanding task and publicly-available dataset with 18 million instances. The task is to predict textual values from the structured knowledge base Wikidata by reading the text of the corresponding Wikipedia articles. The task contains a rich variety of challenging classification and extraction sub-tasks, making it well-suited for end-to-end… ▽ More

    Submitted 15 March, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Journal ref: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016, pp. 1535-1545

  32. arXiv:1504.04317  [pdf, ps, other

    cs.IR cs.CL cs.CR

    Towards a relation extraction framework for cyber-security concepts

    Authors: Corinne L. Jones, Robert A. Bridges, Kelly Huffer, John Goodall

    Abstract: In order to assist security analysts in obtaining information pertaining to their network, such as novel vulnerabilities, exploits, or patches, information retrieval methods tailored to the security domain are needed. As labeled text data is scarce and expensive, we follow developments in semi-supervised Natural Language Processing and implement a bootstrap** algorithm for extracting security en… ▽ More

    Submitted 16 April, 2015; originally announced April 2015.

    Comments: 4 pages in Cyber & Information Security Research Conference 2015, ACM

    ACM Class: H.3.3

  33. arXiv:1501.03311  [pdf, other

    cs.IT cs.MM cs.NI cs.PF

    Optimized Network-coded Scalable Video Multicasting over eMBMS Networks

    Authors: Andrea Tassi, Ioannis Chatzigeorgiou, Dejan Vukobratović, Andrew L. Jones

    Abstract: Delivery of multicast video services over fourth generation (4G) networks such as 3GPP Long Term Evolution-Advanced (LTE-A) is gaining momentum. In this paper, we address the issue of efficiently multicasting layered video services by defining a novel resource allocation framework that aims to maximize the service coverage whilst kee** the radio resource footprint low. A key point in the propose… ▽ More

    Submitted 20 January, 2015; v1 submitted 14 January, 2015; originally announced January 2015.

    Comments: Proc. of IEEE ICC 2015 - Mobile and Wireless Networking Symposium, to appear

  34. arXiv:1501.03307  [pdf, other

    cs.IT cs.MM cs.PF

    Binary Systematic Network Coding for Progressive Packet Decoding

    Authors: Andrew L. Jones, Ioannis Chatzigeorgiou, Andrea Tassi

    Abstract: We consider binary systematic network codes and investigate their capability of decoding a source message either in full or in part. We carry out a probability analysis, derive closed-form expressions for the decoding probability and show that systematic network coding outperforms conventional network coding. We also develop an algorithm based on Gaussian elimination that allows progressive decodi… ▽ More

    Submitted 14 January, 2015; originally announced January 2015.

    Comments: Proc. of IEEE ICC 2015 - Communication Theory Symposium, to appear

  35. Optimal Simultaneous Detection and Signal and Noise Power Estimation

    Authors: Long Le, Douglas L. Jones

    Abstract: Simultaneous detection and estimation is important in many engineering applications. In particular, there are many applications where it is important to perform signal detection and Signal-to-Noise-Ratio (SNR) estimation jointly. Application of existing frameworks in the literature that handle simultaneous detection and estimation is not straightforward for this class of application. This paper th… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Comments: appears in 2014 IEEE International Symposium on Information Theory (ISIT)

  36. arXiv:1308.4941  [pdf, other

    cs.IR cs.CL

    Automatic Labeling for Entity Extraction in Cyber Security

    Authors: Robert A. Bridges, Corinne L. Jones, Michael D. Iannacone, Kelly M. Testa, John R. Goodall

    Abstract: Timely analysis of cyber-security information necessitates automated information extraction from unstructured text. While state-of-the-art extraction methods produce extremely accurate results, they require ample training data, which is generally unavailable for specialized applications, such as detecting security related entities; moreover, manual annotation of corpora is very costly and often no… ▽ More

    Submitted 9 June, 2014; v1 submitted 22 August, 2013; originally announced August 2013.

    Comments: 10 pages

  37. Ganga: a tool for computational-task management and easy access to Grid resources

    Authors: J. T. Mościcki, F. Brochu, J. Ebke, U. Egede, J. Elmsheuser, K. Harrison, R. W. L. Jones, H. C. Lee, D. Liko, A. Maier, A. Muraru, G. N. Patrick, K. Pajchel, W. Reece, B. H. Samset, M. W. Slater, A. Soroko, C. L. Tan, D. C. Vanderster, M. Williams

    Abstract: In this paper, we present the computational task-management tool Ganga, which allows for the specification, submission, bookkee** and post-processing of computational tasks on a wide set of distributed resources. Ganga has been developed to solve a problem increasingly common in scientific projects, which is that researchers must regularly switch between different processing systems, each with… ▽ More

    Submitted 9 June, 2009; v1 submitted 16 February, 2009; originally announced February 2009.

    Comments: Extended and clarified information on the Grid computing context for Ganga, supported job model etc. Additional minor corrections and clarifications. Updated the author list as agreed with the Ganga team

    ACM Class: C.2.4; H.3.4; J.2; J.3

  38. arXiv:cs/0612047  [pdf, ps, other

    cs.HC cs.AI

    Social Browsing on Flickr

    Authors: Kristina Lerman, Laurie Jones

    Abstract: The new social media sites - blogs, wikis, del.icio.us and Flickr, among others - underscore the transformation of the Web to a participatory medium in which users are actively creating, evaluating and distributing information. The photo-sharing site Flickr, for example, allows users to upload photographs, view photos created by others, comment on those photos, etc. As is common to other social… ▽ More

    Submitted 7 December, 2006; originally announced December 2006.

    Comments: 8 pages; submitted to the International Conference on Weblogs and Social Media

  39. arXiv:cs/0306085  [pdf, ps, other

    cs.SE

    GANGA: a user-Grid interface for Atlas and LHCb

    Authors: K. Harrison, W. T. L. P. Lavrijsen, P. Mato, A. Soroko, C. L. Tan, C. E. Tull, N. Brook, R. W. L. Jones

    Abstract: The Gaudi/Athena and Grid Alliance (GANGA) is a front-end for the configuration, submission, monitoring, bookkee**, output collection, and reporting of computing jobs run on a local batch system or on the grid. In particular, GANGA handles jobs that use applications written for the Gaudi software framework shared by the Atlas and LHCb experiments. GANGA exploits the commonality of Gaudi-based… ▽ More

    Submitted 13 June, 2003; originally announced June 2003.

    Comments: 9 pages, 3 figures, CHEP 2003, March 2003, La Jolla, California, PSN TUCT002

    ACM Class: D.2.6

  40. arXiv:cs/0203015  [pdf

    cs.SD cs.LO

    Towards Experimental Nanosound Using Almost Disjoint Set Theory

    Authors: Cameron L Jones

    Abstract: Music composition using digital audio sequence editors is increasingly performed in a visual workspace where sound complexes are built from discrete sound objects, called gestures that are arranged in time and space to generate a continuous composition. The visual workspace, common to most industry standard audio loop sequencing software, is premised on the arrangement of gestures defined with g… ▽ More

    Submitted 12 March, 2002; originally announced March 2002.

    Comments: 20 pages, 4 figures, 1 table

    ACM Class: H.5.5