Skip to main content

Showing 1–50 of 50 results for author: Teng, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16810  [pdf

    cs.CL

    Performance evaluation of Reddit Comments using Machine Learning and Natural Language Processing methods in Sentiment Analysis

    Authors: Xiaoxia Zhang, Xiuyuan Qi, Zixin Teng

    Abstract: Sentiment analysis, an increasingly vital field in both academia and industry, plays a pivotal role in machine learning applications, particularly on social media platforms like Reddit. However, the efficacy of sentiment analysis models is hindered by the lack of expansive and fine-grained emotion datasets. To address this gap, our study leverages the GoEmotions dataset, comprising a diverse range… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures, to be published in Computational and Experimental Simulations in Engineering - Proceedings of ICCES 2024 - Volume 2

  2. arXiv:2404.18130  [pdf, other

    cs.AI cs.CL

    Logic Agent: Enhancing Validity with Logic Rule Invocation

    Authors: Hanmeng Liu, Zhiyang Teng, Chaoli Zhang, Yue Zhang

    Abstract: Chain-of-Thought (CoT) prompting has emerged as a pivotal technique for augmenting the inferential capabilities of language models during reasoning tasks. Despite its advancements, CoT often grapples with challenges in validating reasoning validity and ensuring informativeness. Addressing these limitations, this paper introduces the Logic Agent (LA), an agent-based framework aimed at enhancing the… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  3. arXiv:2401.08232  [pdf, other

    cs.CV

    Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization

    Authors: Chongzhi Zhang, Mingyuan Zhang, Zhiyang Teng, Jiayi Li, Xizhou Zhu, Lewei Lu, Ziwei Liu, Aixin Sun

    Abstract: Natural Language Video Localization (NLVL), grounding phrases from natural language descriptions to corresponding video segments, is a complex yet critical task in video understanding. Despite ongoing advancements, many existing solutions lack the capability to globally capture temporal dynamics of the video data. In this study, we present a novel approach to NLVL that aims to address this issue.… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  4. arXiv:2312.16418  [pdf, other

    cs.LG cs.AI cs.SI

    Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks

    Authors: Chenyang Qiu, Guoshun Nan, Tianyu Xiong, Wendi Deng, Di Wang, Zhiyang Teng, Lijuan Sun, Qimei Cui, Xiaofeng Tao

    Abstract: Graph convolution networks (GCNs) are extensively utilized in various graph tasks to mine knowledge from spatial data. Our study marks the pioneering attempt to quantitatively investigate the GCN robustness over omnipresent heterophilic graphs for node classification. We uncover that the predominant vulnerability is caused by the structural out-of-distribution (OOD) issue. This finding motivates u… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: To be appeared in the proceedings of AAAI-2024

  5. arXiv:2311.07996  [pdf, other

    cs.CL

    How Well Do Text Embedding Models Understand Syntax?

    Authors: Yan Zhang, Zhaopeng Feng, Zhiyang Teng, Zuozhu Liu, Haizhou Li

    Abstract: Text embedding models have significantly contributed to advancements in natural language processing by adeptly capturing semantic properties of textual data. However, the ability of these models to generalize across a wide range of syntactic contexts remains under-explored. In this paper, we first develop an evaluation set, named \textbf{SR}, to scrutinize the capability for syntax understanding o… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP-Findings 2023, datasets and code are released

  6. arXiv:2310.09107  [pdf, other

    cs.CL cs.AI

    GLoRE: Evaluating Logical Reasoning of Large Language Models

    Authors: Hanmeng liu, Zhiyang Teng, Ruoxi Ning, Jian Liu, Qiji Zhou, Yue Zhang

    Abstract: Recently, large language models (LLMs), including notable models such as GPT-4 and burgeoning community models, have showcased significant general language understanding abilities. However, there has been a scarcity of attempts to assess the logical reasoning capacities of these LLMs, an essential facet of natural language understanding. To encourage further investigation in this area, we introduc… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  7. arXiv:2310.05130  [pdf, other

    cs.CL

    Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature

    Authors: Guangsheng Bao, Yanbin Zhao, Zhiyang Teng, Linyi Yang, Yue Zhang

    Abstract: Large language models (LLMs) have shown the ability to produce fluent and cogent content, presenting both productivity opportunities and societal risks. To build trustworthy AI systems, it is imperative to distinguish between machine-generated and human-authored content. The leading zero-shot detector, DetectGPT, showcases commendable performance but is marred by its intensive computational costs.… ▽ More

    Submitted 22 February, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  8. arXiv:2307.07763  [pdf, other

    cs.RO cs.CV eess.IV

    Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents

    Authors: Ke Cao, Rui** Liu, Ze Wang, Kunyu Peng, Jiaming Zhang, Junwei Zheng, Zhifeng Teng, Kailun Yang, Rainer Stiefelhagen

    Abstract: The mobile robot relies on SLAM (Simultaneous Localization and Map**) to provide autonomous navigation and task execution in complex and unknown environments. However, it is hard to develop a dedicated algorithm for mobile robots due to dynamic and challenging situations, such as poor lighting conditions and motion blur. To tackle this issue, we propose a tightly-coupled LiDAR-visual SLAM based… ▽ More

    Submitted 25 December, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Accepted to ROBIO 2023

  9. arXiv:2305.16166  [pdf, other

    cs.CL

    Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis

    Authors: Xuming Hu, Zhijiang Guo, Zhiyang Teng, Irwin King, Philip S. Yu

    Abstract: Multimodal relation extraction (MRE) is the task of identifying the semantic relationships between two entities based on the context of the sentence image pair. Existing retrieval-augmented approaches mainly focused on modeling the retrieved textual knowledge, but this may not be able to accurately identify complex relations. To improve the prediction, this research proposes to retrieve textual an… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  10. arXiv:2305.13718  [pdf, other

    cs.CL

    Exploring Self-supervised Logic-enhanced Training for Large Language Models

    Authors: Fangkai Jiao, Zhiyang Teng, Bosheng Ding, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty

    Abstract: Existing efforts to improve logical reasoning ability of language models have predominantly relied on supervised fine-tuning, hindering generalization to new domains and/or tasks. The development of Large Langauge Models (LLMs) has demonstrated the capacity of compressing abundant knowledge into a single proxy, enabling them to tackle multiple tasks effectively. Our preliminary experiments, nevert… ▽ More

    Submitted 16 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 16 pages, NAACL 2024

  11. arXiv:2305.12878  [pdf, other

    cs.CL

    Non-Autoregressive Document-Level Machine Translation

    Authors: Guangsheng Bao, Zhiyang Teng, Hao Zhou, Jianhao Yan, Yue Zhang

    Abstract: Non-autoregressive translation (NAT) models achieve comparable performance and superior speed compared to auto-regressive translation (AT) models in the context of sentence-level machine translation (MT). However, their abilities are unexplored in document-level MT, hindering their usage in real scenarios. In this paper, we conduct a comprehensive examination of typical NAT models in the context o… ▽ More

    Submitted 9 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EMNLP2023 Findings camera-ready version. Review soundness 443 and excitement 443

  12. arXiv:2305.12147  [pdf, other

    cs.CL cs.AI

    LogiCoT: Logical Chain-of-Thought Instruction-Tuning

    Authors: Hanmeng Liu, Zhiyang Teng, Leyang Cui, Chaoli Zhang, Qiji Zhou, Yue Zhang

    Abstract: Generative Pre-trained Transformer 4 (GPT-4) demonstrates impressive chain-of-thought reasoning ability. Recent work on self-instruction tuning, such as Alpaca, has focused on enhancing the general proficiency of models. These instructions enable the model to achieve performance comparable to GPT-3.5 on general tasks like open-domain text generation and paraphrasing. However, they fall short of he… ▽ More

    Submitted 28 October, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  13. arXiv:2305.04505  [pdf, other

    cs.CL

    Target-Side Augmentation for Document-Level Machine Translation

    Authors: Guangsheng Bao, Zhiyang Teng, Yue Zhang

    Abstract: Document-level machine translation faces the challenge of data sparsity due to its long input length and a small amount of training data, increasing the risk of learning spurious patterns. To address this challenge, we propose a target-side augmentation method, introducing a data augmentation (DA) model to generate many potential translations for each source document. Learning on these wider range… ▽ More

    Submitted 4 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL2023 main conference

  14. arXiv:2305.04493  [pdf, other

    cs.CL

    Token-Level Fitting Issues of Seq2seq Models

    Authors: Guangsheng Bao, Zhiyang Teng, Yue Zhang

    Abstract: Sequence-to-sequence (seq2seq) models have been widely used for natural language processing, computer vision, and other deep learning tasks. We find that seq2seq models trained with early-stop** suffer from issues at the token level. In particular, while some tokens in the vocabulary demonstrate overfitting, others underfit when training is stopped. Experiments show that the phenomena are pervas… ▽ More

    Submitted 22 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023 Workshop on RepL4NLP, 9 pages

  15. arXiv:2305.04205  [pdf, other

    cs.CV cs.RO eess.IV

    Bi-Mapper: Holistic BEV Semantic Map** for Autonomous Driving

    Authors: Siyu Li, Kailun Yang, Hao Shi, Jiaming Zhang, Jiacheng Lin, Zhifeng Teng, Zhiyong Li

    Abstract: A semantic map of the road scene, covering fundamental road elements, is an essential ingredient in autonomous driving systems. It provides important perception foundations for positioning and planning when rendered in the Bird's-Eye-View (BEV). Currently, the prior knowledge of hypothetical depth can guide the learning of translating front perspective views into BEV directly with the help of cali… ▽ More

    Submitted 6 September, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L). The source code is publicly available at https://github.com/lynn-yu/Bi-Mapper

  16. arXiv:2304.03439  [pdf, other

    cs.CL cs.AI

    Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4

    Authors: Hanmeng Liu, Ruoxi Ning, Zhiyang Teng, Jian Liu, Qiji Zhou, Yue Zhang

    Abstract: Harnessing logical reasoning ability is a comprehensive natural language understanding endeavor. With the release of Generative Pretrained Transformer 4 (GPT-4), highlighted as "advanced" at reasoning tasks, we are eager to learn the GPT-4 performance on various logical reasoning tasks. This report analyses multiple logical reasoning datasets, with popular benchmarks like LogiQA and ReClor, and ne… ▽ More

    Submitted 5 May, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  17. arXiv:2303.11910  [pdf, other

    cs.CV

    360BEV: Panoramic Semantic Map** for Indoor Bird's-Eye View

    Authors: Zhifeng Teng, Jiaming Zhang, Kailun Yang, Kunyu Peng, Hao Shi, Simon Reiß, Ke Cao, Rainer Stiefelhagen

    Abstract: Seeing only a tiny part of the whole is not knowing the full circumstance. Bird's-eye-view (BEV) perception, a process of obtaining allocentric maps from egocentric views, is restricted when using a narrow Field of View (FoV) alone. In this work, map** from 360° panoramas to BEV semantics, the 360BEV task, is established for the first time to achieve holistic representations of indoor scenes in… ▽ More

    Submitted 4 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Code and datasets are available at the project page: https://jamycheung.github.io/360BEV.html. Accepted to WACV 2024

  18. NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation

    Authors: Quchen Fu, Zhongwei Teng, Marco Georgaklis, Jules White, Douglas C. Schmidt

    Abstract: Translating natural language into Bash Commands is an emerging research field that has gained attention in recent years. Most efforts have focused on producing more accurate translation models. To the best of our knowledge, only two datasets are available, with one based on the other. Both datasets involve scra** through known data sources (through platforms like stack overflow, crowdsourcing, e… ▽ More

    Submitted 18 June, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Journal ref: Journal of Machine Learning Theory, Applications and Practice 2023

  19. arXiv:2209.13877  [pdf, other

    cs.CL

    YATO: Yet Another deep learning based Text analysis Open toolkit

    Authors: Zeqiang Wang, Yile Wang, Jiageng Wu, Zhiyang Teng, Jie Yang

    Abstract: We introduce YATO, an open-source, easy-to-use toolkit for text analysis with deep learning. Different from existing heavily engineered toolkits and platforms, YATO is lightweight and user-friendly for researchers from cross-disciplinary areas. Designed in a hierarchical structure, YATO supports free combinations of three types of widely used features including 1) traditional neural networks (CNN,… ▽ More

    Submitted 18 October, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  20. arXiv:2209.13773  [pdf, other

    cs.CL

    METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19 Related Tweets

    Authors: Peilin Zhou, Zeqiang Wang, Dading Chong, Zhijiang Guo, Yining Hua, Zichang Su, Zhiyang Teng, Jiageng Wu, Jie Yang

    Abstract: The COVID-19 pandemic continues to bring up various topics discussed or debated on social media. In order to explore the impact of pandemics on people's lives, it is crucial to understand the public's concerns and attitudes towards pandemic-related entities (e.g., drugs, vaccines) on social media. However, models trained on existing named entity recognition (NER) or targeted sentiment analysis (TS… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures, 6 tables, accepted by NeurIPS 2022 Datasets and Benchmarks track

  21. arXiv:2209.03834  [pdf, other

    cs.CL

    Pre-Training a Graph Recurrent Network for Language Representation

    Authors: Yile Wang, Linyi Yang, Zhiyang Teng, Ming Zhou, Yue Zhang

    Abstract: Transformer-based pre-trained models have gained much advance in recent years, becoming one of the most important backbones in natural language processing. Recent work shows that the attention mechanism inside Transformer may not be necessary, both convolutional neural networks and multi-layer perceptron based models have also been investigated as Transformer alternatives. In this paper, we consid… ▽ More

    Submitted 26 October, 2022; v1 submitted 8 September, 2022; originally announced September 2022.

    Comments: NeurIPS Efficient Natural Language and Speech Processing (ENLSP) Workshop 2022

  22. Deep Learning Models on CPUs: A Methodology for Efficient Training

    Authors: Quchen Fu, Ramesh Chukka, Keith Achorn, Thomas Atta-fosu, Deepak R. Canchi, Zhongwei Teng, Jules White, Douglas C. Schmidt

    Abstract: GPUs have been favored for training deep learning models due to their highly parallelized architecture. As a result, most studies on training optimization focus on GPUs. There is often a trade-off, however, between cost and efficiency when deciding on how to choose the proper hardware for training. In particular, CPU servers can be beneficial if training on CPUs was more efficient, as they incur f… ▽ More

    Submitted 18 June, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Journal ref: Journal of Machine Learning Theory, Applications and Practice (2023)

  23. arXiv:2203.14965  [pdf, other

    cs.CR

    A Systematic Survey of Attack Detection and Prevention in Connected and Autonomous Vehicles

    Authors: Trupil Limbasiya, Ko Zheng Teng, Sudipta Chattopadhyay, Jianying Zhou

    Abstract: The number of Connected and Autonomous Vehicles (CAVs) is increasing rapidly in various smart transportation services and applications, considering many benefits to society, people, and the environment. Several research surveys for CAVs were conducted by primarily focusing on various security threats and vulnerabilities in the domain of CAVs to classify different types of attacks, impacts of attac… ▽ More

    Submitted 5 August, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

    Comments: This article is published in the Vehicular Communications journal

  24. arXiv:2203.06517  [pdf, other

    cs.SD eess.AS

    SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System

    Authors: Zhongwei Teng, Quchen Fu, Jules White, Maria E. Powell, Douglas C. Schmidt

    Abstract: Research in the past several years has boosted the performance of automatic speaker verification systems and countermeasure systems to deliver low Equal Error Rates (EERs) on each system. However, research on joint optimization of both systems is still limited. The Spoofing-Aware Speaker Verification (SASV) 2022 challenge was proposed to encourage the development of integrated SASV systems with ne… ▽ More

    Submitted 24 March, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

    Comments: Update Experiment Results in ASV2019 protocol

  25. Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

    Authors: Xiang Bai, Hanchen Wang, Liya Ma, Yongchao Xu, Jiefeng Gan, Ziwei Fan, Fan Yang, Ke Ma, Jiehua Yang, Song Bai, Chang Shu, Xinyu Zou, Renhao Huang, Changzheng Zhang, Xiaowu Liu, Dandan Tu, Chuou Xu, Wenqing Zhang, Xi Wang, Anguo Chen, Yu Zeng, Dehua Yang, Ming-Wei Wang, Nagaraj Holalkere, Neil J. Halin , et al. (21 additional authors not shown)

    Abstract: Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI),… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Nature Machine Intelligence

  26. arXiv:2110.07310  [pdf, other

    cs.CL

    Solving Aspect Category Sentiment Analysis as a Text Generation Task

    Authors: Jian Liu, Zhiyang Teng, Leyang Cui, Hanmeng Liu, Yue Zhang

    Abstract: Aspect category sentiment analysis has attracted increasing research attention. The dominant methods make use of pre-trained language models by learning effective aspect category-specific representations, and adding specific output layers to its pre-trained representation. We consider a more direct way of making use of pre-trained language models, by casting the ACSA tasks into natural language ge… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 main conference

  27. arXiv:2109.02774  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    FastAudio: A Learnable Audio Front-End for Spoof Speech Detection

    Authors: Quchen Fu, Zhongwei Teng, Jules White, Maria Powell, Douglas C. Schmidt

    Abstract: Voice assistants, such as smart speakers, have exploded in popularity. It is currently estimated that the smart speaker adoption rate has exceeded 35% in the US adult population. Manufacturers have integrated speaker identification technology, which attempts to determine the identity of the person speaking, to provide personalized services to different members of the same family. Speaker identific… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  28. arXiv:2109.02773  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model

    Authors: Zhongwei Teng, Quchen Fu, Jules White, Maria Powell, Douglas C. Schmidt

    Abstract: An emerging trend in audio processing is capturing low-level speech representations from raw waveforms. These representations have shown promising results on a variety of tasks, such as speech recognition and speech separation. Compared to handcrafted features, learning speech features via backpropagation provides the model greater flexibility in how it represents data for different tasks theoreti… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  29. arXiv:2107.11963  [pdf, other

    cs.HC

    Can we infer player behavior tendencies from a player's decision-making data? Integrating Theory of Mind to Player Modeling

    Authors: Murtuza N. Shergadwala, Zhaoqing Teng, Magy Seif El-Nasr

    Abstract: Game AI systems need the theory of mind, which is the humanistic ability to infer others' mental models, preferences, and intent. Such systems would enable inferring players' behavior tendencies that contribute to the variations in their decision-making behaviors. To that end, in this paper, we propose the use of inverse Bayesian inference to infer behavior tendencies given a descriptive cognitive… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  30. arXiv:2106.13740  [pdf

    cs.HC cs.AI

    Advancing Methodology for Social Science Research Using Alternate Reality Games: Proof-of-Concept Through Measuring Individual Differences and Adaptability and their impact on Team Performance

    Authors: Magy Seif El-Nasr, Casper Harteveld, Paul Fombelle, Truong-Huy Nguyen, Paola Rizzo, Dylan Schouten, Abdelrahman Madkour, Chaima Jemmali, Erica Kleinman, Nithesh Javvaji, Zhaoqing Teng, Extra Ludic Inc

    Abstract: While work in fields of CSCW (Computer Supported Collaborative Work), Psychology and Social Sciences have progressed our understanding of team processes and their effect performance and effectiveness, current methods rely on observations or self-report, with little work directed towards studying team processes with quantifiable measures based on behavioral data. In this report we discuss work tack… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Journal ref: DARPA Report, 2018

  31. arXiv:2105.14761  [pdf, other

    cs.CL cs.LG

    G-Transformer for Document-level Machine Translation

    Authors: Guangsheng Bao, Yue Zhang, Zhiyang Teng, Boxing Chen, Weihua Luo

    Abstract: Document-level MT models are still far from satisfactory. Existing work extend translation unit from single sentence to multiple sentences. However, study shows that when we further enlarge the translation unit to a whole document, supervised training of Transformer can fail. In this paper, we find such failure is not caused by overfitting, but by sticking around local minima during training. Our… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted by ACL2021 main track

  32. arXiv:2103.02523  [pdf, other

    cs.CL

    NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash Commands

    Authors: Mayank Agarwal, Tathagata Chakraborti, Quchen Fu, David Gros, Xi Victoria Lin, Jaron Maene, Kartik Talamadupula, Zhongwei Teng, Jules White

    Abstract: The NLC2CMD Competition hosted at NeurIPS 2020 aimed to bring the power of natural language processing to the command line. Participants were tasked with building models that can transform descriptions of command line tasks in English to their Bash syntax. This is a report on the competition with details of the task, metrics, data, attempted solutions, and lessons learned.

    Submitted 8 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Appears in PMLR Volume 133: NeurIPS 2020 Competition and Demonstration Track. Competition URL: http://ibm.biz/nlc2cmd

  33. arXiv:2012.15197  [pdf, other

    cs.CL cs.AI

    SemGloVe: Semantic Co-occurrences for GloVe from BERT

    Authors: Leilei Gan, Zhiyang Teng, Yue Zhang, Linchao Zhu, Fei Wu, Yi Yang

    Abstract: GloVe learns word embeddings by leveraging statistical information from word co-occurrence matrices. However, word pairs in the matrices are extracted from a predefined local context window, which might lead to limited word pairs and potentially semantic irrelevant word pairs. In this paper, we propose SemGloVe, which distills semantic co-occurrences from BERT into static GloVe word embeddings. Pa… ▽ More

    Submitted 24 November, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: 10 pages, 3 figures, 5 tables

  34. arXiv:2012.04395  [pdf, other

    cs.CL

    End-to-End Chinese Parsing Exploiting Lexicons

    Authors: Yuan Zhang, Zhiyang Teng, Yue Zhang

    Abstract: Chinese parsing has traditionally been solved by three pipeline systems including word-segmentation, part-of-speech tagging and dependency parsing modules. In this paper, we propose an end-to-end Chinese parsing model based on character inputs which jointly learns to output word segmentation, part-of-speech tags and dependency structures. In particular, our parsing model relies on word-char graph… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  35. arXiv:2010.04383  [pdf, other

    cs.CL

    Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation

    Authors: Yan Zhang, Zhijiang Guo, Zhiyang Teng, Wei Lu, Shay B. Cohen, Zuozhu Liu, Lidong Bing

    Abstract: AMR-to-text generation is used to transduce Abstract Meaning Representation structures (AMR) into text. A key challenge in this task is to efficiently learn effective graph representations. Previously, Graph Convolution Networks (GCNs) were used to encode input AMRs, however, vanilla GCNs are not able to capture non-local information and additionally, they follow a local (first-order) information… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: Accepted to EMNLP 2020, long paper

  36. arXiv:2008.06388  [pdf

    cs.LG cs.CV eess.IV stat.ML

    Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans

    Authors: Michael Roberts, Derek Driggs, Matthew Thorpe, Julian Gilbey, Michael Yeung, Stephan Ursprung, Angelica I. Aviles-Rivero, Christian Etmann, Cathal McCague, Lucian Beer, Jonathan R. Weir-McCall, Zhongzhao Teng, Effrossyni Gkrania-Klotsas, James H. F. Rudd, Evis Sala, Carola-Bibiane Schönlieb

    Abstract: Machine learning methods offer great promise for fast and accurate detection and prognostication of COVID-19 from standard-of-care chest radiographs (CXR) and computed tomography (CT) images. Many articles have been published in 2020 describing new machine learning-based models for both of these tasks, but it is unclear which are of potential clinical utility. In this systematic review, we search… ▽ More

    Submitted 5 January, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: 35 pages, 3 figures, 2 tables, updated to the period 1 January 2020 - 3 October 2020

    Journal ref: Nature Machine Intelligence 3, 199-217 (2021)

  37. Dialogue State Induction Using Neural Latent Variable Models

    Authors: Qingkai Min, Libo Qin, Zhiyang Teng, Xiao Liu, Yue Zhang

    Abstract: Dialogue state modules are a useful component in a task-oriented dialogue system. Traditional methods find dialogue states by manually labeling training corpora, upon which neural models are trained. However, the labeling process can be costly, slow, error-prone, and more importantly, cannot cover the vast range of domains in real-world dialogues for customer service. We propose the task of dialog… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: IJCAI 2020

  38. arXiv:2006.11199  [pdf, other

    cs.HC

    Modeling Individual and Team Behavior through Spatio-temporal Analysis

    Authors: Sabbir Ahmad, Andy Bryant, Erica Kleinman, Zhaoqing Teng, Truong-Huy D. Nguyen, Magy Seif El-Nasr

    Abstract: Modeling players' behaviors in games has gained increased momentum in the past few years. This area of research has wide applications, including modeling learners and understanding player strategies, to mention a few. In this paper, we present a new methodology, called Interactive Behavior Analytics (IBA), comprised of two visualization systems, a labeling mechanism, and abstraction algorithms tha… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Journal ref: CHI Play 2019

  39. arXiv:2006.10823  [pdf, other

    cs.HC

    "And then they died": Using Action Sequences for Data Driven,Context Aware Gameplay Analysis

    Authors: Erica Kleinman, Sabbir Ahmad, Zhaoqing Teng, Andy Bryant, Truong-Huy D. Nguyen, Casper Harteveld, Magy Seif El-Nasr

    Abstract: Many successful games rely heavily on data analytics to understand players and inform design. Popular methodologies focus on machine learning and statistical analysis of aggregated data. While effective in extracting information regarding player action, much of the context regarding when and how those actions occurred is lost. Qualitative methods allow researchers to examine context and derive mea… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Journal ref: Foundations of Digital Games 2020

  40. arXiv:2004.02757  [pdf, other

    cs.CV cs.LG stat.ML

    Efficient Deep Representation Learning by Adaptive Latent Space Sampling

    Authors: Yuanhan Mo, Shuo Wang, Chengliang Dai, Rui Zhou, Zhongzhao Teng, Wenjia Bai, Yike Guo

    Abstract: Supervised deep learning requires a large amount of training samples with annotations (e.g. label class for classification task, pixel- or voxel-wised label map for segmentation tasks), which are expensive and time-consuming to obtain. During the training of a deep neural network, the annotated samples are fed into the network in a mini-batch way, where they are often regarded of equal importance.… ▽ More

    Submitted 12 April, 2020; v1 submitted 19 March, 2020; originally announced April 2020.

  41. arXiv:1910.02450  [pdf

    cs.LG stat.ML

    Mobile APP User Attribute Prediction by Heterogeneous Information Network Modeling

    Authors: Hekai Zhang, Jibing Gong, Zhiyong Teng, Dan Wang, Hongfei Wang, Linfeng Du, Zakirul Alam Bhuiyan

    Abstract: User-based attribute information, such as age and gender, is usually considered as user privacy information. It is difficult for enterprises to obtain user-based privacy attribute information. However, user-based privacy attribute information has a wide range of applications in personalized services, user behavior analysis and other aspects. this paper advances the HetPathMine model and puts forwa… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: 10 pages,3 figures,International Conference on Dependability in Sensor, Cloud, and Big Data Systems and Applications

  42. arXiv:1908.05957  [pdf, other

    cs.CL

    Densely Connected Graph Convolutional Networks for Graph-to-Sequence Learning

    Authors: Zhijiang Guo, Yan Zhang, Zhiyang Teng, Wei Lu

    Abstract: We focus on graph-to-sequence learning, which can be framed as transducing graph structures to sequences for text generation. To capture structural information associated with graphs, we investigate the problem of encoding graphs using graph convolutional networks (GCNs). Unlike various existing approaches where shallow architectures were used for capturing local structural information only, we in… ▽ More

    Submitted 9 September, 2019; v1 submitted 16 August, 2019; originally announced August 2019.

    Comments: Conditional accepted by TACL on December 2018, accepted by TACL on February 2019

  43. arXiv:1904.09445  [pdf, other

    cs.CR cs.IT eess.SY

    Performance and Resilience of Cyber-Physical Control Systems with Reactive Attack Mitigation

    Authors: Subhash Lakshminarayana, Jabir Shabbir Karachiwala, Teo Zhan Teng, Rui Tan, David K. Y. Yau

    Abstract: This paper studies the performance and resilience of a linear cyber-physical control system (CPCS) with attack detection and reactive attack mitigation in the context of power grids. It addresses the problem of deriving an optimal sequence of false data injection attacks that maximizes the state estimation error of the power system. The results provide basic understanding about the limit of the at… ▽ More

    Submitted 20 April, 2019; originally announced April 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1706.01628

    Journal ref: IEEE Trans. on Smart Grids, 2019

  44. arXiv:1904.05020   

    cs.CV

    Imitating Targets from all sides: An Unsupervised Transfer Learning method for Person Re-identification

    Authors: Jiajie Tian, Zhu Teng, Rui Li, Yan Li, Baopeng Zhang, Jian** Fan

    Abstract: Person re-identification (Re-ID) models usually show a limited performance when they are trained on one dataset and tested on another dataset due to the inter-dataset bias (e.g. completely different identities and backgrounds) and the intra-dataset difference (e.g. camera invariance). In terms of this issue, given a labelled source training set and an unlabelled target training set, we propose an… ▽ More

    Submitted 27 April, 2021; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: The author and result of model have changed

  45. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  46. arXiv:1808.04850  [pdf, other

    cs.CL

    Two Local Models for Neural Constituent Parsing

    Authors: Zhiyang Teng, Yue Zhang

    Abstract: Non-local features have been exploited by syntactic parsers for capturing dependencies between sub output structures. Such features have been a key to the success of state-of-the-art statistical parsers. With the rise of deep learning, however, it has been shown that local output decisions can give highly competitive accuracies, thanks to the power of dense neural input representations that embody… ▽ More

    Submitted 28 August, 2018; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: COLING 2018

  47. arXiv:1709.07574  [pdf, other

    cs.CR eess.SY

    Modeling and Detecting False Data Injection Attacks against Railway Traction Power Systems

    Authors: Subhash Lakshminarayana, Teo Zhan Teng, Rui Tan, David K. Y. Yau

    Abstract: Modern urban railways extensively use computerized sensing and control technologies to achieve safe, reliable, and well-timed operations. However, the use of these technologies may provide a convenient leverage to cyber-attackers who have bypassed the air gaps and aim at causing safety incidents and service disruptions. In this paper, we study false data injection (FDI) attacks against railways' t… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

    Comments: IEEE/IFIP DSN-2016 and ACM Trans. on Cyber-Physical Systems

  48. arXiv:1708.07279  [pdf, other

    cs.CL

    Combining Discrete and Neural Features for Sequence Labeling

    Authors: Jie Yang, Zhiyang Teng, Meishan Zhang, Yue Zhang

    Abstract: Neural network models have recently received heated research attention in the natural language processing community. Compared with traditional models with discrete features, neural models have two main advantages. First, they take low-dimensional, real-valued embedding vectors as inputs, which can be trained over large raw data, thereby addressing the issue of feature sparsity in discrete models.… ▽ More

    Submitted 24 August, 2017; originally announced August 2017.

    Comments: Accepted by International Conference on Computational Linguistics and Intelligent Text Processing (CICLing) 2016, April

  49. arXiv:1706.01628  [pdf, other

    cs.CR cs.IT eess.SY

    Optimal Attack against Cyber-Physical Control Systems with Reactive Attack Mitigation

    Authors: Subhash Lakshminarayana, Teo Zhan Teng, David K. Y. Yau, Rui Tan

    Abstract: This paper studies the performance and resilience of a cyber-physical control system (CPCS) with attack detection and reactive attack mitigation. It addresses the problem of deriving an optimal sequence of false data injection attacks that maximizes the state estimation error of the system. The results provide basic understanding about the limit of the attack impact. The design of the optimal atta… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

  50. arXiv:1611.06788  [pdf, other

    cs.CL

    Bidirectional Tree-Structured LSTM with Head Lexicalization

    Authors: Zhiyang Teng, Yue Zhang

    Abstract: Sequential LSTM has been extended to model tree structures, giving competitive results for a number of tasks. Existing methods model constituent trees by bottom-up combinations of constituent nodes, making direct use of input word information only for leaf nodes. This is different from sequential LSTMs, which contain reference to input words for each node. In this paper, we propose a method for au… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: 12 pages, 6 figures