Skip to main content

Showing 1–50 of 122 results for author: Mou, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09402  [pdf, other

    cs.CV cs.AI cs.LG

    Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion

    Authors: Linzhan Mou, Jun-Kun Chen, Yu-Xiong Wang

    Abstract: This paper proposes Instruct 4D-to-4D that achieves 4D awareness and spatial-temporal consistency for 2D diffusion models to generate high-quality instruction-guided dynamic scene editing results. Traditional applications of 2D diffusion models in dynamic scene editing often result in inconsistency, primarily due to their inherent frame-by-frame editing methodology. Addressing the complexities of… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: CVPR 2024

  2. arXiv:2406.05995  [pdf, other

    cs.CL cs.AI cs.LG

    A Dual-View Approach to Classifying Radiology Reports by Co-Training

    Authors: Yutong Han, Yan Yuan, Lili Mou

    Abstract: Radiology report analysis provides valuable information that can aid with public health initiatives, and has been attracting increasing attention from the research community. In this work, we present a novel insight that the structure of a radiology report (namely, the Findings and Impression sections) offers different views of a radiology scan. Based on this intuition, we further propose a co-tra… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by LREC-COLING 2024

  3. arXiv:2406.01721  [pdf, other

    cs.CL

    Rotation and Permutation for Advanced Outlier Management and Efficient Quantization of LLMs

    Authors: Haokun Lin, Haobo Xu, Yichen Wu, **gzhi Cui, Yingtao Zhang, Linzhan Mou, Linqi Song, Zhenan Sun, Ying Wei

    Abstract: Quantizing large language models (LLMs) presents significant challenges, primarily due to outlier activations that compromise the efficiency of low-bit representation. Traditional approaches mainly focus on solving Normal Outliers-activations with consistently high magnitudes across all tokens. However, these techniques falter when dealing with Massive Outliers, which are significantly higher in v… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 26 pages, 13 figures

  4. arXiv:2406.00341  [pdf, other

    eess.IV cs.CV

    DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation

    Authors: Qihang Xie, Mengguo Guo, Lei Mou, Dan Zhang, Da Chen, Caifeng Shan, Yitian Zhao, Ruisheng Su, Jiong Zhang

    Abstract: Cerebrovascular diseases (CVDs) remain a leading cause of global disability and mortality. Digital Subtraction Angiography (DSA) sequences, recognized as the golden standard for diagnosing CVDs, can clearly visualize the dynamic flow and reveal pathological conditions within the cerebrovasculature. Therefore, precise segmentation of cerebral arteries (CAs) and classification between their main tru… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.11277  [pdf, other

    cs.CL cs.AI cs.LG

    Action Controlled Paraphrasing

    Authors: Ning Shi, Zijun Wu, Lili Mou

    Abstract: Recent studies have demonstrated the potential to control paraphrase generation, such as through syntax, which has broad applications in various downstream tasks. However, these methods often require detailed parse trees or syntactic exemplars, which are not user-friendly. Furthermore, an inference gap exists, as control specifications are only available during training but not inference. In this… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  6. arXiv:2403.00144  [pdf, other

    cs.CL cs.AI cs.LG

    EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation

    Authors: Yuqiao Wen, Behzad Shayegh, Chenyang Huang, Yanshuai Cao, Lili Mou

    Abstract: The ability of zero-shot translation emerges when we train a multilingual model with certain translation directions; the model can then directly translate in unseen directions. Alternatively, zero-shot translation can be accomplished by pivoting through a third language (e.g., English). In our work, we observe that both direct and pivot translations are noisy and achieve less satisfactory performa… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    ACM Class: I.2.7; I.2.6; I.2.m; I.5.1; I.7.m

  7. arXiv:2403.00143  [pdf, other

    cs.CL cs.AI cs.LG

    Ensemble-Based Unsupervised Discontinuous Constituency Parsing by Tree Averaging

    Authors: Behzad Shayegh, Yuqiao Wen, Lili Mou

    Abstract: We address unsupervised discontinuous constituency parsing, where we observe a high variance in the performance of the only previous model. We propose to build an ensemble of different runs of the existing discontinuous parser by averaging the predicted trees, to stabilize and boost performance. To begin with, we provide comprehensive computational complexity analysis (in terms of P and NP-complet… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  8. arXiv:2402.11325  [pdf, other

    cs.CV

    ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models

    Authors: Zhenghang Yuan, Zhitong Xiong, Lichao Mou, Xiao Xiang Zhu

    Abstract: An in-depth comprehension of global land cover is essential in Earth observation, forming the foundation for a multitude of applications. Although remote sensing technology has advanced rapidly, leading to a proliferation of satellite imagery, the inherent complexity of these images often makes them difficult for non-expert users to understand. Natural language, as a carrier of human knowledge, ca… ▽ More

    Submitted 26 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  9. arXiv:2402.03295  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks

    Authors: Yongchang Hao, Yanshuai Cao, Lili Mou

    Abstract: Second-order optimization approaches like the generalized Gauss-Newton method are considered more powerful as they utilize the curvature information of the objective function with preconditioning matrices. Albeit offering tempting theoretical benefits, they are not easily applicable to modern deep learning. The major reason is due to the quadratic memory and cubic time complexity to compute the in… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  10. arXiv:2402.03293  [pdf, other

    cs.LG cs.AI stat.ML

    Flora: Low-Rank Adapters Are Secretly Gradient Compressors

    Authors: Yongchang Hao, Yanshuai Cao, Lili Mou

    Abstract: Despite large neural networks demonstrating remarkable abilities to complete different tasks, they require excessive memory usage to store the optimization states for training. To alleviate this, the low-rank adaptation (LoRA) is proposed to reduce the optimization states by training fewer parameters. However, LoRA restricts overall weight update matrices to be low-rank, limiting the model perform… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted @ ICML 2024

  11. arXiv:2310.01717  [pdf, other

    cs.CL cs.AI cs.LG

    Ensemble Distillation for Unsupervised Constituency Parsing

    Authors: Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu, Jackie C. K. Cheung, Lili Mou

    Abstract: We investigate the unsupervised constituency parsing task, which organizes words and phrases of a sentence into a hierarchical structure without using linguistically annotated data. We observe that existing unsupervised parsers capture differing aspects of parsing structures, which can be leveraged to enhance unsupervised parsing performance. To this end, we propose a notion of "tree averaging," b… ▽ More

    Submitted 25 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by International Conference on Learning Representations (ICLR) 2024

  12. arXiv:2310.01691  [pdf, other

    cs.CL cs.AI

    Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models

    Authors: Zijun Wu, Yongkang Wu, Lili Mou

    Abstract: Prompt tuning in natural language processing (NLP) has become an increasingly popular method for adapting large language models to specific tasks. However, the transferability of these prompts, especially continuous prompts, between different models remains a challenge. In this work, we propose a zero-shot continuous prompt transfer method, where source prompts are encoded into relative space and… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  13. arXiv:2309.13072  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly Supervised Reasoning by Neuro-Symbolic Approaches

    Authors: Xianggen Liu, Zhengdong Lu, Lili Mou

    Abstract: Deep learning has largely improved the performance of various natural language processing (NLP) tasks. However, most deep learning models are black-box machinery, and lack explicit interpretation. In this chapter, we will introduce our recent progress on neuro-symbolic approaches to NLP, which combines different schools of AI, namely, symbolism and connectionism. Generally, we will design a neural… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Compendium of Neurosymbolic Artificial Intelligence, 665--692, 2023, IOS Press

  14. Search and Learning for Unsupervised Text Generation

    Authors: Lili Mou

    Abstract: With the advances of deep learning techniques, text generation is attracting increasing interest in the artificial intelligence (AI) community, because of its wide applications and because it is an essential component of AI. Traditional text generation systems are trained in a supervised way, requiring massive labeled parallel corpora. In this paper, I will introduce our recent work on search and… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: AI Magazine}, 43(4), 344--352, 2022

  15. arXiv:2309.04919  [pdf, other

    cs.CL

    Unsupervised Chunking with Hierarchical RNN

    Authors: Zijun Wu, Anup Anand Deshmukh, Yongkang Wu, Jimmy Lin, Lili Mou

    Abstract: In Natural Language Processing (NLP), predicting linguistic structures, such as parsing and chunking, has mostly relied on manual annotations of syntactic structures. This paper introduces an unsupervised approach to chunking, a syntactic task that involves grou** words in a non-hierarchical manner. We present a two-layer Hierarchical Recurrent Neural Network (HRNN) designed to model word-to-chu… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  16. arXiv:2308.07903  [pdf, other

    cs.CV cs.AI cs.GR

    Relightable and Animatable Neural Avatar from Sparse-View Video

    Authors: Zhen Xu, Sida Peng, Chen Geng, Linzhan Mou, Zihan Yan, Jiaming Sun, Hujun Bao, Xiaowei Zhou

    Abstract: This paper tackles the challenge of creating relightable and animatable neural avatars from sparse-view (or even monocular) videos of dynamic humans under unknown illumination. Compared to studio environments, this setting is more practical and accessible but poses an extremely challenging ill-posed problem. Previous neural human reconstruction methods are able to reconstruct animatable avatars fr… ▽ More

    Submitted 17 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Project page: https://zju3dv.github.io/relightable_avatar

  17. arXiv:2307.15190  [pdf, other

    cs.CL cs.LG

    f-Divergence Minimization for Sequence-Level Knowledge Distillation

    Authors: Yuqiao Wen, Zichao Li, Wenyu Du, Lili Mou

    Abstract: Knowledge distillation (KD) is the process of transferring knowledge from a large model to a small one. It has gained increasing attention in the natural language processing community, driven by the demands of compressing ever-growing language models. In this work, we propose an f-DISTILL framework, which formulates sequence-level knowledge distillation as minimizing a generalized f-divergence fun… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted by ACL 2023

    ACM Class: I.2.7; I.2.6; I.2.m; I.5.1; I.7.m

  18. arXiv:2307.11100  [pdf, other

    cs.CV cs.MM

    CSSL-RHA: Contrastive Self-Supervised Learning for Robust Handwriting Authentication

    Authors: **gyao Wang, Luntian Mou, Changwen Zheng, Wen Gao

    Abstract: Handwriting authentication is a valuable tool used in various fields, such as fraud prevention and cultural heritage protection. However, it remains a challenging task due to the complex features, severe damage, and lack of supervision. In this paper, we propose a novel Contrastive Self-Supervised Learning framework for Robust Handwriting Authentication (CSSL-RHA) to address these issues. It can d… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 10 pages, 4 figures, 3 tables, submitted to ACM MM 2023

  19. arXiv:2307.03461  [pdf, other

    cs.CV eess.IV

    A Deep Active Contour Model for Delineating Glacier Calving Fronts

    Authors: Konrad Heidler, Lichao Mou, Erik Loebel, Mirko Scheinert, Sébastien Lefèvre, Xiao Xiang Zhu

    Abstract: Choosing how to encode a real-world problem as a machine learning task is an important design decision in machine learning. The task of glacier calving front modeling has often been approached as a semantic segmentation task. Recent studies have shown that combining segmentation with edge detection can improve the accuracy of calving front detectors. Building on this observation, we completely rep… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: This work has been accepted by IEEE TGRS for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2306.08625  [pdf, other

    cs.CV

    RRSIS: Referring Remote Sensing Image Segmentation

    Authors: Zhenghang Yuan, Lichao Mou, Yuansheng Hua, Xiao Xiang Zhu

    Abstract: Localizing desired objects from remote sensing images is of great use in practical applications. Referring image segmentation, which aims at segmenting out the objects to which a given expression refers, has been extensively studied in natural images. However, almost no research attention is given to this task of remote sensing imagery. Considering its potential for real-world applications, in thi… ▽ More

    Submitted 1 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

  21. arXiv:2306.00483  [pdf, other

    cs.CV

    Overcoming Language Bias in Remote Sensing Visual Question Answering via Adversarial Training

    Authors: Zhenghang Yuan, Lichao Mou, Xiao Xiang Zhu

    Abstract: The Visual Question Answering (VQA) system offers a user-friendly interface and enables human-computer interaction. However, VQA models commonly face the challenge of language bias, resulting from the learned superficial correlation between questions and answers. To address this issue, in this study, we present a novel framework to reduce the language bias of the VQA for remote sensing data (RSVQA… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  22. arXiv:2305.14914  [pdf, other

    cs.CV

    GAMUS: A Geometry-aware Multi-modal Semantic Segmentation Benchmark for Remote Sensing Data

    Authors: Zhitong Xiong, Sining Chen, Yi Wang, Lichao Mou, Xiao Xiang Zhu

    Abstract: Geometric information in the normalized digital surface models (nDSM) is highly correlated with the semantic class of the land cover. Exploiting two modalities (RGB and nDSM (height)) jointly has great potential to improve the segmentation performance. However, it is still an under-explored field in remote sensing due to the following challenges. First, the scales of existing datasets are relative… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 13 pages

  23. arXiv:2304.03844  [pdf, other

    cs.CV

    Multilingual Augmentation for Robust Visual Question Answering in Remote Sensing Images

    Authors: Zhenghang Yuan, Lichao Mou, Xiao Xiang Zhu

    Abstract: Aiming at answering questions based on the content of remotely sensed images, visual question answering for remote sensing data (RSVQA) has attracted much attention nowadays. However, previous works in RSVQA have focused little on the robustness of RSVQA. As we aim to enhance the reliability of RSVQA models, how to learn robust representations against new words and different question templates wit… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: This paper was submitted to the JURSE 2023 conference on November 5, 2022

  24. arXiv:2302.07224  [pdf, other

    cs.CV

    Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask

    Authors: Shangzhan Zhang, Sida Peng, Tianrun Chen, Linzhan Mou, Haotong Lin, Kaicheng Yu, Yiyi Liao, Xiaowei Zhou

    Abstract: We introduce a novel approach that takes a single semantic mask as input to synthesize multi-view consistent color images of natural scenes, trained with a collection of single images from the Internet. Prior works on 3D-aware image synthesis either require multi-view supervision or learning category-level prior for specific classes of objects, which can hardly work for natural scenes. Our key ide… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Project website: https://zju3dv.github.io/paintingnature/

  25. arXiv:2301.11997  [pdf, other

    cs.CL cs.AI cs.LG

    Prompt-Based Editing for Text Style Transfer

    Authors: Guoqing Luo, Yu Tong Han, Lili Mou, Mauajama Firdaus

    Abstract: Prompting approaches have been recently explored in text style transfer, where a textual prompt is used to query a pretrained language model to generate style-transferred texts word by word in an autoregressive manner. However, such a generation process is less controllable and early prediction errors may affect future word predictions. In this paper, we present a prompt-based editing approach for… ▽ More

    Submitted 22 December, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted by EMNLP Findings 2023

  26. arXiv:2210.08708  [pdf, other

    cs.LG cs.AI cs.CL

    Teacher Forcing Recovers Reward Functions for Text Generation

    Authors: Yongchang Hao, Yuxin Liu, Lili Mou

    Abstract: Reinforcement learning (RL) has been widely used in text generation to alleviate the exposure bias issue or to utilize non-parallel datasets. The reward function plays an important role in making RL training successful. However, previous reward functions are typically task-specific and sparse, restricting the use of RL. In our work, we propose a task-agnostic approach that derives a step-wise rewa… ▽ More

    Submitted 18 January, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  27. arXiv:2209.14627  [pdf, other

    cs.CL cs.AI cs.LG

    An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation

    Authors: Yuqiao Wen, Yongchang Hao, Yanshuai Cao, Lili Mou

    Abstract: Open-domain dialogue systems aim to interact with humans through natural language texts in an open-ended fashion. Despite the recent success of super large dialogue systems such as ChatGPT, using medium-to-small-sized dialogue systems remains the common practice as they are more lightweight and accessible; however, generating diverse dialogue responses is challenging, especially with smaller model… ▽ More

    Submitted 24 March, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted by ICLR 2023

    ACM Class: I.2.7; I.2.6

  28. Anomaly Detection in Aerial Videos with Transformers

    Authors: Pu **, Lichao Mou, Gui-Song Xia, Xiao Xiang Zhu

    Abstract: Unmanned aerial vehicles (UAVs) are widely applied for purposes of inspection, search, and rescue operations by the virtue of low-cost, large-coverage, real-time, and high-resolution data acquisition capacities. Massive volumes of aerial videos are produced in these processes, in which normal events often account for an overwhelming proportion. It is extremely difficult to localize and extract abn… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  29. FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification

    Authors: Pu **, Lichao Mou, Yuansheng Hua, Gui-Song Xia, Xiao Xiang Zhu

    Abstract: Unmanned aerial vehicles (UAVs) are now widely applied to data acquisition due to its low cost and fast mobility. With the increasing volume of aerial videos, the demand for automatically parsing these videos is surging. To achieve this, current researches mainly focus on extracting a holistic feature with convolutions along both spatial and temporal dimensions. However, these methods are limited… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  30. arXiv:2208.05162  [pdf, ps, other

    cs.SD cs.LG cs.MM eess.AS

    Controlling Perceived Emotion in Symbolic Music Generation with Monte Carlo Tree Search

    Authors: Lucas N. Ferreira, Lili Mou, Jim Whitehead, Levi H. S. Lelis

    Abstract: This paper presents a new approach for controlling emotion in symbolic music generation with Monte Carlo Tree Search. We use Monte Carlo Tree Search as a decoding mechanism to steer the probability distribution learned by a language model towards a given emotion. At every step of the decoding process, we use Predictor Upper Confidence for Trees (PUCT) to search for sequences that maximize the aver… ▽ More

    Submitted 1 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at the 18th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE-22)

  31. arXiv:2206.13188  [pdf, other

    cs.CV

    Self-supervised Learning in Remote Sensing: A Review

    Authors: Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham, Lichao Mou, Xiao Xiang Zhu

    Abstract: In deep learning research, self-supervised learning (SSL) has received great attention triggering interest within both the computer vision and remote sensing communities. While there has been a big success in computer vision, most of the potential of SSL in the domain of earth observation remains locked. In this paper, we provide an introduction to, and a review of the concepts and latest developm… ▽ More

    Submitted 2 September, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted by IEEE Geoscience and Remote Sensing Magazine. 32 pages, 22 content pages

  32. arXiv:2206.12117  [pdf, other

    cs.CV cs.LG

    Self Supervised Learning for Few Shot Hyperspectral Image Classification

    Authors: Nassim Ait Ali Braham, Lichao Mou, Jocelyn Chanussot, Julien Mairal, Xiao Xiang Zhu

    Abstract: Deep learning has proven to be a very effective approach for Hyperspectral Image (HSI) classification. However, deep neural networks require large annotated datasets to generalize well. This limits the applicability of deep learning for HSI classification, where manually labelling thousands of pixels for every scene is impractical. In this paper, we propose to leverage Self Supervised Learning (SS… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted in IGARSS 2022

  33. arXiv:2205.14522  [pdf, other

    cs.CL cs.LG

    A Character-Level Length-Control Algorithm for Non-Autoregressive Sentence Summarization

    Authors: Puyuan Liu, Xiang Zhang, Lili Mou

    Abstract: Sentence summarization aims at compressing a long sentence into a short one that keeps the main gist, and has extensive real-world applications such as headline generation. In previous work, researchers have developed various approaches to improve the ROUGE score, which is the main evaluation metric for summarization, whereas controlling the summary length has not drawn much attention. In our work… ▽ More

    Submitted 16 October, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: Accepted by NeurIPS22

  34. arXiv:2205.14521  [pdf, other

    cs.CL cs.LG

    Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization

    Authors: Puyuan Liu, Chenyang Huang, Lili Mou

    Abstract: Text summarization aims to generate a short summary for an input text. In this work, we propose a Non-Autoregressive Unsupervised Summarization (NAUS) approach, which does not require parallel data for training. Our NAUS first performs edit-based search towards a heuristically defined score, and generates a summary as pseudo-groundtruth. Then, we train an encoder-only non-autoregressive Transforme… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    Report number: Published in ACL, pages 7916--7929, 2022

  35. From Easy to Hard: Learning Language-guided Curriculum for Visual Question Answering on Remote Sensing Data

    Authors: Zhenghang Yuan, Lichao Mou, Qi Wang, Xiao Xiang Zhu

    Abstract: Visual question answering (VQA) for remote sensing scene has great potential in intelligent human-computer interaction system. Although VQA in computer vision has been widely researched, VQA for remote sensing data (RSVQA) is still in its infancy. There are two characteristics that need to be specially considered for the RSVQA task. 1) No object annotations are available in RSVQA datasets, which m… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  36. arXiv:2204.12679  [pdf, other

    cs.CL

    Document-Level Relation Extraction with Sentences Importance Estimation and Focusing

    Authors: Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao

    Abstract: Document-level relation extraction (DocRE) aims to determine the relation between two entities from a document of multiple sentences. Recent studies typically represent the entire document by sequence- or graph-based models to predict the relations of all entity pairs. However, we find that such a model is not robust and exhibits bizarre behaviors: it predicts correctly when an entire test documen… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Accepted to NAACL 2022

  37. arXiv:2202.04348  [pdf, other

    cs.LG

    MBCT: Tree-Based Feature-Aware Binning for Individual Uncertainty Calibration

    Authors: Siguang Huang, Yunli Wang, Lili Mou, Huayue Zhang, Han Zhu, Chuan Yu, Bo Zheng

    Abstract: Most machine learning classifiers only concern classification accuracy, while certain applications (such as medical diagnosis, meteorological forecasting, and computation advertising) require the model to predict the true probability, known as a calibrated estimate. In previous work, researchers have developed several calibration methods to post-process the outputs of a predictor to obtain calibra… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: WWW 2022

  38. arXiv:2201.06219  [pdf, other

    cs.CL cs.AI

    An Empirical Study on the Overlap** Problem of Open-Domain Dialogue Datasets

    Authors: Yuqiao Wen, Guoqing Luo, Lili Mou

    Abstract: Open-domain dialogue systems aim to converse with humans through text, and dialogue research has heavily relied on benchmark datasets. In this work, we observe the overlap** problem in DailyDialog and OpenSubtitles, two popular open-domain dialogue benchmark datasets. Our systematic analysis then shows that such overlap** can be exploited to obtain fake state-of-the-art performance. Finally, w… ▽ More

    Submitted 8 May, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: Accepted by LREC 2022

    ACM Class: I.2.7; I.2.6

  39. Change Detection Meets Visual Question Answering

    Authors: Zhenghang Yuan, Lichao Mou, Zhitong Xiong, Xiaoxiang Zhu

    Abstract: The Earth's surface is continually changing, and identifying changes plays an important role in urban planning and sustainability. Although change detection techniques have been successfully developed for many years, these techniques are still limited to experts and facilitators in related fields. In order to provide every user with flexible access to change information and help them better unders… ▽ More

    Submitted 2 October, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

  40. arXiv:2112.02770  [pdf, other

    cs.CL

    Search and Learn: Improving Semantic Coverage for Data-to-Text Generation

    Authors: Shailza Jolly, Zi Xuan Zhang, Andreas Dengel, Lili Mou

    Abstract: Data-to-text generation systems aim to generate text descriptions based on input data (often represented in the tabular form). A typical system uses huge training samples for learning the correspondence between tables and texts. However, large training sets are expensive to obtain, limiting the applicability of these approaches in real-world scenarios. In this work, we focus on few-shot data-to-te… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI'22

  41. arXiv:2111.09460  [pdf, other

    eess.IV cs.CV

    Large-scale Building Height Retrieval from Single SAR Imagery based on Bounding Box Regression Networks

    Authors: Yao Sun, Lichao Mou, Yuanyuan Wang, Sina Montazeri, Xiao Xiang Zhu

    Abstract: Building height retrieval from synthetic aperture radar (SAR) imagery is of great importance for urban applications, yet highly challenging owing to the complexity of SAR data. This paper addresses the issue of building height retrieval in large-scale urban areas from a single TerraSAR-X spotlight or stripmap image. Based on the radar viewing geometry, we propose that this problem can be formulate… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  42. arXiv:2110.07515  [pdf, other

    cs.CL

    Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision

    Authors: Chenyang Huang, Hao Zhou, Osmar R. Zaïane, Lili Mou, Lei Li

    Abstract: How do we perform efficient inference while retaining high translation quality? Existing neural machine translation models, such as Transformer, achieve high performance, but they decode words one by one, which is inefficient. Recent non-autoregressive translation models speed up the inference, but their quality is still inferior. In this work, we propose DSLP, a highly efficient and high-performa… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  43. Simulated annealing for optimization of graphs and sequences

    Authors: Xianggen Liu, Pengyong Li, Fandong Meng, Hao Zhou, Huasong Zhong, Jie Zhou, Lili Mou, Sen Song

    Abstract: Optimization of discrete structures aims at generating a new structure with the better property given an existing one, which is a fundamental problem in machine learning. Different from the continuous optimization, the realistic applications of discrete optimization (e.g., text generation) are very challenging due to the complex and long-range constraints, including both syntax and semantics, in d… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: This article is an accepted manuscript of Neurocomputing. arXiv admin note: substantial text overlap with arXiv:1909.03588

    Journal ref: Neurocomputing, 465:310-324 (2021)

  44. Simulated Annealing for Emotional Dialogue Systems

    Authors: Chengzhang Dong, Chenyang Huang, Osmar Zaïane, Lili Mou

    Abstract: Explicitly modeling emotions in dialogue generation has important applications, such as building empathetic personal companions. In this study, we consider the task of expressing a specific emotion for dialogue generation. Previous approaches take the emotion as an input signal, which may be ignored during inference. We instead propose a search-based emotional dialogue system by simulated annealin… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    MSC Class: 68T50

  45. arXiv:2109.08927  [pdf, other

    cs.CL cs.AI

    Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic

    Authors: Zijun Wu, Zi Xuan Zhang, Atharva Naik, Zhijian Mei, Mauajama Firdaus, Lili Mou

    Abstract: Natural language inference (NLI) aims to determine the logical relationship between two sentences, such as Entailment, Contradiction, and Neutral. In recent years, deep learning models have become a prevailing approach to NLI, but they lack interpretability and explainability. In this work, we address the explainability of NLI by weakly supervised logical reasoning, and propose an Explainable Phra… ▽ More

    Submitted 22 February, 2023; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: Accepted by ICLR 2023

  46. SCIDA: Self-Correction Integrated Domain Adaptation from Single- to Multi-label Aerial Images

    Authors: Tianze Yu, Jianzhe Lin, Lichao Mou, Yuansheng Hua, Xiaoxiang Zhu, Z. Jane Wang

    Abstract: Most publicly available datasets for image classification are with single labels, while images are inherently multi-labeled in our daily life. Such an annotation gap makes many pre-trained single-label classification models fail in practical scenarios. This annotation issue is more concerned for aerial images: Aerial data collected from sensors naturally cover a relatively large land area with mul… ▽ More

    Submitted 29 November, 2021; v1 submitted 15 August, 2021; originally announced August 2021.

  47. Bi-Temporal Semantic Reasoning for the Semantic Change Detection in HR Remote Sensing Images

    Authors: Lei Ding, Haitao Guo, Sicong Liu, Lichao Mou, **g Zhang, Lorenzo Bruzzone

    Abstract: Semantic change detection (SCD) extends the multi-class change detection (MCD) task to provide not only the change locations but also the detailed land-cover/land-use (LCLU) categories before and after the observation intervals. This fine-grained semantic change information is very useful in many applications. Recent studies indicate that the SCD can be modeled through a triple-branch Convolutiona… ▽ More

    Submitted 5 January, 2022; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: Manuscript to IEEE TGRS

  48. arXiv:2108.04222  [pdf, other

    cs.CV

    Segmentation of VHR EO Images using Unsupervised Learning

    Authors: Sudipan Saha, Lichao Mou, Muhammad Shahzad, Xiao Xiang Zhu

    Abstract: Semantic segmentation is a crucial step in many Earth observation tasks. Large quantity of pixel-level annotation is required to train deep networks for semantic segmentation. Earth observation techniques are applied to varieties of applications and since classes vary widely depending on the applications, therefore, domain knowledge is often required to label Earth observation images, impeding ava… ▽ More

    Submitted 10 August, 2021; v1 submitted 9 July, 2021; originally announced August 2021.

  49. arXiv:2108.00688  [pdf, other

    cs.CV

    Self-supervised Audiovisual Representation Learning for Remote Sensing Data

    Authors: Konrad Heidler, Lichao Mou, Di Hu, Pu **, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu

    Abstract: Many current deep learning approaches make extensive use of backbone networks pre-trained on large datasets like ImageNet, which are then fine-tuned to perform a certain task. In remote sensing, the lack of comparable large annotated datasets and the wide diversity of sensing platforms impedes similar developments. In order to contribute towards the availability of pre-trained backbone networks in… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  50. arXiv:2106.06462  [pdf, other

    cs.CL

    Semi-Supervised and Unsupervised Sense Annotation via Translations

    Authors: Bradley Hauer, Grzegorz Kondrak, Yixing Luan, Arnob Mallik, Lili Mou

    Abstract: Acquisition of multilingual training data continues to be a challenge in word sense disambiguation (WSD). To address this problem, unsupervised approaches have been proposed to automatically generate sense annotations for training supervised WSD systems. We present three new methods for creating sense-annotated corpora which leverage translations, parallel bitexts, lexical resources, as well as co… ▽ More

    Submitted 17 September, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: In proceedings of RANLP 2021