Skip to main content

Showing 1–31 of 31 results for author: Xia, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09438  [pdf, other

    math.OC cs.LG stat.ML

    Develo** Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

    Authors: Nachuan Xiao, Kuangyu Ding, Xiaoyin Hu, Kim-Chuan Toh

    Abstract: In this paper, we consider the minimization of a nonsmooth nonconvex objective function $f(x)$ over a closed convex subset $\mathcal{X}$ of $\mathbb{R}^n$, with additional nonsmooth nonconvex constraints $c(x) = 0$. We develop a unified framework for develo** Lagrangian-based methods, which takes a single-step update to the primal variables by some subgradient methods in each iteration. These su… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 30 pages, 4 figures

  2. arXiv:2403.11565  [pdf, other

    math.OC cs.LG

    Decentralized Stochastic Subgradient Methods for Nonsmooth Nonconvex Optimization

    Authors: Siyuan Zhang, Nachuan Xiao, Xin Liu

    Abstract: In this paper, we concentrate on decentralized optimization problems with nonconvex and nonsmooth objective functions, especially on the decentralized training of nonsmooth neural networks. We introduce a unified framework to analyze the global convergence of decentralized stochastic subgradient-based methods. We prove the global convergence of our proposed framework under mild conditions, by esta… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 22 pages

  3. arXiv:2402.12743  [pdf

    cs.CR cs.LG

    APT-MMF: An advanced persistent threat actor attribution method based on multimodal and multilevel feature fusion

    Authors: Nan Xiao, Bo Lang, Ting Wang, Yikai Chen

    Abstract: Threat actor attribution is a crucial defense strategy for combating advanced persistent threats (APTs). Cyber threat intelligence (CTI), which involves analyzing multisource heterogeneous data from APTs, plays an important role in APT actor attribution. The current attribution methods extract features from different CTI perspectives and employ machine learning models to classify CTI reports accor… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2402.03783  [pdf, other

    cs.CV

    Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning

    Authors: Fudan Zheng, **dong Cao, Weijiang Yu, Zhiguang Chen, Nong Xiao, Yutong Lu

    Abstract: Most advances in medical image recognition supporting clinical auxiliary diagnosis meet challenges due to the low-resource situation in the medical field, where annotations are highly expensive and professional. This low-resource problem can be alleviated by leveraging the transferable representations of large-scale pre-trained vision-language models via relevant medical text prompts. However, exi… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted by Pattern Recognition

  5. arXiv:2402.03754  [pdf, other

    cs.CV

    Intensive Vision-guided Network for Radiology Report Generation

    Authors: Fudan Zheng, Mengfei Li, Ying Wang, Weijiang Yu, Ruixuan Wang, Zhiguang Chen, Nong Xiao, Yutong Lu

    Abstract: Automatic radiology report generation is booming due to its huge application potential for the healthcare industry. However, existing computer vision and natural language processing approaches to tackle this problem are limited in two aspects. First, when extracting image features, most of them neglect multi-view reasoning in vision and model single-view structure of medical images, such as space-… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted by Physics in Medicine & Biology

  6. arXiv:2402.02149  [pdf, other

    cs.CV cs.LG

    Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

    Authors: Xinyu Peng, Ziyang Zheng, Wenrui Dai, Nuoqian Xiao, Chenglin Li, Junni Zou, Hongkai Xiong

    Abstract: Recent diffusion models provide a promising zero-shot solution to noisy linear inverse problems without retraining for specific inverse problems. In this paper, we reveal that recent methods can be uniformly interpreted as employing a Gaussian approximation with hand-crafted isotropic covariance for the intractable denoising posterior to approximate the conditional posterior mean. Inspired by this… ▽ More

    Submitted 2 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  7. arXiv:2312.16046  [pdf, other

    cs.LG cs.AI physics.ao-ph

    AdaNAS: Adaptively Post-processing with Self-supervised Neural Architecture Search for Ensemble Rainfall Forecasts

    Authors: Yingpeng Wen, Weijiang Yu, Fudan Zheng, Dan Huang, Nong Xiao

    Abstract: Previous post-processing studies on rainfall forecasts using numerical weather prediction (NWP) mainly focus on statistics-based aspects, while learning-based aspects are rarely investigated. Although some manually-designed models are proposed to raise accuracy, they are customized networks, which need to be repeatedly tried and verified, at a huge cost in time and labor. Therefore, a self-supervi… ▽ More

    Submitted 4 February, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  8. arXiv:2310.08858  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Adam-family Methods with Decoupled Weight Decay in Deep Learning

    Authors: Kuangyu Ding, Nachuan Xiao, Kim-Chuan Toh

    Abstract: In this paper, we investigate the convergence properties of a wide class of Adam-family methods for minimizing quadratically regularized nonsmooth nonconvex optimization problems, especially in the context of training nonsmooth neural networks with weight decay. Motivated by the AdamW method, we propose a novel framework for Adam-family methods with decoupled weight decay. Within our framework, th… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 26 pages

  9. arXiv:2307.10053  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    SGD-type Methods with Guaranteed Global Stability in Nonsmooth Nonconvex Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Kim-Chuan Toh

    Abstract: In this paper, we focus on providing convergence guarantees for variants of the stochastic subgradient descent (SGD) method in minimizing nonsmooth nonconvex functions. We first develop a general framework to establish global stability for general stochastic subgradient methods, where the corresponding differential inclusion admits a coercive Lyapunov function. We prove that, with sufficiently sma… ▽ More

    Submitted 13 May, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 36 pages

  10. arXiv:2305.17351  [pdf, other

    cs.CL

    Disambiguated Lexically Constrained Neural Machine Translation

    Authors: **peng Zhang, Nini Xiao, Ke Wang, Chuanqi Dong, Xiangyu Duan, Yuqi Zhang, Min Zhang

    Abstract: Lexically constrained neural machine translation (LCNMT), which controls the translation generation with pre-specified constraints, is important in many practical applications. Current approaches to LCNMT typically assume that the pre-specified lexical constraints are contextually appropriate. This assumption limits their application to real-world scenarios where a source lexicon may have multiple… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 as a long paper (Findings), 12 pages, 3 figures

  11. arXiv:2305.03938  [pdf, other

    math.OC cs.LG stat.ML

    Adam-family Methods for Nonsmooth Optimization with Convergence Guarantees

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we present a comprehensive study on the convergence properties of Adam-family methods for nonsmooth optimization, especially in the training of nonsmooth neural networks. We introduce a novel two-timescale framework that adopts a two-timescale updating scheme, and prove its convergence properties under mild assumptions. Our proposed framework encompasses various popular Adam-family… ▽ More

    Submitted 19 February, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 53 pages

  12. arXiv:2212.02698  [pdf, other

    math.OC cs.MS

    CDOpt: A Python Package for a Class of Riemannian Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: Optimization over the embedded submanifold defined by constraints $c(x) = 0$ has attracted much interest over the past few decades due to its wide applications in various areas. Plenty of related optimization packages have been developed based on Riemannian optimization approaches, which rely on some basic geometrical materials of Riemannian manifolds, including retractions, vector transports, etc… ▽ More

    Submitted 28 March, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 31 pages

  13. arXiv:2211.08987  [pdf, other

    cs.CL

    TSMind: Alibaba and Soochow University's Submission to the WMT22 Translation Suggestion Task

    Authors: Xin Ge, Ke Wang, Jiayi Wang, Nini Xiao, Xiangyu Duan, Yu Zhao, Yuqi Zhang

    Abstract: This paper describes the joint submission of Alibaba and Soochow University, TSMind, to the WMT 2022 Shared Task on Translation Suggestion (TS). We participate in the English-German and English-Chinese tasks. Basically, we utilize the model paradigm fine-tuning on the downstream tasks based on large-scale pre-trained models, which has recently achieved great success. We choose FAIR's WMT19 English… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  14. arXiv:2211.08545  [pdf, other

    cs.CV cs.CL

    MapQA: A Dataset for Question Answering on Choropleth Maps

    Authors: Shuaichen Chang, David Palzer, Jialin Li, Eric Fosler-Lussier, Ningchuan Xiao

    Abstract: Choropleth maps are a common visual representation for region-specific tabular data and are used in a number of different venues (newspapers, articles, etc). These maps are human-readable but are often challenging to deal with when trying to extract data for screen readers, analyses, or other related tasks. Recent research into Visual-Question Answering (VQA) has studied question answering on huma… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  15. arXiv:2205.04686  [pdf, ps, other

    cs.CL

    AdMix: A Mixed Sample Data Augmentation Method for Neural Machine Translation

    Authors: Chang **, Shigui Qiu, Nini Xiao, Hao Jia

    Abstract: In Neural Machine Translation (NMT), data augmentation methods such as back-translation have proven their effectiveness in improving translation performance. In this paper, we propose a novel data augmentation approach for NMT, which is independent of any additional training data. Our approach, AdMix, consists of two parts: 1) introduce faint discrete noise (word replacement, word drop**, word s… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  16. arXiv:2108.02365  [pdf

    cs.CV cs.CL

    Hybrid Reasoning Network for Video-based Commonsense Captioning

    Authors: Weijiang Yu, Jian Liang, Lei Ji, Lu Li, Yuejian Fang, Nong Xiao, Nan Duan

    Abstract: The task of video-based commonsense captioning aims to generate event-wise captions and meanwhile provide multiple commonsense descriptions (e.g., attribute, effect and intention) about the underlying event in the video. Prior works explore the commonsense captions by using separate networks for different commonsense types, which is time-consuming and lacks mining the interaction of different comm… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 11 pages, 6 figures

    MSC Class: 68T07

  17. Identifying High Accuracy Regions in Traffic Camera Images to Enhance the Estimation of Road Traffic Metrics: A Quadtree-Based Method

    Authors: Yue Lin, Ningchuan Xiao

    Abstract: The growing number of real-time camera feeds in urban areas has made it possible to provide high-quality traffic data for effective transportation planning, operations, and management. However, deriving reliable traffic metrics from these camera feeds has been a challenge due to the limitations of current vehicle detection techniques, as well as the various camera conditions such as height and res… ▽ More

    Submitted 14 June, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

    Comments: Transportation Research Record (2022)

  18. arXiv:2106.03084  [pdf, other

    cs.CL

    Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

    Authors: **peng Zhang, Baijun Ji, Nini Xiao, Xiangyu Duan, Min Zhang, Yangbin Shi, Weihua Luo

    Abstract: Bilingual Lexicon Induction (BLI) aims to map words in one language to their translations in another, and is typically through learning linear projections to align monolingual word representation spaces. Two classes of word representations have been explored for BLI: static word embeddings and contextual representations, but there is no studies to combine both. In this paper, we propose a simple y… ▽ More

    Submitted 10 June, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted to Findings of ACL2021

  19. arXiv:2105.00381  [pdf, other

    cs.CV cs.AI

    AGMB-Transformer: Anatomy-Guided Multi-Branch Transformer Network for Automated Evaluation of Root Canal Therapy

    Authors: Yunxiang Li, Guodong Zeng, Yifan Zhang, Jun Wang, Qianni Zhang, Qun **, Lingling Sun, Qisi Lian, Neng Xia, Ruizi Peng, Kai Tang, Yaqi Wang, Shuai Wang

    Abstract: Accurate evaluation of the treatment result on X-ray images is a significant and challenging step in root canal therapy since the incorrect interpretation of the therapy results will hamper timely follow-up which is crucial to the patients' treatment outcome. Nowadays, the evaluation is performed in a manual manner, which is time-consuming, subjective, and error-prone. In this paper, we aim to aut… ▽ More

    Submitted 28 October, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: under review

  20. arXiv:2103.13814  [pdf, other

    cs.LG

    Dynamic Weighted Learning for Unsupervised Domain Adaptation

    Authors: Ni Xiao, Lei Zhang

    Abstract: Unsupervised domain adaptation (UDA) aims to improve the classification performance on an unlabeled target domain by leveraging information from a fully labeled source domain. Recent approaches explore domain-invariant and class-discriminant representations to tackle this task. These methods, however, ignore the interaction between domain alignment learning and class discrimination learning. As a… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: This paper has been accepted by CVPR2021

  21. arXiv:2011.02763  [pdf, other

    cs.CV cs.LG

    Robust Unsupervised Video Anomaly Detection by Multi-Path Frame Prediction

    Authors: Xuanzhao Wang, Zheng** Che, Bo Jiang, Ning Xiao, Ke Yang, Jian Tang, Jie** Ye, **gyu Wang, Qi Qi

    Abstract: Video anomaly detection is commonly used in many applications such as security surveillance and is very challenging.A majority of recent video anomaly detection approaches utilize deep reconstruction models, but their performance is often suboptimal because of insufficient reconstruction error differences between normal and abnormal video frames in practice. Meanwhile, frame prediction-based anoma… ▽ More

    Submitted 27 May, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

    Comments: Paper accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS). Article DOI: 10.1109/TNNLS.2021.3083152

  22. arXiv:2003.08770  [pdf, other

    cs.CV eess.IV

    ElixirNet: Relation-aware Network Architecture Adaptation for Medical Lesion Detection

    Authors: Chenhan Jiang, Shaoju Wang, Hang Xu, Xiaodan Liang, Nong Xiao

    Abstract: Most advances in medical lesion detection network are limited to subtle modification on the conventional detection network designed for natural images. However, there exists a vast domain gap between medical images and natural images where the medical image detection often suffers from several domain-specific challenges, such as high lesion/background similarity, dominant tiny lesions, and severe… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 7 pages, 5 figure, AAAI2020

  23. The Rockerverse: Packages and Applications for Containerization with R

    Authors: Daniel Nüst, Dirk Eddelbuettel, Dom Bennett, Robrecht Cannoodt, Dav Clark, Gergely Daroczi, Mark Edmondson, Colin Fay, Ellis Hughes, Lars Kjeldgaard, Sean Lopp, Ben Marwick, Heather Nolis, Jacqueline Nolis, Hong Ooi, Karthik Ram, Noam Ross, Lori Shepherd, Péter Sólymos, Tyson Lee Swetnam, Nitesh Turaga, Charlotte Van Petegem, Jason Williams, Craig Willis, Nan Xiao

    Abstract: The Rocker Project provides widely used Docker images for R across different application scenarios. This article surveys downstream projects that build upon the Rocker Project images and presents the current state of R packages for managing Docker images and controlling containers. These use cases cover diverse topics such as package development, reproducible research, collaborative work, cloud-ba… ▽ More

    Submitted 17 August, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: Source code for article available at https://github.com/nuest/rockerverse-paper/ Updated version includes some new paragraphs and corrections throughout the text; full diff available at https://github.com/nuest/rockerverse-paper/compare/preprint.v2...preprint.v3

    MSC Class: 68N01 ACM Class: D.2.6; D.2.7; K.6.3

    Journal ref: The R Journal (2020), 12:1, pages 437-461

  24. arXiv:1910.11475  [pdf, other

    cs.CV

    Heterogeneous Graph Learning for Visual Commonsense Reasoning

    Authors: Weijiang Yu, **gwen Zhou, Weihao Yu, Xiaodan Liang, Nong Xiao

    Abstract: Visual commonsense reasoning task aims at leading the research field into solving cognition-level reasoning with the ability of predicting correct answers and meanwhile providing convincing reasoning paths, resulting in three sub-tasks i.e., Q->A, QA->R and Q->AR. It poses great challenges over the proper semantic alignment between vision and linguistic domains and knowledge reasoning to generate… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: 11 pages, 5 figures

    MSC Class: 68T01

  25. arXiv:1910.01923  [pdf, other

    cs.CV

    Layout-Graph Reasoning for Fashion Landmark Detection

    Authors: Weijiang Yu, Xiaodan Liang, Ke Gong, Chenhan Jiang, Nong Xiao, Liang Lin

    Abstract: Detecting dense landmarks for diverse clothes, as a fundamental technique for clothes analysis, has attracted increasing research attention due to its huge application potential. However, due to the lack of modeling underlying semantic layout constraints among landmarks, prior works often detect ambiguous and structure-inconsistent landmarks of multiple overlapped clothes in one person. In this pa… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 9 pages, 5 figures, CVPR2019

    MSC Class: I.4.9

  26. arXiv:1909.09677  [pdf, other

    cs.CV cs.AI cs.MM

    Gradual Network for Single Image De-raining

    Authors: Zhe Huang, Weijiang Yu, Wayne Zhang, Litong Feng, Nong Xiao

    Abstract: Most advances in single image de-raining meet a key challenge, which is removing rain streaks with different scales and shapes while preserving image details. Existing single image de-raining approaches treat rain-streak removal as a process of pixel-wise regression directly. However, they are lacking in mining the balance between over-de-raining (e.g. removing texture details in rain-free regions… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: In Proceedings of the 27th ACM International Conference on Multimedia (MM 2019)

  27. arXiv:1904.09824  [pdf, other

    cs.CL

    Judging Chemical Reaction Practicality From Positive Sample only Learning

    Authors: Shu Jiang, Zhuosheng Zhang, Hai Zhao, Jiangtong Li, Yang Yang, Bao-Liang Lu, Ning Xia

    Abstract: Chemical reaction practicality is the core task among all symbol intelligence based chemical information processing, for example, it provides indispensable clue for further automatic synthesis route inference. Considering that chemical reactions have been represented in a language form, we propose a new solution to generally judge the practicality of organic reaction without considering complex qu… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

  28. Cross-Modal Attentional Context Learning for RGB-D Object Detection

    Authors: Guanbin Li, Yukang Gan, Hejun Wu, Nong Xiao, Liang Lin

    Abstract: Recognizing objects from simultaneously sensed photometric (RGB) and depth channels is a fundamental yet practical problem in many machine vision applications such as robot gras** and autonomous driving. In this paper, we address this problem by develo** a Cross-Modal Attentional Context (CMAC) learning framework, which enables the full exploitation of the context information from both RGB and… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: Accept as a regular paper to IEEE Transactions on Image Processing

  29. arXiv:1810.10697  [pdf

    cs.NI

    COUSTIC: Combinatorial Double auction for Task Assignment in Device-to-Device Clouds

    Authors: Yutong Zhai, Liusheng Huang, Long Chen, Ning Xiao, Yangyang Geng

    Abstract: With the emerging technologies of Internet of Things (IOTs), the capabilities of mobile devices have increased tremendously. However, in the big data era, to complete tasks on one device is still challenging. As an emerging technology, crowdsourcing utilizing crowds of devices to facilitate large scale sensing tasks has gaining more and more research attention. Most of existing works either assume… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: 17 pages, 7 figures, Accepted by 18th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2018)

  30. arXiv:1612.01057  [pdf, other

    cs.CV

    Learning to Segment Object Candidates via Recursive Neural Networks

    Authors: Tianshui Chen, Liang Lin, Xian Wu, Nong Xiao, Xiaonan Luo

    Abstract: To avoid the exhaustive search over locations and scales, current state-of-the-art object detection systems usually involve a crucial component generating a batch of candidate object proposals from images. In this paper, we present a simple yet effective approach for segmenting object proposals via a deep architecture of recursive neural networks (ReNNs), which hierarchically groups regions for de… ▽ More

    Submitted 28 July, 2018; v1 submitted 3 December, 2016; originally announced December 2016.

    Comments: Accepted at TIP

  31. arXiv:1208.4589  [pdf, other

    cs.GT

    Road Pricing for Spreading Peak Travel: Modeling and Design

    Authors: Tichakorn Wongpiromsarn, Nan Xiao, Keyou You, Kai Sim, Lihua Xie, Emilio Frazzoli, Daniela Rus

    Abstract: A case study of the Singapore road network provides empirical evidence that road pricing can significantly affect commuter trip timing behaviors. In this paper, we propose a model of trip timing decisions that reasonably matches the observed commuters' behaviors. Our model explicitly captures the difference in individuals' sensitivity to price, travel time and early or late arrival at destination.… ▽ More

    Submitted 16 July, 2012; originally announced August 2012.