Skip to main content

Showing 1–50 of 58 results for author: Ni, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01264  [pdf, other

    cs.CV

    FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis

    Authors: Linshan Wu, Jiaxin Zhuang, Xuefeng Ni, Hao Chen

    Abstract: AI-driven tumor analysis has garnered increasing attention in healthcare. However, its progress is significantly hindered by the lack of annotated tumor cases, which requires radiologists to invest a lot of effort in collecting and annotation. In this paper, we introduce a highly practical solution for robust tumor synthesis and segmentation, termed FreeTumor, which refers to annotation-free synth… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Preprint

  2. arXiv:2405.10251  [pdf, other

    cs.CL

    A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks

    Authors: Xuanfan Ni, Piji Li

    Abstract: Recent efforts have evaluated large language models (LLMs) in areas such as commonsense reasoning, mathematical reasoning, and code generation. However, to the best of our knowledge, no work has specifically investigated the performance of LLMs in natural language generation (NLG) tasks, a pivotal criterion for determining model excellence. Thus, this paper conducts a comprehensive evaluation of w… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: CCL2023

  3. arXiv:2405.01718  [pdf, other

    cs.LG math.OC stat.ML

    Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

    Authors: Xinyi Ni, Lifeng Lai

    Abstract: Robust Markov Decision Processes (RMDPs) have received significant research interest, offering an alternative to standard Markov Decision Processes (MDPs) that often assume fixed transition probabilities. RMDPs address this by optimizing for the worst-case scenarios within ambiguity sets. While earlier studies on RMDPs have largely centered on risk-neutral reinforcement learning (RL), with the goa… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2404.12000  [pdf, other

    cs.SE

    How far are AI-powered programming assistants from meeting developers' needs?

    Authors: Xin Tan, Xiao Long, Xianjun Ni, Yinghao Zhu, **g Jiang, Li Zhang

    Abstract: Recent In-IDE AI coding assistant tools (ACATs) like GitHub Copilot have significantly impacted developers' coding habits. While some studies have examined their effectiveness, there lacks in-depth investigation into the actual assistance process. To bridge this gap, we simulate real development scenarios encompassing three typical types of software development tasks and recruit 27 computer scienc… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  5. arXiv:2404.08978  [pdf, other

    cs.LG cs.AI

    Incremental Residual Concept Bottleneck Models

    Authors: Chenming Shang, Shiji Zhou, Hengyuan Zhang, Xinzhe Ni, Yujiu Yang, Yuwang Wang

    Abstract: Concept Bottleneck Models (CBMs) map the black-box visual representations extracted by deep neural networks onto a set of interpretable concepts and use the concepts to make predictions, enhancing the transparency of the decision-making process. Multimodal pre-trained models can match visual representations with textual concept embeddings, allowing for obtaining the interpretable concept bottlenec… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  6. arXiv:2404.05446  [pdf, other

    cs.CL

    XL$^2$Bench: A Benchmark for Extremely Long Context Understanding with Long-range Dependencies

    Authors: Xuanfan Ni, Hengyi Cai, Xiaochi Wei, Shuaiqiang Wang, Dawei Yin, Piji Li

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across diverse tasks but are constrained by their small context window sizes. Various efforts have been proposed to expand the context window to accommodate even up to 200K input tokens. Meanwhile, building high-quality benchmarks with much longer text lengths and more demanding tasks to provide comprehensive evaluations is of i… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Work in progress

  7. arXiv:2404.01067  [pdf, other

    cs.CL

    Exploring the Mystery of Influential Data for Mathematical Reasoning

    Authors: Xinzhe Ni, Yeyun Gong, Zhibin Gou, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen

    Abstract: Selecting influential data for fine-tuning on downstream tasks is a key factor for both performance and computation efficiency. Recent works have shown that training with only limited data can show a superior performance on general tasks. However, the feasibility on mathematical reasoning tasks has not been validated. To go further, there exist two open questions for mathematical reasoning: how to… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  8. arXiv:2312.14839  [pdf, other

    cs.GR

    Simulating Parametric Thin Shells by Bicubic Hermite Elements

    Authors: Xingyu Ni, Xuwen Chen, Cheng Yu, Bin Wang, Baoquan Chen

    Abstract: In this study, we present the bicubic Hermite element method (BHEM), a new computational framework devised for the elastodynamic simulation of parametric thin-shell structures. The BHEM is constructed based on parametric quadrilateral Hermite patches, which serve as a unified representation for shell geometry, simulation, collision avoidance, as well as rendering. Compared with the commonly utiliz… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  9. Demystifying DeFi MEV Activities in Flashbots Bundle

    Authors: Zihao Li, Jianfeng Li, Zheyuan He, Xiapu Luo, Ting Wang, Xiaoze Ni, Wenwu Yang, Xi Chen, Ting Chen

    Abstract: Decentralized Finance, mushrooming in permissionless blockchains, has attracted a recent surge in popularity. Due to the transparency of permissionless blockchains, opportunistic traders can compete to earn revenue by extracting Miner Extractable Value (MEV), which undermines both the consensus security and efficiency of blockchain systems. The Flashbots bundle mechanism further aggravates the MEV… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: This submission serves as our full paper version with the appendix

  10. arXiv:2311.07306  [pdf, other

    cs.CV

    What Large Language Models Bring to Text-rich VQA?

    Authors: Xue**g Liu, Wei Tang, Xinzhe Ni, **ghui Lu, Rui Zhao, Zechao Li, Fei Tan

    Abstract: Text-rich VQA, namely Visual Question Answering based on text recognition in the images, is a cross-modal task that requires both image comprehension and text recognition. In this work, we focus on investigating the advantages and bottlenecks of LLM-based approaches in addressing this problem. To address the above concern, we separate the vision and language modules, where we leverage external OCR… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  11. arXiv:2311.01949  [pdf, other

    cs.CL

    Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks

    Authors: Yifan Wang, Qingyan Guo, Xinzhe Ni, Chufan Shi, Lemao Liu, Haiyun Jiang, Yujiu Yang

    Abstract: In-context learning (ICL) ability has emerged with the increasing scale of large language models (LLMs), enabling them to learn input-label map**s from demonstrations and perform well on downstream tasks. However, under the standard ICL setting, LLMs may sometimes neglect query-related information in demonstrations, leading to incorrect predictions. To address this limitation, we propose a new p… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted by ICASSP 2024

  12. arXiv:2310.06613  [pdf, other

    cs.DC

    BandMap: Application Map** with Bandwidth Allocation forCoarse-Grained Reconfigurable Array

    Authors: Xiaobing Ni, Jiaheng Ruan, Mengke Ge, Wendi Sun, Song Chen, Yi Kang

    Abstract: This paper proposes an application map** algorithm, BandMap, for coarse-grained reconfigurable array (CGRA), which allocates the bandwidth in PE array according to the transferring demands of data, especially the data with high spatial reuse, to reduce the routing PEs. To cover bandwidth allocation, BandMap maps the data flow graphs (DFGs), abstracted from applications, by solving the maximum in… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  13. arXiv:2309.13063  [pdf, other

    cs.IR cs.AI cs.CL

    Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies

    Authors: Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Scott Counts, Sarkar Snigdha Sarathi Das, Ali Montazer, Sathish Manivannan, Jennifer Neville, Xiaochuan Ni, Nagu Rangan, Tara Safavi, Siddharth Suri, Mengting Wan, Leijie Wang, Longqi Yang

    Abstract: Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics.… ▽ More

    Submitted 9 May, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Report number: MSR-TR-2023-32

  14. arXiv:2308.08446  [pdf, other

    cs.IR cs.LG

    CSPM: A Contrastive Spatiotemporal Preference Model for CTR Prediction in On-Demand Food Delivery Services

    Authors: Guyu Jiang, Xiaoyun Li, Rongrong **g, Ruoqi Zhao, Xingliang Ni, Guodong Cao, Ning Hu

    Abstract: Click-through rate (CTR) prediction is a crucial task in the context of an online on-demand food delivery (OFD) platform for precisely estimating the probability of a user clicking on food items. Unlike universal e-commerce platforms such as Taobao and Amazon, user behaviors and interests on the OFD platform are more location and time-sensitive due to limited delivery ranges and regional commodity… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  15. arXiv:2307.10168  [pdf, other

    cs.CL cs.HC

    LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs

    Authors: Tongshuang Wu, Haiyi Zhu, Maya Albayrak, Alexis Axon, Amanda Bertsch, Wenxing Deng, Ziqi Ding, Bill Guo, Sireesh Gururaja, Tzu-Sheng Kuo, Jenny T. Liang, Ryan Liu, Ihita Mandal, Jeremiah Milbauer, Xiaolin Ni, Namrata Padmanabhan, Subhashini Ramkumar, Alexis Sudjianto, Jordan Taylor, Ying-Jui Tseng, Patricia Vaidos, Zhi** Wu, Wei Wu, Chenyang Yang

    Abstract: LLMs have shown promise in replicating human-like behavior in crowdsourcing tasks that were previously thought to be exclusive to human abilities. However, current efforts focus mainly on simple atomic tasks. We explore whether LLMs can replicate more complex crowdsourcing pipelines. We find that modern LLMs can simulate some of crowdworkers' abilities in these "human computation algorithms," but… ▽ More

    Submitted 19 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  16. arXiv:2306.00190  [pdf, other

    cs.HC

    Contextualizing Problems to Student Interests at Scale in Intelligent Tutoring System Using Large Language Models

    Authors: Gautam Yadav, Ying-Jui Tseng, Xiaolin Ni

    Abstract: Contextualizing problems to align with student interests can significantly improve learning outcomes. However, this task often presents scalability challenges due to resource and time constraints. Recent advancements in Large Language Models (LLMs) like GPT-4 offer potential solutions to these issues. This study explores the ability of GPT-4 in the contextualization of problems within CTAT, an int… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  17. arXiv:2305.14304  [pdf, other

    quant-ph cs.AR

    A Classical Architecture For Digital Quantum Computers

    Authors: Fang Zhang, Xing Zhu, Rui Chao, Cup** Huang, Linghang Kong, Guoyang Chen, Dawei Ding, Haishan Feng, Yihuai Gao, Xiaotong Ni, Liwei Qiu, Zhe Wei, Yueming Yang, Yang Zhao, Yaoyun Shi, Weifeng Zhang, Peng Zhou, Jianxin Chen

    Abstract: Scaling bottlenecks the making of digital quantum computers, posing challenges from both the quantum and the classical components. We present a classical architecture to cope with a comprehensive list of the latter challenges {\em all at once}, and implement it fully in an end-to-end system by integrating a multi-core RISC-V CPU with our in-house control electronics. Our architecture enables sca… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 12 pages, 12 figures

  18. DesignTracking: Track and Replay BIM-based Design Process

    Authors: Xiang-Rui Ni, Zhe Zheng, Jia-Rui Lin, Zhen-Zhong Hu, Xin Zhang

    Abstract: Among different phases of the life cycle of a building or facility, design is of the utmost importance to ensure safety, efficiency and sustainability of the building or facility. How to control and improve design quality and efficiency has been explored for years, and more studies emerged with the popularization of Building Information Modelling (BIM). However, most of them focused on the extract… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Journal ref: Creative Construction Conference 2023

  19. arXiv:2303.14956  [pdf, other

    cs.CL

    Unified Text Structuralization with Instruction-tuned Language Models

    Authors: Xuanfan Ni, Piji Li, Huayang Li

    Abstract: Text structuralization is one of the important fields of natural language processing (NLP) consists of information extraction (IE) and structure formalization. However, current studies of text structuralization suffer from a shortage of manually annotated high-quality datasets from different domains and languages, which require specialized professional knowledge. In addition, most IE methods are d… ▽ More

    Submitted 30 March, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 13 pages, 5 figures

  20. arXiv:2302.14012  [pdf

    quant-ph cs.ET

    Drone-based quantum key distribution

    Authors: Xiao-Hui Tian, Ran Yang, Ji-Ning Zhang, Hua Yu, Yao Zhang, Pengfei Fan, Mengwen Chen, Changsheng Gu, Xin Ni, Mingzhe Hu, Xun Cao, Xiaopeng Hu, Gang Zhao, Yan-Qing Lu, Zhi-Jun Yin, Hua-Ying Liu, Yan-Xiao Gong, Zhenda Xie, Shi-Ning Zhu

    Abstract: Drone-based quantum link has the potential to realize mobile quantum network, and entanglement distribution has been demonstrated using one and two drones. Here we report the first drone-based quantum key distribution (QKD), with average secure key rate larger than 8 kHz using decoy-state BB84 protocol with polarization coding. Compact acquisition, pointing, and tracking (APT) system and QKD modul… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  21. arXiv:2301.04748  [pdf, other

    cs.CV

    LSDM: Long-Short Diffeomorphic Motion for Weakly-Supervised Ultrasound Landmark Tracking

    Authors: Zhihua Liu, Bin Yang, Yan Shen, Xuejun Ni, Huiyu Zhou

    Abstract: Accurate tracking of an anatomical landmark over time has been of high interests for disease assessment such as minimally invasive surgery and tumor radiation therapy. Ultrasound imaging is a promising modality benefiting from low-cost and real-time acquisition. However, generating a precise landmark tracklet is very challenging, as attempts can be easily distorted by different interference such a… ▽ More

    Submitted 31 January, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

  22. arXiv:2212.04873  [pdf, other

    cs.CV cs.AI

    Multimodal Prototype-Enhanced Network for Few-Shot Action Recognition

    Authors: Xinzhe Ni, Yong Liu, Hao Wen, Yatai Ji, **g Xiao, Yujiu Yang

    Abstract: Current methods for few-shot action recognition mainly fall into the metric learning framework following ProtoNet, which demonstrates the importance of prototypes. Although they achieve relatively good performance, the effect of multimodal information is ignored, e.g. label texts. In this work, we propose a novel MultimOdal PRototype-ENhanced Network (MORN), which uses the semantic information of… ▽ More

    Submitted 21 May, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Accepted by ICMR 2024 (oral)

  23. A Random Forest and Current Fault Texture Feature-Based Method for Current Sensor Fault Diagnosis in Three-Phase PWM VSR

    Authors: Lei Kou, Xiao-dong Gong, Yi Zheng, Xiu-hui Ni, Yang Li, Quan-de Yuan, Ya-nan Dong

    Abstract: Three-phase PWM voltage-source rectifier (VSR) systems have been widely used in various energy conversion systems, where current sensors are the key component for state monitoring and system control. The current sensor faults may bring hidden danger or damage to the whole system; therefore, this paper proposed a random forest (RF) and current fault texture feature-based method for current sensor f… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Frontiers in Energy Research

    MSC Class: 68Q04 ACM Class: I.2

  24. arXiv:2210.01689  [pdf

    cs.CV

    Vision-based Warning System for Maintenance Personnel on Short-Term Roadwork Site

    Authors: Xiao Ni, Walpola Layantha Perera, Carsten Kühnel, Christian Vollrath

    Abstract: We propose a vision-based warning system for the maintenance personnel working on short-term construction sites. Traditional solutions use passive protection, like setting up traffic cones, safety beacons, or even nothing. However, such methods cannot function as physical safety barriers to separate working areas from used lanes. In contrast, our system provides active protection, leveraging acous… ▽ More

    Submitted 20 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

  25. arXiv:2206.01833  [pdf, other

    cs.RO cs.MA eess.SY

    Leveraging Heterogeneous Capabilities in Multi-Agent Systems for Environmental Conflict Resolution

    Authors: Michael Enqi Cao, Jonas Warnke, Yunhai Han, Xinpei Ni, Ye Zhao, Samuel Coogan

    Abstract: In this paper, we introduce a high-level controller synthesis framework that enables teams of heterogeneous agents to assist each other in resolving environmental conflicts that appear at runtime. This conflict resolution method is built upon temporal-logic-based reactive synthesis to guarantee safety and task completion under specific environment assumptions. In heterogeneous multi-agent systems,… ▽ More

    Submitted 1 September, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Submitted to The International Symposium on Safety, Security, and Rescue Robotics (SSRR) 2022

  26. Integrating Quantum Processor Device and Control Optimization in a Gradient-based Framework

    Authors: Xiaotong Ni, Hui-Hai Zhao, Lei Wang, Feng Wu, Jianxin Chen

    Abstract: In a quantum processor, the device design and external controls together contribute to the quality of the target quantum operations. As we continuously seek better alternative qubit platforms, we explore the increasingly large device and control design space. Thus, optimization becomes more and more challenging. In this work, we demonstrate that the figure of merit reflecting a design goal can be… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Journal ref: npj Quantum Information volume 8, 106 (2022)

  27. arXiv:2110.09005  [pdf, other

    eess.SP cs.LG

    Unsupervised Learned Kalman Filtering

    Authors: Guy Revach, Nir Shlezinger, Timur Locher, Xiaoyong Ni, Ruud J. G. van Sloun, Yonina C. Eldar

    Abstract: In this paper we adapt KalmanNet, which is a recently pro-posed deep neural network (DNN)-aided system whose architecture follows the operation of the model-based Kalman filter (KF), to learn its map** in an unsupervised manner, i.e., without requiring ground-truth states. The unsupervised adaptation is achieved by exploiting the hybrid model-based/data-driven architecture of KalmanNet, which in… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 5 Pages, 5 Figures, Submitted to ICASSP 2022

  28. arXiv:2109.10393  [pdf, other

    cs.CV cs.LG

    Towards a Real-Time Facial Analysis System

    Authors: Bishwo Adhikari, Xingyang Ni, Esa Rahtu, Heikki Huttunen

    Abstract: Facial analysis is an active research area in computer vision, with many practical applications. Most of the existing studies focus on addressing one specific task and maximizing its performance. For a complete facial analysis system, one needs to solve these tasks efficiently to ensure a smooth experience. In this work, we present a system-level design of a real-time facial analysis system. With… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Accepted in IEEE MMSP 2021

  29. arXiv:2108.07147  [pdf, other

    cs.CV

    On the Importance of Encrypting Deep Features

    Authors: Xingyang Ni, Heikki Huttunen, Esa Rahtu

    Abstract: In this study, we analyze model inversion attacks with only two assumptions: feature vectors of user data are known, and a black-box API for inference is provided. On the one hand, limitations of existing studies are addressed by opting for a more practical setting. Experiments have been conducted on state-of-the-art models in person re-identification, and two attack scenarios (i.e., recognizing a… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: First Version

  30. arXiv:2108.04089  [pdf, other

    cs.NI

    A Self-Configurable Grou** Method for Integrated Wi-SUN FAN and TSCH-based Networks

    Authors: Xinyu Ni, Michael Baddeley, Nan Jiang, Yichao **

    Abstract: Recent applications in large-scale wireless mesh networks (WSN), e.g., Advanced Metering Infrastructure (AMI) scenarios, expect to support an extended number of nodes with higher throughput, which cannot be sufficiently supported by the current WSN protocols. Two prior protocols, Wi-SUN Field Area Network (Wi-SUN FAN) and IETF 6TiSCH standards, are popularly used that are respectively based on asy… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  31. arXiv:2108.01298  [pdf, other

    cs.AR

    Synthesizing Brain-Network-Inspired Interconnections for Large-Scale Network-on-Chips

    Authors: Mengke Ge, Xiaobing Ni, Qi Xu, Song Chen, **glei Huang, Yi Kang, Feng Wu

    Abstract: Brain network is a large-scale complex network with scale-free, small-world, and modularity properties, which largely supports this high-efficiency massive system. In this paper, we propose to synthesize brain-network-inspired interconnections for large-scale network-on-chips. Firstly, we propose a method to generate brain-network-inspired topologies with limited scale-free and power-law small-wor… ▽ More

    Submitted 26 August, 2021; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: 19 pages, 15 figures, 8 tables, accepted by ACM TODAES

  32. arXiv:2107.10043  [pdf, other

    eess.SP cs.LG stat.ML

    KalmanNet: Neural Network Aided Kalman Filtering for Partially Known Dynamics

    Authors: Guy Revach, Nir Shlezinger, Xiaoyong Ni, Adria Lopez Escoriza, Ruud J. G. van Sloun, Yonina C. Eldar

    Abstract: State estimation of dynamical systems in real-time is a fundamental task in signal processing. For systems that are well-represented by a fully known linear Gaussian state space (SS) model, the celebrated Kalman filter (KF) is a low complexity optimal solution. However, both linearity of the underlying SS model and accurate knowledge of it are often not encountered in practice. Here, we present Ka… ▽ More

    Submitted 10 March, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in IEEE Transactions on Signal Processing - TSP

  33. arXiv:2106.11593  [pdf, other

    cs.LG cs.AI

    A Vertical Federated Learning Framework for Graph Convolutional Network

    Authors: Xiang Ni, Xiaolong Xu, Lingjuan Lyu, Changhua Meng, Weiqiang Wang

    Abstract: Recently, Graph Neural Network (GNN) has achieved remarkable success in various real-world problems on graph data. However in most industries, data exists in the form of isolated islands and the data privacy and security is also an important issue. In this paper, we propose FedVGCN, a federated GCN learning paradigm for privacy-preserving node classification task under data vertically partitioned… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

  34. arXiv:2105.05639  [pdf, other

    cs.CV

    FlipReID: Closing the Gap between Training and Inference in Person Re-Identification

    Authors: Xingyang Ni, Esa Rahtu

    Abstract: Since neural networks are data-hungry, incorporating data augmentation in training is a widely adopted technique that enlarges datasets and improves generalization. On the other hand, aggregating predictions of multiple augmented samples (i.e., test-time augmentation) could boost performance even further. In the context of person re-identification models, it is common practice to extract embedding… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: First Version

  35. arXiv:2104.11645  [pdf, other

    cs.NI cs.LG

    Software-Defined Edge Computing: A New Architecture Paradigm to Support IoT Data Analysis

    Authors: Di Wu, Xiaofeng Xie, Xiang Ni, Bin Fu, Hanhui Deng, Haibo Zeng, Zhi** Qin

    Abstract: The rapid deployment of Internet of Things (IoT) applications leads to massive data that need to be processed. These IoT applications have specific communication requirements on latency and bandwidth, and present new features on their generated data such as time-dependency. Therefore, it is desirable to reshape the current IoT architectures by exploring their inherent nature of communication and c… ▽ More

    Submitted 25 April, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

  36. arXiv:2102.08465  [pdf, other

    cs.SI cs.DL cs.IR cs.LG

    Prioritizing Original News on Facebook

    Authors: Xiuyan Ni, Shujian Bu, Igor L. Markov

    Abstract: This work outlines how we prioritize original news, a critical indicator of news quality. By examining the landscape and life-cycle of news posts on our social media platform, we identify challenges of building and deploying an originality score. We pursue an approach based on normalized PageRank values and three-step clustering, and refresh the score on an hourly basis to capture the dynamics of… ▽ More

    Submitted 14 March, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 9 pages, 8 figures, 6 tables, 2 algorithm pseudocodes

    Journal ref: CIKM 2021

  37. arXiv:2101.06907  [pdf, other

    cs.IT

    Quartic Perturbation-based Outage-constrained Robust Design in Two-hop One-way Relay Networks

    Authors: Sissi Xiaoxiao Wu, Sherry Xue-Ying Ni, Jiaying Li, Anthony Man-Cho So

    Abstract: In this work, we study a classic robust design problem in two-hop one-way relay system. We are particularly interested in the scenario where channel uncertainty exists in both the transmitter-to-relay and relay-to-receiver links. By considering the problem design that minimizes the average amplify-and-forward power budget at the relay side while satisfying SNR outage requirements, an outage-constr… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  38. arXiv:2012.13099  [pdf, other

    cs.LG cs.AI

    Cooperative Policy Learning with Pre-trained Heterogeneous Observation Representations

    Authors: Wenlei Shi, Xinran Wei, Jia Zhang, Xiaoyuan Ni, Arthur Jiang, Jiang Bian, Tie-Yan Liu

    Abstract: Multi-agent reinforcement learning (MARL) has been increasingly explored to learn the cooperative policy towards maximizing a certain global reward. Many existing studies take advantage of graph neural networks (GNN) in MARL to propagate critical collaborative information over the interaction graph, built upon inter-connected agents. Nevertheless, the vanilla GNN approach yields substantial defect… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

    Comments: accepted as an oral paper in AAMAS 2021

  39. arXiv:2011.08410  [pdf, other

    cs.CV

    Semi-Supervised Few-Shot Atomic Action Recognition

    Authors: Xiaoyuan Ni, Sizhe Song, Yu-Wing Tai, Chi-Keung Tang

    Abstract: Despite excellent progress has been made, the performance on action recognition still heavily relies on specific datasets, which are difficult to extend new action classes due to labor-intensive labeling. Moreover, the high diversity in Spatio-temporal appearance requires robust and representative action feature aggregation and attention. To address the above issues, we focus on atomic actions and… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: 7 pages, 3 figures, 2 tables

  40. Develo** and Improving Risk Models using Machine-learning Based Algorithms

    Authors: Yan Wang, Xuelei Sherry Ni

    Abstract: The objective of this study is to develop a good risk model for classifying business delinquency by simultaneously exploring several machine learning based methods including regularization, hyper-parameter optimization, and model ensembling algorithms. The rationale under the analyses is firstly to obtain good base binary classifiers (include Logistic Regression ($LR$), K-Nearest Neighbors ($KNN$)… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

  41. Improving Investment Suggestions for Peer-to-Peer (P2P) Lending via Integrating Credit Scoring into Profit Scoring

    Authors: Yan Wang, Xuelei Sherry Ni

    Abstract: In the peer-to-peer (P2P) lending market, lenders lend the money to the borrowers through a virtual platform and earn the possible profit generated by the interest rate. From the perspective of lenders, they want to maximize the profit while minimizing the risk. Therefore, many studies have used machine learning algorithms to help the lenders identify the "best" loans for making investments. The s… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

  42. arXiv:2007.07875  [pdf, other

    cs.CV

    Adaptive L2 Regularization in Person Re-Identification

    Authors: Xingyang Ni, Liang Fang, Heikki Huttunen

    Abstract: We introduce an adaptive L2 regularization mechanism in the setting of person re-identification. In the literature, it is common practice to utilize hand-picked regularization factors which remain constant throughout the training procedure. Unlike existing approaches, the regularization factors in our proposed method are updated adaptively through backpropagation. This is achieved by incorporating… ▽ More

    Submitted 18 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Accepted at ICPR 2020

  43. Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification

    Authors: Xingyang Ni, Heikki Huttunen

    Abstract: This paper studies vehicle attribute recognition by appearance. In the literature, image-based target recognition has been extensively investigated in many use cases, such as facial recognition, but less so in the field of vehicle attribute recognition. We survey a number of algorithms that identify vehicle properties ranging from coarse-grained level (vehicle type) to fine-grained level (vehicle… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Published in Journal of Signal Processing Systems

  44. arXiv:1911.08517  [pdf, other

    cs.LG cs.DC eess.SP

    Generalizable Resource Allocation in Stream Processing via Deep Reinforcement Learning

    Authors: Xiang Ni, **g Li, Mo Yu, Wang Zhou, Kun-Lung Wu

    Abstract: This paper considers the problem of resource allocation in stream processing, where continuous data flows must be processed in real time in a large distributed system. To maximize system throughput, the resource allocation strategy that partitions the computation tasks of a stream processing graph onto computing devices must simultaneously balance workload distribution and minimize communication.… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  45. arXiv:1903.05535  [pdf

    stat.ML cs.LG

    Predicting class-imbalanced business risk using resampling, regularization, and model ensembling algorithms

    Authors: Yan Wang, Xuelei Sherry Ni

    Abstract: We aim at develo** and improving the imbalanced business risk modeling via jointly using proper evaluation criteria, resampling, cross-validation, classifier regularization, and ensembling techniques. Area Under the Receiver Operating Characteristic Curve (AUC of ROC) is used for model comparison based on 10-fold cross validation. Two undersampling strategies including random undersampling (RUS)… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Journal ref: International Journal of Managing Information Technology (IJIMIT) Vol. 11, No. 1, Februray 2019

  46. arXiv:1902.04954  [pdf, other

    cs.LG q-fin.GN stat.ML

    Risk Prediction of Peer-to-Peer Lending Market by a LSTM Model with Macroeconomic Factor

    Authors: Yan Wang, Xuelei Sherry Ni

    Abstract: In the peer to peer (P2P) lending platform, investors hope to maximize their return while minimizing the risk through a comprehensive understanding of the P2P market. A low and stable average default rate across all the borrowers denotes a healthy P2P market and provides investors more confidence in a promising investment. Therefore, having a powerful model to describe the trend of the default rat… ▽ More

    Submitted 9 September, 2020; v1 submitted 13 February, 2019; originally announced February 2019.

  47. arXiv:1901.08433  [pdf

    stat.ML cs.LG

    A XGBoost risk model via feature selection and Bayesian hyper-parameter optimization

    Authors: Yan Wang, Xuelei Sherry Ni

    Abstract: This paper aims to explore models based on the extreme gradient boosting (XGBoost) approach for business risk classification. Feature selection (FS) algorithms and hyper-parameter optimizations are simultaneously considered during model training. The five most commonly used FS methods including weight by Gini, weight by Chi-square, hierarchical variable clustering, weight by correlation, and weigh… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Accepted by International Journal of Database Management Systems (IJDMS)

  48. arXiv:1901.00251  [pdf, other

    stat.ML cs.LG

    An Automatic Interaction Detection Hybrid Model for Bankcard Response Classification

    Authors: Yan Wang, Xuelei Sherry Ni, Brian Stone

    Abstract: In this paper, we propose a hybrid bankcard response model, which integrates decision tree based chi-square automatic interaction detection (CHAID) into logistic regression. In the first stage of the hybrid model, CHAID analysis is used to detect the possibly potential variable interactions. Then in the second stage, these potential interactions are served as the additional input variables in logi… ▽ More

    Submitted 1 January, 2019; originally announced January 2019.

    Journal ref: The 2018 5th International Conference on Systems and Informatics (ICSAI2018)

  49. arXiv:1812.02546  [pdf

    stat.ML cs.LG

    A two-stage hybrid model by using artificial neural networks as feature construction algorithms

    Authors: Yan Wang, Xuelei Sherry Ni, Brian Stone

    Abstract: We propose a two-stage hybrid approach with neural networks as the new feature construction algorithms for bankcard response classifications. The hybrid model uses a very simple neural network structure as the new feature construction tool in the first stage, then the newly created features are used as the additional input variables in logistic regression in the second stage. The model is compared… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

  50. Neural Network Decoders for Large-Distance 2D Toric Codes

    Authors: Xiaotong Ni

    Abstract: We still do not have perfect decoders for topological codes that can satisfy all needs of different experimental setups. Recently, a few neural network based decoders have been studied, with the motivation that they can adapt to a wide range of noise models, and can easily run on dedicated chips without a full-fledged computer. The later feature might lead to fast speed and the ability to operate… ▽ More

    Submitted 7 April, 2020; v1 submitted 18 September, 2018; originally announced September 2018.

    Journal ref: Quantum 4, 310 (2020)