Skip to main content

Showing 1–50 of 144 results for author: Feng, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00451  [pdf, other

    cs.RO

    Language-Guided Object-Centric Diffusion Policy for Collision-Aware Robotic Manipulation

    Authors: Hang Li, Qian Feng, Zhi Zheng, Jianxiang Feng, Alois Knoll

    Abstract: Learning from demonstrations faces challenges in generalizing beyond the training data and is fragile even to slight visual variations. To tackle this problem, we introduce Lan-o3dp, a language guided object centric diffusion policy that takes 3d representation of task relevant objects as conditional input and can be guided by cost function for safety constraints at inference time. Lan-o3dp enable… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 11 pages, 8 figures

  2. arXiv:2406.14080  [pdf, other

    cs.CV cs.GR

    CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification

    Authors: Faxu Guo, Quan Feng, Sen Yang, Wanxia Yang

    Abstract: Hyperspectral remote sensing (HIS) enables the detailed capture of spectral information from the Earth's surface, facilitating precise classification and identification of surface crops due to its superior spectral diagnostic capabilities. However, current convolutional neural networks (CNNs) focus on local features in hyperspectral data, leading to suboptimal performance when classifying intricat… ▽ More

    Submitted 20 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 15 pages, 11figures

    ACM Class: I.4.6

  3. arXiv:2406.06802  [pdf, other

    stat.ML cs.LG

    Satisficing Exploration in Bandit Optimization

    Authors: Qing Feng, Tianyi Ma, Ruihao Zhu

    Abstract: Motivated by the concept of satisficing in decision-making, we consider the problem of satisficing exploration in bandit optimization. In this setting, the learner aims at selecting satisficing arms (arms with mean reward exceeding a certain threshold value) as frequently as possible. The performance is measured by satisficing regret, which is the cumulative deficit of the chosen arm's mean reward… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.00430  [pdf, other

    cs.RO cs.AI

    Evaluating Uncertainty-based Failure Detection for Closed-Loop LLM Planners

    Authors: Zhi Zheng, Qian Feng, Hang Li, Alois Knoll, Jianxiang Feng

    Abstract: Recently, Large Language Models (LLMs) have witnessed remarkable performance as zero-shot task planners for robotic manipulation tasks. However, the open-loop nature of previous works makes LLM-based planning error-prone and fragile. On the other hand, failure detection approaches for closed-loop planning are often limited by task-specific heuristics or following an unrealistic assumption that the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted at ICRA 2024 Workshop on Back to the Future: Robot Learning Going Probabilistic. Website: https://sites.google.com/view/konwloop/home

  5. arXiv:2405.17741  [pdf, other

    cs.AI

    LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design

    Authors: Rui Kong, Qiyang Li, Xinyu Fang, Qingtian Feng, Qingfeng He, Yazhu Dong, Weijun Wang, Yuanchun Li, Linghe Kong, Yunxin Liu

    Abstract: Recent literature has found that an effective method to customize or further improve large language models (LLMs) is to add dynamic adapters, such as low-rank adapters (LoRA) with Mixture-of-Experts (MoE) structures. Though such dynamic adapters incur modest computational complexity, they surprisingly lead to huge inference latency overhead, slowing down the decoding speed by 2.5+ times. In this p… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.16928  [pdf

    cs.SI cs.GT

    TopoLa: a novel embedding framework for understanding complex networks

    Authors: Kai Zheng, Qilong Feng, Yaohang Li, Qichang Zhao, **hui Xu, Jianxin Wang

    Abstract: Complex networks, which are the abstractions of many real-world systems, present a persistent challenge across disciplines for people to decipher their underlying information. Recently, hyperbolic geometry of latent spaces has gained traction in network analysis, due to its ability to preserve certain local intrinsic properties of the nodes. In this study, we explore the problem from a much broade… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 85 pages, 17 figures

  7. arXiv:2405.07839  [pdf, other

    cs.LG cs.AI stat.ML

    Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

    Authors: Haoyang Zheng, Hengrong Du, Qi Feng, Wei Deng, Guang Lin

    Abstract: Replica exchange stochastic gradient Langevin dynamics (reSGLD) is an effective sampler for non-convex learning in large-scale datasets. However, the simulation may encounter stagnation issues when the high-temperature chain delves too deeply into the distribution tails. To tackle this issue, we propose reflected reSGLD (r2SGLD): an algorithm tailored for constrained non-convex exploration by util… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 28 pages, 13 figures

  8. arXiv:2405.04602  [pdf, other

    cs.SE

    An Empirical Study of Kotlin-Java Interactions

    Authors: Qiong Feng, Huan Ji, Xiaotian Ma, Peng Liang

    Abstract: Background: Since Google introduced Kotlin as an official programming language for develo** Android apps in 2017, Kotlin has gained widespread adoption in Android development. The interoperability of Java and Kotlin's design nature allows them to coexist and interact with each other smoothly within a project. Aims: However, there is limited research on how Java and Kotlin interact with each othe… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  9. arXiv:2405.00739  [pdf, other

    cs.LG cs.CV eess.IV

    Why does Knowledge Distillation Work? Rethink its Attention and Fidelity Mechanism

    Authors: Chenqi Guo, Shiwei Zhong, Xiaofeng Liu, Qianli Feng, Yinglong Ma

    Abstract: Does Knowledge Distillation (KD) really work? Conventional wisdom viewed it as a knowledge transfer procedure where a perfect mimicry of the student to its teacher is desired. However, paradoxical studies indicate that closely replicating the teacher's behavior does not consistently improve student generalization, posing questions on its possible causes. Confronted with this gap, we hypothesize th… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

  10. arXiv:2404.11947  [pdf, other

    cs.LG cs.CV

    VCC-INFUSE: Towards Accurate and Efficient Selection of Unlabeled Examples in Semi-supervised Learning

    Authors: Shijie Fang, Qianhan Feng, Tong Lin

    Abstract: Despite the progress of Semi-supervised Learning (SSL), existing methods fail to utilize unlabeled data effectively and efficiently. Many pseudo-label-based methods select unlabeled examples based on inaccurate confidence scores from the classifier. Most prior work also uses all available unlabeled data without pruning, making it difficult to handle large amounts of unlabeled data. To address thes… ▽ More

    Submitted 21 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted paper of IJCAI 2024. Shijie Fang and Qianhan Feng contributed equally to this paper. New version, some problems and typos are fixed

  11. arXiv:2404.01941  [pdf, other

    cs.CV

    LPSNet: End-to-End Human Pose and Shape Estimation with Lensless Imaging

    Authors: Haoyang Ge, Qiao Feng, Hailong Jia, Xiongzheng Li, Xiangjun Yin, You Zhou, **gyu Yang, Kun Li

    Abstract: Human pose and shape (HPS) estimation with lensless imaging is not only beneficial to privacy protection but also can be used in covert surveillance scenarios due to the small size and simple structure of this device. However, this task presents significant challenges due to the inherent ambiguity of the captured measurements and lacks effective methods for directly estimating human pose and shape… ▽ More

    Submitted 8 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. More results available at https://cic.tju.edu.cn/faculty/likun/projects/LPSNet

  12. arXiv:2403.19983  [pdf, other

    eess.IV cs.CV

    A multi-stage semi-supervised learning for ankle fracture classification on CT images

    Authors: Hongzhi Liu, Guicheng Li, Jiacheng Nie, Hui Tang, Chunfeng Yang, Qian** Feng, Hailin Xu, Yang Chen

    Abstract: Because of the complicated mechanism of ankle injury, it is very difficult to diagnose ankle fracture in clinic. In order to simplify the process of fracture diagnosis, an automatic diagnosis model of ankle fracture was proposed. Firstly, a tibia-fibula segmentation network is proposed for the joint tibiofibular region of the ankle joint, and the corresponding segmentation dataset is established o… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  13. arXiv:2403.12986  [pdf, other

    cs.CV cs.LG

    BaCon: Boosting Imbalanced Semi-supervised Learning via Balanced Feature-Level Contrastive Learning

    Authors: Qianhan Feng, Lu**g Xie, Shijie Fang, Tong Lin

    Abstract: Semi-supervised Learning (SSL) reduces the need for extensive annotations in deep learning, but the more realistic challenge of imbalanced data distribution in SSL remains largely unexplored. In Class Imbalanced Semi-supervised Learning (CISSL), the bias introduced by unreliable pseudo-labels can be exacerbated by imbalanced data distributions. Most existing methods address this issue at instance-… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accpeted paper of AAAI2024

  14. arXiv:2403.10616  [pdf, other

    cs.LG cs.CL

    DiPaCo: Distributed Path Composition

    Authors: Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Adhiguna Kuncoro, Yani Donchev, Rachita Chhaparia, Ionel Gog, Marc'Aurelio Ranzato, Jiajun Shen, Arthur Szlam

    Abstract: Progress in machine learning (ML) has been fueled by scaling neural network models. This scaling has been enabled by ever more heroic feats of engineering, necessary for accommodating ML approaches that require high bandwidth communication between devices working in parallel. In this work, we propose a co-designed modular architecture and training approach for ML models, dubbed DIstributed PAth CO… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  15. arXiv:2403.10242  [pdf, other

    cs.CV

    FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model

    Authors: Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang

    Abstract: Reconstructing detailed 3D objects from single-view images remains a challenging task due to the limited information available. In this paper, we introduce FDGaussian, a novel two-stage framework for single-image 3D reconstruction. Recent methods typically utilize pre-trained 2D diffusion models to generate plausible novel views from the input image, yet they encounter issues with either multi-vie… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  16. arXiv:2403.09143  [pdf, other

    cs.GR

    A New Split Algorithm for 3D Gaussian Splatting

    Authors: Qiyuan Feng, Gengchen Cao, Haoxiang Chen, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu

    Abstract: 3D Gaussian splatting models, as a novel explicit 3D representation, have been applied in many domains recently, such as explicit geometric editing and geometry generation. Progress has been rapid. However, due to their mixed scales and cluttered shapes, 3D Gaussian splatting models can produce a blurred or needle-like effect near the surface. At the same time, 3D Gaussian splatting models tend to… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 10 figures

  17. arXiv:2403.00239  [pdf, other

    cs.CR

    OPAF: Optimized Secure Two-Party Computation Protocols for Nonlinear Activation Functions in Recurrent Neural Network

    Authors: Qian Feng, Zhihua Xia, Zhifeng Xu, Jiasi Weng, Jian Weng

    Abstract: Deep neural network (DNN) typically involves convolutions, pooling, and activation function. Due to the growing concern about privacy, privacy-preserving DNN becomes a hot research topic. Generally, the convolution and pooling operations can be supported by additive homomorphic and secure comparison, but the secure implementation of activation functions is not so straightforward for the requiremen… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  18. arXiv:2402.18180  [pdf, other

    cs.CY

    Human Simulacra: Benchmarking the Personification of Large Language Models

    Authors: Qiuejie Xie, Qiming Feng, Tianqi Zhang, Qingqiu Li, Linyi Yang, Yuejie Zhang, Rui Feng, Liang He, Shang Gao, Yue Zhang

    Abstract: Large language models (LLMs) are recognized as systems that closely mimic aspects of human intelligence. This capability has attracted attention from the social science community, who see the potential in leveraging LLMs to replace human participants in experiments, thereby reducing research costs and complexity. In this paper, we introduce a framework for large language models personification, in… ▽ More

    Submitted 9 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  19. arXiv:2402.01036  [pdf, other

    math.PR cs.LG stat.ML

    Fisher information dissipation for time inhomogeneous stochastic differential equations

    Authors: Qi Feng, Xinzhe Zuo, Wuchen Li

    Abstract: We provide a Lyapunov convergence analysis for time-inhomogeneous variable coefficient stochastic differential equations (SDEs). Three typical examples include overdamped, irreversible drift, and underdamped Langevin dynamics. We first formula the probability transition equation of Langevin dynamics as a modified gradient flow of the Kullback-Leibler divergence in the probability space with respec… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 9 figures, 36 pages

  20. arXiv:2401.16865  [pdf, other

    cs.SE

    Depends-Kotlin: A Cross-Language Kotlin Dependency Extractor

    Authors: Qiong Feng, Xiaotian Ma, Huan Ji, Peng Liang

    Abstract: Since Google introduced Kotlin as an official programming language for develo** Android apps in 2017, Kotlin has gained widespread adoption in Android development. However, compared to Java, there is limited support for Kotlin code dependency analysis, which is the foundation to software analysis. To bridge this gap, we developed Depends-Kotlin to extract entities and their dependencies in Kotli… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  21. arXiv:2401.14176  [pdf, other

    cs.SE cs.AI

    Copilot Refinement: Addressing Code Smells in Copilot-Generated Python Code

    Authors: Beiqi Zhang, Peng Liang, Qiong Feng, Yujia Fu, Zengyang Li

    Abstract: As one of the most popular dynamic languages, Python experiences a decrease in readability and maintainability when code smells are present. Recent advancements in Large Language Models have sparked growing interest in AI-enabled tools for both code generation and refactoring. GitHub Copilot is one such tool that has gained widespread usage. Copilot Chat, released on September 2023, functions as a… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  22. arXiv:2401.03228  [pdf, other

    stat.ML cs.LG

    Reflected Schrödinger Bridge for Constrained Generative Modeling

    Authors: Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, Ricky T. Q. Chen

    Abstract: Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  23. arXiv:2312.17210  [pdf, other

    stat.ML cs.AI cs.LG

    Continual Learning via Sequential Function-Space Variational Inference

    Authors: Tim G. J. Rudner, Freddie Bickford Smith, Qixuan Feng, Yee Whye Teh, Yarin Gal

    Abstract: Sequential Bayesian inference over predictive functions is a natural framework for continual learning from streams of data. However, applying it to neural networks has proved challenging in practice. Addressing the drawbacks of existing techniques, we propose an optimization objective derived by formulating continual learning as sequential function-space variational inference. In contrast to exist… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Published in Proceedings of the 39th International Conference on Machine Learning (ICML 2022)

  24. arXiv:2312.09743  [pdf, other

    cs.CV cs.GR

    SLS4D: Sparse Latent Space for 4D Novel View Synthesis

    Authors: Qi-Yuan Feng, Hao-Xiang Chen, Qun-Ce Xu, Tai-Jiang Mu

    Abstract: Neural radiance field (NeRF) has achieved great success in novel view synthesis and 3D representation for static scenarios. Existing dynamic NeRFs usually exploit a locally dense grid to fit the deformation field; however, they fail to capture the global dynamics and concomitantly yield models of heavy parameters. We observe that the 4D space is inherently sparse. Firstly, the deformation field is… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 10 pages, 6 figures

  25. arXiv:2312.08591  [pdf, other

    cs.CV

    Joint2Human: High-quality 3D Human Generation via Compact Spherical Embedding of 3D Joints

    Authors: Muxin Zhang, Qiao Feng, Zhuo Su, Chao Wen, Zhou Xue, Kun Li

    Abstract: 3D human generation is increasingly significant in various applications. However, the direct use of 2D generative methods in 3D generation often results in losing local details, while methods that reconstruct geometry from generated images struggle with global view consistency. In this work, we introduce Joint2Human, a novel method that leverages 2D diffusion models to generate detailed 3D human g… ▽ More

    Submitted 6 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  26. arXiv:2312.05826   

    cs.CV

    R2Human: Real-Time 3D Human Appearance Rendering from a Single Image

    Authors: Yuanwang Yang, Qiao Feng, Yu-Kun Lai, Kun Li

    Abstract: Rendering 3D human appearance in different views is crucial for achieving holographic communication and immersive VR/AR. Existing methods either rely on multi-camera setups or have low-quality rendered images from a single image. In this paper, we propose R2Human, the first approach for real-time inference and rendering of photorealistic 3D human appearance from a single image. The core of our app… ▽ More

    Submitted 13 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: The manuscript should be revised because the authors aim to achieve better organization in writing and more experimental analysis

  27. arXiv:2312.05804  [pdf, other

    cs.CV

    Layered 3D Human Generation via Semantic-Aware Diffusion Model

    Authors: Yi Wang, Jian Ma, Ruizhi Shao, Qiao Feng, Yu-Kun Lai, Yebin Liu, Kun Li

    Abstract: The generation of 3D clothed humans has attracted increasing attention in recent years. However, existing work cannot generate layered high-quality 3D humans with consistent body structures. As a result, these methods are unable to arbitrarily and separately change and edit the body and clothing of the human. In this paper, we propose a text-driven layered 3D human generation framework based on a… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  28. arXiv:2311.08105  [pdf, other

    cs.LG cs.CL

    DiLoCo: Distributed Low-Communication Training of Language Models

    Authors: Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, Marc'Aurelio Ranzato, Arthur Szlam, Jiajun Shen

    Abstract: Large language models (LLM) have become a critical component in many applications of machine learning. However, standard approaches to training LLM require a large number of tightly interconnected accelerators, with devices exchanging gradients and other intermediate states at each optimization step. While it is difficult to build and maintain a single computing cluster hosting many accelerators,… ▽ More

    Submitted 2 December, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  29. arXiv:2310.17923  [pdf, other

    cs.RO

    Multi-fingered Dynamic Gras** for Unknown Objects

    Authors: Yannick Burkhardt, Qian Feng, Jianxiang Feng, Karan Sharma, Zhaopeng Chen, Alois Knoll

    Abstract: Dexterous gras** of unseen objects in dynamic environments is an essential prerequisite for the advanced manipulation of autonomous robots. Prior advances rely on several assumptions that simplify the setup, including environment stationarity, pre-defined objects, and low-dimensional end-effectors. Though easing the problem and enabling progress, it undermined the complexity of the real world. A… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  30. arXiv:2310.10647  [pdf, other

    cs.CV cs.AI cs.LG

    A Survey on Video Diffusion Models

    Authors: Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang

    Abstract: The recent wave of AI-generated content (AIGC) has witnessed substantial success in computer vision, with the diffusion model playing a crucial role in this achievement. Due to their impressive generative capabilities, diffusion models are gradually superseding methods based on GANs and auto-regressive Transformers, demonstrating exceptional performance not only in image generation and editing, bu… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  31. Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability

    Authors: Taichi Higasa, Keitaro Tanaka, Qi Feng, Shigeo Morishima

    Abstract: Language learners should regularly engage in reading challenging materials as part of their study routine. Nevertheless, constantly referring to dictionaries is time-consuming and distracting. This paper presents a novel gaze-driven sentence simplification system designed to enhance reading comprehension while maintaining their focus on the content. Our system incorporates machine learning models… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: Accepted by ACM ICMI 2023 workshops (Multimodal, Interactive Interfaces for Education)

  32. arXiv:2309.10375  [pdf, other

    cs.CV

    Pointing out Human Answer Mistakes in a Goal-Oriented Visual Dialogue

    Authors: Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima

    Abstract: Effective communication between humans and intelligent agents has promising applications for solving complex problems. One such approach is visual dialogue, which leverages multimodal context to assist humans. However, real-world scenarios occasionally involve human mistakes, which can cause intelligent agents to fail. While most prior research assumes perfect answers from human interlocutors, we… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted at ICCVW 2023

  33. arXiv:2308.16419  [pdf, other

    cs.NI

    Deadline Aware Two-Timescale Resource Allocation for VR Video Streaming

    Authors: Qingxuan Feng, Peng Yang, Zhixuan Huang, Jiayin Chen, Ning Zhang

    Abstract: In this paper, we investigate resource allocation problem in the context of multiple virtual reality (VR) video flows sharing a certain link, considering specific deadline of each video frame and the impact of different frames on video quality. Firstly, we establish a queuing delay bound estimation model, enabling link node to proactively discard frames that will exceed the deadline. Secondly, we… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  34. arXiv:2308.15030  [pdf, other

    cs.AI

    SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget

    Authors: Rui Kong, Yuanchun Li, Qingtian Feng, Weijun Wang, Xiaozhou Ye, Ye Ouyang, Linghe Kong, Yunxin Liu

    Abstract: Mixture of experts (MoE) is a popular technique to improve capacity of Large Language Models (LLMs) with conditionally-activated parallel experts. However, serving MoE models on memory-constrained devices is challenging due to the large parameter size. Typical solutions such as memory swap** or expert pruning may lead to significantly higher latency or severe accuracy loss. In this paper, we int… ▽ More

    Submitted 29 May, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: Accepted at ACL 2024

  35. arXiv:2308.13420  [pdf, other

    cs.NE cs.AI cs.LG

    Reinforcement Learning-assisted Evolutionary Algorithm: A Survey and Research Opportunities

    Authors: Yanjie Song, Yutong Wu, Yangyang Guo, Ran Yan, P. N. Suganthan, Yue Zhang, Witold Pedrycz, Swagatam Das, Rammohan Mallipeddi, Oladayo Solomon Ajani. Qiang Feng

    Abstract: Evolutionary algorithms (EA), a class of stochastic search methods based on the principles of natural evolution, have received widespread acclaim for their exceptional performance in various real-world optimization problems. While researchers worldwide have proposed a wide variety of EAs, certain limitations remain, such as slow convergence speed and poor generalization capabilities. Consequently,… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 28 pages, 16 figures

    Report number: SWEVO-S-2023-00771

  36. arXiv:2308.13042  [pdf, other

    cs.CV cs.HC

    Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation

    Authors: Qi Feng, Hubert P. H. Shum, Shigeo Morishima

    Abstract: Pre-captured immersive environments using omnidirectional cameras provide a wide range of virtual reality applications. Previous research has shown that manipulating the eye height in egocentric virtual environments can significantly affect distance perception and immersion. However, the influence of eye height in pre-captured real environments has received less attention due to the difficulty of… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 10 pages, 13 figures, 3 tables, submitted to ISMAR 2023

  37. arXiv:2308.05110  [pdf, other

    cs.LG

    Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke

    Authors: Qizhang Feng, Jiayi Yuan, Forhan Bin Emdad, Karim Hanna, Xia Hu, Zhe He

    Abstract: Stroke is a significant cause of mortality and morbidity, necessitating early predictive strategies to minimize risks. Traditional methods for evaluating patients, such as Acute Physiology and Chronic Health Evaluation (APACHE II, IV) and Simplified Acute Physiology Score III (SAPS III), have limited accuracy and interpretability. This paper proposes a novel approach: an interpretable, attention-b… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  38. arXiv:2308.01046  [pdf, other

    cs.NI

    Flexible Coherent Optical Access: Architectures, Algorithms, and Demonstrations

    Authors: Ji Zhou, Haide Wang, Kuo Zhang, Xi Chen, Qiguang Feng, Keshuang Zheng, Yijia Zhao, Zhen Dong, Tao Gui, Zhicheng Ye, Liangchuan Li

    Abstract: To cope with the explosive bandwidth demand, significant progress has been made in the ITU-T standardization sector to define a higher-speed passive optical network (PON) with a 50Gb/s line rate. Recently, 50G PON becomes mature gradually, which means it is time to discuss beyond 50G PON. For ensuring an acceptable optical power budget, beyond 50G PON will potentially use coherent technologies, wh… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: The paper has been submitted to the Journal of Lightwave Technology

  39. arXiv:2308.00947  [pdf, other

    eess.IV cs.CV cs.LG

    Decomposing and Coupling Saliency Map for Lesion Segmentation in Ultrasound Images

    Authors: Zhenyuan Ning, Yixiao Mao, Qian** Feng, Shengzhou Zhong, Yu Zhang

    Abstract: Complex scenario of ultrasound image, in which adjacent tissues (i.e., background) share similar intensity with and even contain richer texture patterns than lesion region (i.e., foreground), brings a unique challenge for accurate lesion segmentation. This work presents a decomposition-coupling network, called DC-Net, to deal with this challenge in a (foreground-background) saliency map disentangl… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 18 pages, 18 figures

  40. arXiv:2307.05918  [pdf, other

    cs.DB

    DSPC: Efficiently Answering Shortest Path Counting on Dynamic Graphs

    Authors: Qingshuai Feng, You Peng, Wenjie Zhang, Xuemin Lin, Ying Zhang

    Abstract: The widespread use of graph data in various applications and the highly dynamic nature of today's networks have made it imperative to analyze structural trends in dynamic graphs on a continual basis. The shortest path is a fundamental concept in graph analysis and recent research shows that counting the number of shortest paths between two vertices is crucial in applications like potential friend… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  41. arXiv:2306.10599  [pdf, ps, other

    cs.SE

    An Empirical Study of Untangling Patterns of Two-Class Dependency Cycles

    Authors: Qiong Feng, Shuwen Liu, Huan Ji, Xiaotian Ma, Peng Liang

    Abstract: Dependency cycles pose a significant challenge to software quality and maintainability. However, there is limited understanding of how practitioners resolve dependency cycles in real-world scenarios. This paper presents an empirical study investigating the recurring patterns employed by software developers to resolve dependency cycles between two classes in practice. We analyzed the data from 38 o… ▽ More

    Submitted 17 December, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: Preprint accepted for publication in Empirical Software Engineering, 2023

  42. arXiv:2306.05760  [pdf, other

    cs.LG cs.AI

    Efficient GNN Explanation via Learning Removal-based Attribution

    Authors: Yao Rong, Guanchu Wang, Qizhang Feng, Ninghao Liu, Zirui Liu, Enkelejda Kasneci, Xia Hu

    Abstract: As Graph Neural Networks (GNNs) have been widely used in real-world applications, model explanations are required not only by users but also by legal regulations. However, simultaneously achieving high fidelity and low computational costs in generating explanations has been a challenge for current methods. In this work, we propose a framework of GNN explanation named LeArn Removal-based Attributio… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  43. Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning

    Authors: Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima

    Abstract: This paper presents a novel metric learning approach to address the performance gap between normal and silent speech in visual speech recognition (VSR). The difference in lip movements between the two poses a challenge for existing VSR models, which exhibit degraded accuracy when applied to silent speech. To solve this issue and tackle the scarcity of training data for silent speech, we propose to… ▽ More

    Submitted 16 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by INTERSPEECH 2023

  44. arXiv:2305.12895  [pdf, other

    cs.LG

    DEGREE: Decomposition Based Explanation For Graph Neural Networks

    Authors: Qizhang Feng, Ninghao Liu, Fan Yang, Ruixiang Tang, Mengnan Du, Xia Hu

    Abstract: Graph Neural Networks (GNNs) are gaining extensive attention for their application in graph data. However, the black-box nature of GNNs prevents users from understanding and trusting the models, thus hampering their applicability. Whereas explaining GNNs remains a challenge, most existing methods fall into approximation based and perturbation based approaches with suffer from faithfulness problems… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  45. arXiv:2304.13712  [pdf, other

    cs.CL cs.AI cs.LG

    Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

    Authors: **gfeng Yang, Hongye **, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Bing Yin, Xia Hu

    Abstract: This paper presents a comprehensive and practical guide for practitioners and end-users working with Large Language Models (LLMs) in their downstream natural language processing (NLP) tasks. We provide discussions and insights into the usage of LLMs from the perspectives of models, data, and downstream tasks. Firstly, we offer an introduction and brief summary of current GPT- and BERT-style LLMs.… ▽ More

    Submitted 27 April, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

  46. arXiv:2304.07500  [pdf, other

    cs.CV

    The 7th AI City Challenge

    Authors: Milind Naphade, Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Qi Feng, Vitaly Ablavsky, Stan Sclaroff, Pranamesh Chakraborty, Sanjita Prajapati, Alice Li, Shangru Li, Krishna Kunadharaju, Shenxin Jiang, Rama Chellappa

    Abstract: The AI City Challenge's seventh edition emphasizes two domains at the intersection of computer vision and artificial intelligence - retail business and Intelligent Traffic Systems (ITS) - that have considerable untapped potential. The 2023 challenge had five tracks, which drew a record-breaking number of participation requests from 508 teams across 46 countries. Track 1 was a brand new track that… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: Summary of the 7th AI City Challenge Workshop in conjunction with CVPR 2023

  47. arXiv:2303.11470  [pdf, other

    cs.CR cs.AI cs.LG cs.MM

    Did You Train on My Dataset? Towards Public Dataset Protection with Clean-Label Backdoor Watermarking

    Authors: Ruixiang Tang, Qizhang Feng, Ninghao Liu, Fan Yang, Xia Hu

    Abstract: The huge supporting training data on the Internet has been a key factor in the success of deep learning models. However, this abundance of public-available data also raises concerns about the unauthorized exploitation of datasets for commercial purposes, which is forbidden by dataset licenses. In this paper, we propose a backdoor-based watermarking approach that serves as a general framework for s… ▽ More

    Submitted 10 April, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  48. arXiv:2303.00460  [pdf, other

    cs.RO eess.SY

    Multi-Arm Robot Task Planning for Fruit Harvesting Using Multi-Agent Reinforcement Learning

    Authors: Tao Li, Feng Xie, Ya Xiong, Qingchun Feng

    Abstract: The emergence of harvesting robotics offers a promising solution to the issue of limited agricultural labor resources and the increasing demand for fruits. Despite notable advancements in the field of harvesting robotics, the utilization of such technology in orchards is still limited. The key challenge is to improve operational efficiency. Taking into account inner-arm conflicts, couplings of DoF… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  49. Graph-based Village Level Poverty Identification

    Authors: **g Ma, Liangwei Yang, Qiong Feng, Weizhi Zhang, Philip S. Yu

    Abstract: Poverty status identification is the first obstacle to eradicating poverty. Village-level poverty identification is very challenging due to the arduous field investigation and insufficient information. The development of the Web infrastructure and its modeling tools provides fresh approaches to identifying poor villages. Upon those techniques, we build a village graph for village poverty status id… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 5 pages, accepted by theWebConf 2023

  50. arXiv:2302.02553  [pdf, ps, other

    cs.CV

    A Correction-Based Dynamic Enhancement Framework towards Underwater Detection

    Authors: Yanling Qiu, Qianxue Feng, Boqin Cai, Hongan Wei, Weiling Chen

    Abstract: To assist underwater object detection for better performance, image enhancement technology is often used as a pre-processing step. However, most of the existing enhancement methods tend to pursue the visual quality of an image, instead of providing effective help for detection tasks. In fact, image enhancement algorithms should be optimized with the goal of utility improvement. In this paper, to a… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.