Skip to main content

Showing 1–50 of 94 results for author: Wei, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18664  [pdf, other

    cs.CL cs.LG

    Evaluating Copyright Takedown Methods for Language Models

    Authors: Boyi Wei, Weijia Shi, Yangsibo Huang, Noah A. Smith, Chiyuan Zhang, Luke Zettlemoyer, Kai Li, Peter Henderson

    Abstract: Language models (LMs) derive their capabilities from extensive training on diverse data, including potentially copyrighted material. These models can memorize and generate content similar to their training data, posing potential concerns. Therefore, model creators are motivated to develop mitigation methods that prevent generating protected content. We term this procedure as copyright takedowns fo… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 31 pages, 9 figures, 14 tables

  2. arXiv:2406.18364  [pdf

    cs.CL cs.AI

    Research on Information Extraction of LCSTS Dataset Based on an Improved BERTSum-LSTM Model

    Authors: Yiming Chen, Haobin Chen, Simin Liu, Yunyun Liu, Fanhao Zhou, Bing Wei

    Abstract: With the continuous advancement of artificial intelligence, natural language processing technology has become widely utilized in various fields. At the same time, there are many challenges in creating Chinese news summaries. First of all, the semantics of Chinese news is complex, and the amount of information is enormous. Extracting critical information from Chinese news presents a significant cha… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: submitted to ICMIII 2024

  3. arXiv:2406.15485  [pdf, other

    cs.CL cs.CV

    SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection

    Authors: Xingjian Hu, Baole Wei, Liangcai Gao

    Abstract: Text line detection is a key task in historical document analysis facing many challenges of arbitrary-shaped text lines, dense texts, and text lines with high aspect ratios, etc. In this paper, we propose a general framework for historical document text detection (SegHist), enabling existing segmentation-based text detection methods to effectively address the challenges, especially text lines with… ▽ More

    Submitted 25 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by ICDAR2024

  4. arXiv:2406.14598  [pdf, other

    cs.AI

    SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

    Authors: Tinghao Xie, Xiangyu Qi, Yi Zeng, Yangsibo Huang, Udari Madhushani Sehwag, Kaixuan Huang, Luxi He, Boyi Wei, Dacheng Li, Ying Sheng, Ruoxi Jia, Bo Li, Kai Li, Danqi Chen, Peter Henderson, Prateek Mittal

    Abstract: Evaluating aligned large language models' (LLMs) ability to recognize and reject unsafe user requests is crucial for safe, policy-compliant deployments. Existing evaluation efforts, however, face three limitations that we address with SORRY-Bench, our proposed benchmark. First, existing methods often use coarse-grained taxonomies of unsafe topics, and are over-representing some fine-grained topics… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.10101  [pdf, other

    cs.SE

    Requirements are All You Need: From Requirements to Code with LLMs

    Authors: Bingyang Wei

    Abstract: The pervasive use of textual formats in the documentation of software requirements presents a great opportunity for applying large language models (LLMs) to software engineering tasks. High-quality software requirements not only enhance the manual software development process but also position organizations to fully harness the potential of the emerging LLMs technology. This paper introduces a tai… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2406.08909  [pdf, other

    cs.CV

    A Label-Free and Non-Monotonic Metric for Evaluating Denoising in Event Cameras

    Authors: Chenyang Shi, Shasha Guo, Boyi Wei, Hanxiao Liu, Yibo Zhang, Ningfang Song, **g **

    Abstract: Event cameras are renowned for their high efficiency due to outputting a sparse, asynchronous stream of events. However, they are plagued by noisy events, especially in low light conditions. Denoising is an essential task for event cameras, but evaluating denoising performance is challenging. Label-dependent denoising metrics involve artificially adding noise to clean sequences, complicating evalu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2406.05746  [pdf

    cs.AI cs.HC cs.LG

    Methodology and Real-World Applications of Dynamic Uncertain Causality Graph for Clinical Diagnosis with Explainability and Invariance

    Authors: Zhan Zhang, Qin Zhang, Yang Jiao, Lin Lu, Lin Ma, Aihua Liu, Xiao Liu, Juan Zhao, Yajun Xue, Bing Wei, Mingxia Zhang, Ru Gao, Hong Zhao, Jie Lu, Fan Li, Yang Zhang, Yiming Wang, Lei Zhang, Fengwei Tian, Jie Hu, Xin Gou

    Abstract: AI-aided clinical diagnosis is desired in medical care. Existing deep learning models lack explainability and mainly focus on image analysis. The recently developed Dynamic Uncertain Causality Graph (DUCG) approach is causality-driven, explainable, and invariant across different application scenarios, without problems of data collection, labeling, fitting, privacy, bias, generalization, high cost… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Journal ref: Artificaial Intelligence Review, (2024) 57:151

  8. arXiv:2406.05707  [pdf, other

    cs.CL cs.AI

    QGEval: A Benchmark for Question Generation Evaluation

    Authors: Wei** Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu

    Abstract: Automatically generated questions often suffer from problems such as unclear expression or factual inaccuracies, requiring a reliable and comprehensive evaluation of their quality. Human evaluation is frequently used in the field of question generation (QG) and is one of the most accurate evaluation methods. It also serves as the standard for automatic metrics. However, there is a lack of unified… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  9. arXiv:2405.19769  [pdf, other

    cs.CV

    All-In-One Medical Image Restoration via Task-Adaptive Routing

    Authors: Zhiwen Yang, Haowei Chen, Ziniu Qian, Yang Yi, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

    Abstract: Although single-task medical image restoration (MedIR) has witnessed remarkable success, the limited generalizability of these methods poses a substantial obstacle to wider application. In this paper, we focus on the task of all-in-one medical image restoration, aiming to address multiple distinct MedIR tasks with a single universal model. Nonetheless, due to significant differences between differ… ▽ More

    Submitted 28 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: This article has been early accepted by MICCAI 2024

  10. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Gei**, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  11. arXiv:2405.15914  [pdf, other

    cs.CV

    ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score Matching

    Authors: Yumin Zhang, Xingyu Miao, Haoran Duan, Bo Wei, Tejal Shah, Yang Long, Rajiv Ranjan

    Abstract: Text-to-3D content creation is a rapidly evolving research area. Given the scarcity of 3D data, current approaches often adapt pre-trained 2D diffusion models for 3D synthesis. Among these approaches, Score Distillation Sampling (SDS) has been widely adopted. However, the issue of over-smoothing poses a significant limitation on the high-fidelity generation of 3D models. To address this challenge,… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  12. arXiv:2405.15544  [pdf, other

    q-bio.QM cs.AI cs.LG

    Knowledge-enhanced Relation Graph and Task Sampling for Few-shot Molecular Property Prediction

    Authors: Zeyu Wang, Tianyi Jiang, Yao Lu, Xiaoze Bao, Shanqing Yu, Bin Wei, Qi Xuan

    Abstract: Recently, few-shot molecular property prediction (FSMPP) has garnered increasing attention. Despite impressive breakthroughs achieved by existing methods, they often overlook the inherent many-to-many relationships between molecules and properties, which limits their performance. For instance, similar substructures of molecules can inspire the exploration of new compounds. Additionally, the relati… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  13. arXiv:2405.10674  [pdf, other

    cs.CV cs.AI

    From Sora What We Can See: A Survey of Text-to-Video Generation

    Authors: Rui Sun, Yumin Zhang, Tejal Shah, Jiahao Sun, Shuoying Zhang, Wenqi Li, Haoran Duan, Bo Wei, Rajiv Ranjan

    Abstract: With impressive achievements made, artificial intelligence is on the path forward to artificial general intelligence. Sora, developed by OpenAI, which is capable of minute-level world-simulative abilities can be considered as a milestone on this developmental path. However, despite its notable successes, Sora still encounters various obstacles that need to be resolved. In this survey, we embark fr… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: A comprehensive list of text-to-video generation studies in this survey is available at https://github.com/soraw-ai/Awesome-Text-to-Video-Generation

  14. arXiv:2403.14374  [pdf, other

    cs.CL cs.IR

    FIT-RAG: Black-Box RAG with Factual Information and Token Reduction

    Authors: Yuren Mao, Xuemei Dong, Wenyi Xu, Yunjun Gao, Bin Wei, Ying Zhang

    Abstract: Due to the extraordinarily large number of parameters, fine-tuning Large Language Models (LLMs) to update long-tail or out-of-date knowledge is impractical in lots of applications. To avoid fine-tuning, we can alternatively treat a LLM as a black-box (i.e., freeze the parameters of the LLM) and augment it with a Retrieval-Augmented Generation (RAG) system, namely black-box RAG. Recently, black-box… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  15. arXiv:2402.05162  [pdf, other

    cs.LG cs.AI cs.CL

    Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

    Authors: Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

    Abstract: Large language models (LLMs) show inherent brittleness in their safety mechanisms, as evidenced by their susceptibility to jailbreaking and even non-malicious fine-tuning. This study explores this brittleness of safety alignment by leveraging pruning and low-rank modifications. We develop methods to identify critical regions that are vital for safety guardrails, and that are disentangled from util… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 22 pages, 9 figures. Project page is available at https://boyiwei.com/alignment-attribution/

  16. arXiv:2401.08185  [pdf

    cs.CV cs.AI eess.IV

    DPAFNet:Dual Path Attention Fusion Network for Single Image Deraining

    Authors: Bingcai Wei

    Abstract: Rainy weather will have a significant impact on the regular operation of the imaging system. Based on this premise, image rain removal has always been a popular branch of low-level visual tasks, especially methods using deep neural networks. However, most neural networks are but-branched, such as only using convolutional neural networks or Transformers, which is unfavourable for the multidimension… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  17. arXiv:2311.13317  [pdf, other

    cs.CV

    Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution

    Authors: Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei

    Abstract: Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR). Previous methods predominantly employ discriminative Convolutional Neural Networks (CNNs) augmented with diverse forms of text guidance to address this issue. Nevertheless, they remain deficie… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  18. arXiv:2309.09984  [pdf

    q-bio.NC cs.NE

    BDEC:Brain Deep Embedded Clustering model

    Authors: Xiaoxiao Ma, Chunzhi Yi, Zhicai Zhong, Hui Zhou, Baichun Wei, Haiqi Zhu, Feng Jiang

    Abstract: An essential premise for neuroscience brain network analysis is the successful segmentation of the cerebral cortex into functionally homogeneous regions. Resting-state functional magnetic resonance imaging (rs-fMRI), capturing the spontaneous activities of the brain, provides the potential for cortical parcellation. Previous parcellation methods can be roughly categorized into three groups, mainly… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  19. arXiv:2309.07170  [pdf, other

    eess.SP cs.LG

    Overview of Human Activity Recognition Using Sensor Data

    Authors: Rebeen Ali Hamad, Wai Lok Woo, Bo Wei, Longzhi Yang

    Abstract: Human activity recognition (HAR) is an essential research field that has been used in different applications including home and workplace automation, security and surveillance as well as healthcare. Starting from conventional machine learning methods to the recently develo** deep learning techniques and the Internet of things, significant contributions have been shown in the HAR area in the last… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  20. arXiv:2307.05249  [pdf, other

    eess.IV cs.CV cs.LG

    DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

    Authors: Zhiwen Yang, Yang Zhou, Hui Zhang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Multi-center positron emission tomography (PET) image synthesis aims at recovering low-dose PET images from multiple different centers. The generalizability of existing methods can still be suboptimal for a multi-center study due to domain shifts, which result from non-identical data distribution among centers with different imaging systems/protocols. While some approaches address domain shifts by… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: This article has been early accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

  21. arXiv:2306.17659  [pdf, other

    cs.CV

    Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

    Authors: Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Large-scale visual-language pre-trained models (VLPM) have proven their excellent performance in downstream object detection for natural scenes. However, zero-shot nuclei detection on H\&E images via VLPMs remains underexplored. The large gap between medical images and the web-originated text-image pairs used for pre-training makes it a challenging task. In this paper, we attempt to explore the po… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: This article has been accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

  22. Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

    Authors: Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is under… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI https://doi.org/10.1109/TMI.2023.3275609, IEEE Transactions on Medical Imaging. Code: https://github.com/wuyongjianCODE/Cyclic

  23. arXiv:2305.10198  [pdf, other

    cs.CV eess.IV

    IDO-VFI: Identifying Dynamics via Optical Flow Guidance for Video Frame Interpolation with Events

    Authors: Chenyang Shi, Hanxiao Liu, **g **, Wenzhuo Li, Yuzhen Li, Boyi Wei, Yibo Zhang

    Abstract: Video frame interpolation aims to generate high-quality intermediate frames from boundary frames and increase frame rate. While existing linear, symmetric and nonlinear models are used to bridge the gap from the lack of inter-frame motion, they cannot reconstruct real motions. Event cameras, however, are ideal for capturing inter-frame dynamics with their extremely high temporal resolution. In thi… ▽ More

    Submitted 18 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  24. arXiv:2305.03270  [pdf, other

    cs.RO

    Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

    Authors: Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham , et al. (15 additional authors not shown)

    Abstract: We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Published at Robotics: Science and Systems 2023

  25. arXiv:2303.17614  [pdf, other

    cs.HC cs.AI eess.SP

    Estimating Continuous Muscle Fatigue For Multi-Muscle Coordinated Exercise: A Pilot Study

    Authors: Chunzhi Yi, Baichun Wei, Wei **, Jianfei Zhu, Seungmin Rho, Zhiyuan Chen, Feng Jiang

    Abstract: Assessing the progression of muscle fatigue for daily exercises provides vital indicators for precise rehabilitation, personalized training dose, especially under the context of Metaverse. Assessing fatigue of multi-muscle coordination-involved daily exercises requires the neuromuscular features that represent the fatigue-induced characteristics of spatiotemporal adaptions of multiple muscles and… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: submitted to IEEE JBHI

  26. arXiv:2303.15107  [pdf, other

    cs.HC

    ActiveSelfHAR: Incorporating Self Training into Active Learning to Improve Cross-Subject Human Activity Recognition

    Authors: Baichun Wei, Chunzhi Yi, Qi Zhang, Haiqi Zhu, Jianfei Zhu, Feng Jiang

    Abstract: Deep learning-based human activity recognition (HAR) methods have shown great promise in the applications of smart healthcare systems and wireless body sensor network (BSN). Despite their demonstrated performance in laboratory settings, the real-world implementation of such methods is still hindered by the cross-subject issue when adapting to new users. To solve this issue, we propose ActiveSelfHA… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  27. arXiv:2303.04365  [pdf, other

    cs.CV

    SANDFORMER: CNN and Transformer under Gated Fusion for Sand Dust Image Restoration

    Authors: Jun Shi, Bingcai Wei, Gang Zhou, Liye Zhang

    Abstract: Although Convolutional Neural Networks (CNN) have made good progress in image restoration, the intrinsic equivalence and locality of convolutions still constrain further improvements in image quality. Recent vision transformer and self-attention have achieved promising results on various computer vision tasks. However, directly utilizing Transformer for image restoration is a challenging task. In… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  28. arXiv:2302.11095  [pdf, other

    cs.CV

    MM-SFENet: Multi-scale Multi-task Localization and Classification of Bladder Cancer in MRI with Spatial Feature Encoder Network

    Authors: Yu Ren, Guoli Wang, **** Wang, Kunmeng Liu, Quan** Liu, Hongfu Sun, Xiang Li, Benzheng Wei

    Abstract: Background and Objective: Bladder cancer is a common malignant urinary carcinoma, with muscle-invasive and non-muscle-invasive as its two major subtypes. This paper aims to achieve automated bladder cancer invasiveness localization and classification based on MRI. Method: Different from previous efforts that segment bladder wall and tumor, we propose a novel end-to-end multi-scale multi-task spati… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  29. arXiv:2302.11082  [pdf, other

    cs.CV

    BB-GCN: A Bi-modal Bridged Graph Convolutional Network for Multi-label Chest X-Ray Recognition

    Authors: Guoli Wang, **** Wang, **yu Cong, Kunmeng Liu, Benzheng Wei

    Abstract: Multi-label chest X-ray (CXR) recognition involves simultaneously diagnosing and identifying multiple labels for different pathologies. Since pathological labels have rich information about their relationship to each other, modeling the co-occurrence dependencies between pathological labels is essential to improve recognition performance. However, previous methods rely on state variable coding and… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: under Computers in Biology and Medicine submission

  30. arXiv:2302.03222  [pdf, other

    cs.CL

    Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support

    Authors: Stephen Obadinma, Faiza Khan Khattak, Shirley Wang, Tania Sidhom, Elaine Lau, Sean Robertson, **gcheng Niu, Winnie Au, Alif Munim, Karthik Raja K. Bhaskar, Bencheng Wei, Iris Ren, Waqar Muhammad, Erin Li, Bukola Ishola, Michael Wang, Griffin Tanner, Yu-Jia Shiah, Sean X. Zhang, Kwesi P. Apponsah, Kanishk Patel, Jaswinder Narain, Deval Pandya, Xiaodan Zhu, Frank Rudzicz , et al. (1 additional authors not shown)

    Abstract: Building Agent Assistants that can help improve customer service support requires inputs from industry users and their customers, as well as knowledge about state-of-the-art Natural Language Processing (NLP) technology. We combine expertise from academia and industry to bridge the gap and build task/domain-specific Neural Agent Assistants (NAA) with three high-level components for: (1) Intent Iden… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Camera Ready Version of Paper Published in EMNLP 2022 Industry Track

  31. arXiv:2212.14479  [pdf, other

    cs.NI cs.LG

    Pensieve 5G: Implementation of RL-based ABR Algorithm for UHD 4K/8K Content Delivery on Commercial 5G SA/NR-DC Network

    Authors: Kasidis Arunruangsirilert, Bo Wei, Hang Song, Jiro Katto

    Abstract: While the rollout of the fifth-generation mobile network (5G) is underway across the globe with the intention to deliver 4K/8K UHD videos, Augmented Reality (AR), and Virtual Reality (VR) content to the mass amounts of users, the coverage and throughput are still one of the most significant issues, especially in the rural areas, where only 5G in the low-frequency band are being deployed. This call… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: 2023 IEEE Wireless Communications and Networking Conference (WCNC), 26-29 March 2023, Glasgow, Scotland, UK

  32. arXiv:2212.10866  [pdf

    cs.CR

    CyberEye: Obtaining Data from Virtual Desktop by Video

    Authors: Bin Wei

    Abstract: VDI is no longer safe and reliable anymore. VDI(Virtual Desktop Infrastructure, also called Cloud Desktop) is being widely used as working interface to avoid data exfiltration. With VDI client, end users can access internal data without obtaining data actually. In this paper, we present a new approach named CyberEye, to extract data from VDI by video even data transmission has been forbidden. By e… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Open source code: https://github.com/bin-will/cybereye This paper contains 17 pages, 12 figures

  33. arXiv:2212.08418  [pdf, other

    cs.NI

    rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments

    Authors: Bo Wei, Mingcen Gao, Chengwen Luo, Sen Wang, ** Zhang

    Abstract: In this paper, we propose rWiFiSLAM, an indoor localisation system based on WiFi ranging measurements. Indoor localisation techniques play an important role in mobile robots when they cannot access good quality GPS signals in indoor environments. Indoor localisation also has many other applications, such as rescue, smart buildings, etc. Inertial Measurement Units (IMU) have been used for Pedestria… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  34. arXiv:2209.12029  [pdf, other

    cs.LG cs.AI

    Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation

    Authors: Kang Xu, Yan Ma, Bingsheng Wei, Wei Li

    Abstract: While Reinforcement Learning can achieve impressive results for complex tasks, the learned policies are generally prone to fail in downstream tasks with even minor model mismatch or unexpected perturbations. Recent works have demonstrated that a policy population with diverse behavior characteristics can generalize to downstream environments with various discrepancies. However, such policies might… ▽ More

    Submitted 20 May, 2023; v1 submitted 24 September, 2022; originally announced September 2022.

  35. arXiv:2209.02916  [pdf, other

    cs.LG cs.AR

    Hardware Acceleration of Sampling Algorithms in Sample and Aggregate Graph Neural Networks

    Authors: Yuchen Gui, Boyi Wei, Wei Yuan, Xi **

    Abstract: Sampling is an important process in many GNN structures in order to train larger datasets with a smaller computational complexity. However, compared to other processes in GNN (such as aggregate, backward propagation), the sampling process still costs tremendous time, which limits the speed of training. To reduce the time of sampling, hardware acceleration is an ideal choice. However, state of the… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  36. arXiv:2207.12744  [pdf, other

    cs.CV cs.AI

    Distribution Learning Based on Evolutionary Algorithm Assisted Deep Neural Networks for Imbalanced Image Classification

    Authors: Yudi Zhao, Kuangrong Hao, Chaochen Gu, Bing Wei

    Abstract: To address the trade-off problem of quality-diversity for the generated images in imbalanced classification tasks, we research on over-sampling based methods at the feature level instead of the data level and focus on searching the latent feature space for optimal distributions. On this basis, we propose an iMproved Estimation Distribution Algorithm based Latent featUre Distribution Evolution (MED… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  37. arXiv:2206.09427  [pdf

    cs.NI cs.MM eess.IV

    QuDASH: Quantum-inspired rate adaptation approach for DASH video streaming

    Authors: Bo Wei, Hang Song, Makoto Nakamura, Koichi Kimura, Nozomu Togawa, Jiro Katto

    Abstract: Internet traffic is dramatically increasing with the development of network technologies and video streaming traffic accounts for large amount within the total traffic, which reveals the importance to guarantee the quality of content delivery service. Based on the network conditions, adaptive bitrate (ABR) control is utilized as a common technique which can choose the proper bitrate to ensure the… ▽ More

    Submitted 21 October, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

    Comments: Accepted Version

    Journal ref: IEEE Access, 2023

  38. arXiv:2205.08878  [pdf, other

    cs.CV

    Transformer based multiple instance learning for weakly supervised histopathology image segmentation

    Authors: Ziniu Qian, Kailu Li, Maode Lai, Eric I-Chao Chang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Hispathological image segmentation algorithms play a critical role in computer aided diagnosis technology. The development of weakly supervised segmentation algorithm alleviates the problem of medical image annotation that it is time-consuming and labor-intensive. As a subset of weakly supervised learning, Multiple Instance Learning (MIL) has been proven to be effective in segmentation. However, t… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Provisional accepted for MICCAI 2022

  39. Semi-Cycled Generative Adversarial Networks for Real-World Face Super-Resolution

    Authors: Hao Hou, Jun Xu, Yingkun Hou, Xiaotao Hu, Benzheng Wei, Dinggang Shen

    Abstract: Real-world face super-resolution (SR) is a highly ill-posed image restoration task. The fully-cycled Cycle-GAN architecture is widely employed to achieve promising performance on face SR, but prone to produce artifacts upon challenging cases in real-world scenarios, since joint participation in the same degradation branch will impact final performance due to huge domain gap between real-world and… ▽ More

    Submitted 25 January, 2023; v1 submitted 8 May, 2022; originally announced May 2022.

  40. arXiv:2203.10435  [pdf

    cs.CV

    Vision Transformer with Convolutions Architecture Search

    Authors: Haichao Zhang, Kuangrong Hao, Witold Pedrycz, Lei Gao, Xuesong Tang, Bing Wei

    Abstract: Transformers exhibit great advantages in handling computer vision tasks. They model image classification tasks by utilizing a multi-head attention mechanism to process a series of patches consisting of split images. However, for complex tasks, Transformer in computer vision not only requires inheriting a bit of dynamic attention and global context, but also needs to introduce features concerning n… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  41. arXiv:2201.04286  [pdf, other

    cs.NE cs.LG

    Evolutionary Action Selection for Gradient-based Policy Learning

    Authors: Yan Ma, Tianxing Liu, Bingsheng Wei, Yi Liu, Kang Xu, Wei Li

    Abstract: Evolutionary Algorithms (EAs) and Deep Reinforcement Learning (DRL) have recently been integrated to take the advantage of the both methods for better exploration and exploitation.The evolutionary part in these hybrid methods maintains a population of policy networks.However, existing methods focus on optimizing the parameters of policy network, which is usually high-dimensional and tricky for EA.… ▽ More

    Submitted 16 September, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

  42. arXiv:2111.12618  [pdf, other

    cs.IR

    Group based Personalized Search by Integrating Search Behaviour and Friend Network

    Authors: Yujia Zhou, Zhicheng Dou, Bingzheng Wei, Ruobing Xievand Ji-Rong Wen

    Abstract: The key to personalized search is to build the user profile based on historical behaviour. To deal with the users who lack historical data, group based personalized models were proposed to incorporate the profiles of similar users when re-ranking the results. However, similar users are mostly found based on simple lexical or topical similarity in search behaviours. In this paper, we propose a neur… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 10 pages

  43. arXiv:2109.12293  [pdf

    cs.NI cs.MM eess.IV

    Adaptive video transmission using QUBO method and Digital Annealer based on Ising machine

    Authors: Bo Wei, Hang Song, Jiro Katto

    Abstract: With the dramatically increasing video streaming in the total network traffic, it is critical to develop effective algorithms to promote the content delivery service of high quality. Adaptive bitrate (ABR) control is the most essential technique which determines the proper bitrate to be chosen based on network conditions, thus realize high-quality video streaming. In this paper, a novel ABR strate… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  44. arXiv:2109.10485  [pdf, other

    cs.CL

    The NiuTrans Machine Translation Systems for WMT21

    Authors: Shuhan Zhou, Tao Zhou, Binghao Wei, Yingfeng Luo, Yongyu Mu, Zefan Zhou, Chenglong Wang, Xuanjun Zhou, Chuanhao Lv, Yi **g, Laohu Wang, **gnan Zhang, Canan Huang, Zhongxiang Yan, Chi Hu, Bei Li, Tong Xiao, **gbo Zhu

    Abstract: This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. We made submissions to 9 language directions, including English$\leftrightarrow$$\{$Chinese, Japanese, Russian, Icelandic$\}$ and English$\rightarrow$Hausa tasks. Our primary systems are built on several effective variants of Transformer, e.g., Transformer-DLCL, ODE-Transformer. We also utilize… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  45. arXiv:2109.03391  [pdf

    cs.AI

    Visual Sensation and Perception Computational Models for Deep Learning: State of the art, Challenges and Prospects

    Authors: Bing Wei, Yudi Zhao, Kuangrong Hao, Lei Gao

    Abstract: Visual sensation and perception refers to the process of sensing, organizing, identifying, and interpreting visual information in environmental awareness and understanding. Computational models inspired by visual perception have the characteristics of complexity and diversity, as they come from many subjects such as cognition science, information science, and artificial intelligence. In this paper… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

  46. arXiv:2109.01765  [pdf

    cs.AI

    Effective user intent mining with unsupervised word representation models and topic modelling

    Authors: Bencheng Wei

    Abstract: Understanding the intent behind chat between customers and customer service agents has become a crucial problem nowadays due to an exponential increase in the use of the Internet by people from different cultures and educational backgrounds. More importantly, the explosion of e-commerce has led to a significant increase in text conversation between customers and agents. In this paper, we propose a… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

  47. arXiv:2108.03305  [pdf, other

    cs.CL cs.AI

    Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning

    Authors: Bencheng Wei, Jason Li, Ajay Gupta, Hafiza Umair, Atsu Vovor, Natalie Durzynski

    Abstract: Toxic online speech has become a crucial problem nowadays due to an exponential increase in the use of internet by people from different cultures and educational backgrounds. Differentiating if a text message belongs to hate speech and offensive language is a key challenge in automatic detection of toxic text content. In this paper, we propose an approach to automatically classify tweets into thre… ▽ More

    Submitted 22 August, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

  48. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  49. arXiv:2106.06971  [pdf, other

    eess.IV cs.CV

    NLHD: A Pixel-Level Non-Local Retinex Model for Low-Light Image Enhancement

    Authors: Hao Hou, Yingkun Hou, Yuxuan Shi, Benzheng Wei, Jun Xu

    Abstract: Retinex model has been applied to low-light image enhancement in many existing methods. More appropriate decomposition of a low-light image can help achieve better image enhancement. In this paper, we propose a new pixel-level non-local Haar transform based illumination and reflectance decomposition method (NLHD). The unique low-frequency coefficient of Haar transform on each similar pixel group i… ▽ More

    Submitted 15 June, 2021; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: 14 pages, 11 figures

  50. arXiv:2103.12529  [pdf

    cs.CV

    Enhanced Gradient for Differentiable Architecture Search

    Authors: Haichao Zhang, Kuangrong Hao, Lei Gao, Xuesong Tang, Bing Wei

    Abstract: In recent years, neural architecture search (NAS) methods have been proposed for the automatic generation of task-oriented network architecture in image classification. However, the architectures obtained by existing NAS approaches are optimized only for classification performance and do not adapt to devices with limited computational resources. To address this challenge, we propose a neural netwo… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.