Skip to main content

Showing 1–50 of 84 results for author: Su, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00431  [pdf, other

    cs.CV

    Location embedding based pairwise distance learning for fine-grained diagnosis of urinary stones

    Authors: Qiangguo **, Jiapeng Huang, Changming Sun, Hui Cui, ** Xuan, Ran Su, Leyi Wei, Yu-Jie Wu, Chia-An Wu, Henry B. L. Duh, Yueh-Hsun Lu

    Abstract: The precise diagnosis of urinary stones is crucial for devising effective treatment strategies. The diagnostic process, however, is often complicated by the low contrast between stones and surrounding tissues, as well as the variability in stone locations across different patients. To address this issue, we propose a novel location embedding based pairwise distance learning network (LEPD-Net) that… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Journal ref: MICCAI 2024

  2. arXiv:2406.14964  [pdf, other

    cs.CV

    VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation

    Authors: Zixuan Chen, Ruijie Su, Jiahao Zhu, Lingxiao Yang, Jian-Huang Lai, Xiaohua Xie

    Abstract: Text-to-3D generation aims to create 3D assets from text-to-image diffusion models. However, existing methods face an inherent bottleneck in generation quality because the widely-used objectives such as Score Distillation Sampling (SDS) inappropriately omit U-Net jacobians for swift generation, leading to significant bias compared to the "true" gradient obtained by full denoising sampling. This bi… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.00341  [pdf, other

    eess.IV cs.CV

    DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation

    Authors: Qihang Xie, Mengguo Guo, Lei Mou, Dan Zhang, Da Chen, Caifeng Shan, Yitian Zhao, Ruisheng Su, Jiong Zhang

    Abstract: Cerebrovascular diseases (CVDs) remain a leading cause of global disability and mortality. Digital Subtraction Angiography (DSA) sequences, recognized as the golden standard for diagnosing CVDs, can clearly visualize the dynamic flow and reveal pathological conditions within the cerebrovasculature. Therefore, precise segmentation of cerebral arteries (CAs) and classification between their main tru… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2405.09744  [pdf, other

    cs.CL cs.AI

    Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts

    Authors: Ruolin Su, Biing-Hwang Juang

    Abstract: Task-oriented dialogue systems are broadly used in virtual assistants and other automated services, providing interfaces between users and machines to facilitate specific tasks. Nowadays, task-oriented dialogue systems have greatly benefited from pre-trained language models (PLMs). However, their task-solving performance is constrained by the inherent capacities of PLMs, and scaling these models i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  5. arXiv:2405.08935  [pdf, other

    cs.RO

    Function based sim-to-real learning for shape control of deformable free-form surfaces

    Authors: Yingjun Tian, Guoxin Fang, Renbo Su, Weiming Wang, Simeon Gill, Andrew Weightman, Charlie C. L. Wang

    Abstract: For the shape control of deformable free-form surfaces, simulation plays a crucial role in establishing the map** between the actuation parameters and the deformed shapes. The differentiation of this forward kinematic map** is usually employed to solve the inverse kinematic problem for determining the actuation parameters that can realize a target shape. However, the free-form surfaces obtaine… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  6. arXiv:2405.05017  [pdf, other

    cs.SE

    6G Software Engineering: A Systematic Map** Study

    Authors: Ruoyu Su, Xiaozhou Li, Davide Taibi

    Abstract: 6G will revolutionize the software world allowing faster cellular communications and a massive number of connected devices. 6G will enable a shift towards a continuous edge-to-cloud architecture. Current cloud solutions, where all the data is transferred and computed in the cloud, are not sustainable in such a large network of devices. Current technologies, including development methods, software… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  7. arXiv:2404.11119  [pdf, other

    cs.IR cs.MM

    DRepMRec: A Dual Representation Learning Framework for Multimodal Recommendation

    Authors: Kangning Zhang, Yingjie Qin, Ruilong Su, Yifan Liu, Jiarui **, Weinan Zhang, Yong Yu

    Abstract: Multimodal Recommendation focuses mainly on how to effectively integrate behavior and multimodal information in the recommendation task. Previous works suffer from two major issues. Firstly, the training process tightly couples the behavior module and multimodal module by jointly optimizing them using the sharing model parameters, which leads to suboptimal performance since behavior signals and mo… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 8 pages, 9 figures

  8. Inter- and intra-uncertainty based feature aggregation model for semi-supervised histopathology image segmentation

    Authors: Qiangguo **, Hui Cui, Changming Sun, Yang Song, Jiangbin Zheng, Leilei Cao, Leyi Wei, Ran Su

    Abstract: Acquiring pixel-level annotations is often limited in applications such as histology studies that require domain expertise. Various semi-supervised learning approaches have been developed to work with limited ground truth annotations, such as the popular teacher-student models. However, hierarchical prediction uncertainty within the student model (intra-uncertainty) and image prediction uncertaint… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Journal ref: Expert Systems with Applications, 2024, 238: 122093

  9. arXiv:2403.12384  [pdf, other

    cs.IR cs.LG

    An Aligning and Training Framework for Multimodal Recommendations

    Authors: Yifan Liu, Kangning Zhang, Xiangyuan Ren, Yanhua Huang, Jiarui **, Yingjie Qin, Ruilong Su, Ruiwen Xu, Weinan Zhang

    Abstract: With the development of multimedia applications, multimodal recommendations play an essential role, as they can leverage rich contexts beyond user and item interactions. Existing methods mainly use them to help learn ID features; however, there exist semantic gaps among multimodal content features and ID features. Directly using multimodal information as an auxiliary would lead to misalignment in… ▽ More

    Submitted 21 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 11 pages, revise some typos, correct some explanations

  10. arXiv:2403.05820  [pdf, other

    cs.SD cs.CL eess.AS

    An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

    Authors: Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang

    Abstract: Acoustic-to-articulatory inversion (AAI) is to convert audio into articulator movements, such as ultrasound tongue imaging (UTI) data. An issue of existing AAI methods is only using the personalized acoustic information to derive the general patterns of tongue motions, and thus the quality of generated UTI data is limited. To address this issue, this paper proposes an audio-textual diffusion model… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: ICASSP2024 Accept

  11. arXiv:2403.05753  [pdf, other

    eess.IV cs.CV

    UDCR: Unsupervised Aortic DSA/CTA Rigid Registration Using Deep Reinforcement Learning and Overlap Degree Calculation

    Authors: Wentao Liu, Bowen Liang, Wei** Xu, Tong Tian, Qingsheng Lu, Xipeng Pan, Haoyuan Li, Siyu Tian, Huihua Yang, Ruisheng Su

    Abstract: The rigid registration of aortic Digital Subtraction Angiography (DSA) and Computed Tomography Angiography (CTA) can provide 3D anatomical details of the vasculature for the interventional surgical treatment of conditions such as aortic dissection and aortic aneurysms, holding significant value for clinical research. However, the current methods for 2D/3D image registration are dependent on manual… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2403.05748  [pdf, other

    cs.RO

    Image-Guided Autonomous Guidewire Navigation in Robot-Assisted Endovascular Interventions using Reinforcement Learning

    Authors: Wentao Liu, Tong Tian, Wei** Xu, Bowen Liang, Qingsheng Lu, Xipeng Pan, Wenyi Zhao, Huihua Yang, Ruisheng Su

    Abstract: Autonomous robots in endovascular interventions possess the potential to navigate guidewires with safety and reliability, while reducing human error and shortening surgical time. However, current methods of guidewire navigation based on Reinforcement Learning (RL) depend on manual demonstration data or magnetic guidance. In this work, we propose an Image-guided Autonomous Guidewire Navigation (IAG… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  13. Kernel Correlation-Dissimilarity for Multiple Kernel k-Means Clustering

    Authors: Rina Su, Yu Guo, Caiying Wu, Qiyu **, Tieyong Zeng

    Abstract: The main objective of the Multiple Kernel k-Means (MKKM) algorithm is to extract non-linear information and achieve optimal clustering by optimizing base kernel matrices. Current methods enhance information diversity and reduce redundancy by exploiting interdependencies among multiple kernels based on correlations or dissimilarities. Nevertheless, relying solely on a single metric, such as correla… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 36 pages. This paper was accepted by Pattern Recognition on January 31, 2024

    Journal ref: Pattern Recognition, 2024, 150:110307

  14. arXiv:2401.11867  [pdf, other

    cs.SE

    Modular Monolith: Is This the Trend in Software Architecture?

    Authors: Ruoyu Su, Xiaozhou Li

    Abstract: Recently modular monolith architecture has attracted the attention of practitioners, as Google proposed "Service Weaver" framework to enable developers to write applications as modular monolithic and deploy them as a set of microservices. Google considered it as a framework that has the best of both worlds and it seems to be a trend in software architecture. This paper aims to understand the defin… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  15. arXiv:2401.07041  [pdf, other

    eess.IV cs.CV

    An automated framework for brain vessel centerline extraction from CTA images

    Authors: Sijie Liu, Ruisheng Su, Jianghang Su, **gmin Xin, Jiayi Wu, Wim van Zwam, Pieter Jan van Doormaal, Aad van der Lugt, Wiro J. Niessen, Nanning Zheng, Theo van Walsum

    Abstract: Accurate automated extraction of brain vessel centerlines from CTA images plays an important role in diagnosis and therapy of cerebrovascular diseases, such as stroke. However, this task remains challenging due to the complex cerebrovascular structure, the varying imaging quality, and vessel pathology effects. In this paper, we consider automatic lumen segmentation generation without additional an… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  16. arXiv:2401.04570  [pdf, other

    eess.IV cs.CV

    An Automatic Cascaded Model for Hemorrhagic Stroke Segmentation and Hemorrhagic Volume Estimation

    Authors: Wei** Xu, Zhuang Sha, Huihua Yang, Rongcai Jiang, Zhanying Li, Wentao Liu, Ruisheng Su

    Abstract: Hemorrhagic Stroke (HS) has a rapid onset and is a serious condition that poses a great health threat. Promptly and accurately delineating the bleeding region and estimating the volume of bleeding in Computer Tomography (CT) images can assist clinicians in treatment planning, leading to improved treatment outcomes for patients. In this paper, a cascaded 3D model is constructed based on UNet to per… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted by SWITCH2023: Stroke Workshop on Imaging and Treatment CHallenges, a workshop at MICCAI 2023

  17. arXiv:2311.06345  [pdf, other

    cs.CL

    Schema Graph-Guided Prompt for Multi-Domain Dialogue State Tracking

    Authors: Ruolin Su, Ting-Wei Wu, Biing-Hwang Juang

    Abstract: Tracking dialogue states is an essential topic in task-oriented dialogue systems, which involve filling in the necessary information in pre-defined slots corresponding to a schema. While general pre-trained language models have been shown effective in slot-filling, their performance is limited when applied to specific domains. We propose a graph-based framework that learns domain-specific prompts… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  18. AngioMoCo: Learning-based Motion Correction in Cerebral Digital Subtraction Angiography

    Authors: Ruisheng Su, Matthijs van der Sluijs, Sandra Cornelissen, Wim van Zwam, Aad van der Lugt, Wiro Niessen, Danny Ruijters, Theo van Walsum, Adrian Dalca

    Abstract: Cerebral X-ray digital subtraction angiography (DSA) is the standard imaging technique for visualizing blood flow and guiding endovascular treatments. The quality of DSA is often negatively impacted by body motion during acquisition, leading to decreased diagnostic value. Time-consuming iterative methods address motion correction based on non-rigid registration, and employ sparse key points and no… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  19. arXiv:2308.15281  [pdf, ps, other

    cs.SE

    Back to the Future: From Microservice to Monolith

    Authors: Ruoyu Su, Xiaozhou Li, Davide Taibi

    Abstract: Recently the trend of companies switching from microservice back to monolith has increased, leading to intense debate in the industry. We conduct a multivocal literature review, to investigate reasons for the phenomenon and key aspects to pay attention to during the switching back and analyze the opinions of other practitioners. The results pave the way for further research and provide guidance fo… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  20. arXiv:2307.12519  [pdf, other

    cs.LG

    DEPHN: Different Expression Parallel Heterogeneous Network using virtual gradient optimization for Multi-task Learning

    Authors: Menglin Kong, Ri Su, Shaojie Zhao, Muzhou Hou

    Abstract: Recommendation system algorithm based on multi-task learning (MTL) is the major method for Internet operators to understand users and predict their behaviors in the multi-behavior scenario of platform. Task correlation is an important consideration of MTL goals, traditional models use shared-bottom models and gating experts to realize shared representation learning and information differentiation.… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  21. arXiv:2307.12518  [pdf, other

    cs.LG cs.AI cs.IR

    FaFCNN: A General Disease Classification Framework Based on Feature Fusion Neural Networks

    Authors: Menglin Kong, Shaojie Zhao, Juan Cheng, Xingquan Li, Ri Su, Muzhou Hou, Cong Cao

    Abstract: There are two fundamental problems in applying deep learning/machine learning methods to disease classification tasks, one is the insufficient number and poor quality of training samples; another one is how to effectively fuse multiple source features and thus train robust classification models. To address these problems, inspired by the process of human learning knowledge, we propose the Feature-… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  22. arXiv:2307.02935  [pdf, other

    cs.CV

    DisAsymNet: Disentanglement of Asymmetrical Abnormality on Bilateral Mammograms using Self-adversarial Learning

    Authors: Xin Wang, Tao Tan, Yuan Gao, Luyi Han, Tianyu Zhang, Chunyao Lu, Regina Beets-Tan, Ruisheng Su, Ritse Mann

    Abstract: Asymmetry is a crucial characteristic of bilateral mammograms (Bi-MG) when abnormalities are develo**. It is widely utilized by radiologists for diagnosis. The question of 'what the symmetrical Bi-MG would look like when the asymmetrical abnormalities have been removed ?' has not yet received strong attention in the development of algorithms on mammograms. Addressing this question could provide… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  23. arXiv:2306.12153  [pdf, other

    eess.IV cs.CV

    DIAS: A Dataset and Benchmark for Intracranial Artery Segmentation in DSA sequences

    Authors: Wentao Liu, Tong Tian, Lemeng Wang, Wei** Xu, Lei Li, Haoyuan Li, Wenyi Zhao, Siyu Tian, Xipeng Pan, Huihua Yang, Feng Gao, Yiming Deng, Xin Yang, Ruisheng Su

    Abstract: The automated segmentation of Intracranial Arteries (IA) in Digital Subtraction Angiography (DSA) plays a crucial role in the quantification of vascular morphology, significantly contributing to computer-assisted stroke research and clinical practice. Current research primarily focuses on the segmentation of single-frame DSA using proprietary datasets. However, these methods face challenges due to… ▽ More

    Submitted 13 June, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

  24. arXiv:2305.12058  [pdf, other

    cs.IR cs.AI cs.LG

    DADIN: Domain Adversarial Deep Interest Network for Cross Domain Recommender Systems

    Authors: Menglin Kong, Muzhou Hou, Shaojie Zhao, Feng Liu, Ri Su, Yinghao Chen

    Abstract: Click-Through Rate (CTR) prediction is one of the main tasks of the recommendation system, which is conducted by a user for different items to give the recommendation results. Cross-domain CTR prediction models have been proposed to overcome problems of data sparsity, long tail distribution of user-item interactions, and cold start of items or users. In order to make knowledge transfer from source… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  25. arXiv:2304.02948  [pdf, other

    cs.AI cs.LG physics.ao-ph

    FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead

    Authors: Kang Chen, Tao Han, Junchao Gong, Lei Bai, Fenghua Ling, **g-Jia Luo, Xi Chen, Leiming Ma, Tianning Zhang, Rui Su, Yuanzheng Ci, Bin Li, Xiaokang Yang, Wanli Ouyang

    Abstract: We present FengWu, an advanced data-driven global medium-range weather forecast system based on Artificial Intelligence (AI). Different from existing data-driven weather forecast methods, FengWu solves the medium-range forecast problem from a multi-modal and multi-task perspective. Specifically, a deep learning architecture equipped with model-specific encoder-decoders and cross-modal fusion Trans… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 12 pages

  26. arXiv:2303.01091  [pdf, other

    cs.CV

    OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution

    Authors: Gaochao Song, Luo Zhang, Ran Su, Jianfeng Shi, Ying He, Qian Sun

    Abstract: Implicit neural representation (INR) is a popular approach for arbitrary-scale image super-resolution (SR), as a key component of INR, position encoding improves its representation ability. Motivated by position encoding, we propose orthogonal position encoding (OPE) - an extension of position encoding - and an OPE-Upscale module to replace the INR-based upsampling module for arbitrary-scale image… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023. 11 pages

  27. arXiv:2302.13201  [pdf, other

    cs.CL

    CLICKER: Attention-Based Cross-Lingual Commonsense Knowledge Transfer

    Authors: Ruolin Su, Zhongkai Sun, Sixing Lu, Chengyuan Ma, Chenlei Guo

    Abstract: Recent advances in cross-lingual commonsense reasoning (CSR) are facilitated by the development of multilingual pre-trained models (mPTMs). While mPTMs show the potential to encode commonsense knowledge for different languages, transferring commonsense knowledge learned in large-scale English corpus to other languages is challenging. To address this problem, we propose the attention-based Cross-LI… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  28. arXiv:2302.13013  [pdf, other

    cs.CL

    Choice Fusion as Knowledge for Zero-Shot Dialogue State Tracking

    Authors: Ruolin Su, **gfeng Yang, Ting-Wei Wu, Biing-Hwang Juang

    Abstract: With the demanding need for deploying dialogue systems in new domains with less cost, zero-shot dialogue state tracking (DST), which tracks user's requirements in task-oriented dialogues without training on desired domains, draws attention increasingly. Although prior works have leveraged question-answering (QA) data to reduce the need for in-domain training in DST, they fail to explicitly model k… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  29. Research on data integration of overseas discrete archives from the perspective of digital humanties

    Authors: Rina Su, 2. Yumeng Li, Xin Yang, Xin Yin, Tao Chen

    Abstract: The digitization of displaced archives is of great historical and cultural significance. Through the construction of digital humanistic platforms represented by MISS platform, and the comprehensive application of IIIF technology, knowledge graph technology, ontology technology, and other popular information technologies. We can find that the digital framework of displaced archives built through th… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Journal ref: International Journal of Web&Semantic Technology,2023,Vol14,Num1

  30. arXiv:2212.01575  [pdf

    cs.LG q-bio.BM

    Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery

    Authors: Chao Pang, Yu Wang, Yi Jiang, Ruheng Wang, Ran Su, Leyi Wei

    Abstract: In this work, we propose MEDICO, a Multi-viEw Deep generative model for molecule generation, structural optimization, and the SARS-CoV-2 Inhibitor disCOvery. To the best of our knowledge, MEDICO is the first-of-this-kind graph generative model that can generate molecular graphs similar to the structure of targeted molecules, with a multi-view representation learning framework to sufficiently and a… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  31. Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization

    Authors: Weiqi Sun, Rui Su, Qian Yu, Dong Xu

    Abstract: Weakly supervised temporal action localization (WTAL) aims to localize actions in untrimmed videos with only weak supervision information (e.g. video-level labels). Most existing models handle all input videos with a fixed temporal scale. However, such models are not sensitive to actions whose pace of the movements is different from the ``normal" speed, especially slow-motion action instances, whi… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology, 2022

  32. arXiv:2211.09375  [pdf, other

    cs.CV

    3D-QueryIS: A Query-based Framework for 3D Instance Segmentation

    Authors: Jiaheng Liu, Tong He, Honghui Yang, Rui Su, Jiayi Tian, Junran Wu, Hongcheng Guo, Ke Xu, Wanli Ouyang

    Abstract: Previous top-performing methods for 3D instance segmentation often maintain inter-task dependencies and the tendency towards a lack of robustness. Besides, inevitable variations of different datasets make these methods become particularly sensitive to hyper-parameter values and manifest poor generalization capability. In this paper, we address the aforementioned challenges by proposing a novel que… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  33. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  34. arXiv:2210.12385  [pdf, other

    q-bio.QM cs.AI

    Deep Learning in Single-Cell Analysis

    Authors: Dylan Molho, Jiayuan Ding, Zhaoheng Li, Hongzhi Wen, Wenzhuo Tang, Yixin Wang, Julian Venegas, Wei **, Renming Liu, Runze Su, Patrick Danaher, Robert Yang, Yu Leo Lei, Yuying Xie, Jiliang Tang

    Abstract: Single-cell technologies are revolutionizing the entire field of biology. The large volumes of data generated by single-cell technologies are high-dimensional, sparse, heterogeneous, and have complicated dependency structures, making analyses using conventional machine learning approaches challenging and impractical. In tackling these challenges, deep learning often demonstrates superior performan… ▽ More

    Submitted 5 November, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: 77 pages, 11 figures, 15 tables, deep learning, single-cell analysis

  35. arXiv:2210.05258  [pdf, other

    eess.IV cs.CV cs.LG

    EOCSA: Predicting Prognosis of Epithelial Ovarian Cancer with Whole Slide Histopathological Images

    Authors: Tianling Liu, Ran Su, Changming Sun, Xiuting Li, Leyi Wei

    Abstract: Ovarian cancer is one of the most serious cancers that threaten women around the world. Epithelial ovarian cancer (EOC), as the most commonly seen subtype of ovarian cancer, has rather high mortality rate and poor prognosis among various gynecological cancers. Survival analysis outcome is able to provide treatment advices to doctors. In recent years, with the development of medical imaging technol… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Published in Expert Systems with Applications 2022

  36. arXiv:2210.01799  [pdf, other

    cs.LG cs.AI

    STGIN: A Spatial Temporal Graph-Informer Network for Long Sequence Traffic Speed Forecasting

    Authors: Ruikang Luo, Yaofeng Song, Li** Huang, Yicheng Zhang, Rong Su

    Abstract: Accurate long series forecasting of traffic information is critical for the development of intelligent traffic systems. We may benefit from the rapid growth of neural network analysis technology to better understand the underlying functioning patterns of traffic networks as a result of this progress. Due to the fact that traffic data and facility utilization circumstances are sequentially dependen… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: 12 pages, 18 figures and 2 tables

  37. arXiv:2210.00674  [pdf

    cs.LG q-bio.GN q-bio.QM

    Multi-view information fusion using multi-view variational autoencoders to predict proximal femoral strength

    Authors: Chen Zhao, Joyce H Keyak, Xuewei Cao, Qiuying Sha, Li Wu, Zhe Luo, Lanjuan Zhao, Qing Tian, Chuan Qiu, Ray Su, Hui Shen, Hong-Wen Deng, Weihua Zhou

    Abstract: The aim of this paper is to design a deep learning-based model to predict proximal femoral strength using multi-view information fusion. Method: We developed new models using multi-view variational autoencoder (MVAE) for feature representation learning and a product of expert (PoE) model for multi-view information fusion. We applied the proposed models to an in-house Louisiana Osteoporosis Study (… ▽ More

    Submitted 27 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 16 pages, 3 figures

  38. arXiv:2209.13500  [pdf, other

    cs.CV cs.AI

    Dense-TNT: Efficient Vehicle Type Classification Neural Network Using Satellite Imagery

    Authors: Ruikang Luo, Yaofeng Song, Han Zhao, Yicheng Zhang, Yi Zhang, Nanbin Zhao, Li** Huang, Rong Su

    Abstract: Accurate vehicle type classification serves a significant role in the intelligent transportation system. It is critical for ruler to understand the road conditions and usually contributive for the traffic light control system to response correspondingly to alleviate traffic congestion. New technologies and comprehensive data sources, such as aerial photos and remote sensing data, provide richer an… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 10 pages, 8 figures, 5 tables

  39. arXiv:2209.11318  [pdf, other

    cs.RO

    OpenPneu: Compact platform for pneumatic actuation with multi-channels

    Authors: Yingjun Tian, Renbo Su, Xilong Wang, Nur Banu Altin, Guoxin Fang, Charlie C. L. Wang

    Abstract: This paper presents a compact system, OpenPneu, to support the pneumatic actuation for multi-chambers on soft robots. Micro-pumps are employed in the system to generate airflow and therefore no extra input as compressed air is needed. Our system conducts modular design to provide good scalability, which has been demonstrated on a prototype with ten air channels. Each air channel of OpenPneu is equ… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  40. arXiv:2209.03356  [pdf, other

    cs.LG eess.SY

    AST-GIN: Attribute-Augmented Spatial-Temporal Graph Informer Network for Electric Vehicle Charging Station Availability Forecasting

    Authors: Ruikang Luo, Yaofeng Song, Li** Huang, Yicheng Zhang, Rong Su

    Abstract: Electric Vehicle (EV) charging demand and charging station availability forecasting is one of the challenges in the intelligent transportation system. With the accurate EV station situation prediction, suitable charging behaviors could be scheduled in advance to relieve range anxiety. Many existing deep learning methods are proposed to address this issue, however, due to the complex road network s… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 10 pages; 17 figures; Under review for IEEE Transaction on Vehicular Technology

  41. arXiv:2208.07167  [pdf, other

    cs.CV cs.AI

    Where is VALDO? VAscular Lesions Detection and segmentatiOn challenge at MICCAI 2021

    Authors: Carole H. Sudre, Kimberlin Van Wijnen, Florian Dubost, Hieab Adams, David Atkinson, Frederik Barkhof, Mahlet A. Birhanu, Esther E. Bron, Robin Camarasa, Nish Chaturvedi, Yuan Chen, Zihao Chen, Shuai Chen, Qi Dou, Tavia Evans, Ivan Ezhov, Haojun Gao, Marta Girones Sanguesa, Juan Domingo Gispert, Beatriz Gomez Anson, Alun D. Hughes, M. Arfan Ikram, Silvia Ingala, H. Rolf Jaeger, Florian Kofler , et al. (24 additional authors not shown)

    Abstract: Imaging markers of cerebral small vessel disease provide valuable information on brain health, but their manual assessment is time-consuming and hampered by substantial intra- and interrater variability. Automated rating may benefit biomedical research, as well as clinical assessment, but diagnostic reliability of existing algorithms is unknown. Here, we present the results of the \textit{VAscular… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  42. Act-Aware Slot-Value Predicting in Multi-Domain Dialogue State Tracking

    Authors: Ruolin Su, Ting-Wei Wu, Biing-Hwang Juang

    Abstract: As an essential component in task-oriented dialogue systems, dialogue state tracking (DST) aims to track human-machine interactions and generate state representations for managing the dialogue. Representations of dialogue states are dependent on the domain ontology and the user's goals. In several task-oriented dialogues with a limited scope of objectives, dialogue states can be represented as a s… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Published in Spoken Dialogue Systems I, Interspeech 2021. Code is now publicly available on Github: https://github.com/youlandasu/ACT-AWARE-DST

    Journal ref: Proc. Interspeech 2021, 236-240 (2021)

  43. arXiv:2207.10388  [pdf, other

    cs.CV

    NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition

    Authors: Boyang Xia, Wenhao Wu, Haoran Wang, Rui Su, Dongliang He, Haosen Yang, Xiaoran Fan, Wanli Ouyang

    Abstract: It is challenging for artificial intelligence systems to achieve accurate video recognition under the scenario of low computation costs. Adaptive inference based efficient video recognition methods typically preview videos and focus on salient parts to reduce computation costs. Most existing works focus on complex networks learning with video classification based objectives. Taking all frames as p… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022

  44. Optimizing out-of-plane stiffness for soft grippers

    Authors: Renbo Su, Yingjun Tian, Mingwei Du, Charlie C. L. Wang

    Abstract: In this paper, we presented a data-driven framework to optimize the out-of-plane stiffness for soft grippers to achieve mechanical properties as hard-to-twist and easy-to-bend. The effectiveness of this method is demonstrated in the design of a soft pneumatic bending actuator (SPBA). First, a new objective function is defined to quantitatively evaluate the out-of-plane stiffness as well as the ben… ▽ More

    Submitted 29 July, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

  45. arXiv:2206.15076  [pdf, other

    cs.CL

    BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

    Authors: Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Samuele Garda, Myungsun Kang, Ruisi Su, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Matthias Samwald, Stephen Bach, Stella Biderman, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Periñán, Théo Gigant, Patrick Haller, Jenny Chim, Jose David Posada, John Michael Giorgi, Karthik Rangasai Sivaraman , et al. (18 additional authors not shown)

    Abstract: Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful i… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Submitted to NeurIPS 2022 Datasets and Benchmarks Track

  46. arXiv:2206.14541  [pdf, other

    cs.LG cs.CV

    Why patient data cannot be easily forgotten?

    Authors: Ruolin Su, Xiao Liu, Sotirios A. Tsaftaris

    Abstract: Rights provisioned within data protection regulations, permit patients to request that knowledge about their information be eliminated by data holders. With the advent of AI learned on data, one can imagine that such rights can extent to requests for forgetting knowledge of patient's data within AI models. However, forgetting patients' imaging data from AI models, is still an under-explored proble… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Ruolin Su and Xiao Liu contributed equally. Accepted by MICCAI 2022

  47. arXiv:2206.09184  [pdf, other

    cs.LG cs.IR cs.NI

    PHN: Parallel heterogeneous network with soft gating for CTR prediction

    Authors: Ri Su, Alphonse Houssou Hounye, Cong Cao, Muzhou Hou

    Abstract: The Click-though Rate (CTR) prediction task is a basic task in recommendation system. Most of the previous researches of CTR models built based on Wide \& deep structure and gradually evolved into parallel structures with different modules. However, the simple accumulation of parallel structures can lead to higher structural complexity and longer training time. Based on the Sigmoid activation func… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

  48. arXiv:2204.08185  [pdf, ps, other

    cs.IT

    Completion Delay of Random Linear Network Coding in Full-Duplex Relay Networks

    Authors: Rina Su, Qifu Tyler Sun, Zhongshan Zhang, Zongpeng Li

    Abstract: As the next-generation wireless networks thrive, full-duplex and relay techniques are combined to improve the network performance. Random linear network coding (RLNC) is another popular technique to enhance the efficiency and reliability of wireless communications. In this paper, in order to explore the potential of RLNC in full-duplex relay networks, we investigate two fundamental perfect RLNC sc… ▽ More

    Submitted 3 November, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

  49. arXiv:2204.07579  [pdf, other

    cs.LG cs.AI

    Interpretable Fault Diagnosis of Rolling Element Bearings with Temporal Logic Neural Network

    Authors: Gang Chen, Yu Lu, Rong Su, Zhaodan Kong

    Abstract: Machine learning-based methods have achieved successful applications in machinery fault diagnosis. However, the main limitation that exists for these methods is that they operate as a black box and are generally not interpretable. This paper proposes a novel neural network structure, called temporal logic neural network (TLNN), in which the neurons of the network are logic propositions. More impor… ▽ More

    Submitted 19 April, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

  50. arXiv:2204.04090  [pdf, other

    cs.LG

    Single-level Adversarial Data Synthesis based on Neural Tangent Kernels

    Authors: Yu-Rong Zhang, Ruei-Yang Su, Sheng Yen Chou, Shan-Hung Wu

    Abstract: Abstract Generative adversarial networks (GANs) have achieved impressive performance in data synthesis and have driven the development of many applications. However, GANs are known to be hard to train due to their bilevel objective, which leads to the problems of convergence, mode collapse, and gradient vanishing. In this paper, we propose a new generative model called the generative adversarial N… ▽ More

    Submitted 20 November, 2022; v1 submitted 8 April, 2022; originally announced April 2022.