Skip to main content

Showing 1–50 of 113 results for author: Hu, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04316  [pdf, other

    cs.CV

    Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking

    Authors: Jiyao Zhang, Weiyao Huang, Bo Peng, Mingdong Wu, Fei Hu, Zijian Chen, Bo Zhao, Hao Dong

    Abstract: 6D Object Pose Estimation is a crucial yet challenging task in computer vision, suffering from a significant lack of large-scale datasets. This scarcity impedes comprehensive evaluation of model performance, limiting research advancements. Furthermore, the restricted number of available instances or categories curtails its applications. To address these issues, this paper introduces Omni6DPose, a… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2406.02609  [pdf, other

    cs.LG cs.AI

    Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation

    Authors: Jiayao Tan, Fan Lyu, Chenggong Ni, Tingliang Feng, Fuyuan Hu, Zhang Zhang, Shaochuang Zhao, Liang Wang

    Abstract: Continual Test-Time Adaptation (CTTA) aims to adapt a pre-trained model to a sequence of target domains during the test phase without accessing the source data. To adapt to unlabeled data from unknown domains, existing methods rely on constructing pseudo-labels for all samples and updating the model through self-training. However, these pseudo-labels often involve noise, leading to insufficient ad… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.03335 by other authors

  3. arXiv:2405.14602  [pdf, other

    cs.LG

    Controllable Continual Test-Time Adaptation

    Authors: Ziqi Shi, Fan Lyu, Ye Liu, Fanhua Shang, Fuyuan Hu, Wei Feng, Zhang Zhang, Liang Wang

    Abstract: Continual Test-Time Adaptation (CTTA) is an emerging and challenging task where a model trained in a source domain must adapt to continuously changing conditions during testing, without access to the original source data. CTTA is prone to error accumulation due to uncontrollable domain shifts, leading to blurred decision boundaries between categories. Existing CTTA methods primarily focus on suppr… ▽ More

    Submitted 28 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.13097  [pdf, other

    cs.CV

    NieR: Normal-Based Lighting Scene Rendering

    Authors: Hongsheng Wang, Yang Wang, Yalan Liu, Fayuan Hu, Shengyu Zhang, Fei Wu, Feng Lin

    Abstract: In real-world road scenes, diverse material properties lead to complex light reflection phenomena, making accurate color reproduction crucial for enhancing the realism and safety of simulated driving environments. However, existing methods often struggle to capture the full spectrum of lighting effects, particularly in dynamic scenarios where viewpoint changes induce significant material color var… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  5. arXiv:2405.12961  [pdf, other

    cs.LG cs.AI physics.chem-ph q-bio.QM

    Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scale

    Authors: Shriram Chennakesavalu, Frank Hu, Sebastian Ibarraran, Grant M. Rotskoff

    Abstract: Searching through chemical space is an exceptionally challenging problem because the number of possible molecules grows combinatorially with the number of atoms. Large, autoregressive models trained on databases of chemical compounds have yielded powerful generators, but we still lack robust strategies for generating molecules with desired properties. This molecular search problem closely resemble… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  6. arXiv:2405.09133  [pdf, other

    cs.LG

    Overcoming Domain Drift in Online Continual Learning

    Authors: Fan Lyu, Daofeng Liu, Linglan Zhao, Zhang Zhang, Fanhua Shang, Fuyuan Hu, Wei Feng, Liang Wang

    Abstract: Online Continual Learning (OCL) empowers machine learning models to acquire new knowledge online across a sequence of tasks. However, OCL faces a significant challenge: catastrophic forgetting, wherein the model learned in previous tasks is substantially overwritten upon encountering new tasks, leading to a biased forgetting of prior knowledge. Moreover, the continual doman drift in sequential lea… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  7. arXiv:2405.08298  [pdf, other

    cs.LG

    Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments

    Authors: Ke Liu, Fan Hu, Hui Lin, Xi Cheng, Jianan Chen, Jilin Song, Siyuan Feng, Gaofeng Su, Chen Zhu

    Abstract: This paper explores the optimization of Ground Delay Programs (GDP), a prevalent Traffic Management Initiative used in Air Traffic Management (ATM) to reconcile capacity and demand discrepancies at airports. Employing Reinforcement Learning (RL) to manage the inherent uncertainties in the national airspace system-such as weather variability, fluctuating flight demands, and airport arrival rates-we… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2405.07488  [pdf, other

    cs.LG cs.RO cs.SC

    Predictive Modeling of Flexible EHD Pumps using Kolmogorov-Arnold Networks

    Authors: Yanhong Peng, Miao He, Fangchao Hu, Zebing Mao, Xia Huang, Jun Ding

    Abstract: We present a novel approach to predicting the pressure and flow rate of flexible electrohydrodynamic pumps using the Kolmogorov-Arnold Network. Inspired by the Kolmogorov-Arnold representation theorem, KAN replaces fixed activation functions with learnable spline-based activation functions, enabling it to approximate complex nonlinear functions more effectively than traditional models like Multi-L… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  9. arXiv:2402.12747  [pdf, other

    cs.NI

    Enhanced Physical Layer Security for Full-duplex Symbiotic Radio with AN Generation and Forward Noise Suppression

    Authors: Chi **, Zheng Chang, Fengye Hu, Hsiao-Hwa Chen, Timo Hamalainen

    Abstract: Due to the constraints on power supply and limited encryption capability, data security based on physical layer security (PLS) techniques in backscatter communications has attracted a lot of attention. In this work, we propose to enhance PLS in a full-duplex symbiotic radio (FDSR) system with a proactive eavesdropper, which may overhear the information and interfere legitimate communications simul… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  10. arXiv:2402.07790  [pdf, other

    cs.LG

    From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration

    Authors: Agathe Fernandes Machado, Arthur Charpentier, Emmanuel Flachaire, Ewen Gallic, François Hu

    Abstract: The assessment of binary classifier performance traditionally centers on discriminative ability using metrics, such as accuracy. However, these metrics often disregard the model's inherent uncertainty, especially when dealing with sensitive decision-making domains, such as finance or healthcare. Given that model-predicted scores are commonly seen as event probabilities, calibration is crucial for… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  11. arXiv:2402.07233  [pdf, other

    cs.CL cs.AI

    TransGPT: Multi-modal Generative Pre-trained Transformer for Transportation

    Authors: Peng Wang, Xiang Wei, Fangxu Hu, Wenjuan Han

    Abstract: Natural language processing (NLP) is a key component of intelligent transportation systems (ITS), but it faces many challenges in the transportation domain, such as domain-specific knowledge and data, and multi-modal inputs and outputs. This paper presents TransGPT, a novel (multi-modal) large language model for the transportation domain, which consists of two independent variants: TransGPT-SM for… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  12. arXiv:2401.16197  [pdf, other

    cs.LG cs.CY

    Geospatial Disparities: A Case Study on Real Estate Prices in Paris

    Authors: Agathe Fernandes Machado, François Hu, Philipp Ratz, Ewen Gallic, Arthur Charpentier

    Abstract: Driven by an increasing prevalence of trackers, ever more IoT sensors, and the declining cost of computing power, geospatial information has come to play a pivotal role in contemporary predictive models. While enhancing prognostic performance, geospatial data also has the potential to perpetuate many historical socio-economic patterns, raising concerns about a resurgence of biases and exclusionary… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  13. arXiv:2311.17041  [pdf, other

    cs.CV cs.AI cs.CL

    Efficient In-Context Learning in Vision-Language Models for Egocentric Videos

    Authors: Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Joyce Chai

    Abstract: Recent advancements in text-only large language models (LLMs) have highlighted the benefit of in-context learning for adapting to new tasks with a few demonstrations. However, extending in-context learning to large vision-language models (VLMs) using a huge amount of naturalistic vision-language data has shown limited success, particularly for egocentric videos, due to high data collection costs.… ▽ More

    Submitted 29 November, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 10 pages, LaTeX; added acknowledgments

  14. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  15. arXiv:2310.20508  [pdf, other

    stat.ML cs.CY cs.LG

    Parametric Fairness with Statistical Guarantees

    Authors: François HU, Philipp Ratz, Arthur Charpentier

    Abstract: Algorithmic fairness has gained prominence due to societal and regulatory concerns about biases in Machine Learning models. Common group fairness metrics like Equalized Odds for classification or Demographic Parity for both classification and regression are widely used and a host of computationally advantageous post-processing methods have been developed around them. However, these metrics often l… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  16. arXiv:2310.20268  [pdf, other

    cs.CV cs.AI

    Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning

    Authors: Fuyuan Hu, Jian Zhang, Fan Lyu, Linyan Li, Fenglei Xu

    Abstract: Few-shot class-incremental learning (FSCIL) aims to build machine learning model that can continually learn new concepts from a few data samples, without forgetting knowledge of old classes. The challenges of FSCIL lies in the limited data of new classes, which not only lead to significant overfitting issues but also exacerbates the notorious catastrophic forgetting problems. As proved in early… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  17. arXiv:2310.19113  [pdf, other

    cs.CV cs.AI eess.SP

    Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision

    Authors: Jiayao Tan, Fan Lyu, Linyan Li, Fuyuan Hu, Tingliang Feng, Fenglei Xu, Rui Yao

    Abstract: Vehicle-to-everything (V2X) perception is an innovative technology that enhances vehicle perception accuracy, thereby elevating the security and reliability of autonomous systems. However, existing V2X perception methods focus on static scenes from mainly vehicle-based vision, which is constrained by sensor capabilities and communication loads. To adapt V2X perception models to dynamic scenes, we… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  18. arXiv:2310.18364  [pdf, other

    cs.CL cs.AI

    From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning

    Authors: Zheyuan Zhang, Shane Storks, Fengyuan Hu, Sungryull Sohn, Moontae Lee, Honglak Lee, Joyce Chai

    Abstract: Pre-trained language models (PLMs) have shown impressive performance in various language tasks. However, they are prone to spurious correlations, and often generate illusory information. In real-world applications, PLMs should justify decisions with formalized, coherent reasoning chains, but this challenge remains under-explored. Cognitive psychology theorizes that humans are capable of utilizing… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference

  19. arXiv:2310.16162  [pdf, other

    cs.LG

    Brainchop: Next Generation Web-Based Neuroimaging Application

    Authors: Mohamed Masoud, Pratyush Reddy, Farfalla Hu, Sergey Plis

    Abstract: Performing volumetric image processing directly within the browser, particularly with medical data, presents unprecedented challenges compared to conventional backend tools. These challenges arise from limitations inherent in browser environments, such as constrained computational resources and the availability of frontend machine learning libraries. Consequently, there is a shortage of neuroimagi… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  20. arXiv:2310.16003  [pdf, other

    cs.CV

    CVPR 2023 Text Guided Video Editing Competition

    Authors: Jay Zhangjie Wu, Xiuyu Li, Difei Gao, Zhen Dong, **bin Bai, Aishani Singh, Xiaoyu Xiang, Youzeng Li, Zuwei Huang, Yuanxi Sun, Rui He, Feng Hu, Junhua Hu, Hai Huang, Hanyu Zhu, Xu Cheng, Jie Tang, Mike Zheng Shou, Kurt Keutzer, Forrest Iandola

    Abstract: Humans watch more than a billion hours of video per day. Most of this video was edited manually, which is a tedious process. However, AI-enabled video-generation and video-editing is on the rise. Building on text-to-image models like Stable Diffusion and Imagen, generative AI has improved dramatically on video tasks. But it's hard to evaluate progress in these video tasks because there is no stand… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Project page: https://sites.google.com/view/loveucvpr23/track4

  21. arXiv:2310.03121  [pdf

    physics.chem-ph cs.LG

    OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials

    Authors: Peter Eastman, Raimondas Galvelis, Raúl P. Peláez, Charlles R. A. Abreu, Stephen E. Farr, Emilio Gallicchio, Anton Gorenko, Michael M. Henry, Frank Hu, **g Huang, Andreas Krämer, Julien Michel, Joshua A. Mitchell, Vijay S. Pande, João PGLM Rodrigues, Jaime Rodriguez-Guerra, Andrew C. Simmonett, Sukrit Singh, Jason Swails, Philip Turner, Yuanqing Wang, Ivy Zhang, John D. Chodera, Gianni De Fabritiis, Thomas E. Markland

    Abstract: Machine learning plays an important and growing role in molecular simulation. The newest version of the OpenMM molecular dynamics toolkit introduces new features to support the use of machine learning potentials. Arbitrary PyTorch models can be added to a simulation and used to compute forces and energy. A higher-level interface allows users to easily model their molecules of interest with general… ▽ More

    Submitted 29 November, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 16 pages, 5 figures

    ACM Class: J.2; J.3

  22. arXiv:2309.06627  [pdf, other

    stat.ML cs.CY cs.LG

    A Sequentially Fair Mechanism for Multiple Sensitive Attributes

    Authors: François Hu, Philipp Ratz, Arthur Charpentier

    Abstract: In the standard use case of Algorithmic Fairness, the goal is to eliminate the relationship between a sensitive variable and a corresponding score. Throughout recent years, the scientific community has developed a host of definitions and tools to solve this task, which work well in many practical applications. However, the applicability and effectivity of these tools and definitions becomes less s… ▽ More

    Submitted 14 January, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  23. Vision-Based Human Pose Estimation via Deep Learning: A Survey

    Authors: Gong** Lan, Yu Wu, Fei Hu, Qi Hao

    Abstract: Human pose estimation (HPE) has attracted a significant amount of attention from the computer vision community in the past decades. Moreover, HPE has been applied to various domains, such as human-computer interaction, sports analysis, and human tracking via images and videos. Recently, deep learning-based approaches have shown state-of-the-art performance in HPE-based applications. Although deep… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 16 pages, 4 figures

  24. arXiv:2308.11090  [pdf, other

    cs.CV cs.LG stat.AP

    Fairness Explainability using Optimal Transport with Applications in Image Classification

    Authors: Philipp Ratz, François Hu, Arthur Charpentier

    Abstract: Ensuring trust and accountability in Artificial Intelligence systems demands explainability of its outcomes. Despite significant progress in Explainable AI, human biases still taint a substantial portion of its training data, raising concerns about unfairness or discriminatory tendencies. Current approaches in the field of Algorithmic Fairness focus on mitigating such biases in the outcomes of a m… ▽ More

    Submitted 31 October, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  25. arXiv:2308.06806  [pdf, other

    cs.DC

    A Dynamic Distributed Scheduler for Computing on the Edge

    Authors: Fei Hu, Kunal Mehta, Shivakant Mishra, Mohammad AlMutawa

    Abstract: Edge computing has become a promising computing paradigm for building IoT (Internet of Things) applications, particularly for applications with specific constraints such as latency or privacy requirements. Due to resource constraints at the edge, it is important to efficiently utilize all available computing resources to satisfy these constraints. A key challenge in utilizing these computing resou… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 11 pages,14 figures

  26. arXiv:2308.05782  [pdf, other

    eess.IV cs.CV

    Multi-scale Multi-site Renal Microvascular Structures Segmentation for Whole Slide Imaging in Renal Pathology

    Authors: Franklin Hu, Ruining Deng, Shunxing Bao, Haichun Yang, Yuankai Huo

    Abstract: Segmentation of microvascular structures, such as arterioles, venules, and capillaries, from human kidney whole slide images (WSI) has become a focal point in renal pathology. Current manual segmentation techniques are time-consuming and not feasible for large-scale digital pathology images. While deep learning-based methods offer a solution for automatic segmentation, most suffer from a limitatio… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  27. arXiv:2307.09748  [pdf, other

    cs.CV

    Watch out Venomous Snake Species: A Solution to SnakeCLEF2023

    Authors: Feiran Hu, Peng Wang, Yangyang Li, Chenlong Duan, Zijian Zhu, Fei Wang, Faen Zhang, Yong Li, Xiu-Shen Wei

    Abstract: The SnakeCLEF2023 competition aims to the development of advanced algorithms for snake species identification through the analysis of images and accompanying metadata. This paper presents a method leveraging utilization of both images and metadata. Modern CNN models and strong data augmentation are utilized to learn better representation of images. To relieve the challenge of long-tailed distribut… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: This work was the winner solution of the SnakeCLEF2023 challenge

  28. arXiv:2306.15704  [pdf, other

    cs.CV

    MAE-GEBD:Winning the CVPR'2023 LOVEU-GEBD Challenge

    Authors: Yuanxi Sun, Rui He, Youzeng Li, Zuwei Huang, Feng Hu, Xu Cheng, Jie Tang

    Abstract: The Generic Event Boundary Detection (GEBD) task aims to build a model for segmenting videos into segments by detecting general event boundaries applicable to various classes. In this paper, based on last year's MAE-GEBD method, we have improved our model performance on the GEBD task by adjusting the data processing strategy and loss function. Based on last year's approach, we extended the applica… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Winner of CVPR2023 LOVEU GEBD Challenge

  29. arXiv:2306.12912  [pdf, other

    stat.ML cs.CY cs.LG

    Mitigating Discrimination in Insurance with Wasserstein Barycenters

    Authors: Arthur Charpentier, François Hu, Philipp Ratz

    Abstract: The insurance industry is heavily reliant on predictions of risks based on characteristics of potential customers. Although the use of said models is common, researchers have long pointed out that such practices perpetuate discrimination based on sensitive features such as gender or race. Given that such discrimination can often be attributed to historical data biases, an elimination or at least m… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  30. Fairness in Multi-Task Learning via Wasserstein Barycenters

    Authors: François Hu, Philipp Ratz, Arthur Charpentier

    Abstract: Algorithmic Fairness is an established field in machine learning that aims to reduce biases in data. Recent advances have proposed various methods to ensure fairness in a univariate environment, where the goal is to de-bias a single task. However, extending fairness to a multi-task setting, where more than one objective is optimised using a shared representation, remains underexplored. To bridge t… ▽ More

    Submitted 6 July, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  31. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  32. arXiv:2303.13862  [pdf, other

    cs.CV

    Two-level Graph Network for Few-Shot Class-Incremental Learning

    Authors: Hao Chen, Linyan Li, Fan Lyu, Fuyuan Hu, Fenglei Xu

    Abstract: Few-shot class-incremental learning (FSCIL) aims to design machine learning algorithms that can continually learn new concepts from a few data points, without forgetting knowledge of old classes. The difficulty lies in that limited data from new classes not only lead to significant overfitting issues but also exacerbates the notorious catastrophic forgetting problems. However, existing FSCIL metho… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2203.06953 by other authors

  33. arXiv:2303.11661  [pdf, other

    eess.IV cs.CV

    Advanced Multi-Microscopic Views Cell Semi-supervised Segmentation

    Authors: Fang Hu, Xuexue Sun, Ke Qing, Fenxi Xiao, Zhi Wang, Xiaolu Fan

    Abstract: Although deep learning (DL) shows powerful potential in cell segmentation tasks, it suffers from poor generalization as DL-based methods originally simplified cell segmentation in detecting cell membrane boundary, lacking prominent cellular structures to position overall differentiating. Moreover, the scarcity of annotated cell images limits the performance of DL models. Segmentation limitations o… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 23 pages

  34. arXiv:2303.02954  [pdf, other

    cs.LG cs.CV

    Centroid Distance Distillation for Effective Rehearsal in Continual Learning

    Authors: Daofeng Liu, Fan Lyu, Linyan Li, Fuyuan Hu

    Abstract: Rehearsal, retraining on a stored small data subset of old tasks, has been proven effective in solving catastrophic forgetting in continual learning. However, due to the sampled data may have a large bias towards the original dataset, retraining them is susceptible to driving continual domain drift of old tasks in feature space, resulting in forgetting. In this paper, we focus on tackling the cont… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  35. arXiv:2302.09320  [pdf

    cs.AI

    Anomaly Detection of UAV State Data Based on Single-class Triangular Global Alignment Kernel Extreme Learning Machine

    Authors: Feisha Hu, Qi Wang, Haijian Shao, Shang Gao, Hualong Yu

    Abstract: Unmanned Aerial Vehicles (UAVs) are widely used and meet many demands in military and civilian fields. With the continuous enrichment and extensive expansion of application scenarios, the safety of UAVs is constantly being challenged. To address this challenge, we propose algorithms to detect anomalous data collected from drones to improve drone safety. We deployed a one-class kernel extreme learn… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

  36. arXiv:2302.09293  [pdf, other

    cs.CY physics.data-an

    Periodicity Intensity Reveals Insights into Time Series Data: Three Use Cases

    Authors: Alan F. Smeaton, Feiyan Hu

    Abstract: Periodic phenomena are oscillating signals found in many naturally-occurring time series. A periodogram can be used to measure the intensities of oscillations at different frequencies over an entire time series but sometimes we are interested in measuring how periodicity intensity at a specific frequency varies throughout the time series. This can be done by calculating periodicity intensity withi… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 14 pages, 6 figures, a

    Journal ref: Algorithms 2023, 16, 119

  37. arXiv:2301.10140  [pdf, other

    cs.DL cs.CL

    The Semantic Scholar Open Data Platform

    Authors: Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin , et al. (23 additional authors not shown)

    Abstract: The volume of scientific output is creating an urgent need for automated tools to help scientists keep up with developments in their field. Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by hel** scholars discover and understand scientific literature. We combine public and proprietary data sources using state-of-the-art techniques for scholarly PDF conte… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 8 pages, 6 figures

  38. arXiv:2301.04619  [pdf, other

    cs.CV

    TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation

    Authors: Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness

    Abstract: Video saliency prediction has recently attracted attention of the research community, as it is an upstream task for several practical applications. However, current solutions are particularly computationally demanding, especially due to the wide usage of spatio-temporal 3D convolutions. We observe that, while different model architectures achieve similar performance on benchmarks, visual variation… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: WACV2023

  39. arXiv:2301.02885  [pdf, other

    cs.SI

    SCOREH+: A High-Order Node Proximity Spectral Clustering on Ratios-of-Eigenvectors Algorithm for Community Detection

    Authors: Yanhui Zhu, Fang Hu, Lei Hsin Kuo, Jia liu

    Abstract: The research on complex networks has achieved significant progress in revealing the mesoscopic features of networks. Community detection is an important aspect of understanding real-world complex systems. We present in this paper a High-order node proximity Spectral Clustering on Ratios-of-Eigenvectors (SCOREH+) algorithm for locating communities in complex networks. The algorithm improves SCORE a… ▽ More

    Submitted 17 December, 2023; v1 submitted 7 January, 2023; originally announced January 2023.

  40. arXiv:2212.00313  [pdf, other

    cs.CV

    Concealed Object Detection for Passive Millimeter-Wave Security Imaging Based on Task-Aligned Detection Transformer

    Authors: Cheng Guo, Fei Hu, Yan Hu

    Abstract: Passive millimeter-wave (PMMW) is a significant potential technique for human security screening. Several popular object detection networks have been used for PMMW images. However, restricted by the low resolution and high noise of PMMW images, PMMW hidden object detection based on deep learning usually suffers from low accuracy and low classification confidence. To tackle the above problems, this… ▽ More

    Submitted 7 July, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  41. arXiv:2211.15039  [pdf, other

    cs.CV cs.MM

    Renmin University of China at TRECVID 2022: Improving Video Search by Feature Fusion and Negation Understanding

    Authors: Xirong Li, Aozhu Chen, Ziyue Wang, Fan Hu, Kaibin Tian, Xinru Chen, Chengbo Dong

    Abstract: We summarize our TRECVID 2022 Ad-hoc Video Search (AVS) experiments. Our solution is built with two new techniques, namely Lightweight Attentional Feature Fusion (LAFF) for combining diverse visual / textual features and Bidirectional Negation Learning (BNL) for addressing queries that contain negation cues. In particular, LAFF performs feature fusion at both early and late stages and at both text… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  42. arXiv:2211.14763  [pdf, other

    cs.CV cs.AI

    Multi-Label Continual Learning using Augmented Graph Convolutional Network

    Authors: Kaile Du, Fan Lyu, Linyan Li, Fuyuan Hu, Wei Feng, Fenglei Xu, Xuefeng Xi, Han**g Cheng

    Abstract: Multi-Label Continual Learning (MLCL) builds a class-incremental framework in a sequential multi-label image recognition data stream. The critical challenges of MLCL are the construction of label relationships on past-missing and future-missing partial labels of training data and the catastrophic forgetting on old classes, resulting in poor generalization. To solve the problems, the study proposes… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  43. Justification of Recommender Systems Results: A Service-based Approach

    Authors: Noemi Mauro, Zhongli Filippo Hu, Liliana Ardissono

    Abstract: With the increasing demand for predictable and accountable Artificial Intelligence, the ability to explain or justify recommender systems results by specifying how items are suggested, or why they are relevant, has become a primary goal. However, current models do not explicitly represent the services and actors that the user might encounter during the overall interaction with an item, from its se… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Journal ref: User Modeling and User-Adapted Interaction (2022)

  44. arXiv:2211.01144  [pdf

    cs.CR cs.LG cs.SE

    UniASM: Binary Code Similarity Detection without Fine-tuning

    Authors: Yeming Gu, Hui Shu, Fan Hu

    Abstract: Binary code similarity detection (BCSD) is widely used in various binary analysis tasks such as vulnerability search, malware detection, clone detection, and patch analysis. Recent studies have shown that the learning-based binary code embedding models perform better than the traditional feature-based approaches. In this paper, we propose a novel transformer-based binary code embedding model named… ▽ More

    Submitted 6 April, 2023; v1 submitted 28 October, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  45. arXiv:2210.16730  [pdf

    cs.AI cs.LG

    Graph Fuzzy System: Concepts, Models and Algorithms

    Authors: Fu** Hu, Zhaohong Deng, Kup-Sze Choi, Shitong Wang

    Abstract: Fuzzy systems (FSs) have enjoyed wide applications in various fields, including pattern recognition, intelligent control, data mining and bioinformatics, which is attributed to the strong interpretation and learning ability. In traditional application scenarios, FSs are mainly applied to model Euclidean space data and cannot be used to handle graph data of non-Euclidean structure in nature, such a… ▽ More

    Submitted 20 September, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: This paper has been submitted to a journal

  46. arXiv:2210.00507  [pdf, other

    cs.CV cs.LG

    Fast and Robust Video-Based Exercise Classification via Body Pose Tracking and Scalable Multivariate Time Series Classifiers

    Authors: Ashish Singh, Antonio Bevilacqua, Thach Le Nguyen, Feiyan Hu, Kevin McGuinness, Martin OReilly, Darragh Whelan, Brian Caulfield, Georgiana Ifrim

    Abstract: Technological advancements have spurred the usage of machine learning based applications in sports science. Physiotherapists, sports coaches and athletes actively look to incorporate the latest technologies in order to further improve performance and avoid injuries. While wearable sensors are very popular, their use is hindered by constraints on battery power and sensor calibration, especially for… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  47. arXiv:2209.01746  [pdf, other

    cs.CV cs.GR

    SPCNet: Stepwise Point Cloud Completion Network

    Authors: Fei Hu, Honghua Chen, Xuequan Lu, Zhe Zhu, Jun Wang, Weiming Wang, Fu Lee Wang, Mingqiang Wei

    Abstract: How will you repair a physical object with large missings? You may first recover its global yet coarse shape and stepwise increase its local details. We are motivated to imitate the above physical repair procedure to address the point cloud completion task. We propose a novel stepwise point cloud completion network (SPCNet) for various 3D models with large missings. SPCNet has a hierarchical botto… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

  48. Revisiting Code Search in a Two-Stage Paradigm

    Authors: Fan Hu, Yanlin Wang, Lun Du, Xirong Li, Hongyu Zhang, Shi Han, Dongmei Zhang

    Abstract: With a good code search engine, developers can reuse existing code snippets and accelerate software development process. Current code search methods can be divided into two categories: traditional information retrieval (IR) based and deep learning (DL) based approaches. DL-based approaches include the cross-encoder paradigm and the bi-encoder paradigm. However, both approaches have certain limitat… ▽ More

    Submitted 27 March, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted by WSDM 2023

  49. arXiv:2208.11271  [pdf, other

    cs.SE

    Tackling Long Code Search with Splitting, Encoding, and Aggregating

    Authors: Fan Hu, Yanlin Wang, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang, Xirong Li

    Abstract: Code search with natural language helps us reuse existing code snippets. Thanks to the Transformer-based pretraining models, the performance of code search has been improved significantly. However, due to the quadratic complexity of multi-head self-attention, there is a limit on the input token length. For efficient training on standard GPUs like V100, existing pretrained code models, including Gr… ▽ More

    Submitted 26 March, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted by LREC-COLING 2024

  50. Bridging the gap between target-based and cell-based drug discovery with a graph generative multi-task model

    Authors: Fan Hu, Dongqi Wang, Huazhen Huang, Yishen Hu, Peng Yin

    Abstract: Drug discovery is vitally important for protecting human against disease. Target-based screening is one of the most popular methods to develop new drugs in the past several decades. This method efficiently screens candidate drugs inhibiting target protein in vitro, but it often fails due to inadequate activity of the selected drugs in vivo. Accurate computational methods are needed to bridge this… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Journal ref: Journal of Chemical Information and Modeling, 2022