Skip to main content

Showing 101–150 of 758 results for author: Fang, Z

.
  1. arXiv:2312.10714  [pdf, other

    cs.CV

    Primitive-based 3D Human-Object Interaction Modelling and Programming

    Authors: Siqi Liu, Yong-Lu Li, Zhou Fang, Xinpeng Liu, Yang You, Cewu Lu

    Abstract: Embedding Human and Articulated Object Interaction (HAOI) in 3D is an important direction for a deeper human activity understanding. Different from previous works that use parametric and CAD models to represent humans and objects, in this work, we propose a novel 3D geometric primitive-based language to encode both humans and objects. Given our new paradigm, humans and objects are all compositions… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: AAAI2024

  2. arXiv:2312.09911  [pdf, other

    cs.SD eess.AS

    Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

    Authors: Xueyao Zhang, Liumeng Xue, Yicheng Gu, Yuancheng Wang, Haorui He, Chaoren Wang, Xi Chen, Zihao Fang, Haopeng Chen, Junan Zhang, Tze Ying Tang, Lexiao Zou, Mingxuan Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu

    Abstract: Amphion is an open-source toolkit for Audio, Music, and Speech Generation, targeting to ease the way for junior researchers and engineers into these fields. It presents a unified framework that is inclusive of diverse generation tasks and models, with the added bonus of being easily extendable for new incorporation. The toolkit is designed with beginner-friendly workflows and pre-trained models, a… ▽ More

    Submitted 22 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Amphion Website: https://github.com/open-mmlab/Amphion

  3. arXiv:2312.09085  [pdf, other

    cs.CL cs.AI cs.CR cs.CY

    The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation

    Authors: Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu, Han Qiu

    Abstract: Large language models (LLMs) encapsulate vast amounts of knowledge but still remain vulnerable to external misinformation. Existing research mainly studied this susceptibility behavior in a single-turn setting. However, belief can change during a multi-turn conversation, especially a persuasive one. Therefore, in this study, we delve into LLMs' susceptibility to persuasive conversations, particula… ▽ More

    Submitted 31 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted to ACL'24 (Main). Camera-ready version

  4. VASP2KP: kp models and Lande g-factors from ab initio calculations

    Authors: Sheng Zhang, Haohao Sheng, Zhi-Da Song, Chenhao Liang, Yi Jiang, Song Sun, Quansheng Wu, Hongming Weng, Zhong Fang, Xi Dai, Zhijun Wang

    Abstract: The $k\cdot p$ method is significant in condensed matter physics for the compact and analytical Hamiltonian. In the presence of magnetic field, it is described by the effective Zeeman's coupling Hamiltonian with Landé $ g $-factors. Here, we develop an open-source package VASP2KP (including two parts: vasp2mat and mat2kp) to compute $k\cdot p$ parameters and Landé $g$-factors directly from the wav… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: Chin. Phys. Lett. 40, 127101 (2023)

  5. arXiv:2312.08682  [pdf, other

    physics.optics physics.app-ph

    High-coherence parallelization in integrated photonics

    Authors: Xuguang Zhang, Zixuan Zhou, Yijun Guo, Minxue Zhuang, Warren **, Bitao Shen, Yujun Chen, Jiahui Huang, Zihan Tao, Ming **, Ruixuan Chen, Zhangfeng Ge, Zhou Fang, Ning Zhang, Yadong Liu, Pengfei Cai, Weiwei Hu, Haowen Shu, Dong Pan, John E. Bowers, Xingjun Wang, Lin Chang

    Abstract: Coherent optics has profoundly impacted diverse applications ranging from communications, LiDAR to quantum computations. However, building coherent systems in integrated photonics previously came at great expense in hardware integration and energy efficiency: the lack of a power-efficient way to generate highly coherent light necessitates bulky lasers and amplifiers, while frequency and phase reco… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  6. arXiv:2312.03703  [pdf, other

    cs.CV

    Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning

    Authors: Xinshun Wang, Zhongbin Fang, Xia Li, Xiangtai Li, Mengyuan Liu

    Abstract: In-context learning provides a new perspective for multi-task modeling for vision and NLP. Under this setting, the model can perceive tasks from prompts and accomplish them without any extra task-specific head predictions or model fine-tuning. However, Skeleton sequence modeling via in-context learning remains unexplored. Directly applying existing in-context models from other areas onto skeleton… ▽ More

    Submitted 2 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://github.com/fanglaosi/Skeleton-in-Context

  7. arXiv:2312.01346  [pdf, other

    hep-ph

    A holographic study on QCD phase transition and phase diagram with two flavors

    Authors: Xin-Yi Liu, Xiao-Chang Peng, Yue-Liang Wu, Zhen Fang

    Abstract: We investigate the chemical potential effects of the equation of state and the chiral transition in an Einstein-Maxwell-dilaton-scalar system, which is obtained from an improved soft-wall AdS/QCD model coupled with an Einstein-Maxwell-dilaton system. The equations of state obtained from the model are in quantitative agreement with the lattice results at both zero and nonzero chemical potentials. T… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  8. arXiv:2311.17267  [pdf, other

    cs.CV

    E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer

    Authors: Jacob Zhiyuan Fang, Skyler Zheng, Vasu Sharma, Robinson Piramuthu

    Abstract: To build scalable models for challenging real-world tasks, it is important to learn from diverse, multi-modal data in various forms (e.g., videos, text, and images). Among the existing works, a plethora of them have focused on leveraging large but cumbersome cross-modal architectures. Regardless of their effectiveness, larger architectures unavoidably prevent the models from being extended to real… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  9. arXiv:2311.16754  [pdf, other

    cs.CV cs.AI

    Towards Full-scene Domain Generalization in Multi-agent Collaborative Bird's Eye View Segmentation for Connected and Autonomous Driving

    Authors: Senkang Hu, Zhengru Fang, Xianhao Chen, Yuguang Fang, Sam Kwong

    Abstract: Collaborative perception has recently gained significant attention in autonomous driving, improving perception quality by enabling the exchange of additional information among vehicles. However, deploying collaborative perception systems can lead to domain shifts due to diverse environmental conditions and data heterogeneity among connected and autonomous vehicles (CAVs). To address these challeng… ▽ More

    Submitted 1 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  10. arXiv:2311.15315  [pdf

    physics.optics physics.app-ph

    Integrated electro-optically tunable narrow-linewidth III-V laser

    Authors: Yiran Zhu, Shupeng Yu, Zhiwei Fang, Difeng Yin, Jian Liu, Zhe Wang, Yuan Zhou, Yu Ma, Haisu Zhang, Min Wang, Ya Cheng

    Abstract: We demonstrate an integrated electro-optically tunable narrow-linewidth III-V laser with an output power of 738.8 μW and an intrinsic linewidth of 45.55 kHz at the C band. The laser cavity is constructed using a fiber Bragg grating (FBG) and a tunable Sagnac loop reflector (TSLR) fabricated on thin film lithium niobate (TFLN). The combination of the FBG and the electro-optically tunable TSLR offer… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  11. arXiv:2311.12299  [pdf

    physics.optics physics.app-ph

    Thin Film Lithium Niobate Electro-optic Isolator Fabricated by photolithography assisted chemo-mechanical etching (PLACE)

    Authors: Lang Gao, Youting Liang, Lvbin Song, Difeng Yin, Jia Qi, **ming Chen, Zhaoxiang Liu, Jian** Yu, Jian Liu, Haisu Zhang, Zhiwei Fang, Hongxin Qi, Ya Cheng

    Abstract: We report a thin-film lithium niobate electro-optic isolator fabricated by photolithography-assisted chemo-mechanical etching in this work. The device demonstrates 39.50 dB isolation when subjected to a 24 GHz microwave of 25.5 dBm on its electrodes. The measured isolation remains consistently above 30 dB within the 1510 nm to 1600 nm wavelength range. The overall device insertion loss, specifical… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  12. Simulation method of urban evacuation based on mesoscopic cellular automata

    Authors: Wei Lv, **ghui Wang, Zhiming Fang, Dun Mao

    Abstract: This study integrates pedestrian flow characteristics to formulate a mesoscopic cellular automata model tailored for simulating evacuations in large-scale scenarios. Departing from the conventional planar grid cell division, the model employs road cell segmentation, thereby physically enlarging the dimensions of individual cells. This augmentation accommodates an increased occupancy of individuals… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 13 pages, 14figures

    Journal ref: Acta Physica Sinica, 70(10), 76-84.[In Chinese] (2021)

  13. Learning Contrastive Self-Distillation for Ultra-Fine-Grained Visual Categorization Targeting Limited Samples

    Authors: Ziye Fang, Xin Jiang, Hao Tang, Zechao Li

    Abstract: In the field of intelligent multimedia analysis, ultra-fine-grained visual categorization (Ultra-FGVC) plays a vital role in distinguishing intricate subcategories within broader categories. However, this task is inherently challenging due to the complex granularity of category subdivisions and the limited availability of data for each category. To address these challenges, this work proposes CSDN… ▽ More

    Submitted 25 February, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted for Publication in TCSVT

  14. Exploring crowd persistent dynamism from pedestrian crossing perspective: An empirical study

    Authors: **ghui Wang, Wei Lv, Huihua Jiang, Zhiming Fang, Jian Ma

    Abstract: Crowd studies have gained increasing relevance due to the recurring incidents of crowd crush accidents. In addressing the issue of the crowd's persistent dynamism, this paper explored the macroscopic and microscopic features of pedestrians crossing in static and dynamic contexts, employing a series of systematic experiments. Firstly, empirical evidence has confirmed the existence of crowd's persis… ▽ More

    Submitted 26 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 31pages, 17figures

    Journal ref: Transportation Research Part C: Emerging Technologies, Volume 157, 2023, 104400

  15. arXiv:2311.03236  [pdf, other

    cs.LG cs.MM

    Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources

    Authors: Haotian Zheng, Qizhou Wang, Zhen Fang, Xiaobo Xia, Feng Liu, Tongliang Liu, Bo Han

    Abstract: Out-of-distribution (OOD) detection discerns OOD data where the predictor cannot make valid predictions as in-distribution (ID) data, thereby increasing the reliability of open-world classification. However, it is typically hard to collect real out-of-distribution (OOD) data for training a predictor capable of discerning ID and OOD patterns. This obstacle gives rise to data generation-based learni… ▽ More

    Submitted 5 December, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  16. arXiv:2311.01796  [pdf, other

    cs.LG

    Learning to Augment Distributions for Out-of-Distribution Detection

    Authors: Qizhou Wang, Zhen Fang, Yonggang Zhang, Feng Liu, Yixuan Li, Bo Han

    Abstract: Open-world classification systems should discern out-of-distribution (OOD) data whose labels deviate from those of in-distribution (ID) cases, motivating recent studies in OOD detection. Advanced works, despite their promising progress, may still fail in the open world, owing to the lack of knowledge about unseen OOD data in advance. Although one can access auxiliary OOD data (distinct from unseen… ▽ More

    Submitted 25 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

  17. arXiv:2311.01483  [pdf, other

    cs.LG cs.AI cs.DC

    FedSN: A Novel Federated Learning Framework over LEO Satellite Networks

    Authors: Zheng Lin, Zhe Chen, Zihan Fang, Xianhao Chen, Xiong Wang, Yue Gao

    Abstract: Recently, a large number of Low Earth Orbit (LEO) satellites have been launched and deployed successfully in space by commercial companies, such as SpaceX. Due to multimodal sensors equipped by the LEO satellites, they serve not only for communication but also for various machine learning applications, such as space modulation recognition, remote sensing image classification, etc. However, the gro… ▽ More

    Submitted 2 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 14 pages, 17 figures

  18. arXiv:2311.00836  [pdf, ps, other

    math.OC eess.SP math.PR stat.CO

    Effective filtering approach for joint parameter-state estimation in SDEs via Rao-Blackwellization and modularization

    Authors: Zhou Fang, Ankit Gupta, Mustafa Khammash

    Abstract: Stochastic filtering is a vibrant area of research in both control theory and statistics, with broad applications in many scientific fields. Despite its extensive historical development, there still lacks an effective method for joint parameter-state estimation in SDEs. The state-of-the-art particle filtering methods suffer from either sample degeneracy or information loss, with both issues stemmi… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures

    MSC Class: 62M20; 62F15; 65C05; 92-08; 93E11

  19. arXiv:2310.18764  [pdf

    cond-mat.mtrl-sci

    Atomistic Processes of high-temperature plastic deformation of nanoscale body-centered cubic tungsten

    Authors: Sixue Zheng, Zhengwu Fang, Scott X. Mao

    Abstract: Much scientific and practical interest is currently focused on the atomic-scale mechanical behaviors of metallic nanocrystals with different crystal structures at room temperature, while the high-temperature plastic deformation in tungsten nanocrystals remains not well understood, due to the technical difficulty in elevating the experimental temperature during in situ mechanical tests in an extrem… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Modify the figure captions. Reduce the file size

  20. arXiv:2310.16655  [pdf, other

    cs.LG

    Towards Control-Centric Representations in Reinforcement Learning from Images

    Authors: Chen Liu, Hongyu Zang, Xin Li, Yong Heng, Yifei Wang, Zhen Fang, Yisen Wang, Mingzhong Wang

    Abstract: Image-based Reinforcement Learning is a practical yet challenging task. A major hurdle lies in extracting control-centric representations while disregarding irrelevant information. While approaches that follow the bisimulation principle exhibit the potential in learning state representations to address this issue, they still grapple with the limited expressive capacity of latent dynamics and the i… ▽ More

    Submitted 27 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  21. arXiv:2310.16123  [pdf, other

    cs.LG

    Anchor Space Optimal Transport: Accelerating Batch Processing of Multiple OT Problems

    Authors: Jianming Huang, Xun Su, Zhongxi Fang, Hiroyuki Kasai

    Abstract: The optimal transport (OT) theory provides an effective way to compare probability distributions on a defined metric space, but it suffers from cubic computational complexity. Although the Sinkhorn's algorithm greatly reduces the computational complexity of OT solutions, the solutions of multiple OT problems are still time-consuming and memory-comsuming in practice. However, many works on the comp… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 26 pages, 4 figures, 6 tables

  22. arXiv:2310.14541  [pdf, other

    cs.CL

    Continual Named Entity Recognition without Catastrophic Forgetting

    Authors: Duzhen Zhang, Wei Cong, Jiahua Dong, Yahan Yu, Xiuyi Chen, Yonggang Zhang, Zhen Fang

    Abstract: Continual Named Entity Recognition (CNER) is a burgeoning area, which involves updating an existing model by incorporating new entity types sequentially. Nevertheless, continual learning approaches are often severely afflicted by catastrophic forgetting. This issue is intensified in CNER due to the consolidation of old entity types from previous steps into the non-entity type at each step, leading… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP2023 main conference as a long paper

  23. arXiv:2310.14344  [pdf, other

    cs.CV cs.LG

    What's in a Prior? Learned Proximal Networks for Inverse Problems

    Authors: Zhenghan Fang, Sam Buchanan, Jeremias Sulam

    Abstract: Proximal operators are ubiquitous in inverse problems, commonly appearing as part of algorithmic strategies to regularize problems that are otherwise ill-posed. Modern deep learning models have been brought to bear for these tasks too, as in the framework of plug-and-play or deep unrolling, where they loosely resemble proximal operators. Yet, something essential is lost in employing these purely d… ▽ More

    Submitted 27 March, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

  24. arXiv:2310.11160  [pdf, other

    cs.SD eess.AS

    Leveraging Diverse Semantic-based Audio Pretrained Models for Singing Voice Conversion

    Authors: Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Junan Zhang, Liumeng Xue, **chao Zhang, Jie Zhou, Zhizheng Wu

    Abstract: Singing Voice Conversion (SVC) is a technique that enables any singer to perform any song. To achieve this, it is essential to obtain speaker-agnostic representations from the source audio, which poses a significant challenge. A common solution involves utilizing a semantic-based audio pretrained model as a feature extractor. However, the degree to which the extracted features can meet the SVC req… ▽ More

    Submitted 27 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  25. arXiv:2310.11093  [pdf, other

    cs.LG cs.CV

    SODA: Robust Training of Test-Time Data Adaptors

    Authors: Zige Wang, Yonggang Zhang, Zhen Fang, Long Lan, Wen**g Yang, Bo Han

    Abstract: Adapting models deployed to test distributions can mitigate the performance degradation caused by distribution shifts. However, privacy concerns may render model parameters inaccessible. One promising approach involves utilizing zeroth-order optimization (ZOO) to train a data adaptor to adapt the test data to fit the deployed models. Nevertheless, the data adaptor trained with ZOO typically brings… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  26. arXiv:2310.09475  [pdf

    physics.chem-ph

    Twisted DNA origami-based chiral monolayers for spin filtering

    Authors: Haozhi Wang, Fangfei Yin, Linyun Li, Mingqiang Li, Zheng Fang, Chenyun Sun, Bochen Li, Jiye Shi, Jiang Li, Lihua Wang, Shi** Song, Xiaolei Zuo, Xiaoguo Liu, Chunhai Fan

    Abstract: DNA monolayers with inherent chirality play a pivotal role across various domains, including biosensors, DNA chips, and bioelectronics. Nonetheless, conventional DNA chiral monolayers, typically constructed from single-stranded DNA (ssDNA) or double-stranded DNA (dsDNA), often lack structural orderliness and design flexibility at the interface. Structural DNA nanotechnology emerges as a promising… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  27. arXiv:2310.07464  [pdf

    eess.IV cs.LG q-bio.QM

    Deep Learning Predicts Biomarker Status and Discovers Related Histomorphology Characteristics for Low-Grade Glioma

    Authors: Zijie Fang, Yihan Liu, Yifeng Wang, Xiangyang Zhang, Yang Chen, Chang**g Cai, Yiyang Lin, Ying Han, Zhi Wang, Shan Zeng, Hong Shen, Jun Tan, Yongbing Zhang

    Abstract: Biomarker detection is an indispensable part in the diagnosis and treatment of low-grade glioma (LGG). However, current LGG biomarker detection methods rely on expensive and complex molecular genetic testing, for which professionals are required to analyze the results, and intra-rater variability is often reported. To overcome these challenges, we propose an interpretable deep learning pipeline, a… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 47 pages, 6 figures

  28. arXiv:2310.06403  [pdf, other

    cs.CV

    Boundary Discretization and Reliable Classification Network for Temporal Action Detection

    Authors: Zhenying Fang, Jun Yu, Richang Hong

    Abstract: Temporal action detection aims to recognize the action category and determine each action instance's starting and ending time in untrimmed videos. The mixed methods have achieved remarkable performance by seamlessly merging anchor-based and anchor-free approaches. Nonetheless, there are still two crucial issues within the mixed framework: (1) Brute-force merging and handcrafted anchor design hinde… ▽ More

    Submitted 7 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 12 pages, Source code: https://github.com/zhenyingfang/BDRC-Net

  29. arXiv:2310.00013  [pdf, other

    cs.AI

    Adaptive Communications in Collaborative Perception with Domain Alignment for Autonomous Driving

    Authors: Senkang Hu, Zhengru Fang, Haonan An, Guowen Xu, Yuan Zhou, Xianhao Chen, Yuguang Fang

    Abstract: Collaborative perception among multiple connected and autonomous vehicles can greatly enhance perceptive capabilities by allowing vehicles to exchange supplementary information via communications. Despite advances in previous approaches, challenges still remain due to channel variations and data heterogeneity among collaborative vehicles. To address these issues, we propose ACC-DA, a channel-aware… ▽ More

    Submitted 16 March, 2024; v1 submitted 14 September, 2023; originally announced October 2023.

    Comments: 6 pages, 6 figures

  30. arXiv:2309.16730  [pdf

    cs.LG cs.CY

    Explainable machine learning-based prediction model for diabetic nephropathy

    Authors: **g-Mei Yin, Yang Li, Jun-Tang Xue, Guo-Wei Zong, Zhong-Ze Fang, Lang Zou

    Abstract: The aim of this study is to analyze the effect of serum metabolites on diabetic nephropathy (DN) and predict the prevalence of DN through a machine learning approach. The dataset consists of 548 patients from April 2018 to April 2019 in Second Affiliated Hospital of Dalian Medical University (SAHDMU). We select the optimal 38 features through a Least absolute shrinkage and selection operator (LASS… ▽ More

    Submitted 24 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  31. arXiv:2309.13660  [pdf

    eess.SP

    Non-Uniform Sampling Reconstruction for Symmetrical NMR Spectroscopy by Exploiting Inherent Symmetry

    Authors: En** Lin, Ze Fang, Yuqing Huang, Yu Yang, Zhong Chen

    Abstract: Symmetrical NMR spectroscopy constitutes a vital branch of multidimensional NMR spectroscopy, providing a powerful tool for the structural elucidation of biological macromolecules. Non-Uniform Sampling (NUS) serves as an effective strategy for averting the prohibitive acquisition time of multidimensional NMR spectroscopy by only sampling a few points according to NUS sampling schedules and reconst… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 30 pages, 6 figures

  32. arXiv:2309.13035  [pdf, other

    cs.RO

    PyPose v0.6: The Imperative Programming Interface for Robotics

    Authors: Zitong Zhan, Xiangfu Li, Qihang Li, Haonan He, Abhinav Pandey, Haitao Xiao, Yangmengfei Xu, Xiangyu Chen, Kuan Xu, Kun Cao, Zhipeng Zhao, Zihan Wang, Huan Xu, Zihang Fang, Yutian Chen, Wentao Wang, Xu Fang, Yi Du, Tianhao Wu, Xiao Lin, Yuheng Qiu, Fan Yang, **gnan Shi, Shaoshu Su, Yiren Lu , et al. (11 additional authors not shown)

    Abstract: PyPose is an open-source library for robot learning. It combines a learning-based approach with physics-based optimization, which enables seamless end-to-end robot learning. It has been used in many tasks due to its meticulously designed application programming interface (API) and efficient implementation. From its initial launch in early 2022, PyPose has experienced significant enhancements, inco… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  33. arXiv:2309.12559  [pdf, other

    cs.LG cs.AI cs.CV

    Invariant Learning via Probability of Sufficient and Necessary Causes

    Authors: Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

    Abstract: Out-of-distribution (OOD) generalization is indispensable for learning models in the wild, where testing distribution typically unknown and different from the training. Recent methods derived from causality have shown great potential in achieving OOD generalization. However, existing methods mainly focus on the invariance property of causes, while largely overlooking the property of \textit{suffic… ▽ More

    Submitted 10 May, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  34. arXiv:2309.11751  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    How Robust is Google's Bard to Adversarial Image Attacks?

    Authors: Yinpeng Dong, Huanran Chen, Jiawei Chen, Zhengwei Fang, Xiao Yang, Yichi Zhang, Yu Tian, Hang Su, Jun Zhu

    Abstract: Multimodal Large Language Models (MLLMs) that integrate text and other modalities (especially vision) have achieved unprecedented performance in various multimodal tasks. However, due to the unsolved adversarial robustness problem of vision models, MLLMs can have more severe safety and security risks by introducing the vision inputs. In this work, we study the adversarial robustness of Google's Ba… ▽ More

    Submitted 14 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Technical report

  35. arXiv:2309.11705  [pdf, other

    cs.LG cs.CV

    Meta OOD Learning for Continuously Adaptive OOD Detection

    Authors: Xinheng Wu, Jie Lu, Zhen Fang, Guangquan Zhang

    Abstract: Out-of-distribution (OOD) detection is crucial to modern deep learning applications by identifying and alerting about the OOD samples that should not be tested or used for making predictions. Current OOD detection methods have made significant progress when in-distribution (ID) and OOD samples are drawn from static distributions. However, this can be unrealistic when applied to real-world systems… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  36. arXiv:2309.09949  [pdf, other

    cs.AI cs.CL

    How to Generate Popular Post Headlines on Social Media?

    Authors: Zhouxiang Fang, Min Yu, Zhendong Fu, Boning Zhang, Xuanwen Huang, Xiaoqi Tang, Yang Yang

    Abstract: Posts, as important containers of user-generated-content pieces on social media, are of tremendous social influence and commercial value. As an integral components of a post, the headline has a decisive contribution to the post's popularity. However, current mainstream method for headline generation is still manually writing, which is unstable and requires extensive human effort. This drives us to… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  37. arXiv:2309.04270  [pdf, other

    eess.SP cs.MA

    A Reliable and Resilient Framework for Multi-UAV Mutual Localization

    Authors: Zexin Fang, Bin Han, Hans D. Schotten

    Abstract: This paper presents a robust and secure framework for achieving accurate and reliable mutual localization in multiple unmanned aerial vehicle (UAV) systems. Challenges of accurate localization and security threats are addressed and corresponding solutions are brought forth and accessed in our paper with numerical simulations. The proposed solution incorporates two key components: the Mobility Adap… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted by the 2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall), Hong Kong, 10-13 October 2023

  38. arXiv:2309.03084  [pdf, other

    cs.AI cs.GT cs.LG

    Pure Monte Carlo Counterfactual Regret Minimization

    Authors: Ju Qi, Ting Feng, Falun Hei, Zhemei Fang, Yunfeng Luo

    Abstract: Counterfactual Regret Minimization (CFR) and its variants are the best algorithms so far for solving large-scale incomplete information games. However, we believe that there are two problems with CFR: First, matrix multiplication is required in CFR iteration, and the time complexity of one iteration is too high; Secondly, the game characteristics in the real world are different. Just using one CFR… ▽ More

    Submitted 13 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  39. arXiv:2308.16320  [pdf, other

    cs.GT

    Information Disclosure under Competition in Sharing Systems

    Authors: Ningning Ding, Zhixuan Fang, Jianwei Huang

    Abstract: Sharing systems have facilitated the redistribution of underused resources by providing convenient online marketplaces for individual sellers and buyers. However, sellers in these systems may not fully disclose the information of their shared commodities, due to strategic behaviors or privacy concerns. Sellers' strategic information disclosure significantly affects buyers' user experiences and sys… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  40. arXiv:2308.12939  [pdf, other

    cs.LG math.NA physics.comp-ph

    Learning Only On Boundaries: a Physics-Informed Neural operator for Solving Parametric Partial Differential Equations in Complex Geometries

    Authors: Zhiwei Fang, Sifan Wang, Paris Perdikaris

    Abstract: Recently deep learning surrogates and neural operators have shown promise in solving partial differential equations (PDEs). However, they often require a large amount of training data and are limited to bounded domains. In this work, we present a novel physics-informed neural operator method to solve parametrized boundary value problems without labeled data. By reformulating the PDEs into boundary… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  41. arXiv:2308.12055  [pdf, other

    cond-mat.mtrl-sci cond-mat.supr-con

    Majorana corner modes in unconventional monolayers of 1T-PtSe2 family

    Authors: Haohao Sheng, Yue Xie, Quansheng Wu, Hongming Weng, Xi Dai, B. Andrei Bernevig, Zhong Fang, Zhijun Wang

    Abstract: In this work, we propose that Majorana zero modes can be realized at the corners of a topologically trivial insulator with unconventionality. We demonstrate that 1T-PtSe$_2$ is a symmetry indicator-free (SI-free) unconventional insulator, originating from orbital hybridization between Pt $d$ and Se $p_{x,y}$ states. The new kind of SI-free unconventionality has no symmetry eigenvalue indication. I… ▽ More

    Submitted 14 December, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  42. arXiv:2308.11875  [pdf, other

    cs.CV

    Motion-to-Matching: A Mixed Paradigm for 3D Single Object Tracking

    Authors: Zhiheng Li, Yu Lin, Yubo Cui, Shuo Li, Zheng Fang

    Abstract: 3D single object tracking with LiDAR points is an important task in the computer vision field. Previous methods usually adopt the matching-based or motion-centric paradigms to estimate the current target status. However, the former is sensitive to the similar distractors and the sparseness of point cloud due to relying on appearance matching, while the latter usually focuses on short-term motion c… ▽ More

    Submitted 18 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at IEEE Robotics and Automation Letters (RAL)

  43. HICL: Hashtag-Driven In-Context Learning for Social Media Natural Language Understanding

    Authors: Hanzhuo Tan, Chunpu Xu, **g Li, Yuqun Zhang, Zeyang Fang, Zeyu Chen, Baohua Lai

    Abstract: Natural language understanding (NLU) is integral to various social media applications. However, existing NLU models rely heavily on context for semantic learning, resulting in compromised performance when faced with short and noisy social media content. To address this issue, we leverage in-context learning (ICL), wherein language models learn to make inferences by conditioning on a handful of dem… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: https://github.com/albertan017/HICL

    Journal ref: 10.1109/TNNLS.2024.3384987

  44. arXiv:2308.08740  [pdf

    physics.optics

    On-chip coherent beam combination of waveguide amplifiers on Er$^{3+}$-doped thin film lithium niobate

    Authors: Rui Bao, Lvbin Song, Zhiwei Fang, **min Chen, Zhe Wang, Jian Liu, Lang Gao, Zhaoxiang Liu, Zhihao Zhang, Min Wang, Haisu Zhang, Ya Cheng

    Abstract: We demonstrate on-chip coherent beam combination of two waveguide amplifiers on Er$^{3+}$-doped thin film lithium niobate (Er: TFLN) platform. Our device is built based on an electro-optic modulator fabricated on Er: TFLN. The output power of the coherently combined amplifiers is measured as high as 12.9 mW, surpassing that of previous single waveguide amplifiers based on Er$^{3+}$-doped thin film… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  45. arXiv:2308.03666  [pdf, other

    stat.ML cs.LG

    Bridging Trustworthiness and Open-World Learning: An Exploratory Neural Approach for Enhancing Interpretability, Generalization, and Robustness

    Authors: Shide Du, Zihan Fang, Shiyang Lan, Yanchao Tan, Manuel Günther, Shi** Wang, Wenzhong Guo

    Abstract: As researchers strive to narrow the gap between machine intelligence and human through the development of artificial intelligence technologies, it is imperative that we recognize the critical importance of trustworthiness in open-world, which has become ubiquitous in all aspects of daily life for everyone. However, several challenges may create a crisis of trust in current artificial intelligence… ▽ More

    Submitted 18 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  46. arXiv:2308.01098  [pdf, other

    cs.IR cs.AI

    Towards Better Query Classification with Multi-Expert Knowledge Condensation in JD Ads Search

    Authors: Kun-Peng Ning, Ming Pang, Zheng Fang, Xue Jiang, Xi-Wei Zhao, Chang-** Peng, Zhan-Gang Lin, **g-He Hu, **g-** Shao

    Abstract: Search query classification, as an effective way to understand user intents, is of great importance in real-world online ads systems. To ensure a lower latency, a shallow model (e.g. FastText) is widely used for efficient online inference. However, the representation ability of the FastText model is insufficient, resulting in poor classification performance, especially on some low-frequency querie… ▽ More

    Submitted 19 November, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  47. arXiv:2307.16562  [pdf, other

    cs.CR

    SAKSHI: Decentralized AI Platforms

    Authors: Suma Bhat, Canhui Chen, Zerui Cheng, Zhixuan Fang, Ashwin Hebbar, Sreeram Kannan, Ranvir Rana, Peiyao Sheng, Himanshu Tyagi, Pramod Viswanath, Xuechao Wang

    Abstract: Large AI models (e.g., Dall-E, GPT4) have electrified the scientific, technological and societal landscape through their superhuman capabilities. These services are offered largely in a traditional web2.0 format (e.g., OpenAI's GPT4 service). As more large AI models proliferate (personalizing and specializing to a variety of domains), there is a tremendous need to have a neutral trust-free platfor… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 23 pages, 9 figures

  48. arXiv:2307.12103  [pdf

    physics.optics

    Non-volatile Phase-only Transmissive Spatial Light Modulators

    Authors: Zhuoran Fang, Rui Chen, Johannes E. Fröch, Quentin A. A. Tanguy, Asir Intisar Khan, Xiang** Wu, Virat Tara, Arnab Manna, David Sharp, Christopher Munley, Forrest Miller, Yang Zhao, Sarah J. Geiger, Karl F. Böhringer, Matthew Reynolds, Eric Pop, Arka Majumdar

    Abstract: Free-space modulation of light is crucial for many applications, from light detection and ranging to virtual or augmented reality. Traditional means of modulating free-space light involves spatial light modulators based on liquid crystals and microelectromechanical systems, which are bulky, have large pixel areas (~10 micron x 10 micron), and require high driving voltage. Recent progress in meta-o… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  49. arXiv:2307.11530  [pdf, other

    eess.IV cs.CV

    UWAT-GAN: Fundus Fluorescein Angiography Synthesis via Ultra-wide-angle Transformation Multi-scale GAN

    Authors: Zhaojie Fang, Zhanghao Chen, Pengxue Wei, Wangting Li, Shaochong Zhang, Ahmed Elazab, Gangyong Jia, Ruiquan Ge, Changmiao Wang

    Abstract: Fundus photography is an essential examination for clinical and differential diagnosis of fundus diseases. Recently, Ultra-Wide-angle Fundus (UWF) techniques, UWF Fluorescein Angiography (UWF-FA) and UWF Scanning Laser Ophthalmoscopy (UWF-SLO) have been gradually put into use. However, Fluorescein Angiography (FA) and UWF-FA require injecting sodium fluorescein which may have detrimental influence… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 26th International Conference on Medical Image Computing and Computer Assisted Intervention

  50. arXiv:2307.10371  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Enumeration of spin-space groups: Towards a complete description of symmetries of magnetic orders

    Authors: Yi Jiang, Ziyin Song, Tiannian Zhu, Zhong Fang, Hongming Weng, Zheng-Xin Liu, Jian Yang, Chen Fang

    Abstract: Symmetries of three-dimensional periodic scalar fields are described by 230 space groups (SGs). Symmetries of three-dimensional periodic (pseudo-) vector fields, however, are described by the spin-space groups (SSGs), which were initially used to describe the symmetries of magnetic orders. In SSGs, the real-space and spin degrees of freedom are unlocked in the sense that an operation could have di… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.