Skip to main content

Showing 1–50 of 124 results for author: Bo, L

.
  1. arXiv:2406.16864  [pdf, other

    cs.CV cs.AI cs.GR

    StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

    Authors: Chongjie Ye, Lingteng Qiu, Xiaodong Gu, Qi Zuo, Yushuang Wu, Zilong Dong, Liefeng Bo, Yuliang Xiu, Xiaoguang Han

    Abstract: This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle with stochastic inference, conflicting with the deterministic nature of the Image2Normal task, and costly ensembling step, which slows down the e… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: HF Demo: hf.co/Stable-X, Video: https://www.youtube.com/watch?v=sylXTxG_U2U

  2. arXiv:2406.14927  [pdf, other

    cs.CV cs.RO

    Gaussian-Informed Continuum for Physical Property Identification and Simulation

    Authors: Junhao Cai, Yuji Yang, Weihao Yuan, Yisheng He, Zilong Dong, Liefeng Bo, Hui Cheng, Qifeng Chen

    Abstract: This paper studies the problem of estimating physical properties (system identification) through visual observations. To facilitate geometry-aware guidance in physical property estimation, we introduce a novel hybrid framework that leverages 3D Gaussian representation to not only capture explicit shapes but also enable the simulated continuum to deduce implicit shapes during training. We propose a… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures

  3. arXiv:2405.15176  [pdf, other

    cs.CV

    MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method

    Authors: Pan Liao, Feng Yang, Di Wu, Liu Bo

    Abstract: Monocular vision-based 3D object detection is crucial in various sectors, yet existing methods face significant challenges in terms of accuracy and computational efficiency. Building on the successful strategies in 2D detection and depth estimation, we propose MonoDETRNext, which seeks to optimally balance precision and processing speed. Our methodology includes the development of an efficient hyb… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. Suppression of the skyrmion Hall effect in synthetic ferrimagnets with gradient magnetization

    Authors: Lan Bo, Xichao Zhang, Masahito Mochizuki, Xuefeng Zhang

    Abstract: Magnetic skyrmions are promising building blocks for future spintronic devices. However, the skyrmion Hall effect (SkHE) remains an obstacle for practical applications based on the in-line transport of skyrmions. Here, we numerically study the static properties and current-driven dynamics of synthetic ferrimagnetic skyrmions. Inspired by graded-index magnonics, we introduce a linear gradient of sa… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

    Journal ref: Physical Review Research 6, 023199 (2024)

  5. arXiv:2404.02514  [pdf, other

    cs.CV

    Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

    Authors: Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, Qixing Huang

    Abstract: This paper enables high-fidelity, transferable NeRF editing by frequency decomposition. Recent NeRF editing pipelines lift 2D stylization results to 3D scenes while suffering from blurry results, and fail to capture detailed structures caused by the inconsistency between 2D editings. Our critical insight is that low-frequency components of images are more multiview-consistent after editing compare… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  6. arXiv:2404.00269  [pdf, other

    cs.CV

    IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images

    Authors: Yushuang Wu, Luyue Shi, Junhao Cai, Weihao Yuan, Lingteng Qiu, Zilong Dong, Liefeng Bo, Shuguang Cui, Xiaoguang Han

    Abstract: Generalizable 3D object reconstruction from single-view RGB-D images remains a challenging task, particularly with real-world data. Current state-of-the-art methods develop Transformer-based implicit field learning, necessitating an intensive learning paradigm that requires dense query-supervision uniformly sampled throughout the entire space. We propose a novel approach, IPoD, which harmonizes im… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: CVPR 2024

  7. arXiv:2403.15559  [pdf, other

    cs.CV cs.AI

    An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models

    Authors: Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, Qixing Huang

    Abstract: A fundamental problem in the texturing of 3D meshes using pre-trained text-to-image models is to ensure multi-view consistency. State-of-the-art approaches typically use diffusion models to aggregate multi-view inputs, where common issues are the blurriness caused by the averaging operation in the aggregation step or inconsistencies in local features. This paper introduces an optimization framewor… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  8. arXiv:2403.12396  [pdf, other

    cs.CV cs.RO

    OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation

    Authors: Junhao Cai, Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, Qifeng Chen

    Abstract: This paper studies a new open-set problem, the open-vocabulary category-level object pose and size estimation. Given human text descriptions of arbitrary novel object categories, the robot agent seeks to predict the position, orientation, and size of the target object in the observed scene image. To enable such generalizability, we first introduce OO3D-9D, a large-scale photorealistic dataset for… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  9. arXiv:2403.12010  [pdf, other

    cs.CV cs.AI cs.GR

    VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

    Authors: Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, Qixing Huang

    Abstract: Generating multi-view images based on text or single-image prompts is a critical capability for the creation of 3D content. Two fundamental questions on this topic are what data we use for training and how to ensure multi-view consistency. This paper introduces a novel framework that makes fundamental contributions to both questions. Unlike leveraging images from 2D diffusion models for training,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: aigc3d.github.io/VideoMV/

  10. arXiv:2402.17485  [pdf, other

    cs.CV

    EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

    Authors: Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo

    Abstract: In this work, we tackle the challenge of enhancing the realism and expressiveness in talking head video generation by focusing on the dynamic and nuanced relationship between audio cues and facial movements. We identify the limitations of traditional techniques that often fail to capture the full spectrum of human expressions and the uniqueness of individual facial styles. To address these issues,… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2402.15219  [pdf, other

    cond-mat.mes-hall

    Global Rotation of Skyrmion Bags under Vertical Microwave Fields

    Authors: Lan Bo, Rongzhi Zhao, Xichao Zhang, Masahito Mochizuki, Xuefeng Zhang

    Abstract: Magnetic skyrmion bags are composite topological spin textures with arbitrary topological charges. Here, we computationally study the transient rotational motion of skyrmion bags, which is characterized by a global rotation of the inner skyrmions around the central point. Distinct from conventional rotational modes found in skyrmions, the observed rotation is a forced motion associated with the br… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

    Journal ref: J. Appl. Phys. 135, 063905 (2024)

  12. arXiv:2401.14886  [pdf, other

    cs.CR cs.SE

    Coca: Improving and Explaining Graph Neural Network-Based Vulnerability Detection Systems

    Authors: Sicong Cao, Xiaobing Sun, Xiaoxue Wu, David Lo, Lili Bo, Bin Li, Wei Liu

    Abstract: Recently, Graph Neural Network (GNN)-based vulnerability detection systems have achieved remarkable success. However, the lack of explainability poses a critical challenge to deploy black-box models in security-related domains. For this reason, several approaches have been proposed to explain the decision logic of the detection model by providing a set of crucial statements positively contributing… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: To appear in the Technical Track of ICSE 2024

  13. arXiv:2401.14617  [pdf, other

    cs.SE cs.AI

    A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research

    Authors: Sicong Cao, Xiaobing Sun, Ratnadira Widyasari, David Lo, Xiaoxue Wu, Lili Bo, Jiale Zhang, Bin Li, Wei Liu, Di Wu, Yixin Chen

    Abstract: The remarkable achievements of Artificial Intelligence (AI) algorithms, particularly in Machine Learning (ML) and Deep Learning (DL), have fueled their extensive deployment across multiple sectors, including Software Engineering (SE). However, due to their black-box nature, these promising AI-driven SE models are still far from being deployed in practice. This lack of explainability poses unwanted… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: submitted to ACM Computing Surveys. arXiv admin note: text overlap with arXiv:2202.06840 by other authors

  14. arXiv:2401.14257  [pdf, other

    cs.CV cs.AI

    Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation

    Authors: Minglin Chen, Weihao Yuan, Yukun Wang, Zhe Sheng, Yisheng He, Zilong Dong, Liefeng Bo, Yulan Guo

    Abstract: Recently, text-to-3D approaches have achieved high-fidelity 3D content generation using text description. However, the generated objects are stochastic and lack fine-grained control. Sketches provide a cheap approach to introduce such fine-grained control. Nevertheless, it is challenging to achieve flexible control from these sketches due to their abstraction and ambiguity. In this paper, we prese… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 11 pages, 9 figures

  15. arXiv:2401.10242  [pdf, other

    cs.OH cs.GR cs.HC cs.SD eess.AS

    DanceMeld: Unraveling Dance Phrases with Hierarchical Latent Codes for Music-to-Dance Synthesis

    Authors: Xin Gao, Li Hu, Peng Zhang, Bang Zhang, Liefeng Bo

    Abstract: In the realm of 3D digital human applications, music-to-dance presents a challenging task. Given the one-to-many relationship between music and dance, previous methods have been limited in their approach, relying solely on matching and generating corresponding dance movements based on music rhythm. In the professional field of choreography, a dance phrase consists of several dance poses and dance… ▽ More

    Submitted 30 November, 2023; originally announced January 2024.

    Comments: 10 pages, 8 figures

  16. arXiv:2312.17641  [pdf, other

    cs.CV

    Motion State: A New Benchmark Multiple Object Tracking

    Authors: Yang Feng, Liao Pan, Wu Di, Liu Bo, Zhang Xingle

    Abstract: In the realm of video analysis, the field of multiple object tracking (MOT) assumes paramount importance, with the motion state of objects-whether static or dynamic relative to the ground-holding practical significance across diverse scenarios. However, the extant literature exhibits a notable dearth in the exploration of this aspect. Deep learning methodologies encounter challenges in accurately… ▽ More

    Submitted 7 May, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

  17. arXiv:2312.15430  [pdf, other

    cs.CV

    Make-A-Character: High Quality Text-to-3D Character Generation within Minutes

    Authors: Jianqiang Ren, Chao He, Lin Liu, Jiahao Chen, Yutong Wang, Yafei Song, Jianfang Li, Tangli Xue, Siqi Hu, Tao Chen, Kunkun Zheng, Jian**g Xiang, Liefeng Bo

    Abstract: There is a growing demand for customized and expressive 3D characters with the emergence of AI agents and Metaverse, but creating 3D characters using traditional computer graphics tools is a complex and time-consuming task. To address these challenges, we propose a user-friendly framework named Make-A-Character (Mach) to create lifelike 3D avatars from text descriptions. The framework leverages th… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Technical Report

  18. arXiv:2312.13309  [pdf, other

    cs.CV cs.AI

    Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style

    Authors: Haohan Wang, Wei Feng, Yang Lu, Yaoyu Li, Zheng Zhang, **g**g Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Lixing Bo, **g** Shao

    Abstract: The state-of-the-art methods for e-commerce product background generation suffer from the inefficiency of designing product-wise prompts when scaling up the production, as well as the ineffectiveness of describing fine-grained styles when customizing personalized backgrounds for some specific brands. To address these obstacles, we integrate the category commonality and personalized style into diff… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 12 pages, 11 figures

  19. arXiv:2312.12726  [pdf, other

    cs.CV

    Reducing Shape-Radiance Ambiguity in Radiance Fields with a Closed-Form Color Estimation Method

    Authors: Qihang Fang, Yafei Song, Keqiang Li, Liefeng Bo

    Abstract: Neural radiance field (NeRF) enables the synthesis of cutting-edge realistic novel view images of a 3D scene. It includes density and color fields to model the shape and radiance of a scene, respectively. Supervised by the photometric loss in an end-to-end training manner, NeRF inherently suffers from the shape-radiance ambiguity problem, i.e., it can perfectly fit training views but does not guar… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: This work has been published in NeurIPS 2023

  20. arXiv:2312.06947  [pdf, other

    cs.CV

    MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

    Authors: Kangneng Zhou, Daiheng Gao, Xuan Wang, Jie Zhang, Peng Zhang, Xusen Sun, Longhao Zhang, Shiqi Yang, Bang Zhang, Liefeng Bo, Yaxing Wang, Ming-Ming Cheng

    Abstract: 3D-aware portrait editing has a wide range of applications in multiple fields. However, current approaches are limited due that they can only perform mask-guided or text-based editing. Even by fusing the two procedures into a model, the editing quality and stability cannot be ensured. To address this limitation, we propose \textbf{MaTe3D}: mask-guided text-based 3D-aware portrait editing. In this… ▽ More

    Submitted 3 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 13 pages, 13 figures

  21. arXiv:2312.01841  [pdf, other

    cs.CV

    VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

    Authors: Xusen Sun, Longhao Zhang, Hao Zhu, Peng Zhang, Bang Zhang, Xinya Ji, Kangneng Zhou, Daiheng Gao, Liefeng Bo, Xun Cao

    Abstract: Audio-driven talking head generation has drawn much attention in recent years, and many efforts have been made in lip-sync, expressive facial expressions, natural head pose generation, and high video quality. However, no model has yet led or tied on all these metrics due to the one-to-many map** between audio and motion. In this paper, we propose VividTalk, a two-stage generic framework that sup… ▽ More

    Submitted 6 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 10 pages, 8 figures

  22. arXiv:2311.17117  [pdf, other

    cs.CV

    Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

    Authors: Li Hu, Xin Gao, Peng Zhang, Ke Sun, Bang Zhang, Liefeng Bo

    Abstract: Character Animation aims to generating character videos from still images through driving signals. Currently, diffusion models have become the mainstream in visual generation research, owing to their robust generative capabilities. However, challenges persist in the realm of image-to-video, especially in character animation, where temporally maintaining consistency with detailed information from c… ▽ More

    Submitted 13 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Page: https://humanaigc.github.io/animate-anyone/

  23. arXiv:2311.16918  [pdf, other

    cs.CV cs.AI

    RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

    Authors: Lingteng Qiu, Guanying Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han

    Abstract: Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric prior and the complex entanglement of materials and lighting in natural images. Existing methods have shown promise by first creating the geometry through score-distillation sampling (SDS) applied to rendered surface normals, followed by appearance modeling. However, relying on a 2D RGB diffusion model to… ▽ More

    Submitted 24 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Project Page: https://aigc3d.github.io/richdreamer/

  24. arXiv:2311.14318  [pdf, ps, other

    q-fin.PM math.OC

    On optimal tracking portfolio in incomplete markets: The classical control and the reinforcement learning approaches

    Authors: Lijun Bo, Yijie Huang, Xiang Yu

    Abstract: This paper studies an infinite horizon optimal tracking portfolio problem using capital injection in incomplete market models. We consider the benchmark process modelled by a geometric Brownian motion with zero drift driven by some unhedgeable risk. The relaxed tracking formulation is adopted where the portfolio value compensated by the injected capital needs to outperform the benchmark process at… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: Optimal tracking portfolio, capital injection, incomplete market, stochastic control with reflection, continuous-time reinforcement learning, q-learning

  25. arXiv:2311.04555  [pdf, ps, other

    math.OC

    De Finetti's Control Problem with Poisson Observations under Spectrally Positive Markov Additive Process

    Authors: Lijun Bo, Wenyuan Wang, Kaixin Yan

    Abstract: We study a De Finetti's optimal dividend and capital injection problem under a Markov additive model. The surplus process before dividend and capital injection is assumed to follow a spectrally positive Markov additive process (MAP). Dividend payments are made only at the jump times of an independent Poisson process and capitals are injected to avoid bankruptcy. The aim of the paper is to characte… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 33 pages. arXiv admin note: substantial text overlap with arXiv:2210.07549

    MSC Class: 60G51; 93E20; 91G80

  26. arXiv:2310.17170  [pdf, other

    cs.CV

    DecoderTracker: Decoder-Only Method for Multiple-Object Tracking

    Authors: Liao Pan, Yang Feng, Wu Di, Liu Bo, Zhang Xingle

    Abstract: Decoder-only models, such as GPT, have demonstrated superior performance in many areas compared to traditional encoder-decoder structure transformer models. Over the years, end-to-end models based on the traditional transformer structure, like MOTR, have achieved remarkable performance in multi-object tracking. However, the significant computational resource consumption of these models leads to le… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  27. arXiv:2309.09602  [pdf, other

    cs.CL cs.AI cs.LG

    Proposition from the Perspective of Chinese Language: A Chinese Proposition Classification Evaluation Benchmark

    Authors: Conghui Niu, Mengyang Hu, Lin Bo, Xiaoli He, Dong Yu, Pengyuan Liu

    Abstract: Existing propositions often rely on logical constants for classification. Compared with Western languages that lean towards hypotaxis such as English, Chinese often relies on semantic or logical understanding rather than logical connectives in daily expressions, exhibiting the characteristics of parataxis. However, existing research has rarely paid attention to this issue. And accurately classifyi… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  28. arXiv:2308.04288  [pdf, other

    cs.CV

    Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

    Authors: Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, Qixing Huang

    Abstract: Fabricating and designing 3D garments has become extremely demanding with the increasing need for synthesizing realistic dressed persons for a variety of applications, e.g. 3D virtual try-on, digitalization of 2D clothes into 3D apparel, and cloth animation. It thus necessitates a simple and straightforward pipeline to obtain high-quality texture from simple input, such as 2D reference images. Sin… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 15 pages, 15 figures

  29. arXiv:2307.10583  [pdf

    cs.CR

    Deep fused flow and topology features for botnet detection basing on pretrained GCN

    Authors: Meng Xiaoyuan, Lang bo, Yanxi Liu, Yuhao Yan

    Abstract: Nowadays, botnets have become one of the major threats to cyber security. The characteristics of botnets are mainly reflected in bots network behavior and their intercommunication relationships. Existing botnet detection methods use flow features or topology features individually, which overlook the other type of feature. This affects model performance. In this paper, we propose a botnet detection… ▽ More

    Submitted 24 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

  30. Controllable Creation of Skyrmion Bags in a Ferromagnetic Nanodisk

    Authors: Lan Bo, Rongzhi Zhao, Chenglong Hu, Xichao Zhang, Xuefeng Zhang, Masahito Mochizuki

    Abstract: Skyrmion bags are composed of an outer skyrmion and arbitrary inner skyrmions, which have recently been observed in bulk chiral magnets, but still remain elusive in magnetic films. Here, we propose a method of creating skyrmion bags in a thin-film nanodisk, which includes three steps. Firstly, the size of outer skyrmion is enlarged by a vertical magnetic field, then inner skyrmions are nucleated a… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Journal ref: Physical Review B 107, 224431 (2023)

  31. arXiv:2306.08312  [pdf, ps, other

    math.PR math.AP

    A decomposition-homogenization method for Robin boundary problems on the nonnegative orthant

    Authors: Lijun Bo, Yijie Huang, Xiang Yu

    Abstract: This paper studies the existence and uniqueness of a classical solution to a type of Robin boundary problems on the nonnegative orthant. We propose a new decomposition-homogenization method for the Robin boundary problem based on probabilistic representations, which leads to two auxiliary Robin boundary problems admitting some simplified probabilistic representations. The auxiliary probabilistic r… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Keywords: Robin boundary problem, decomposition-homogenization method, probabilistic representation, classical solution, stochastic flow analysis

  32. arXiv:2305.13705  [pdf, other

    cs.CV

    DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models

    Authors: Lijun Li, Li'an Zhuo, Bang Zhang, Liefeng Bo, Chen Chen

    Abstract: Hand mesh reconstruction from the monocular image is a challenging task due to its depth ambiguity and severe occlusion, there remains a non-unique map** between the monocular image and hand mesh. To address this, we develop DiffHand, the first diffusion-based framework that approaches hand mesh reconstruction as a denoising diffusion process. Our one-stage pipeline utilizes noise to model the u… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  33. arXiv:2305.12497  [pdf, other

    cs.CV

    PanoContext-Former: Panoramic Total Scene Understanding with a Transformer

    Authors: Yuan Dong, Chuan Fang, Liefeng Bo, Zilong Dong, ** Tan

    Abstract: Panoramic image enables deeper understanding and more holistic perception of $360^\circ$ surrounding environment, which can naturally encode enriched scene context information compared to standard perspective image. Previous work has made lots of effort to solve the scene understanding task in a bottom-up form, thus each sub-task is processed separately and few correlations are explored in this pr… ▽ More

    Submitted 5 June, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  34. arXiv:2305.04808  [pdf, other

    cs.CL

    CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning

    Authors: Weiqi Wang, Tianqing Fang, Baixuan Xu, Chun Yi Louis Bo, Yangqiu Song, Lei Chen

    Abstract: Commonsense reasoning, aiming at endowing machines with a human-like ability to make situational presumptions, is extremely challenging to generalize. For someone who barely knows about "meditation," while is knowledgeable about "singing," he can still infer that "meditation makes people relaxed" from the existing knowledge that "singing makes people relaxed" by first conceptualizing "singing" as… ▽ More

    Submitted 10 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: ACL2023 Main Conference

  35. arXiv:2304.10802  [pdf, other

    math.OC q-fin.PM

    An extended Merton problem with relaxed benchmark tracking

    Authors: Lijun Bo, Yijie Huang, Xiang Yu

    Abstract: This paper studies a Merton's optimal portfolio and consumption problem in an extended formulation incorporating the tracking of a benchmark process described by a geometric Brownian motion. We consider a relaxed tracking formulation such that the wealth process compensated by a fictitious capital injection outperforms the benchmark at all times. The fund manager aims to maximize the expected util… ▽ More

    Submitted 7 March, 2024; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: Keywords: Benchmark tracking, capital injection, expected largest shortfall, consumption and portfolio choice, Neumann boundary condition

  36. arXiv:2304.05097  [pdf, other

    cs.CV

    One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field

    Authors: Weichuang Li, Longhao Zhang, Dong Wang, Bin Zhao, Zhigang Wang, Mulin Chen, Bang Zhang, Zhongjian Wang, Liefeng Bo, Xuelong Li

    Abstract: Talking head generation aims to generate faces that maintain the identity information of the source image and imitate the motion of the driving image. Most pioneering methods rely primarily on 2D representations and thus will inevitably suffer from face distortion when large head rotations are encountered. Recent works instead employ explicit 3D structural representations or implicit neural render… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023

  37. arXiv:2304.04351  [pdf, other

    cs.CV

    Evaluate Geometry of Radiance Fields with Low-frequency Color Prior

    Authors: Qihang Fang, Yafei Song, Keqiang Li, Li Shen, Huaiyu Wu, Gang Xiong, Liefeng Bo

    Abstract: A radiance field is an effective representation of 3D scenes, which has been widely adopted in novel-view synthesis and 3D reconstruction. It is still an open and challenging problem to evaluate the geometry, i.e., the density field, as the ground-truth is almost impossible to obtain. One alternative indirect solution is to transform the density field into a point-cloud and compute its Chamfer Dis… ▽ More

    Submitted 17 January, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: This paper has been accepted by AAAI 2024

  38. arXiv:2304.04233  [pdf, other

    cs.CR

    ODDFUZZ: Discovering Java Deserialization Vulnerabilities via Structure-Aware Directed Greybox Fuzzing

    Authors: Sicong Cao, Biao He, Xiaobing Sun, Yu Ouyang, Chao Zhang, Xiaoxue Wu, Ting Su, Lili Bo, Bin Li, Chuanlei Ma, Jiajia Li, Tao Wei

    Abstract: Java deserialization vulnerability is a severe threat in practice. Researchers have proposed static analysis solutions to locate candidate vulnerabilities and fuzzing solutions to generate proof-of-concept (PoC) serialized objects to trigger them. However, existing solutions have limited effectiveness and efficiency. In this paper, we propose a novel hybrid solution ODDFUZZ to efficiently discover… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: To appear in the Main Track of IEEE S&P 2023

  39. arXiv:2303.07593  [pdf, other

    cs.CR cs.SE

    Improving Java Deserialization Gadget Chain Mining via Overriding-Guided Object Generation

    Authors: Sicong Cao, Xiaobing Sun, Xiaoxue Wu, Lili Bo, Bin Li, Rongxin Wu, Wei Liu, Biao He, Yu Ouyang, Jiajia Li

    Abstract: Java (de)serialization is prone to causing security-critical vulnerabilities that attackers can invoke existing methods (gadgets) on the application's classpath to construct a gadget chain to perform malicious behaviors. Several techniques have been proposed to statically identify suspicious gadget chains and dynamically generate injection objects for fuzzing. However, due to their incomplete supp… ▽ More

    Submitted 3 April, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: To appear in the Technical Track of ICSE 2023

  40. arXiv:2303.06095  [pdf, other

    cs.IR cs.AI

    HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

    Authors: Jie Zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu

    Abstract: Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture. However, the MoE-based method, which aims to project all information in the same feature space, cannot effectively deal with the… ▽ More

    Submitted 13 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  41. Multi-Behavior Graph Neural Networks for Recommender System

    Authors: Lianghao Xia, Chao Huang, Yong Xu, Peng Dai, Liefeng Bo

    Abstract: Recommender systems have been demonstrated to be effective to meet user's personalized interests for many online services (e.g., E-commerce and online advertising platforms). Recent years have witnessed the emerging success of many deep learning-based recommendation models for augmenting collaborative filtering architectures with various neural network architectures, such as multi-layer perceptron… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: Published at IEEE Transactions on Nueral Networks and Learning Systems, 2022

  42. arXiv:2302.08302  [pdf, other

    math.OC q-fin.MF

    Stochastic control problems with state-reflections arising from relaxed benchmark tracking

    Authors: Lijun Bo, Yijie Huang, Xiang Yu

    Abstract: This paper studies stochastic control problems motivated by optimal consumption with wealth benchmark tracking. The benchmark process is modeled by a combination of a geometric Brownian motion and a running maximum process, indicating its increasing trend in the long run. We consider a relaxed tracking formulation such that the wealth compensated by the injected capital always dominates the benchm… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Keywords: Relaxed benchmark tracking, optimal consumption, Neumann boundary conditions, probabilistic representation, reflected diffusion process

  43. arXiv:2212.04701  [pdf, other

    cs.CV

    4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions

    Authors: Zhongshu Wang, Lingzhi Li, Zhen Shen, Li Shen, Liefeng Bo

    Abstract: In this paper, we present a novel and effective framework, named 4K-NeRF, to pursue high fidelity view synthesis on the challenging scenarios of ultra high resolutions, building on the methodology of neural radiance fields (NeRF). The rendering procedure of NeRF-based methods typically relies on a pixel-wise manner in which rays (or pixels) are treated independently on both training and inference… ▽ More

    Submitted 3 April, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  44. arXiv:2211.16386  [pdf, other

    cs.CV

    Compressing Volumetric Radiance Fields to 1 MB

    Authors: Lingzhi Li, Zhen Shen, Zhongshu Wang, Li Shen, Liefeng Bo

    Abstract: Approximating radiance fields with volumetric grids is one of promising directions for improving NeRF, represented by methods like Plenoxels and DVGO, which achieve super-fast training convergence and real-time rendering. However, these methods typically require a tremendous storage overhead, costing up to hundreds of megabytes of disk space and runtime memory for a single scene. We address this i… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  45. arXiv:2211.09035  [pdf, other

    cs.CV

    A Creative Industry Image Generation Dataset Based on Captions

    Authors: Xiang Yuejia, Lv Chuanhao, Liu Qingdazhu, Yang Xiaocui, Liu Bo, Ju Meizhi

    Abstract: Most image generation methods are difficult to precisely control the properties of the generated images, such as structure, scale, shape, etc., which limits its large-scale application in creative industries such as conceptual design and graphic design, and so on. Using the prompt and the sketch is a practical solution for controllability. Existing datasets lack either prompt or sketch and are not… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  46. arXiv:2210.17129  [pdf

    physics.plasm-ph

    Observation of tungsten impurity suppression with ECRH by an X-ray Crystal Spectrometer on EAST

    Authors: Lin Zichao, Zhang Hongming, Wang Fudi, Bae Chenonho, Fu Jia, Shen Yongcai, Lu Dian, ** Yifei, He Liang, Wang Minrui, Lin Guangle, Ye Kaixuan, Wang Shouxin, Zhao Hailin, Lyu Bo

    Abstract: Impurity degrades tokamak plasmas confinement by causing energy loss, diluting the fuel concentration, even terminating the discharges in some extreme cases. Previously, the suppression effects of on-axis Electron Cyclotron Resonance Heating (ECRH) on the impurity accumulation have been investigated on EAST by the extreme ultraviolet (EUV) spectroscopy. However, it is difficult to quantify the cha… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

  47. arXiv:2210.07549  [pdf, ps, other

    math.OC

    On De Finetti's control under Poisson observations: optimality of a double barrier strategy in a Markov additive model

    Authors: Lijun Bo, Wenyuan Wang, Kaixin Yan

    Abstract: In this paper we consider the De Finetti's optimal dividend and capital injection problem under a Markov additive model. We assume that the surplus process before dividends and capital injections follows a spectrally positive Markov additive process. Dividend payments are made only at the jump times of an independent Poisson process. Capitals are required to be injected whenever needed to ensure a… ▽ More

    Submitted 26 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2207.02661

  48. arXiv:2209.12304  [pdf

    stat.ME

    Issues in Implementing Regression Calibration Analyses

    Authors: Lillian Boe, Pamela A. Shaw, Douglas Midthune, Paul Gustafson, Victor Kipnis, Eunyoung Park, Daniela Sotres-Alvarez, Laurence Freedman

    Abstract: Regression calibration is a popular approach for correcting biases in estimated regression parameters when exposure variables are measured with error. This approach involves building a calibration equation to estimate the value of the unknown true exposure given the error-prone measurement and other confounding covariates. The estimated, or calibrated, exposure is then substituted for the true exp… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  49. arXiv:2209.10061  [pdf, ps, other

    stat.ME stat.AP

    Practical considerations for sandwich variance estimation in two-stage regression settings

    Authors: Lillian A. Boe, Thomas Lumley, Pamela A. Shaw

    Abstract: We present a practical approach for computing the sandwich variance estimator in two-stage regression model settings. As a motivating example for two-stage regression, we consider regression calibration, a popular approach for addressing covariate measurement error. The sandwich variance approach has been rarely applied in regression calibration, despite that it requires less computation time than… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 18 pages of main manuscript including 2 figures and 4 tables; 14 pages of supplementary materials and references (including 2 tables)

  50. arXiv:2206.13341  [pdf, other

    q-fin.MF math.OC

    A mean field game approach to equilibrium consumption under external habit formation

    Authors: Lijun Bo, Shihua Wang, Xiang Yu

    Abstract: This paper studies the equilibrium consumption under external habit formation in a large population of agents. We first formulate problems under two types of conventional habit formation preferences, namely linear and multiplicative external habit formation, in a mean field game framework. In a log-normal market model with the asset specialization, we characterize one mean field equilibrium in ana… ▽ More

    Submitted 8 March, 2024; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Keywords: Catching up with the Joneses, linear habit formation, multiplicative habit formation, mean field equilibrium, approximate Nash equilibrium