Skip to main content

Showing 1–22 of 22 results for author: Shao, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02157  [pdf, other

    cs.CV cs.HC

    FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs

    Authors: Haodong Chen, Haojian Huang, Junhao Dong, Mingzhe Zheng, Dian Shao

    Abstract: Dynamic Facial Expression Recognition (DFER) is crucial for understanding human behavior. However, current methods exhibit limited performance mainly due to the scarcity of high-quality data, the insufficient utilization of facial dynamics, and the ambiguity of expression semantics, etc. To this end, we propose a novel framework, named Multi-modal Fine-grained CLIP for Dynamic Facial Expression Re… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Project Page: https://haroldchen19.github.io/FineCLIPER-Page/

  2. arXiv:2405.08498  [pdf, other

    cs.LG stat.ML

    Learning Decision Policies with Instrumental Variables through Double Machine Learning

    Authors: Daqian Shao, Ashkan Soleymani, Francesco Quinzan, Marta Kwiatkowska

    Abstract: A common issue in learning decision-making policies in data-rich settings is spurious correlations in the offline dataset, which can be caused by hidden confounders. Instrumental variable (IV) regression, which utilises a key unconfounded variable known as the instrument, is a standard technique for learning causal relationships between confounded action, outcome, and context variables. Most recen… ▽ More

    Submitted 28 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024

  3. arXiv:2405.07472  [pdf, other

    cs.CV

    GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting

    Authors: Haodong Chen, Yongle Huang, Haojian Huang, Xiangsheng Ge, Dian Shao

    Abstract: The increasing prominence of e-commerce has underscored the importance of Virtual Try-On (VTON). However, previous studies predominantly focus on the 2D realm and rely heavily on extensive data for training. Research on 3D VTON primarily centers on garment-body shape compatibility, a topic extensively covered in 2D VTON. Thanks to advances in 3D scene editing, a 2D diffusion model has now been ada… ▽ More

    Submitted 23 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: On-going work

  4. arXiv:2403.19969  [pdf, other

    cs.CV cs.LG

    Separate, Dynamic and Differentiable (SMART) Pruner for Block/Output Channel Pruning on Computer Vision Tasks

    Authors: Guanhua Ding, Zexi Ye, Zhen Zhong, Gang Li, David Shao

    Abstract: Deep Neural Network (DNN) pruning has emerged as a key strategy to reduce model size, improve inference latency, and lower power consumption on DNN accelerators. Among various pruning techniques, block and output channel pruning have shown significant potential in accelerating hardware performance. However, their accuracy often requires further improvement. In response to this challenge, we introd… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  5. arXiv:2401.05338  [pdf, other

    cs.CV cs.LG

    STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers

    Authors: Daqian Shao, Lukas Fesser, Marta Kwiatkowska

    Abstract: Robustness certification, which aims to formally certify the predictions of neural networks against adversarial inputs, has become an integral part of important tool for safety-critical applications. Despite considerable progress, existing certification methods are limited to elementary architectures, such as convolutional networks, recurrent networks and recently Transformers, on benchmark datase… ▽ More

    Submitted 28 November, 2023; originally announced January 2024.

  6. arXiv:2309.00942  [pdf, other

    cs.CV

    Tracking without Label: Unsupervised Multiple Object Tracking via Contrastive Similarity Learning

    Authors: Sha Meng, Dian Shao, Jiacheng Guo, Shan Gao

    Abstract: Unsupervised learning is a challenging task due to the lack of labels. Multiple Object Tracking (MOT), which inevitably suffers from mutual object interference, occlusion, etc., is even more difficult without label supervision. In this paper, we explore the latent consistency of sample features across video frames and propose an Unsupervised Contrastive Similarity Learning method, named UCSL, incl… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  7. arXiv:2308.15474  [pdf, other

    cs.CV cs.AI q-bio.TO

    A General-Purpose Self-Supervised Model for Computational Pathology

    Authors: Richard J. Chen, Tong Ding, Ming Y. Lu, Drew F. K. Williamson, Guillaume Jaume, Bowen Chen, Andrew Zhang, Daniel Shao, Andrew H. Song, Muhammad Shaban, Mane Williams, Anurag Vaidya, Sharifa Sahai, Lukas Oldenburg, Luca L. Weishaupt, Judy J. Wang, Walt Williams, Long Phi Le, Georg Gerber, Faisal Mahmood

    Abstract: Tissue phenoty** is a fundamental computational pathology (CPath) task in learning objective characterizations of histopathologic biomarkers in anatomic pathology. However, whole-slide imaging (WSI) poses a complex computer vision problem in which the large-scale image resolutions of WSIs and the enormous diversity of morphological phenotypes preclude large-scale data annotation. Current efforts… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  8. arXiv:2305.04007  [pdf, other

    cs.CV

    Weighted Point Cloud Normal Estimation

    Authors: Weijia Wang, Xuequan Lu, Di Shao, Xiao Liu, Richard Dazeley, Antonio Robles-Kelly, Wei Pan

    Abstract: Existing normal estimation methods for point clouds are often less robust to severe noise and complex geometric structures. Also, they usually ignore the contributions of different neighbouring points during normal estimation, which leads to less accurate results. In this paper, we introduce a weighted normal estimation method for 3D point cloud data. We innovate in two key points: 1) we develop a… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: Accepted by ICME 2023

  9. arXiv:2305.01381  [pdf, other

    cs.LG cs.AI cs.FL cs.RO

    Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees

    Authors: Daqian Shao, Marta Kwiatkowska

    Abstract: Linear Temporal Logic (LTL) is widely used to specify high-level objectives for system policies, and it is highly desirable for autonomous systems to learn the optimal policy with respect to such specifications. However, learning the optimal policy from LTL specifications is not trivial. We present a model-free Reinforcement Learning (RL) approach that efficiently learns an optimal policy for an u… ▽ More

    Submitted 3 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted at the International Joint Conference on Artificial Intelligence 2023 (IJCAI)

    Journal ref: IJCAI/2023/0465

  10. arXiv:2304.13390  [pdf, other

    cs.CV

    Group Equivariant BEV for 3D Object Detection

    Authors: Hongwei Liu, Jian Yang, Jianfeng Zhang, Dongheng Shao, Jielong Guo, Shaobo Li, Xuan Tang, Xian Wei

    Abstract: Recently, 3D object detection has attracted significant attention and achieved continuous improvement in real road scenarios. The environmental information is collected from a single sensor or multi-sensor fusion to detect interested objects. However, most of the current 3D object detection approaches focus on develo** advanced network architectures to improve the detection precision of the obje… ▽ More

    Submitted 28 June, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 8 pages,3 figures

    MSC Class: 68T45 ACM Class: I.4.9

  11. Using Consensual Biterms from Text Structures of Requirements and Code to Improve IR-Based Traceability Recovery

    Authors: Hui Gao, Hongyu Kuang, Kexin Sun, Xiaoxing Ma, Alexander Egyed, Patrick Mäder, Guo** Rong, Dong Shao, He Zhang

    Abstract: Traceability approves trace links among software artifacts based on whether two artifacts are related by system functionalities. The traces are valuable for software development, but are difficult to obtain manually. To cope with the costly and fallible manual recovery, automated approaches are proposed to recover traces through textual similarities among software artifacts, such as those based on… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: Accepted by the 37th IEEE/ACM International Conference on Automated Software Engineering (ASE 2022)

  12. A Cross-Company Ethnographic Study on Software Teams for DevOps and Microservices: Organization, Benefits, and Issues

    Authors: Xin Zhou, Huang Huang, He Zhang, Xin Huang, Dong Shao, Chenxing Zhong

    Abstract: Context: DevOps and microservices are acknowledged to be important new paradigms to tackle contemporary software demands and provide capabilities for rapid and reliable software development. Industrial reports show that they are quickly adopted together in massive software companies. However, because of the technical and organizational requirements, many difficulties against efficient implementati… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

  13. arXiv:2201.02198  [pdf, other

    eess.IV cs.CV cs.LG

    3D Intracranial Aneurysm Classification and Segmentation via Unsupervised Dual-branch Learning

    Authors: Di Shao, Xuequan Lu, Xiao Liu

    Abstract: Intracranial aneurysms are common nowadays and how to detect them intelligently is of great significance in digital health. While most existing deep learning research focused on medical images in a supervised way, we introduce an unsupervised method for the detection of intracranial aneurysms based on 3D point cloud data. In particular, our method consists of two stages: unsupervised pre-training… ▽ More

    Submitted 16 January, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: under review (corresponding: {[email protected]})

  14. arXiv:2103.13154  [pdf

    cs.SE

    Exploiting the Unique Expression for Improved Sentiment Analysis in Software Engineering Text

    Authors: Kexin Sun, Hui Gao, Hongyu Kuang, Xiaoxing Ma, Guo** Rong, Dong Shao, He Zhang

    Abstract: Sentiment analysis on software engineering (SE) texts has been widely used in the SE research, such as evaluating app reviews or analyzing developers sentiments in commit messages. To better support the use of automated sentiment analysis for SE tasks, researchers built an SE-domain-specified sentiment dictionary to further improve the accuracy of the results. Unfortunately, recent work reported t… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  15. arXiv:2010.01007  [pdf, other

    cs.CV

    DecAug: Augmenting HOI Detection via Decomposition

    Authors: Yichen Xie, Hao-Shu Fang, Dian Shao, Yong-Lu Li, Cewu Lu

    Abstract: Human-object interaction (HOI) detection requires a large amount of annotated data. Current algorithms suffer from insufficient training samples and category imbalance within datasets. To increase data efficiency, in this paper, we propose an efficient and effective data augmentation method called DecAug for HOI detection. Based on our proposed object state similarity metric, object patterns acros… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  16. arXiv:2010.01005  [pdf, other

    cs.CV

    DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection

    Authors: Hao-Shu Fang, Yichen Xie, Dian Shao, Cewu Lu

    Abstract: Recent years, human-object interaction (HOI) detection has achieved impressive advances. However, conventional two-stage methods are usually slow in inference. On the other hand, existing one-stage methods mainly focus on the union regions of interactions, which introduce unnecessary visual information as disturbances to HOI detection. To tackle the problems above, we propose a novel one-stage HOI… ▽ More

    Submitted 19 January, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Paper is accepted. Code available at: https://github.com/MVIG-SJTU/DIRV

  17. arXiv:2008.04762  [pdf

    physics.soc-ph cs.CY

    A validated multi-agent simulation test bed to evaluate congestion pricing policies on population segments by time-of-day in New York City

    Authors: Brian Yueshuai He, **kai Zhou, Ziyi Ma, Ding Wang, Di Sha, Mina Lee, Joseph Y. J. Chow, Kaan Ozbay

    Abstract: Evaluation of the demand for emerging transportation technologies and policies can vary by time of day due to spillbacks on roadways, rescheduling of travelers' activity patterns, and shifting to other modes that affect the level of congestion. These effects are not well-captured with static travel demand models. We calibrate and validate the first open-source multi-agent simulation model for New… ▽ More

    Submitted 21 December, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Journal ref: Transport Policy 101 (2021) 145-161

  18. arXiv:2005.10229  [pdf, other

    cs.CV

    Intra- and Inter-Action Understanding via Temporal Action Parsing

    Authors: Dian Shao, Yue Zhao, Bo Dai, Dahua Lin

    Abstract: Current methods for action recognition primarily rely on deep convolutional networks to derive feature embeddings of visual and motion features. While these methods have demonstrated remarkable performance on standard benchmarks, we are still in need of a better understanding as to how the videos, in particular their internal structures, relate to high-level semantics, which may lead to benefits i… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: CVPR 2020 Poster; Project page: https://sdolivia.github.io/TAPOS/

  19. arXiv:2004.06704  [pdf, other

    cs.CV

    FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding

    Authors: Dian Shao, Yue Zhao, Bo Dai, Dahua Lin

    Abstract: On public benchmarks, current action recognition techniques have achieved great success. However, when used in real-world applications, e.g. sport analysis, which requires the capability of parsing an activity into phases and differentiating between subtly different actions, their performances remain far from being satisfactory. To take action recognition to a new level, we develop FineGym, a new… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 Oral (3 strong accepts); Project page: https://sdolivia.github.io/FineGym/

  20. arXiv:1901.00748  [pdf

    cs.CY

    The Impact of Countdown Clocks on Subway Ridership in New York City

    Authors: Zhengbo Zou, Di Sha

    Abstract: Protecting the passengers' safety and increasing ridership are two never ending pursuits of public transit agencies. One of the proposed methods to achieve both goals for subway service is to implement real time train arriving countdown clocks in subway stations. Metropolitan Transportation Authority (MTA) of New York City (NYC) chose to install such countdown clocks in their stations starting fro… ▽ More

    Submitted 26 December, 2018; originally announced January 2019.

  21. arXiv:1108.3980  [pdf

    cs.RO

    Three-dimensional Torques and Power of Horse Forelimb Joints at Trot

    Authors: H. M. Clayton, D. H. Sha, D. R. Mullineaux

    Abstract: Reasons for Performing Study: Equine gait analysis has focused on 2D analysis in the sagittal plane, while descriptions of 3D kinetics and ground reaction force could provide more information on the Equine gait analysis. Hypothesis or Objectives: The aim of this study was to characterize the 3D torques and powers of the forelimb joints at trotting. Methods: Eight sound horses were used in the st… ▽ More

    Submitted 19 August, 2011; originally announced August 2011.

    Comments: 18 pages, 4 figures, 15 tables

  22. arXiv:1004.1997  [pdf

    cs.NE cs.DC cs.LG

    An optimized recursive learning algorithm for three-layer feedforward neural networks for mimo nonlinear system identifications

    Authors: Daohang Sha, Vladimir B. Bajic

    Abstract: Back-propagation with gradient method is the most popular learning algorithm for feed-forward neural networks. However, it is critical to determine a proper fixed learning rate for the algorithm. In this paper, an optimized recursive algorithm is presented for online learning based on matrix operation and optimization methods analytically, which can avoid the trouble to select a proper learning ra… ▽ More

    Submitted 12 April, 2010; originally announced April 2010.

    Comments: 15 pages, 5 figures

    Journal ref: Intelligent Automation and Soft Computing, Vol. 17, No. 2, pp. 133-147, 2011