CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships

Du, Keqing; Yang, Xinyu; Chen, Hang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.12401v3 (cs)

A newer version of this paper has been withdrawn by Keqing Du

[Submitted on 21 Nov 2023 (v1), revised 15 Jan 2024 (this version, v3), latest version 26 Jan 2024 (v4)]

Title:CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships

Authors:Keqing Du, Xinyu Yang, Hang Chen

View PDF HTML (experimental)

Abstract:Integrating deep learning and causal discovery has increased the interpretability of Temporal Action Segmentation (TAS) tasks. However, frame-level causal relationships exist many complicated noises outside the segment-level, making it infeasible to directly express macro action semantics. Thus, we propose Causal Abstraction Segmentation Refiner (CASR), which can refine TAS results from various models by enhancing video causality in marginalizing frame-level casual relationships. Specifically, we define the equivalent frame-level casual model and segment-level causal model, so that the causal adjacency matrix constructed from marginalized frame-level causal relationships has the ability to represent the segmnet-level causal relationships. CASR works out by reducing the difference in the causal adjacency matrix between we constructed and pre-segmentation results of backbone models. In addition, we propose a novel evaluation metric Causal Edit Distance (CED) to evaluate the causal interpretability. Extensive experimental results on mainstream datasets indicate that CASR significantly surpasses existing various methods in action segmentation performance, as well as in causal explainability and generalization.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2311.12401 [cs.CV]
	(or arXiv:2311.12401v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.12401

Submission history

From: Keqing Du [view email]
[v1] Tue, 21 Nov 2023 07:28:51 UTC (755 KB)
[v2] Fri, 24 Nov 2023 08:51:13 UTC (755 KB)
[v3] Mon, 15 Jan 2024 07:32:28 UTC (756 KB)
[v4] Fri, 26 Jan 2024 07:32:39 UTC (1 KB) (withdrawn)

Computer Science > Computer Vision and Pattern Recognition

Title:CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators