-
Learning to Localize Temporal Events in Large-scale Video Data
Abstract: We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset. This emerging field within video recognition can enable applications to identify the precise time a specified event occurs in a video, which has broad implications for video search. To address this we present two separate approaches: (1) a gradient boosted decision tree model on… ▽ More
Submitted 25 October, 2019; originally announced October 2019.
Comments: ICCV 2019, 3rd Youtube-8M Workshop
-
Deep Learning Methods for Efficient Large Scale Video Labeling
Abstract: We present a solution to "Google Cloud and YouTube-8M Video Understanding Challenge" that ranked 5th place. The proposed model is an ensemble of three model families, two frame level and one video level. The training was performed on augmented dataset, with cross validation.
Submitted 14 June, 2017; originally announced June 2017.
Comments: 7 pages, 5 tables, 1 figure