Showing 1–2 of 2 results for author: Skalic, M

Search v0.5.6 released 2020-02-24

arXiv:1910.11631 [pdf, other]

cs.CV

Learning to Localize Temporal Events in Large-scale Video Data

Authors: Mikel Bober-Irizar, Miha Skalic, David Austin

Abstract: We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset. This emerging field within video recognition can enable applications to identify the precise time a specified event occurs in a video, which has broad implications for video search. To address this we present two separate approaches: (1) a gradient boosted decision tree model on… ▽ More We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset. This emerging field within video recognition can enable applications to identify the precise time a specified event occurs in a video, which has broad implications for video search. To address this we present two separate approaches: (1) a gradient boosted decision tree model on a crafted dataset and (2) a combination of deep learning models based on frame-level data, video-level data, and a localization model. The combinations of these two approaches achieved 5th place in the 3rd Youtube-8M video recognition challenge. △ Less

Submitted 25 October, 2019; originally announced October 2019.

Comments: ICCV 2019, 3rd Youtube-8M Workshop
arXiv:1706.04572 [pdf, other]

stat.ML cs.CV cs.LG

Deep Learning Methods for Efficient Large Scale Video Labeling

Authors: Miha Skalic, Marcin Pekalski, Xingguo E. Pan

Abstract: We present a solution to "Google Cloud and YouTube-8M Video Understanding Challenge" that ranked 5th place. The proposed model is an ensemble of three model families, two frame level and one video level. The training was performed on augmented dataset, with cross validation. We present a solution to "Google Cloud and YouTube-8M Video Understanding Challenge" that ranked 5th place. The proposed model is an ensemble of three model families, two frame level and one video level. The training was performed on augmented dataset, with cross validation. △ Less

Submitted 14 June, 2017; originally announced June 2017.

Comments: 7 pages, 5 tables, 1 figure

Search v0.5.6 released 2020-02-24