Search | arXiv e-print repository

doi 10.1117/1.JEI.26.1.011018

Discovering Characteristic Landmarks on Ancient Coins using Convolutional Networks

Abstract: In this paper, we propose a novel method to find characteristic landmarks on ancient Roman imperial coins using deep convolutional neural network models (CNNs). We formulate an optimization problem to discover class-specific regions while guaranteeing specific controlled loss of accuracy. Analysis on visualization of the discovered region confirms that not only can the proposed method successfully… ▽ More In this paper, we propose a novel method to find characteristic landmarks on ancient Roman imperial coins using deep convolutional neural network models (CNNs). We formulate an optimization problem to discover class-specific regions while guaranteeing specific controlled loss of accuracy. Analysis on visualization of the discovered region confirms that not only can the proposed method successfully find a set of characteristic regions per class, but also the discovered region is consistent with human expert annotations. We also propose a new framework to recognize the Roman coins which exploits hierarchical structure of the ancient Roman coins using the state-of-the-art classification power of the CNNs adopted to a new task of coin classification. Experimental results show that the proposed framework is able to effectively recognize the ancient Roman coins. For this research, we have collected a new Roman coin dataset where all coins are annotated and consist of observe (head) and reverse (tail) images. △ Less

Submitted 30 June, 2015; v1 submitted 30 June, 2015; originally announced June 2015.

arXiv:1506.09124 [pdf, other]

Multi-Cue Structure Preserving MRF for Unconstrained Video Segmentation

Authors: Saehoon Yi, Vladimir Pavlovic

Abstract: Video segmentation is a step** stone to understanding video context. Video segmentation enables one to represent a video by decomposing it into coherent regions which comprise whole or parts of objects. However, the challenge originates from the fact that most of the video segmentation algorithms are based on unsupervised learning due to expensive cost of pixelwise video annotation and intra-cla… ▽ More Video segmentation is a step** stone to understanding video context. Video segmentation enables one to represent a video by decomposing it into coherent regions which comprise whole or parts of objects. However, the challenge originates from the fact that most of the video segmentation algorithms are based on unsupervised learning due to expensive cost of pixelwise video annotation and intra-class variability within similar unconstrained video classes. We propose a Markov Random Field model for unconstrained video segmentation that relies on tight integration of multiple cues: vertices are defined from contour based superpixels, unary potentials from temporal smooth label likelihood and pairwise potentials from global structure of a video. Multi-cue structure is a breakthrough to extracting coherent object regions for unconstrained videos in absence of supervision. Our experiments on VSB100 dataset show that the proposed model significantly outperforms competing state-of-the-art algorithms. Qualitative analysis illustrates that video segmentation result of the proposed model is consistent with human perception of objects. △ Less

Submitted 30 June, 2015; originally announced June 2015.

arXiv:1506.08928 [pdf, other]

Fast ADMM Algorithm for Distributed Optimization with Adaptive Penalty

Authors: Changkyu Song, Sejong Yoon, Vladimir Pavlovic

Abstract: We propose new methods to speed up convergence of the Alternating Direction Method of Multipliers (ADMM), a common optimization tool in the context of large scale and distributed learning. The proposed method accelerates the speed of convergence by automatically deciding the constraint penalty needed for parameter consensus in each iteration. In addition, we also propose an extension of the method… ▽ More We propose new methods to speed up convergence of the Alternating Direction Method of Multipliers (ADMM), a common optimization tool in the context of large scale and distributed learning. The proposed method accelerates the speed of convergence by automatically deciding the constraint penalty needed for parameter consensus in each iteration. In addition, we also propose an extension of the method that adaptively determines the maximum number of iterations to update the penalty. We show that this approach effectively leads to an adaptive, dynamic network topology underlying the distributed optimization. The utility of the new penalty update schemes is demonstrated on both synthetic and real data, including a computer vision application of distributed structure from motion. △ Less

Submitted 29 June, 2015; originally announced June 2015.

Comments: 8 pages manuscript, 2 pages appendix, 5 figures

arXiv:1303.1966 [pdf, ps, other]

doi 10.1103/PhysRevA.89.042332

Nonreversal and nonrepeating quantum walks

Authors: T. J. Proctor, K. E. Barr, B. Hanson, S. Martiel, V. Pavlovic, A. Bullivant, V. M. Kendon

Abstract: We introduce a variation of the discrete time quantum walk, the nonreversal quantum walk, which does not step back onto a position which it has just occupied. This allows us to simulate a dimer and we achieve it by introducing a new type of coin operator. The nonrepeating walk, which never moves in the same direction in consecutive time steps, arises by a permutation of this coin operator. We desc… ▽ More We introduce a variation of the discrete time quantum walk, the nonreversal quantum walk, which does not step back onto a position which it has just occupied. This allows us to simulate a dimer and we achieve it by introducing a new type of coin operator. The nonrepeating walk, which never moves in the same direction in consecutive time steps, arises by a permutation of this coin operator. We describe the basic properties of both walks and prove that the even-order joint moments of the nonrepeating walker are independent of the initial condition, being determined by five parameters derived from the coin instead. Numerical evidence suggests that the same is the case for the nonreversal walk. This contrasts strongly with previously studied coins, such as the Grover operator, where the initial condition can be used to control the standard deviation of the walker. △ Less

Submitted 26 June, 2014; v1 submitted 8 March, 2013; originally announced March 2013.

Comments: v4: 8 pages 4 figures. Published version

Journal ref: Phys. Rev. A 89, 042332 (2014)

arXiv:1301.6731 [pdf]

Variational Learning in Mixed-State Dynamic Graphical Models

Authors: Vladimir Pavlovic, Brendan J. Frey, Thomas S. Huang

Abstract: Many real-valued stochastic time-series are locally linear (Gassian), but globally non-linear. For example, the trajectory of a human hand gesture can be viewed as a linear dynamic system driven by a nonlinear dynamic system that represents muscle actions. We present a mixed-state dynamic graphical model in which a hidden Markov model drives a linear dynamic system. This combination allows us t… ▽ More Many real-valued stochastic time-series are locally linear (Gassian), but globally non-linear. For example, the trajectory of a human hand gesture can be viewed as a linear dynamic system driven by a nonlinear dynamic system that represents muscle actions. We present a mixed-state dynamic graphical model in which a hidden Markov model drives a linear dynamic system. This combination allows us to model both the discrete and continuous causes of trajectories such as human gestures. The number of computations needed for exact inference is exponential in the sequence length, so we derive an approximate variational inference technique that can also be used to learn the parameters of the discrete and continuous models. We show how the mixed-state model and the variational technique can be used to classify human hand gestures made with a computer mouse. △ Less

Submitted 23 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

Report number: UAI-P-1999-PG-522-530

arXiv:1301.5063

Heteroscedastic Conditional Ordinal Random Fields for Pain Intensity Estimation from Facial Images

Authors: Ognjen Rudovic, Maja Pantic, Vladimir Pavlovic

Abstract: We propose a novel method for automatic pain intensity estimation from facial images based on the framework of kernel Conditional Ordinal Random Fields (KCORF). We extend this framework to account for heteroscedasticity on the output labels(i.e., pain intensity scores) and introduce a novel dynamic features, dynamic ranks, that impose temporal ordinal constraints on the static ranks (i.e., intensi… ▽ More We propose a novel method for automatic pain intensity estimation from facial images based on the framework of kernel Conditional Ordinal Random Fields (KCORF). We extend this framework to account for heteroscedasticity on the output labels(i.e., pain intensity scores) and introduce a novel dynamic features, dynamic ranks, that impose temporal ordinal constraints on the static ranks (i.e., intensity scores). Our experimental results show that the proposed approach outperforms state-of-the art methods for sequence classification with ordinal data and other ordinal regression models. The approach performs significantly better than other models in terms of Intra-Class Correlation measure, which is the most accepted evaluation measure in the tasks of facial behaviour intensity estimation. △ Less

Submitted 3 April, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

Comments: This paper has been withdrawn by the authors due to a crucial sign error in equation 2&3

arXiv:1107.2553 [pdf, other]

Learning Hypergraph Labeling for Feature Matching

Authors: Toufiq Parag, Vladimir Pavlovic, Ahmed Elgammal

Abstract: This study poses the feature correspondence problem as a hypergraph node labeling problem. Candidate feature matches and their subsets (usually of size larger than two) are considered to be the nodes and hyperedges of a hypergraph. A hypergraph labeling algorithm, which models the subset-wise interaction by an undirected graphical model, is applied to label the nodes (feature correspondences) as c… ▽ More This study poses the feature correspondence problem as a hypergraph node labeling problem. Candidate feature matches and their subsets (usually of size larger than two) are considered to be the nodes and hyperedges of a hypergraph. A hypergraph labeling algorithm, which models the subset-wise interaction by an undirected graphical model, is applied to label the nodes (feature correspondences) as correct or incorrect. We describe a method to learn the cost function of this labeling algorithm from labeled examples using a graphical model training algorithm. The proposed feature matching algorithm is different from the most of the existing learning point matching methods in terms of the form of the objective function, the cost function to be learned and the optimization method applied to minimize it. The results on standard datasets demonstrate how learning over a hypergraph improves the matching performance over existing algorithms, notably one that also uses higher order information without learning. △ Less

Submitted 13 July, 2011; originally announced July 2011.

Showing 51–57 of 57 results for author: Pavlovic, V