Search | arXiv e-print repository

Study of Residual Networks for Image Recognition

Authors: Mohammad Sadegh Ebrahimi, Hossein Karkeh Abadi

Abstract: Deep neural networks demonstrate to have a high performance on image classification tasks while being more difficult to train. Due to the complexity and vanishing gradient problem, it normally takes a lot of time and more computational power to train deeper neural networks. Deep residual networks (ResNets) can make the training process faster and attain more accuracy compared to their equivalent n… ▽ More Deep neural networks demonstrate to have a high performance on image classification tasks while being more difficult to train. Due to the complexity and vanishing gradient problem, it normally takes a lot of time and more computational power to train deeper neural networks. Deep residual networks (ResNets) can make the training process faster and attain more accuracy compared to their equivalent neural networks. ResNets achieve this improvement by adding a simple skip connection parallel to the layers of convolutional neural networks. In this project we first design a ResNet model that can perform the image classification task on the Tiny ImageNet dataset with a high accuracy, then we compare the performance of this ResNet model with its equivalent Convolutional Network (ConvNet). Our findings illustrate that ResNets are more prone to overfitting despite their higher accuracy. Several methods to prevent overfitting such as adding dropout layers and stochastic augmentation of the training dataset has been studied in this work. △ Less

Submitted 21 April, 2018; originally announced May 2018.

Comments: 6 pages, 9 figures

arXiv:1710.05262 [pdf, other]

Stable Matchings in Metric Spaces: Modeling Real-World Preferences using Proximity

Authors: Hossein Karkeh Abadi, Balaji Prabhakar

Abstract: Suppose each of $n$ men and $n$ women is located at a point in a metric space. A woman ranks the men in order of their distance to her from closest to farthest, breaking ties at random. The men rank the women similarly. An interesting problem is to use these ranking lists and find a stable matching in the sense of Gale and Shapley. This problem formulation naturally models preferences in several r… ▽ More Suppose each of $n$ men and $n$ women is located at a point in a metric space. A woman ranks the men in order of their distance to her from closest to farthest, breaking ties at random. The men rank the women similarly. An interesting problem is to use these ranking lists and find a stable matching in the sense of Gale and Shapley. This problem formulation naturally models preferences in several real world applications; for example, dating sites, room renting/letting, ride hailing and labor markets. Two key questions that arise in this setting are: (a) When is the stable matching unique without resorting to tie breaks? (b) If $X$ is the distance between a randomly chosen stable pair, what is the distribution of $X$ and what is $E(X)$? We study dating sites and ride hailing as prototypical examples of stable matchings in discrete and continuous metric spaces, respectively. In the dating site model, each person is assigned to a point on the $k$-dimensional hypercube based on their answers to a set of binary $k$ questions. We consider two different metrics on the hypercube: Hamming and Weighted Hamming. Under both metrics, there are exponentially many stable matchings when $k = \lfloor\log n\rfloor$. There is a unique stable matching, with high probability, under the Hamming distance when $k = Ω(n^6)$, and under the Weighted Hamming distance when $k > (2+ε) \log n$ for some $ε>0$. In the ride hailing model, passengers and cabs are modeled as points on the line and matched based on Euclidean distance. Assuming the locations of the passengers and cabs are independent Poisson processes of different intensities, we derive bounds on the distribution of $X$ in terms of busy periods at a last-come-first-served preemptive-resume (LCFS-PR) queue. △ Less

Submitted 14 October, 2017; originally announced October 2017.

arXiv:1102.4099 [pdf, ps, other]

Capacity Achieving Linear Codes with Random Binary Sparse Generating Matrices

Authors: A. Makhdoumi Kakhaki, H. Karkeh Abadi, P. Pad, H. Saeedi, F. Marvasti, K. Alishahi

Abstract: In this paper, we prove the existence of capacity achieving linear codes with random binary sparse generating matrices. The results on the existence of capacity achieving linear codes in the literature are limited to the random binary codes with equal probability generating matrix elements and sparse parity-check matrices. Moreover, the codes with sparse generating matrices reported in the literat… ▽ More In this paper, we prove the existence of capacity achieving linear codes with random binary sparse generating matrices. The results on the existence of capacity achieving linear codes in the literature are limited to the random binary codes with equal probability generating matrix elements and sparse parity-check matrices. Moreover, the codes with sparse generating matrices reported in the literature are not proved to be capacity achieving. As opposed to the existing results in the literature, which are based on optimal maximum a posteriori decoders, the proposed approach is based on a different decoder and consequently is suboptimal. We also demonstrate an interesting trade-off between the sparsity of the generating matrix and the error exponent (a constant which determines how exponentially fast the probability of error decays as block length tends to infinity). An interesting observation is that for small block sizes, less sparse generating matrices have better performances while for large blok sizes, the performance of the random generating matrices become independent of the sparsity. Moreover, we prove the existence of capacity achieving linear codes with a given (arbitrarily low) density of ones on rows of the generating matrix. In addition to proving the existence of capacity achieving sparse codes, an important conclusion of our paper is that for a sufficiently large code length, no search is necessary in practice to find a deterministic matrix by proving that any arbitrarily selected sequence of sparse generating matrices is capacity achieving with high probability. The focus in this paper is on the binary symmetric and binary erasure channels.her discrete memory-less symmetric channels. △ Less

Submitted 29 August, 2011; v1 submitted 20 February, 2011; originally announced February 2011.

Comments: Submitted to IEEE transaction on Information Theory

Showing 1–3 of 3 results for author: Abadi, H K