-
The Magic XRoom: A Flexible VR Platform for Controlled Emotion Elicitation and Recognition
Authors:
S. M. Hossein Mousavi,
Matteo Besenzoni,
Davide Andreoletti,
Achille Peternier,
Silvia Giordano
Abstract:
Affective computing has recently gained popularity, especially in the field of human-computer interaction systems, where effectively evoking and detecting emotions is of paramount importance to enhance users experience. However, several issues are hindering progress in the field. In fact, the complexity of emotions makes it difficult to understand their triggers and control their elicitation. Addi…
▽ More
Affective computing has recently gained popularity, especially in the field of human-computer interaction systems, where effectively evoking and detecting emotions is of paramount importance to enhance users experience. However, several issues are hindering progress in the field. In fact, the complexity of emotions makes it difficult to understand their triggers and control their elicitation. Additionally, effective emotion recognition requires analyzing multiple sensor data, such as facial expressions and physiological signals. These factors combined make it hard to collect high-quality datasets that can be used for research purposes (e.g., development of emotion recognition algorithms). Despite these challenges, Virtual Reality (VR) holds promise as a solution. By providing a controlled and immersive environment, VR enables the replication of real-world emotional experiences and facilitates the tracking of signals indicative of emotional states. However, controlling emotion elicitation remains a challenging task also within VR. This research paper introduces the Magic Xroom, a VR platform designed to enhance control over emotion elicitation by leveraging the theory of flow. This theory establishes a map** between an individuals skill levels, task difficulty, and perceived emotions. In the Magic Xroom, the users skill level is continuously assessed, and task difficulty is adjusted accordingly to evoke specific emotions. Furthermore, user signals are collected using sensors, and virtual panels are utilized to determine the ground truth emotional states, making the Magic Xroom an ideal platform for collecting extensive datasets. The paper provides detailed implementation information, highlights the main properties of the Magic Xroom, and presents examples of virtual scenarios to illustrate its abilities and capabilities.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Bayesian Inference for Estimating Heat Sources through Temperature Assimilation
Authors:
Hanieh Mousavi,
Jeff D. Eldredge
Abstract:
This paper introduces a Bayesian inference framework for two-dimensional steady-state heat conduction, focusing on the estimation of unknown distributed heat sources in a thermally-conducting medium with uniform conductivity. The goal is to infer heater locations, strengths, and shapes using temperature assimilation in the Euclidean space, employing a Fourier series to represent each heater's shap…
▽ More
This paper introduces a Bayesian inference framework for two-dimensional steady-state heat conduction, focusing on the estimation of unknown distributed heat sources in a thermally-conducting medium with uniform conductivity. The goal is to infer heater locations, strengths, and shapes using temperature assimilation in the Euclidean space, employing a Fourier series to represent each heater's shape. The Markov Chain Monte Carlo (MCMC) method, incorporating the random-walk Metropolis-Hasting algorithm and parallel tempering, is utilized for posterior distribution exploration in both unbounded and wall-bounded domains. Strong correlations between heat strength and heater area prompt caution against simultaneously estimating these two quantities. It is found that multiple solutions arise in cases where the number of temperature sensors is less than the number of unknown states. Moreover, smaller heaters introduce greater uncertainty in estimated strength. The diffusive nature of heat conduction smooths out any deformations in the temperature contours, especially in the presence of multiple heaters positioned near each other, impacting convergence. In wall-bounded domains with Neumann boundary conditions, the inference of heater parameters tends to be more accurate than in unbounded domains.
△ Less
Submitted 17 April, 2024;
originally announced May 2024.
-
Averages with the Gaussian divisor: Weighted Inequalities and the Pointwise Ergodic Theorem
Authors:
Christina Giannitsi,
Nazar Miheisi,
Hamed Mousavi
Abstract:
We discuss the Pointwise Ergodic Theorem for the Gaussian divisor function $d(n)$, that is, for a measure preserving $\mathbb Z[i]$ action $T$, the limit
$$\lim_{N\rightarrow \infty} \frac{1}{D(N)} \sum _{\mathscr{N} (n) \leq N} d(n) \,f(T^n x) $$ converges for every $f\in L^p$, where $\mathscr{N} (n) = n \bar{n}$, and $D(N) = \sum _{\mathscr{N} (n) \leq N} d(n) $, and $1<p\leq \infty$. To do so…
▽ More
We discuss the Pointwise Ergodic Theorem for the Gaussian divisor function $d(n)$, that is, for a measure preserving $\mathbb Z[i]$ action $T$, the limit
$$\lim_{N\rightarrow \infty} \frac{1}{D(N)} \sum _{\mathscr{N} (n) \leq N} d(n) \,f(T^n x) $$ converges for every $f\in L^p$, where $\mathscr{N} (n) = n \bar{n}$, and $D(N) = \sum _{\mathscr{N} (n) \leq N} d(n) $, and $1<p\leq \infty$. To do so we study the averages
$$ A_N f (x) = \frac{1}{D(N)} \sum _{\mathscr{N} (n) \leq N} d(n) \,f(x-n) ,$$ and obtain improving and weighted maximal inequalities for our operator, in the process.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Approximation algorithms for noncommutative constraint satisfaction problems
Authors:
Eric Culf,
Hamoon Mousavi,
Taro Spirig
Abstract:
We study operator - or noncommutative - variants of constraint satisfaction problems (CSPs). These higher-dimensional variants are a core topic of investigation in quantum information, where they arise as nonlocal games and entangled multiprover interactive proof systems (MIP*). The idea of higher-dimensional relaxations of CSPs is also important in the classical literature. For example since the…
▽ More
We study operator - or noncommutative - variants of constraint satisfaction problems (CSPs). These higher-dimensional variants are a core topic of investigation in quantum information, where they arise as nonlocal games and entangled multiprover interactive proof systems (MIP*). The idea of higher-dimensional relaxations of CSPs is also important in the classical literature. For example since the celebrated work of Goemans and Williamson on Max-Cut, higher dimensional vector relaxations have been central in the design of approximation algorithms for classical CSPs.
We introduce a framework for designing approximation algorithms for noncommutative CSPs. Prior to this work Max-$2$-Lin$(k)$ was the only family of noncommutative CSPs known to be efficiently solvable. This work is the first to establish approximation ratios for a broader class of noncommutative CSPs.
In the study of classical CSPs, $k$-ary decision variables are often represented by $k$-th roots of unity, which generalise to the noncommutative setting as order-$k$ unitary operators. In our framework, using representation theory, we develop a way of constructing unitary solutions from SDP relaxations, extending the pioneering work of Tsirelson on XOR games. Then, we introduce a novel rounding scheme to transform these solutions to order-$k$ unitaries. Our main technical innovation here is a theorem guaranteeing that, for any set of unitary operators, there exists a set of order-$k$ unitaries that closely mimics it. As an integral part of the rounding scheme, we prove a random matrix theory result that characterises the distribution of the relative angles between eigenvalues of random unitaries using tools from free probability.
△ Less
Submitted 27 December, 2023;
originally announced December 2023.
-
Groups whose non-normal subgroups are either nilpotent or minimal non-nilpotent
Authors:
Nasrin Dastborhan,
Hamid Mousavi
Abstract:
Let $\mathfrak{Nil}$ be the class of nilpotent groups and $G$ be a group. We call $G$ a meta-$\mathfrak{Nil}$-Hamiltonian group if any of its non-$\mathfrak{Nil}$ subgroups is normal. Also, we call $G$ a para-$\mathfrak{Nil}$-Hamiltonian group if $G$ is a non-$\mathfrak{Nil}$ group and every non-normal subgroup of $G$ is either a $\mathfrak{Nil}$-group or a minimal non-$\mathfrak{Nil}$ group. In t…
▽ More
Let $\mathfrak{Nil}$ be the class of nilpotent groups and $G$ be a group. We call $G$ a meta-$\mathfrak{Nil}$-Hamiltonian group if any of its non-$\mathfrak{Nil}$ subgroups is normal. Also, we call $G$ a para-$\mathfrak{Nil}$-Hamiltonian group if $G$ is a non-$\mathfrak{Nil}$ group and every non-normal subgroup of $G$ is either a $\mathfrak{Nil}$-group or a minimal non-$\mathfrak{Nil}$ group. In this paper we investigate the class of finitely generated meta-$\mathfrak{Nil}$-Hamiltonian and para-$\mathfrak{Nil}$-Hamiltonian groups.
△ Less
Submitted 20 February, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Averages over the Gaussian Primes: Goldbach's Conjecture and Improving Estimates
Authors:
Christina Giannitsi,
Ben Krause,
Michael Lacey,
Hamed Mousavi,
Yaghoub Rahimi
Abstract:
We prove versions of Goldbach conjectures for Gaussian primes in arbitrary sectors. Fix an interval $ω\subset \mathbb{T}$. There is an integer $N_ω$, so that every odd integer $n$ with $N(n)>N_ω$ and $\text{dist}( \text{arg}(n) , \mathbb{T}\setminus ω) > (\log N(n)) ^{-B}$, is a sum of three Gaussian primes $n=p_1+p_2+p_3$, with $\text{arg}(p_j) \in ω$, for $j=1,2,3$. A density version of the bina…
▽ More
We prove versions of Goldbach conjectures for Gaussian primes in arbitrary sectors. Fix an interval $ω\subset \mathbb{T}$. There is an integer $N_ω$, so that every odd integer $n$ with $N(n)>N_ω$ and $\text{dist}( \text{arg}(n) , \mathbb{T}\setminus ω) > (\log N(n)) ^{-B}$, is a sum of three Gaussian primes $n=p_1+p_2+p_3$, with $\text{arg}(p_j) \in ω$, for $j=1,2,3$. A density version of the binary Goldbach conjecture in a sector is also proved.
△ Less
Submitted 20 March, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Bees Local Phase Quantization Feature Selection for RGB-D Facial Expressions Recognition
Authors:
Seyed Muhammad Hossein Mousavi,
Atiye Ilanloo
Abstract:
Feature selection could be defined as an optimization problem and solved by bio-inspired algorithms. Bees Algorithm (BA) shows decent performance in feature selection optimization tasks. On the other hand, Local Phase Quantization (LPQ) is a frequency domain feature which has excellent performance on Depth images. Here, after extracting LPQ features out of RGB (colour) and Depth images from the Ir…
▽ More
Feature selection could be defined as an optimization problem and solved by bio-inspired algorithms. Bees Algorithm (BA) shows decent performance in feature selection optimization tasks. On the other hand, Local Phase Quantization (LPQ) is a frequency domain feature which has excellent performance on Depth images. Here, after extracting LPQ features out of RGB (colour) and Depth images from the Iranian Kinect Face Database (IKFDB), the Bees feature selection algorithm applies to select the desired number of features for final classification tasks. IKFDB is recorded with Kinect sensor V.2 and contains colour and depth images for facial and facial micro-expressions recognition purposes. Here five facial expressions of Anger, Joy, Surprise, Disgust and Fear are used for final validation. The proposed Bees LPQ method is compared with Particle Swarm Optimization (PSO) LPQ, PCA LPQ, Lasso LPQ, and just LPQ features for classification tasks with Support Vector Machines (SVM), K-Nearest Neighbourhood (KNN), Shallow Neural Network and Ensemble Subspace KNN. Returned results, show a decent performance of the proposed algorithm (99 % accuracy) in comparison with others.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Introduction to Facial Micro Expressions Analysis Using Color and Depth Images: A Matlab Coding Approach (Second Edition, 2023)
Authors:
Seyed Muhammad Hossein Mousavi
Abstract:
The book attempts to introduce a gentle introduction to the field of Facial Micro Expressions Recognition (FMER) using Color and Depth images, with the aid of MATLAB programming environment. FMER is a subset of image processing and it is a multidisciplinary topic to analysis. So, it requires familiarity with other topics of Artifactual Intelligence (AI) such as machine learning, digital image proc…
▽ More
The book attempts to introduce a gentle introduction to the field of Facial Micro Expressions Recognition (FMER) using Color and Depth images, with the aid of MATLAB programming environment. FMER is a subset of image processing and it is a multidisciplinary topic to analysis. So, it requires familiarity with other topics of Artifactual Intelligence (AI) such as machine learning, digital image processing, psychology and more. So, it is a great opportunity to write a book which covers all of these topics for beginner to professional readers in the field of AI and even without having background of AI. Our goal is to provide a standalone introduction in the field of MFER analysis in the form of theorical descriptions for readers with no background in image processing with reproducible Matlab practical examples. Also, we describe any basic definitions for FMER analysis and MATLAB library which is used in the text, that helps final reader to apply the experiments in the real-world applications. We believe that this book is suitable for students, researchers, and professionals alike, who need to develop practical skills, along with a basic understanding of the field. We expect that, after reading this book, the reader feels comfortable with different key stages such as color and depth image processing, color and depth image representation, classification, machine learning, facial micro-expressions recognition, feature extraction and dimensionality reduction. The book attempts to introduce a gentle introduction to the field of Facial Micro Expressions Recognition (FMER) using Color and Depth images, with the aid of MATLAB programming environment.
△ Less
Submitted 19 June, 2023;
originally announced July 2023.
-
Victoria Amazonica Optimization (VAO): An Algorithm Inspired by the Giant Water Lily Plant
Authors:
Seyed Muhammad Hossein Mousavi
Abstract:
The Victoria Amazonica plant, often known as the Giant Water Lily, has the largest floating spherical leaf in the world, with a maximum leaf diameter of 3 meters. It spreads its leaves by the force of its spines and creates a large shadow underneath, killing any plants that require sunlight. These water tyrants use their formidable spines to compel each other to the surface and increase their stre…
▽ More
The Victoria Amazonica plant, often known as the Giant Water Lily, has the largest floating spherical leaf in the world, with a maximum leaf diameter of 3 meters. It spreads its leaves by the force of its spines and creates a large shadow underneath, killing any plants that require sunlight. These water tyrants use their formidable spines to compel each other to the surface and increase their strength to grab more space from the surface. As they spread throughout the pond or basin, with the earliest-growing leaves having more room to grow, each leaf gains a unique size. Its flowers are transsexual and when they bloom, Cyclocephala beetles are responsible for the pollination process, being attracted to the scent of the female flower. After entering the flower, the beetle becomes covered with pollen and transfers it to another flower for fertilization. After the beetle leaves, the flower turns into a male and changes color from white to pink. The male flower dies and sinks into the water, releasing its seed to help create a new generation. In this paper, the mathematical life cycle of this magnificent plant is introduced, and each leaf and blossom are treated as a single entity. The proposed bio-inspired algorithm is tested with 24 benchmark optimization test functions, such as Ackley, and compared to ten other famous algorithms, including the Genetic Algorithm. The proposed algorithm is tested on 10 optimization problems: Minimum Spanning Tree, Hub Location Allocation, Quadratic Assignment, Clustering, Feature Selection, Regression, Economic Dispatching, Parallel Machine Scheduling, Color Quantization, and Image Segmentation and compared to traditional and bio-inspired algorithms. Overall, the performance of the algorithm in all tasks is satisfactory.
△ Less
Submitted 22 January, 2023;
originally announced March 2023.
-
Neural Gas Network Image Features and Segmentation for Brain Tumor Detection Using Magnetic Resonance Imaging Data
Authors:
S. Muhammad Hossein Mousavi
Abstract:
Accurate detection of brain tumors could save lots of lives and increasing the accuracy of this binary classification even as much as a few percent has high importance. Neural Gas Networks (NGN) is a fast, unsupervised algorithm that could be used in data clustering, image pattern recognition, and image segmentation. In this research, we used the metaheuristic Firefly Algorithm (FA) for image cont…
▽ More
Accurate detection of brain tumors could save lots of lives and increasing the accuracy of this binary classification even as much as a few percent has high importance. Neural Gas Networks (NGN) is a fast, unsupervised algorithm that could be used in data clustering, image pattern recognition, and image segmentation. In this research, we used the metaheuristic Firefly Algorithm (FA) for image contrast enhancement as pre-processing and NGN weights for feature extraction and segmentation of Magnetic Resonance Imaging (MRI) data on two brain tumor datasets from the Kaggle platform. Also, tumor classification is conducted by Support Vector Machine (SVM) classification algorithms and compared with a deep learning technique plus other features in train and test phases. Additionally, NGN tumor segmentation is evaluated by famous performance metrics such as Accuracy, F-measure, Jaccard, and more versus ground truth data and compared with traditional segmentation techniques. The proposed method is fast and precise in both tasks of tumor classification and segmentation compared with other methods. A classification accuracy of 95.14 % and segmentation accuracy of 0.977 is achieved by the proposed method.
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
GTFLAT: Game Theory Based Add-On For Empowering Federated Learning Aggregation Techniques
Authors:
Hamidreza Mahini,
Hamid Mousavi,
Masoud Daneshtalab
Abstract:
GTFLAT, as a game theory-based add-on, addresses an important research question: How can a federated learning algorithm achieve better performance and training efficiency by setting more effective adaptive weights for averaging in the model aggregation phase? The main objectives for the ideal method of answering the question are: (1) empowering federated learning algorithms to reach better perform…
▽ More
GTFLAT, as a game theory-based add-on, addresses an important research question: How can a federated learning algorithm achieve better performance and training efficiency by setting more effective adaptive weights for averaging in the model aggregation phase? The main objectives for the ideal method of answering the question are: (1) empowering federated learning algorithms to reach better performance in fewer communication rounds, notably in the face of heterogeneous scenarios, and last but not least, (2) being easy to use alongside the state-of-the-art federated learning algorithms as a new module. To this end, GTFLAT models the averaging task as a strategic game among active users. Then it proposes a systematic solution based on the population game and evolutionary dynamics to find the equilibrium. In contrast with existing approaches that impose the weights on the participants, GTFLAT concludes a self-enforcement agreement among clients in a way that none of them is motivated to deviate from it individually. The results reveal that, on average, using GTFLAT increases the top-1 test accuracy by 1.38%, while it needs 21.06% fewer communication rounds to reach the accuracy.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
The impact of the solubilizer of an element on the structure of a finite group
Authors:
Hamid Mousavi,
Mina Poozesh,
Yousef Zamani
Abstract:
Let $G$ be a finite group, and let $x$ be an element of $G$. Denote by $\Sol_G(x)$ the set of all $y \in G$ such that the group generated by $x$ and $y$ is soluble. We investigate the influence of $\Sol_G(x)$ on the structure of $G$.
Let $G$ be a finite group, and let $x$ be an element of $G$. Denote by $\Sol_G(x)$ the set of all $y \in G$ such that the group generated by $x$ and $y$ is soluble. We investigate the influence of $\Sol_G(x)$ on the structure of $G$.
△ Less
Submitted 2 April, 2023; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Fast Updating the STBC Decoder Matrices in the Uplink of a Massive MIMO System
Authors:
Seyed Hosein Mousavi,
Jafar Pourrostam
Abstract:
Reducing computational complexity of the modern wireless communication systems such as massive Multiple-Input Multiple-Output (MIMO) configurations is of utmost interest. In this paper, we propose new algorithm that can be used to accelerate matrix inversion in the decoding of space-time block codes (STBC) in the uplink of dynamic massive MIMO systems. A multi-user system in which the base station…
▽ More
Reducing computational complexity of the modern wireless communication systems such as massive Multiple-Input Multiple-Output (MIMO) configurations is of utmost interest. In this paper, we propose new algorithm that can be used to accelerate matrix inversion in the decoding of space-time block codes (STBC) in the uplink of dynamic massive MIMO systems. A multi-user system in which the base station is equipped with a large number of antennas and each user has two antennas is considered. In addition, users can enter or exit the system dynamically. For a given space-time block coding/decoding scheme the computational complexity of the receiver will be significantly reduced when a user is added to or removed from the system by employing the proposed method. In the proposed scheme, the matrix inversion for zero-forcing (ZF) as well as minimum mean square error (MMSE) decoding is derived from the inverse of a partitioned matrix and the Woodbury matrix identity. Furthermore, the suggested technique can be utilized when the number of users is fixed but the channel estimate changes for a particular user. The mathematical equations for updating the inverse of the decoding matrices are derived and its complexity is compared to the direct way of computing the inverse. Evaluations confirm the effectiveness of the proposed approach.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
DASS: Differentiable Architecture Search for Sparse neural networks
Authors:
Hamid Mousavi,
Mohammad Loni,
Mina Alibeigi,
Masoud Daneshtalab
Abstract:
The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available processing power. While recent research has made significant strides in develo** pruning methods to build a sparse network for reducing the computing overhead of DNNs, there remains considerable accuracy loss, especially at high pruning ratios. We find t…
▽ More
The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available processing power. While recent research has made significant strides in develo** pruning methods to build a sparse network for reducing the computing overhead of DNNs, there remains considerable accuracy loss, especially at high pruning ratios. We find that the architectures designed for dense networks by differentiable architecture search methods are ineffective when pruning mechanisms are applied to them. The main reason is that the current method does not support sparse architectures in their search space and uses a search objective that is made for dense networks and does not pay any attention to sparsity. In this paper, we propose a new method to search for sparsity-friendly neural architectures. We do this by adding two new sparse operations to the search space and modifying the search objective. We propose two novel parametric SparseConv and SparseLinear operations in order to expand the search space to include sparse operations. In particular, these operations make a flexible search space due to using sparse parametric versions of linear and convolution operations. The proposed search objective lets us train the architecture based on the sparsity of the search space operations. Quantitative analyses demonstrate that our search architectures outperform those used in the stateof-the-art sparse networks on the CIFAR-10 and ImageNet datasets. In terms of performance and hardware effectiveness, DASS increases the accuracy of the sparse version of MobileNet-v2 from 73.44% to 81.35% (+7.91% improvement) with 3.87x faster inference time.
△ Less
Submitted 12 September, 2023; v1 submitted 14 July, 2022;
originally announced July 2022.
-
On a conjecture of Graham on the p-divisibility of central binomial coefficients
Authors:
Ernie Croot,
Hamed Mousavi,
Maxie Schmidt
Abstract:
We show that for every $r \geq 1$, and all $r$ distinct (sufficiently large) primes $p_1,..., p_r > p_0(r)$, there exist infinitely many integers $n$ such that ${2n \choose n}$ is divisible by these primes to only low multiplicity. From a theorem of Kummer, an upper bound for the number of times that a prime $p_j$ can divide ${2n \choose n}$ is $1+\log n / \log p_j$; and our theorem shows that for…
▽ More
We show that for every $r \geq 1$, and all $r$ distinct (sufficiently large) primes $p_1,..., p_r > p_0(r)$, there exist infinitely many integers $n$ such that ${2n \choose n}$ is divisible by these primes to only low multiplicity. From a theorem of Kummer, an upper bound for the number of times that a prime $p_j$ can divide ${2n \choose n}$ is $1+\log n / \log p_j$; and our theorem shows that for every $\varepsilon > 0$, $r \geq 1$, and any sufficiently large primes $p_1,...,p_r > p_0(\varepsilon,r)$, we can find integers $n$ where for $j=1,...,r$, $p_j$ divides ${2n \choose n}$ with multiplicity at most $\varepsilon \log n/\log p_j$. We connect this result to a famous conjecture by R. L. Graham on whether there are infinitely many integers $n$ such that ${2n \choose n}$ is coprime to $105$.
△ Less
Submitted 6 January, 2023; v1 submitted 26 January, 2022;
originally announced January 2022.
-
Deep Curriculum Learning for PolSAR Image Classification
Authors:
Hamidreza Mousavi,
Maryam Imani,
Hassan Ghassemian
Abstract:
Following the great success of curriculum learning in the area of machine learning, a novel deep curriculum learning method proposed in this paper, entitled DCL, particularly for the classification of fully polarimetric synthetic aperture radar (PolSAR) data. This method utilizes the entropy-alpha target decomposition method to estimate the degree of complexity of each PolSAR image patch before ap…
▽ More
Following the great success of curriculum learning in the area of machine learning, a novel deep curriculum learning method proposed in this paper, entitled DCL, particularly for the classification of fully polarimetric synthetic aperture radar (PolSAR) data. This method utilizes the entropy-alpha target decomposition method to estimate the degree of complexity of each PolSAR image patch before applying it to the convolutional neural network (CNN). Also, an accumulative mini-batch pacing function is used to introduce more difficult patches to CNN.Experiments on the widely used data set of AIRSAR Flevoland reveal that the proposed curriculum learning method can not only increase classification accuracy but also lead to faster training convergence.
△ Less
Submitted 26 December, 2021;
originally announced December 2021.
-
Improving and Maximal Inequalities for Primes in Progressions
Authors:
Christina Giannitsi,
Michael T. Lacey,
Hamed Mousavi,
Yaghoub Rahimi
Abstract:
Assume that $ y < N$ are integers, and that $ (b,y) =1$. Define an average along the primes in a progression of diameter $ y$, given by integer $ (b,y)=1 $. \begin{align*} A_{N,y,b} := \frac{φ(y)}{N} \sum _{\substack{n <N\\n\equiv b\pmod{y}}} Λ(n) f(x-n) \end{align*} Above, $Λ$ is the von Mangoldt function and $φ$ is the totient function. We establish improving and maximal inequalities for these a…
▽ More
Assume that $ y < N$ are integers, and that $ (b,y) =1$. Define an average along the primes in a progression of diameter $ y$, given by integer $ (b,y)=1 $. \begin{align*} A_{N,y,b} := \frac{φ(y)}{N} \sum _{\substack{n <N\\n\equiv b\pmod{y}}} Λ(n) f(x-n) \end{align*} Above, $Λ$ is the von Mangoldt function and $φ$ is the totient function. We establish improving and maximal inequalities for these averages. These bounds are uniform in the choice of progression. For instance, for $ 1< r < \infty $ there is an integer $N _{y, r}$ so that \begin{align*} \lVert \sup _{N>N _{y,r}} \lvert A_{N,y,b} f \rvert \rVert_{r}\ll \lVert f\rVert_{r}. \end{align*} The implied constant is only a function of $ r$. The uniformity over progressions imposes several novel elements on the proof.
△ Less
Submitted 17 April, 2022; v1 submitted 14 December, 2021;
originally announced December 2021.
-
Nonlocal Games, Compression Theorems, and the Arithmetical Hierarchy
Authors:
Hamoon Mousavi,
Seyed Sajjad Nezhadi,
Henry Yuen
Abstract:
We investigate the connection between the complexity of nonlocal games and the arithmetical hierarchy, a classification of languages according to the complexity of arithmetical formulas defining them. It was recently shown by Ji, Natarajan, Vidick, Wright and Yuen that deciding whether the (finite-dimensional) quantum value of a nonlocal game is $1$ or at most $\frac{1}{2}$ is complete for the cla…
▽ More
We investigate the connection between the complexity of nonlocal games and the arithmetical hierarchy, a classification of languages according to the complexity of arithmetical formulas defining them. It was recently shown by Ji, Natarajan, Vidick, Wright and Yuen that deciding whether the (finite-dimensional) quantum value of a nonlocal game is $1$ or at most $\frac{1}{2}$ is complete for the class $Σ_1$ (i.e., $\mathsf{RE}$). A result of Slofstra implies that deciding whether the commuting operator value of a nonlocal game is equal to $1$ is complete for the class $Π_1$ (i.e., $\mathsf{coRE}$). We prove that deciding whether the quantum value of a two-player nonlocal game is exactly equal to $1$ is complete for $Π_2$; this class is in the second level of the arithmetical hierarchy and corresponds to formulas of the form "$\forall x \, \exists y \, φ(x,y)$". This shows that exactly computing the quantum value is strictly harder than approximating it, and also strictly harder than computing the commuting operator value (either exactly or approximately). We explain how results about the complexity of nonlocal games all follow in a unified manner from a technique known as compression. At the core of our $Π_2$-completeness result is a new "gapless" compression theorem that holds for both quantum and commuting operator strategies. Our compression theorem yields as a byproduct an alternative proof of Slofstra's result that the set of quantum correlations is not closed. We also show how a "gap-preserving" compression theorem for commuting operator strategies would imply that approximating the commuting operator value is complete for $Π_1$.
△ Less
Submitted 11 October, 2021; v1 submitted 9 October, 2021;
originally announced October 2021.
-
Synchronous Values of Games
Authors:
J. William Helton,
Hamoon Mousavi,
Seyed Sajjad Nezhadi,
Vern I. Paulsen,
Travis B. Russell
Abstract:
We study synchronous values of games, especially synchronous games. It is known that a synchronous game has a perfect strategy if and only if it has a perfect synchronous strategy. However, we give examples of synchronous games, in particular graph colouring games, with synchronous value that is strictly smaller than their ordinary value. Thus, the optimal strategy for a synchronous game need not…
▽ More
We study synchronous values of games, especially synchronous games. It is known that a synchronous game has a perfect strategy if and only if it has a perfect synchronous strategy. However, we give examples of synchronous games, in particular graph colouring games, with synchronous value that is strictly smaller than their ordinary value. Thus, the optimal strategy for a synchronous game need not be synchronous. We derive a formula for the synchronous value of an XOR game as an optimization problem over a spectrahedron involving a matrix related to the cost matrix. We give an example of a game such that the synchronous value of repeated products of the game is strictly increasing. We show that the synchronous quantum bias of the XOR of two XOR games is not multiplicative. Finally, we derive geometric and algebraic conditions that a set of projections that yields the synchronous value of a game must satisfy.
△ Less
Submitted 22 August, 2023; v1 submitted 29 September, 2021;
originally announced September 2021.
-
Endpoint $ \ell ^{r}$ improving estimates for Prime averages
Authors:
Michael T. Lacey,
Hamed Mousavi,
Yaghoub Rahimi
Abstract:
Let $ Λ$ denote von Mangoldt's function, and consider the averages \begin{align*} A_N f (x) &=\frac{1}{N}\sum_{1\leq n \leq N}f(x-n)Λ(n) . \end{align*} We prove sharp $ \ell ^{p}$-improving for these averages, and sparse bounds for the maximal function. The simplest inequality is that for sets $ F, G\subset [0,N]$ there holds \begin{equation*} N ^{-1} \langle A_N \mathbf 1_{F} , \mathbf 1_{G} \ran…
▽ More
Let $ Λ$ denote von Mangoldt's function, and consider the averages \begin{align*} A_N f (x) &=\frac{1}{N}\sum_{1\leq n \leq N}f(x-n)Λ(n) . \end{align*} We prove sharp $ \ell ^{p}$-improving for these averages, and sparse bounds for the maximal function. The simplest inequality is that for sets $ F, G\subset [0,N]$ there holds \begin{equation*} N ^{-1} \langle A_N \mathbf 1_{F} , \mathbf 1_{G} \rangle \ll \frac{\lvert F\rvert \cdot \lvert G\rvert} { N ^2 } \Bigl( \operatorname {Log} \frac{\lvert F\rvert \cdot \lvert G\rvert} { N ^2 } \Bigr) ^{t}, \end{equation*} where $ t=2$, or assuming the Generalized Riemann Hypothesis, $ t=1$. The corresponding sparse bound is proved for the maximal function $ \sup_N A_N \mathbf 1_{F}$. The inequalities for $ t=1$ are sharp. The proof depends upon the Circle Method, and an interpolation argument of Bourgain.
△ Less
Submitted 1 May, 2023; v1 submitted 25 January, 2021;
originally announced January 2021.
-
On Reduced archimedean skew power series rings
Authors:
Hamed Mousavi,
Farzad Padashnik,
Ayesha Asloob Qureshi
Abstract:
In this paper, we prove that if $R$ is an Archimedean reduced ring and satisfy ACC on annihilators, then $R[[x]]$ is also an Archimedean reduced ring. More generally we prove that if $R$ is a right Archimedean ring satisfying the \emph{ACC} on annihilators and $α$ is a rigid automorphism of $R$, then the skew power series ring $R[[x;α]]$ is right Archimedean reduced ring. We also provide some exam…
▽ More
In this paper, we prove that if $R$ is an Archimedean reduced ring and satisfy ACC on annihilators, then $R[[x]]$ is also an Archimedean reduced ring. More generally we prove that if $R$ is a right Archimedean ring satisfying the \emph{ACC} on annihilators and $α$ is a rigid automorphism of $R$, then the skew power series ring $R[[x;α]]$ is right Archimedean reduced ring. We also provide some examples to justify the assumptions we made to obtain the required result.
△ Less
Submitted 3 September, 2020;
originally announced September 2020.
-
A Low Complexity Space-Time Block Codes Detection for Cell-Free Massive MIMO Systems
Authors:
A. Mazhari Saray,
J. Pourrostam,
S. H. Mousavi,
M. Mohassel Feghhi
Abstract:
The new generation of telecommunication systems must provide acceptable data rates and spectral efficiency for new applications. Recently massive MIMO has been introduced as a key technique for the new generation of telecommunication systems. Cell-free massive MIMO system is not segmented into cells. Each BS antennas are distributed throughout the environment and each user is served by all BSs, si…
▽ More
The new generation of telecommunication systems must provide acceptable data rates and spectral efficiency for new applications. Recently massive MIMO has been introduced as a key technique for the new generation of telecommunication systems. Cell-free massive MIMO system is not segmented into cells. Each BS antennas are distributed throughout the environment and each user is served by all BSs, simultaneously.
In this paper, the performance of the multiuser cell-free massive MIMO-system exploying space-time block codes in the uplink, and with linear decoders is studied. An Inverse matrix approximation using Neumann series is proposed to reduce the computational and hardware complexity of the decoding in the receiver.
For this purpose, each user has two antennas, and also for improving the diversity gain performance, space-time block codes are used in the uplink. Then, Neumann series is used to approximate the inverse matrix in ZF and MMSE decoders, and its performance is evaluated in terms of BER and spectral efficiency.
In addition, we derive lower bound for throughput of ZF decoder.
The simulation results show that performance of the system , in terms of BER and spectral efficiency, is better than the single-antenna users at the same system. Also, the BER performance in a given system with the proposed method will be close to the exact method.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Generic Unsupervised Optimization for a Latent Variable Model With Exponential Family Observables
Authors:
Hamid Mousavi,
Jakob Drefs,
Florian Hirschberger,
Jörg Lücke
Abstract:
Latent variable models (LVMs) represent observed variables by parameterized functions of latent variables. Prominent examples of LVMs for unsupervised learning are probabilistic PCA or probabilistic SC which both assume a weighted linear summation of the latents to determine the mean of a Gaussian distribution for the observables. In many cases, however, observables do not follow a Gaussian distri…
▽ More
Latent variable models (LVMs) represent observed variables by parameterized functions of latent variables. Prominent examples of LVMs for unsupervised learning are probabilistic PCA or probabilistic SC which both assume a weighted linear summation of the latents to determine the mean of a Gaussian distribution for the observables. In many cases, however, observables do not follow a Gaussian distribution. For unsupervised learning, LVMs which assume specific non-Gaussian observables have therefore been considered. Already for specific choices of distributions, parameter optimization is challenging and only a few previous contributions considered LVMs with more generally defined observable distributions. Here, we consider LVMs that are defined for a range of different distributions, i.e., observables can follow any (regular) distribution of the exponential family. The novel class of LVMs presented is defined for binary latents, and it uses maximization in place of summation to link the latents to observables. To derive an optimization procedure, we follow an EM approach for maximum likelihood parameter estimation. We show that a set of very concise parameter update equations can be derived which feature the same functional form for all exponential family distributions. The derived generic optimization can consequently be applied to different types of metric data as well as to different types of discrete data. Also, the derived optimization equations can be combined with a recently suggested variational acceleration which is likewise generically applicable to the LVMs considered here. So, the combination maintains generic and direct applicability of the derived optimization procedure, but, crucially, enables efficient scalability. We numerically verify our analytical results and discuss some potential applications such as learning of variance structure, noise type estimation and denoising.
△ Less
Submitted 15 December, 2023; v1 submitted 4 March, 2020;
originally announced March 2020.
-
On the complexity of zero gap MIP*
Authors:
Hamoon Mousavi,
Seyed Sajjad Nezhadi,
Henry Yuen
Abstract:
The class $\mathsf{MIP}^*$ is the set of languages decidable by multiprover interactive proofs with quantum entangled provers. It was recently shown by Ji, Natarajan, Vidick, Wright and Yuen that $\mathsf{MIP}^*$ is equal to $\mathsf{RE}$, the set of recursively enumerable languages. In particular this shows that the complexity of approximating the quantum value of a non-local game $G$ is equivale…
▽ More
The class $\mathsf{MIP}^*$ is the set of languages decidable by multiprover interactive proofs with quantum entangled provers. It was recently shown by Ji, Natarajan, Vidick, Wright and Yuen that $\mathsf{MIP}^*$ is equal to $\mathsf{RE}$, the set of recursively enumerable languages. In particular this shows that the complexity of approximating the quantum value of a non-local game $G$ is equivalent to the complexity of the Halting problem.
In this paper we investigate the complexity of deciding whether the quantum value of a non-local game $G$ is exactly $1$. This problem corresponds to a complexity class that we call zero gap $\mathsf{MIP}^*$, denoted by $\mathsf{MIP}^*_0$, where there is no promise gap between the verifier's acceptance probabilities in the YES and NO cases. We prove that $\mathsf{MIP}^*_0$ extends beyond the first level of the arithmetical hierarchy (which includes $\mathsf{RE}$ and its complement $\mathsf{coRE}$), and in fact is equal to $Π_2^0$, the class of languages that can be decided by quantified formulas of the form $\forall y \, \exists z \, R(x,y,z)$.
Combined with the previously known result that $\mathsf{MIP}^{co}_0$ (the commuting operator variant of $\mathsf{MIP}^*_0$) is equal to $\mathsf{coRE}$, our result further highlights the fascinating connection between various models of quantum multiprover interactive proofs and different classes in computability theory.
△ Less
Submitted 28 April, 2020; v1 submitted 24 February, 2020;
originally announced February 2020.
-
A generalization of CHSH and the algebraic structure of optimal strategies
Authors:
David Cui,
Arthur Mehta,
Hamoon Mousavi,
Seyed Sajjad Nezhadi
Abstract:
Self-testing has been a rich area of study in quantum information theory. It allows an experimenter to interact classically with a black box quantum system and to test that a specific entangled state was present and a specific set of measurements were performed. Recently, self-testing has been central to high-profile results in complexity theory as seen in the work on entangled games PCP of Natara…
▽ More
Self-testing has been a rich area of study in quantum information theory. It allows an experimenter to interact classically with a black box quantum system and to test that a specific entangled state was present and a specific set of measurements were performed. Recently, self-testing has been central to high-profile results in complexity theory as seen in the work on entangled games PCP of Natarajan and Vidick (FOCS 2018), iterated compression by Fitzsimons et al. (STOC 2019), and NEEXP in MIP* due to Natarajan and Wright (FOCS 2019).
In this work, we introduce an algebraic generalization of CHSH by viewing it as a linear constraint system (LCS) game, exhibiting self-testing properties that are qualitatively different. These provide the first example of non-local games that self-test non-Pauli operators resolving an open questions posed by Coladangelo and Stark (QIP 2017). Our games also provide a self-test for states other than the maximally entangled state, and hence resolves the open question posed by Cleve and Mittal (ICALP 2012). Additionally, our games have 1 bit question and $\log n$ bit answer lengths making them suitable candidates for complexity theoretic application. This work is the first step towards a general theory of self-testing arbitrary groups. In order to obtain our results, we exploit connections between sum of squares proofs, non-commutative ring theory, and the Gowers-Hatami theorem from approximate representation theory. A crucial part of our analysis is to introduce a sum of squares framework that generalizes the \emph{solution group} of Cleve, Liu, and Slofstra (Journal of Mathematical Physics 2017) to the non-pseudo-telepathic regime. Finally, we give the first example of a game that is not a self-test. Our results suggest a richer landscape of self-testing phenomena than previously considered.
△ Less
Submitted 21 September, 2021; v1 submitted 4 November, 2019;
originally announced November 2019.
-
Predictive Coding Networks Meet Action Recognition
Authors:
Xia Huang,
Hossein Mousavi,
Gemma Roig
Abstract:
Action recognition is a key problem in computer vision that labels videos with a set of predefined actions. Capturing both, semantic content and motion, along the video frames is key to achieve high accuracy performance on this task. Most of the state-of-the-art methods rely on RGB frames for extracting the semantics and pre-computed optical flow fields as a motion cue. Then, both are combined usi…
▽ More
Action recognition is a key problem in computer vision that labels videos with a set of predefined actions. Capturing both, semantic content and motion, along the video frames is key to achieve high accuracy performance on this task. Most of the state-of-the-art methods rely on RGB frames for extracting the semantics and pre-computed optical flow fields as a motion cue. Then, both are combined using deep neural networks. Yet, it has been argued that such models are not able to leverage the motion information extracted from the optical flow, but instead the optical flow allows for better recognition of people and objects in the video. This urges the need to explore different cues or models that can extract motion in a more informative fashion. To tackle this issue, we propose to explore the predictive coding network, so called PredNet, a recurrent neural network that propagates predictive coding errors across layers and time steps. We analyze whether PredNet can better capture motions in videos by estimating over time the representations extracted from pre-trained networks for action recognition. In this way, the model only relies on the video frames, and does not need pre-processed optical flows as input. We report the effectiveness of our proposed model on UCF101 and HMDB51 datasets.
△ Less
Submitted 22 October, 2019;
originally announced October 2019.
-
On a Class of Sums with Unexpectedly High Cancellation, and its Applications
Authors:
Ernie Croot,
Hamed Mousavi
Abstract:
Following attempts at an analytic proof of the Pentagonal Number Theorem, we report on the discovery of a general principle leading to an unexpected cancellation of oscillating sums. After stating the motivation, and our theorem, we apply it to prove several results on the Prouhet-Tarry-Escott Problem, integer partitions, and the distribution of prime numbers. Regarding the Prouhet-Tarry-Escott pr…
▽ More
Following attempts at an analytic proof of the Pentagonal Number Theorem, we report on the discovery of a general principle leading to an unexpected cancellation of oscillating sums. After stating the motivation, and our theorem, we apply it to prove several results on the Prouhet-Tarry-Escott Problem, integer partitions, and the distribution of prime numbers. Regarding the Prouhet-Tarry-Escott problem, we show that \begin{align*} \sum_{|\ell|\leq x}(4x^2-4\ell^2)^{2r}-\sum_{|\ell|<x}(4x^2-(2\ell+1)^2)^{2r}=\text{polynomial w.r.t. } x \text{ with degree }2r-1. \end{align*} This can perhaps be proved using properties of Bernoulli polynomials, but the claim fell out of our method in a more natural and motivated way. Using this result, we solve an approximate version of the PTE Problem, and in doing so our work in the approximate case exceeds the bounds one can prove using a pigeonhole argument, which seems remarkable. Also, we prove that $$ \sum_{\ell^2 < n} (-1)^\ell p(n-\ell^2)\ \sim\ (-1)^n 2^{-3/4} n^{-1/4} \sqrt{p(n)}, $$ where $p(n)$ is the usual partition function. We get the following "Weak pentagonal number theorem", in which we can replace the partition function $p(n)$ with Chebyshev $Ψ$ function: $$ \sum_{0 < \ell < \sqrt{xT}/2} Ψ([e^{\sqrt{x - \frac{(2\ell)^2}{T}}},\ e^{\sqrt{x - \frac{(2\ell-1)^2}{T}}}])\ =Ψ(e^{\sqrt{x}})\left(\frac{1}{2} + O\left (e^{-0.196\sqrt{x}}\right)\right), $$ where $T=e^{0.786\sqrt{x}}$, where $Ψ([a,b]) := \sum_{n\in [a,b]} Λ(n)$ and $Ψ(x) = Ψ([1,x])$, where $Λ$ is the von Mangoldt function. Note that this last equation (sum over $\ell$) is stronger than one would get using a strong form of the Prime Number Theorem and also a naive use of the Riemann Hypothesis in each interval, since the widths of the intervals are smaller than $e^{\frac{1}{2} \sqrt{x}}$, making the RH estimate ``trivial".
△ Less
Submitted 22 June, 2022; v1 submitted 26 September, 2019;
originally announced September 2019.
-
A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning
Authors:
Hossein K. Mousavi,
Guangyi Liu,
Weihang Yuan,
Martin Takáč,
Héctor Muñoz-Avila,
Nader Motee
Abstract:
We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that…
▽ More
We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that evaluates the reward and makes a prediction. We design and implement these layers using deep reinforcement learning. A generalized policy gradient algorithm is utilized to learn the parameters of these layers to maximize the expected reward. Our proposed methodology is tested on the MNIST dataset of handwritten digits, which provides us with a level of explainability while interpreting the agent's intermediate goals and course of action.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.
-
Explicit Characterization of Performance of a Class of Networked Linear Control Systems
Authors:
Hossein K. Mousavi,
Nader Motee
Abstract:
We show that the steady-state variance as a performance measure for a class of networked linear control systems is expressible as the summation of a rational function over the Laplacian eigenvalues of the network graph. Moreover, we characterize the role of connectivity thresholds for the feedback (and observer) gain design of these networks. We use our framework to derive bounds and scaling laws…
▽ More
We show that the steady-state variance as a performance measure for a class of networked linear control systems is expressible as the summation of a rational function over the Laplacian eigenvalues of the network graph. Moreover, we characterize the role of connectivity thresholds for the feedback (and observer) gain design of these networks. We use our framework to derive bounds and scaling laws for the performance of the dynamical network. Our approach generalizes and unifies the previous results on the performance measure of these networks for the case of arbitrary nodal dynamics. We bring extensions of our methodology for the case of decentralized observer-based output feedback as well as a class of composite networks. Numerous examples support our theoretical contributions.
△ Less
Submitted 4 August, 2019;
originally announced August 2019.
-
Transdimensional epsilon-near-zero modes in planar plasmonic nanostructures
Authors:
Igor V. Bondarev,
Hamze Mousavi,
Vladimir M. Shalaev
Abstract:
We use quantum electrodynamics and the confinement-induced nonlocal dielectric response model based on the Keldysh-Rytova electron interaction potential to study the epsilon-near-zero modes of metallic films in the transdimensional regime. New peculiar effects are revealed such as the plasmon mode degeneracy lifting and the dipole emitter coupling to the split epsilon-near-zero modes, leading to t…
▽ More
We use quantum electrodynamics and the confinement-induced nonlocal dielectric response model based on the Keldysh-Rytova electron interaction potential to study the epsilon-near-zero modes of metallic films in the transdimensional regime. New peculiar effects are revealed such as the plasmon mode degeneracy lifting and the dipole emitter coupling to the split epsilon-near-zero modes, leading to thickness-controlled spontaneous decay with up to three-orders-of-magnitude increased rates.
△ Less
Submitted 6 November, 2019; v1 submitted 1 August, 2019;
originally announced August 2019.
-
Multi-Agent Image Classification via Reinforcement Learning
Authors:
Hossein K. Mousavi,
Mohammadreza Nazari,
Martin Takáč,
Nader Motee
Abstract:
We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allo…
▽ More
We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allowed to exchange information with their neighboring agents to update their own beliefs. It is shown how reinforcement learning techniques can be utilized to achieve decentralized implementation of the classification problem by running a decentralized consensus protocol. Our experimental results on the MNIST handwritten digit dataset demonstrates the effectiveness of our proposed framework.
△ Less
Submitted 6 August, 2019; v1 submitted 12 May, 2019;
originally announced May 2019.
-
Adaptive Transform Domain Image Super-resolution Via Orthogonally Regularized Deep Networks
Authors:
Tiantong Guo,
Hojjat S. Mousavi,
Vishal Monga
Abstract:
Deep learning methods, in particular, trained Convolutional Neural Networks (CNN) have recently been shown to produce compelling results for single image Super-Resolution (SR). Invariably, a CNN is learned to map the Low Resolution (LR) image to its corresponding High Resolution (HR) version in the spatial domain. We propose a novel network structure for learning the SR map** function in an imag…
▽ More
Deep learning methods, in particular, trained Convolutional Neural Networks (CNN) have recently been shown to produce compelling results for single image Super-Resolution (SR). Invariably, a CNN is learned to map the Low Resolution (LR) image to its corresponding High Resolution (HR) version in the spatial domain. We propose a novel network structure for learning the SR map** function in an image transform domain, specifically the Discrete Cosine Transform (DCT). As the first contribution, we show that DCT can be integrated into the network structure as a Convolutional DCT (CDCT) layer. With the CDCT layer, we construct the DCT Deep SR (DCT-DSR) network. We further extend the DCT-DSR to allow the CDCT layer to become trainable (i.e., optimizable). Because this layer represents an image transform, we enforce pairwise orthogonality constraints and newly formulated complexity order constraints on the individual basis functions/filters. This Orthogonally Regularized Deep SR network (ORDSR) simplifies the SR task by taking advantage of image transform domain while adapting the design of transform basis to the training image set. Experimental results show ORDSR achieves state-of-the-art SR image quality with fewer parameters than most of the deep CNN methods. A particular success of ORDSR is in overcoming the artifacts introduced by bicubic interpolation. A key burden of deep SR has been identified as the requirement of generous training LR and HR image pairs; ORSDR exhibits a much more graceful degradation as training size is reduced with significant benefits in the regime of limited training. Analysis of memory and computation requirements confirms that ORDSR can allow for a more efficient network with faster inference.
△ Less
Submitted 22 April, 2019;
originally announced April 2019.
-
Private Inner Product Retrieval for Distributed Machine Learning
Authors:
Mohammad Hossein Mousavi,
Mohammad Ali Maddah-Ali,
Mahtab Mirmohseni
Abstract:
In this paper, we argue that in many basic algorithms for machine learning, including support vector machine (SVM) for classification, principal component analysis (PCA) for dimensionality reduction, and regression for dependency estimation, we need the inner products of the data samples, rather than the data samples themselves.
Motivated by the above observation, we introduce the problem of pri…
▽ More
In this paper, we argue that in many basic algorithms for machine learning, including support vector machine (SVM) for classification, principal component analysis (PCA) for dimensionality reduction, and regression for dependency estimation, we need the inner products of the data samples, rather than the data samples themselves.
Motivated by the above observation, we introduce the problem of private inner product retrieval for distributed machine learning, where we have a system including a database of some files, duplicated across some non-colluding servers. A user intends to retrieve a subset of specific size of the inner products of the data files with minimum communication load, without revealing any information about the identity of the requested subset. For achievability, we use the algorithms for multi-message private information retrieval. For converse, we establish that as the length of the files becomes large, the set of all inner products converges to independent random variables with uniform distribution, and derive the rate of convergence. To prove that, we construct special dependencies among sequences of the sets of all inner products with different length, which forms a time-homogeneous irreducible Markov chain, without affecting the marginal distribution. We show that this Markov chain has a uniform distribution as its unique stationary distribution, with rate of convergence dominated by the second largest eigenvalue of the transition probability matrix. This allows us to develop a converse, which converges to a tight bound in some cases, as the size of the files becomes large. While this converse is based on the one in multi-message private information retrieval, due to the nature of retrieving inner products instead of data itself some changes are made to reach the desired result.
△ Less
Submitted 17 February, 2019;
originally announced February 2019.
-
Estimation with Fast Landmark Selection in Robot Visual Navigation
Authors:
Hossein K. Mousavi,
Nader Motee
Abstract:
We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the information matrix of the corresponding estimation problem. We utilize standard models for motion and vision system using a camera to formulate the feature select…
▽ More
We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the information matrix of the corresponding estimation problem. We utilize standard models for motion and vision system using a camera to formulate the feature selection problem over moving finite time horizons. A scalable randomized sampling algorithm is proposed to select more informative features (and ignore the rest) to achieve a superior position estimation quality. We provide probabilistic performance guarantees for our method. The time-complexity of our feature selection algorithm is linear in the number of candidate features, which is practically plausible and outperforms existing greedy methods that scale quadratically with the number of candidates features. Our numerical simulations confirm that not only the execution time of our proposed method is comparably less than that of the greedy method, but also the resulting estimation quality is very close to the greedy method.
△ Less
Submitted 3 February, 2019;
originally announced February 2019.
-
Sparse Sensing, Communication, and Actuation via Self-Triggered Control Algorithms
Authors:
MirSaleh Bahavarnia,
Hossein K. Mousavi,
Nader Motee
Abstract:
We propose a self-triggered control algorithm to reduce onboard processor usage, communication bandwidth, and energy consumption across a linear time-invariant networked control system. We formulate an optimal control problem by penalizing the l0-measures of the feedback gain and the vector of control inputs and maximizing the dwell time between the consecutive triggering times. It is shown that t…
▽ More
We propose a self-triggered control algorithm to reduce onboard processor usage, communication bandwidth, and energy consumption across a linear time-invariant networked control system. We formulate an optimal control problem by penalizing the l0-measures of the feedback gain and the vector of control inputs and maximizing the dwell time between the consecutive triggering times. It is shown that the corresponding l1-relaxation of the optimal control problem is feasible and results in a stabilizing feedback control law with guaranteed performance bounds, while providing a sparse schedule for collecting samples from sensors, communication with other subsystems, and activating the input actuators.
△ Less
Submitted 21 December, 2018;
originally announced December 2018.
-
Space-Time Sampling for Network Observability
Authors:
Hossein K. Mousavi,
Qiyu Sun,
Nader Motee
Abstract:
Designing sparse sampling strategies is one of the important components in having resilient estimation and control in networked systems as they make network design problems more cost-effective due to their reduced sampling requirements and less fragile to where and when samples are collected. It is shown that under what conditions taking coarse samples from a network will contain the same amount o…
▽ More
Designing sparse sampling strategies is one of the important components in having resilient estimation and control in networked systems as they make network design problems more cost-effective due to their reduced sampling requirements and less fragile to where and when samples are collected. It is shown that under what conditions taking coarse samples from a network will contain the same amount of information as a more finer set of samples. Our goal is to estimate initial condition of linear time-invariant networks using a set of noisy measurements. The observability condition is reformulated as the frame condition, where one can easily trace location and time stamps of each sample. We compare estimation quality of various sampling strategies using estimation measures, which depend on spectrum of the corresponding frame operators. Using properties of the minimal polynomial of the state matrix, deterministic and randomized methods are suggested to construct observability frames. Intrinsic tradeoffs assert that collecting samples from fewer subsystems dictates taking more samples (in average) per subsystem. Three scalable algorithms are developed to generate sparse space-time sampling strategies with explicit error bounds.
△ Less
Submitted 18 July, 2019; v1 submitted 3 November, 2018;
originally announced November 2018.
-
Factorization Theorems for Relatively Prime Divisor Sums, GCD Sums and Generalized Ramanujan Sums
Authors:
Hamed Mousavi,
Maxie D. Schmidt
Abstract:
We generalize recent matrix-based factorization theorems for Lambert series generating functions generating the coefficients $(f \ast 1)(n)$ for some arithmetic function $f$. Our new factorization theorems provide analogs to these established expansions generating sums of the form $\sum_{d: (d,n)=1} f(d)$ (type I) and the Anderson-Apostol sums $\sum_{d|(m,n)} f(d) g(n/d)$ (type II) for any arithme…
▽ More
We generalize recent matrix-based factorization theorems for Lambert series generating functions generating the coefficients $(f \ast 1)(n)$ for some arithmetic function $f$. Our new factorization theorems provide analogs to these established expansions generating sums of the form $\sum_{d: (d,n)=1} f(d)$ (type I) and the Anderson-Apostol sums $\sum_{d|(m,n)} f(d) g(n/d)$ (type II) for any arithmetic functions $f$ and $g$. Our treatment of the type II sums includes a matrix-based factorization method relating the partition function $p(n)$ to arbitrary arithmetic functions $f$. We also conclude the last section of the article by directly expanding new formulas for an arithmetic function $g$ by the type II sums using discrete Fourier transforms for functions over inputs of greatest common divisors and by suitably defined orthogonal polynomial sequences whose weight function we can define by a discrete time Fourier transform (DTFT) involving the partition function $p(n)$. There are numerous applications and special cases of our new results which we are able to cite as examples in the article. Particular cases of the applications we give in the article include new identities for Euler's totient function, the Ramanujan sums $c_q(n)$, the generalized sum-of-divisors functions, the Mertens function which is the summatory function of the Möbius function, and the cyclotomic polynomials.
△ Less
Submitted 19 September, 2019; v1 submitted 19 October, 2018;
originally announced October 2018.
-
Resilient Sparse Controller Design with Guaranteed Disturbance Attenuation
Authors:
MirSaleh Bahavarnia,
Hossein K. Mousavi
Abstract:
We design resilient sparse state-feedback controllers for a linear time-invariant (LTI) control system while attaining a pre-specified guarantee on ${\mathcal{H}}_\infty$ performance measure. We leverage a technique from non-fragile control theory to identify a region of resilient state-feedback controllers. Afterward, we explore the region to identify a sparse controller. To this end, we use two…
▽ More
We design resilient sparse state-feedback controllers for a linear time-invariant (LTI) control system while attaining a pre-specified guarantee on ${\mathcal{H}}_\infty$ performance measure. We leverage a technique from non-fragile control theory to identify a region of resilient state-feedback controllers. Afterward, we explore the region to identify a sparse controller. To this end, we use two different techniques: the greedy method of sparsification, as well as the re-weighted $\ell_1$ norm minimization. Our approach highlights a tradeoff between the sparsity of the feedback gain, performance measure, and fragility of the design. To best of our knowledge, this work is the first framework providing performance guarantees for sparse feedback gain design.
△ Less
Submitted 26 September, 2019; v1 submitted 11 October, 2018;
originally announced October 2018.
-
Stability analysis of networked control systems with not necessarily UGES protocols
Authors:
Seyed Hossein Mousavi,
Navid Noroozi,
Anton H. J. de Ruiter,
Roman Geiselhart
Abstract:
This note studies (practical) asymptotic stability of nonlinear networked control systems whose protocols are not necessarily uniformly globally exponentially stable. In particular, we propose a Lyapunov-based approach to establish (practical) asymptotic stability of the networked control systems. Considering so-called modified Round Robin and Try-Once-Discard protocols, which are only uniformly g…
▽ More
This note studies (practical) asymptotic stability of nonlinear networked control systems whose protocols are not necessarily uniformly globally exponentially stable. In particular, we propose a Lyapunov-based approach to establish (practical) asymptotic stability of the networked control systems. Considering so-called modified Round Robin and Try-Once-Discard protocols, which are only uniformly globally asymptotically stable, we explicitly construct Lyapunov functions for these two protocols, which fit our proposed setting. In order to optimize the usage of communication resource, we exploit the following transmission policy: wait for a certain minimum amount of time after the last sampling instant and then check a state-dependent criterion. When the latter condition is violated, a transmission occurs. In that way, the existence of the minimum amount of time between two consecutive transmission is established and so-called Zeno phenomenon, therefore, is avoided. Finally, illustrative examples are given to verify the effectiveness of our results.
△ Less
Submitted 9 October, 2018; v1 submitted 5 October, 2018;
originally announced October 2018.
-
Koopman Performance Analysis of Nonlinear Consensus Networks
Authors:
Hossein K. Mousavi,
Christoforos Somarakis,
Qiyu Sun,
Nader Motee
Abstract:
Spectral decomposition of dynamical systems is a popular methodology to investigate the fundamental qualitative and quantitative properties of these systems and their solutions. In this chapter, we consider a class of nonlinear cooperative protocols, which consist of multiple agents that are coupled together via an undirected state-dependent graph. We develop a representation of the system solutio…
▽ More
Spectral decomposition of dynamical systems is a popular methodology to investigate the fundamental qualitative and quantitative properties of these systems and their solutions. In this chapter, we consider a class of nonlinear cooperative protocols, which consist of multiple agents that are coupled together via an undirected state-dependent graph. We develop a representation of the system solution by decomposing the nonlinear system utilizing ideas from the Koopman operator theory and its spectral analysis. We use recent results on the extensions of the well-known Hartman theorem for hyperbolic systems to establish a connection between the original nonlinear dynamics and the linearized dynamics in terms of Koopman spectral properties. The expected value of the output energy of the nonlinear protocol, which is related to the notions of coherence and robustness in dynamical networks, is evaluated and characterized in terms of Koopman eigenvalues, eigenfunctions, and modes. Spectral representation of the performance measure enables us to develop algorithmic methods to assess the performance of this class of nonlinear dynamical networks as a function of their graph topology. Finally, we propose a scalable computational method for approximation of the components of the Koopman mode decomposition, which is necessary to evaluate the systemic performance measure of the nonlinear dynamic network.
△ Less
Submitted 19 April, 2019; v1 submitted 11 July, 2018;
originally announced July 2018.
-
Optical Response of Finite-Thickness Ultrathin Plasmonic Films
Authors:
Igor V. Bondarev,
Hamze Mousavi,
Vladimir M. Shalaev
Abstract:
We discuss the optical response peculiarities for ultrathin plasmonic films of finite lateral size. Due to their plasma frequency spatial dispersion caused by the spatial confinement of the electron motion, the film dielectric permittivity tensor is spatially dispersive as well and so nonlocal. Such a confinement induced nonlocality can result in peculiar magneto-optical effects. For example, the…
▽ More
We discuss the optical response peculiarities for ultrathin plasmonic films of finite lateral size. Due to their plasma frequency spatial dispersion caused by the spatial confinement of the electron motion, the film dielectric permittivity tensor is spatially dispersive as well and so nonlocal. Such a confinement induced nonlocality can result in peculiar magneto-optical effects. For example, the frequency dependence of the magnetic permeability of the film exhibits a sharp resonance structure shifting to the red as the film aspect ratio increases. The properly tuned ultrathin plasmonic films of finite lateral size can feature the negative refraction effect in the IR frequency range. We discuss how to control these magneto-optical properties and show that they can be tuned by adjusting the film chemical composition, plasmonic material quality, the aspect ratio, and the surroundings of the film.
△ Less
Submitted 2 June, 2018;
originally announced June 2018.
-
Enhanced Signal Recovery via Sparsity Inducing Image Priors
Authors:
Hojjat Seyed Mousavi
Abstract:
Parsimony in signal representation is a topic of active research. Sparse signal processing and representation is the outcome of this line of research which has many applications in information processing and has shown significant improvement in real-world applications such as recovery, classification, clustering, super resolution, etc. This vast influence of sparse signal processing in real-world…
▽ More
Parsimony in signal representation is a topic of active research. Sparse signal processing and representation is the outcome of this line of research which has many applications in information processing and has shown significant improvement in real-world applications such as recovery, classification, clustering, super resolution, etc. This vast influence of sparse signal processing in real-world problems raises a significant need in develo** novel sparse signal representation algorithms to obtain more robust systems. In such algorithms, a few open challenges remain in (a) efficiently posing sparsity on signals that can capture the structure of underlying signal and (b) the design of tractable algorithms that can recover signals under aforementioned sparse models.
△ Less
Submitted 13 May, 2018;
originally announced May 2018.
-
Integral versions of input-to-state stability for dual-rate nonlinear sampled-data systems
Authors:
Navid Noroozi,
Seyed Hossein Mousavi,
Horacio J. Marquez
Abstract:
This paper presents versions of integral input-to-state stability and integral input-to-integral-state stability for nonlinear sampled-data systems, under the low measurement rate constraint. In particular, we compensate the lack of measurements using an estimator approximately reconstructing the current state. Interestingly, under certain checkable conditions, we establish that a controller that…
▽ More
This paper presents versions of integral input-to-state stability and integral input-to-integral-state stability for nonlinear sampled-data systems, under the low measurement rate constraint. In particular, we compensate the lack of measurements using an estimator approximately reconstructing the current state. Interestingly, under certain checkable conditions, we establish that a controller that semiglobally practically integral input-to-(integral-) state stabilizes an approximate discrete-time model of a single-rate nonlinear sampled-data system, also stabilizes the exact discrete-time model of the nonlinear sampled-data system in the same sense implemented in a dual-rate setting. Numerical simulations are given to illustrate the effectiveness of our results.
△ Less
Submitted 22 April, 2018;
originally announced April 2018.
-
Deep Image Super Resolution via Natural Image Priors
Authors:
Hojjat S. Mousavi,
Tiantong Guo,
Vishal Monga
Abstract:
Single image super-resolution (SR) via deep learning has recently gained significant attention in the literature. Convolutional neural networks (CNNs) are typically learned to represent the map** between low-resolution (LR) and high-resolution (HR) images/patches with the help of training examples. Most existing deep networks for SR produce high quality results when training data is abundant. Ho…
▽ More
Single image super-resolution (SR) via deep learning has recently gained significant attention in the literature. Convolutional neural networks (CNNs) are typically learned to represent the map** between low-resolution (LR) and high-resolution (HR) images/patches with the help of training examples. Most existing deep networks for SR produce high quality results when training data is abundant. However, their performance degrades sharply when training is limited. We propose to regularize deep structures with prior knowledge about the images so that they can capture more structural information from the same limited data. In particular, we incorporate in a tractable fashion within the CNN framework, natural image priors which have shown to have much recent success in imaging and vision inverse problems. Experimental results show that the proposed deep network with natural image priors is particularly effective in training starved regimes.
△ Less
Submitted 8 February, 2018;
originally announced February 2018.
-
Orthogonally Regularized Deep Networks For Image Super-resolution
Authors:
Tiantong Guo,
Hojjat S. Mousavi,
Vishal Monga
Abstract:
Deep learning methods, in particular trained Convolutional Neural Networks (CNNs) have recently been shown to produce compelling state-of-the-art results for single image Super-Resolution (SR). Invariably, a CNN is learned to map the low resolution (LR) image to its corresponding high resolution (HR) version in the spatial domain. Aiming for faster inference and more efficient solutions than solvi…
▽ More
Deep learning methods, in particular trained Convolutional Neural Networks (CNNs) have recently been shown to produce compelling state-of-the-art results for single image Super-Resolution (SR). Invariably, a CNN is learned to map the low resolution (LR) image to its corresponding high resolution (HR) version in the spatial domain. Aiming for faster inference and more efficient solutions than solving the SR problem in the spatial domain, we propose a novel network structure for learning the SR map** function in an image transform domain, specifically the Discrete Cosine Transform (DCT). As a first contribution, we show that DCT can be integrated into the network structure as a Convolutional DCT (CDCT) layer. We further extend the network to allow the CDCT layer to become trainable (i.e. optimizable). Because this layer represents an image transform, we enforce pairwise orthogonality constraints on the individual basis functions/filters. This Orthogonally Regularized Deep SR network (ORDSR) simplifies the SR task by taking advantage of image transform domain while adapting the design of transform basis to the training image set.
△ Less
Submitted 6 February, 2018;
originally announced February 2018.
-
Skew cyclic codes over $\mathbb{F}_{p}+u\mathbb{F}_{p}$
Authors:
Reza Dastbasteh,
Seyyed Hamed Mousavi,
Taher Abualrub,
Nuh Aydin,
Javad Haghighat
Abstract:
In this paper, we study skew cyclic codes with arbitrary length over the ring $R=\mathbb{F}_{p}+u\mathbb{F}_{p}$ where $p$ is an odd prime and $% u^{2}=0$. We characterize all skew cyclic codes of length $n$ as left $% R[x;θ]$-submodules of $R_{n}=R[x;θ]/\langle x^{n}-1\rangle $. We find all generator polynomials for these codes and describe their minimal spanning sets. Moreover, an encoding and d…
▽ More
In this paper, we study skew cyclic codes with arbitrary length over the ring $R=\mathbb{F}_{p}+u\mathbb{F}_{p}$ where $p$ is an odd prime and $% u^{2}=0$. We characterize all skew cyclic codes of length $n$ as left $% R[x;θ]$-submodules of $R_{n}=R[x;θ]/\langle x^{n}-1\rangle $. We find all generator polynomials for these codes and describe their minimal spanning sets. Moreover, an encoding and decoding algorithm is presented for skew cyclic codes over the ring $R$. Finally, based on the theory we developed in this paper, we provide examples of codes with good parameters over $F_{p}$ with different odd prime $p.$ In fact, example 25 in our paper is a new ternary code in the class of quasi-twisted codes. The other examples we provided are examples of optimal codes.
△ Less
Submitted 20 December, 2017;
originally announced December 2017.
-
Femtosecond CDMA Using Dielectric Metasurfaces: Design Procedure and Challenges
Authors:
Taha Rajabzadeh,
Mohammad Hosein Mousavi,
Sajjad Abdollahramezani,
Mohammad Vahid Jamali,
Jawad A. Salehi
Abstract:
Inspired by the ever-increasing demand for higher data transmission rates and the tremendous attention toward all-optical signal processing based on miniaturized nanophotonics, in this paper, for the first time, we investigate the integrable design of coherent ultrashort light pulse code-division multiple-access (CDMA) technique, also known as femtosecond CDMA, using all-dielectric metasurfaces (M…
▽ More
Inspired by the ever-increasing demand for higher data transmission rates and the tremendous attention toward all-optical signal processing based on miniaturized nanophotonics, in this paper, for the first time, we investigate the integrable design of coherent ultrashort light pulse code-division multiple-access (CDMA) technique, also known as femtosecond CDMA, using all-dielectric metasurfaces (MSs). In this technique, the data bits are firstly modulated using ultrashort femtosecond optical pulses generated by mode-locked lasers, and then by employing a unique phase metamask for each data stream, in order to provide the multiple access capability, the optical signals are spectrally encoded. This procedure spreads the optical signal in the temporal domain and generates low-intensity pseudo-noise bursts through random phase coding leading to minimized multiple access interference. This paper comprehensively presents the principles and design approach to realize fundamental components of a typical femtosecond CDMA encoder, including the grating, lens, and phase mask, by employing high-contrast CMOS-compatible MSs. By controlling the interference between the provided Mie and Fabry-Perot resonance modes, we tailor the spectral and spatial responses of the im**ing light locally and independently. Accordingly, we design a MS-based grating with the highest possible refracted angle and, in the meantime, the maximized efficiency which results in a reasonable diameter for the subsequent lens. Moreover, to design our MS-based lens commensurate with the spot size and distance requirements of the pursuant phase mask, we leverage a new optimization method which splits the lens structure into central and peripheral parts, and then design the peripheral part using a collection of gratings converging the im**ing at the subsequent phase mask.
△ Less
Submitted 3 December, 2017;
originally announced December 2017.
-
Lower Bounds on Regular Expression Size
Authors:
Hamoon Mousavi
Abstract:
We introduce linear programs encoding regular expressions of finite languages. We show that, given a language, the optimum value of the associated linear program is a lower bound on the size of any regular expression of the language. Moreover we show that any regular expression can be turned into a dual feasible solution with an objective value that is equal to the size of the regular expression.…
▽ More
We introduce linear programs encoding regular expressions of finite languages. We show that, given a language, the optimum value of the associated linear program is a lower bound on the size of any regular expression of the language. Moreover we show that any regular expression can be turned into a dual feasible solution with an objective value that is equal to the size of the regular expression. For binomial languages we can relax the associated linear program using duality theorem. We use this relaxation to prove lower bounds on the size of regular expressions of binomial and threshold languages.
△ Less
Submitted 6 December, 2017; v1 submitted 3 December, 2017;
originally announced December 2017.
-
Dual-carrier Floquet circulator with time-modulated optical resonators
Authors:
Ian A. D. Williamson,
S. Hossein Mousavi,
Zheng Wang
Abstract:
Spatio-temporal modulation has shown great promise as a strong time-reversal symmetry breaking mechanism that enables integrated nonreciprocal devices and topological materials at optical frequencies. However, optical modulation has its own constraints in terms of modulation index and frequency, which limit the bandwidth and miniaturization of circulators and isolators, not unlike the magneto-opti…
▽ More
Spatio-temporal modulation has shown great promise as a strong time-reversal symmetry breaking mechanism that enables integrated nonreciprocal devices and topological materials at optical frequencies. However, optical modulation has its own constraints in terms of modulation index and frequency, which limit the bandwidth and miniaturization of circulators and isolators, not unlike the magneto-optical schemes that it promises to replace. Here we propose and numerically demonstrate a Floquet circulator that leverages the untapped degrees of freedom unique to time-modulated resonators. Excited by sideband-selective waveguides, the system supports broadband nonreciprocal transmission without relying on the mirror or rotational symmetries required in conventional circulators. Cascading two resonators, we create a linear three-port circulator that exhibits complete and frequency-independent forward transmission between two of the ports. This approach enables wavelength-scale circulators that can rely on a variety of modulation mechanisms.
△ Less
Submitted 14 August, 2017;
originally announced August 2017.
-
Near-field imaging of spin-locked edge states in all-dielectric topological metasurfaces
Authors:
A. Slobozhanyuk,
A. V. Shchelokova,
X. Ni,
S. H. Mousavi,
D. A. Smirnova,
P. A. Belov,
A. Alù,
Y. S. Kivshar,
A. B. Khanikaev
Abstract:
A new class of phenomena stemming from topological states of quantum matter has recently found a variety of analogies in classical systems. Spin-locking and one-way propagation have been shown to drastically alter our view on scattering of electromagnetic waves, thus offering an unprecedented robustness to defects and disorder. Despite these successes, bringing these new ideas to practical grounds…
▽ More
A new class of phenomena stemming from topological states of quantum matter has recently found a variety of analogies in classical systems. Spin-locking and one-way propagation have been shown to drastically alter our view on scattering of electromagnetic waves, thus offering an unprecedented robustness to defects and disorder. Despite these successes, bringing these new ideas to practical grounds meets a number of serious limitations. In photonics, when it is crucial to implement topological photonic devices on a chip, two major challenges are associated with electromagnetic dissipation into heat and out-of-plane radiation into free space. Both these mechanisms may destroy the topological state and seriously affect the device performance. Here we experimentally demonstrate that the topological order for light can be implemented in all-dielectric on-chip prototype metasurfaces, which mitigate the effect of Ohmic losses by using exclusively dielectric materials, and reveal that coupling of the system to the radiative continuum does not affect the topological properties. Spin-Hall effect of light for spin-polarized topological edge states is revealed through near-field spectroscopy measurements.
△ Less
Submitted 22 May, 2017;
originally announced May 2017.