-
Topological asymptotic dimension
Authors:
Massoud Amini
Abstract:
We initiate a study of asymptotic dimension for locally compact groups. This notion extends the existing invariant for discrete groups and is shown to be finite for a large class of residually compact groups. Along the way, the notion of Hirsch length is extended to topological groups and classical results of Hirsch and Malcev are extended using a topological version of the Poincaré lemma. We show…
▽ More
We initiate a study of asymptotic dimension for locally compact groups. This notion extends the existing invariant for discrete groups and is shown to be finite for a large class of residually compact groups. Along the way, the notion of Hirsch length is extended to topological groups and classical results of Hirsch and Malcev are extended using a topological version of the Poincaré lemma. We show that polycyclic-by-compact groups and compactly generated, topologically virtually nilpotent groups are residually compact, and that compactly generated nilpotent groups are polycyclic-by-compact. We prove that for compactly generated, solvable-by-compact groups the asymptotic dimension is majorized by the Hirsch length, and equality holds for polycyclic-by-compact groups. We extend the class of elementary amenable groups beyond the discrete case and show that topologically elementary amenable groups with finite Hirsch length have finite asymptotic dimension. We prove that a topologically elementary amenable group of finite Hirsch length with no nontrivial locally elliptic normal closed subgroup is solvable-by-compact. Finally, we show that a totally disconnected, locally compact, second countable [SIN]-group has finite asymptotic dimension, if all of its discrete quotients are so.
△ Less
Submitted 16 April, 2024; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Enhanced thermoelectric efficiency of zigzag bilayer phosphorene nanoribbon; edge states engineering
Authors:
Shima Sodagar,
Hossein Karbaschi,
Morteza Soltani,
M. Amini
Abstract:
We theoretically investigate the thermoelectric properties of zigzag bilayer phosphorene nanoribbons (ZBPNR). We first, draw an analogy between the extended Su-Schrieffer-Heeger (SSH) ladder and ZBPNR edge states and obtain their corresponding band structure and wave functions analytically. Then, by applying the energy filtering method, we show that the electric power and thermoelectric efficiency…
▽ More
We theoretically investigate the thermoelectric properties of zigzag bilayer phosphorene nanoribbons (ZBPNR). We first, draw an analogy between the extended Su-Schrieffer-Heeger (SSH) ladder and ZBPNR edge states and obtain their corresponding band structure and wave functions analytically. Then, by applying the energy filtering method, we show that the electric power and thermoelectric efficiency of the ZBPNRs can be improved remarkably in the presence of mid-gap edge states. We also argue how to engineer the edge modes to further optimize thermoelectric power and efficiency of the system by applying periodic point potentials at the boundaries.
△ Less
Submitted 2 March, 2022;
originally announced March 2022.
-
Quasitriangular operator algebras
Authors:
Massoud Amini,
Mehdi Moradi,
Ismaeil Mousavi
Abstract:
We give characterizations of quasitriangular operator algebras along the line of Voiculescu's characterization of quasidiagonal $C^*$-algebras.
We give characterizations of quasitriangular operator algebras along the line of Voiculescu's characterization of quasidiagonal $C^*$-algebras.
△ Less
Submitted 24 October, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation (Extended Abstract)
Authors:
Aleksandra Burashnikova,
Yury Maximov,
Marianne Clausel,
Charlotte Laclau,
Franck Iutzeler,
Massih-Reza Amini
Abstract:
This paper is an extended version of [Burashnikova et al., 2021, arXiv: 2012.06910], where we proposed a theoretically supported sequential strategy for training a large-scale Recommender System (RS) over implicit feedback, mainly in the form of clicks. The proposed approach consists in minimizing pairwise ranking loss over blocks of consecutive items constituted by a sequence of non-clicked items…
▽ More
This paper is an extended version of [Burashnikova et al., 2021, arXiv: 2012.06910], where we proposed a theoretically supported sequential strategy for training a large-scale Recommender System (RS) over implicit feedback, mainly in the form of clicks. The proposed approach consists in minimizing pairwise ranking loss over blocks of consecutive items constituted by a sequence of non-clicked items followed by a clicked one for each user. We present two variants of this strategy where model parameters are updated using either the momentum method or a gradient-based approach. To prevent updating the parameters for an abnormally high number of clicks over some targeted items (mainly due to bots), we introduce an upper and a lower threshold on the number of updates for each user. These thresholds are estimated over the distribution of the number of blocks in the training set. They affect the decision of RS by shifting the distribution of items that are shown to the users. Furthermore, we provide a convergence analysis of both algorithms and demonstrate their practical efficiency over six large-scale collections with respect to various ranking measures.
△ Less
Submitted 26 February, 2022;
originally announced February 2022.
-
Development of a Model Predictive Airpath Controller for a Diesel Engine on a High-Fidelity Engine Model with Transient Thermal Dynamics
Authors:
Jiadi Zhang,
Mohammad Reza Amini,
Ilya Kolmanovsky,
Munechika Tsutsumi,
Hayato Nakada
Abstract:
This paper presents the results of a model predictive controller (MPC) development for diesel engine air-path regulation. The control objective is to track the intake manifold pressure and exhaust gas recirculation (EGR) rate targets by manipulating the EGR valve and variable geometry turbine (VGT) while satisfying state and control constraints. The MPC controller is designed and verified using a…
▽ More
This paper presents the results of a model predictive controller (MPC) development for diesel engine air-path regulation. The control objective is to track the intake manifold pressure and exhaust gas recirculation (EGR) rate targets by manipulating the EGR valve and variable geometry turbine (VGT) while satisfying state and control constraints. The MPC controller is designed and verified using a high-fidelity engine model in GT-Power. The controller exploits a low-order rate-based linear parameter-varying (LPV) model for prediction which is identified from transient response data generated by the GT-Power model. It is shown that transient engine thermal dynamics influence the airpath dynamics, specifically the intake manifold pressure response, however, MPC demonstrates robustness against inaccuracies in modeling these thermal dynamics. In particular, we show that MPC can be successfully implemented using a rate-based prediction model with two inputs (EGR and VGT positions) identified from data with steady-state wall temperature dynamics, however, closed-loop performance can be improved if a prediction model (i) is identified from data with transient thermal dynamics, and (ii) has the fuel injection rate as extra model input. Further, the MPC calibration process across the engine operating range to achieve improved performance is addressed. As the MPC calibration is shown to be sensitive to the operating conditions, a fast calibration process is proposed.
△ Less
Submitted 25 February, 2022;
originally announced February 2022.
-
Self-Training: A Survey
Authors:
Massih-Reza Amini,
Vasilii Feofanov,
Loic Pauletto,
Lies Hadjadj,
Emilie Devijver,
Yury Maximov
Abstract:
Semi-supervised algorithms aim to learn prediction functions from a small set of labeled observations and a large set of unlabeled observations. Because this framework is relevant in many applications, they have received a lot of interest in both academia and industry. Among the existing techniques, self-training methods have undoubtedly attracted greater attention in recent years. These models ar…
▽ More
Semi-supervised algorithms aim to learn prediction functions from a small set of labeled observations and a large set of unlabeled observations. Because this framework is relevant in many applications, they have received a lot of interest in both academia and industry. Among the existing techniques, self-training methods have undoubtedly attracted greater attention in recent years. These models are designed to find the decision boundary on low density regions without making additional assumptions about the data distribution, and use the unsigned output score of a learned classifier, or its margin, as an indicator of confidence. The working principle of self-training algorithms is to learn a classifier iteratively by assigning pseudo-labels to the set of unlabeled training samples with a margin greater than a certain threshold. The pseudo-labeled examples are then used to enrich the labeled training data and to train a new classifier in conjunction with the labeled training set. In this paper, we present self-training methods for binary and multi-class classification; as well as their variants and two related approaches, namely consistency-based approaches and transductive learning. We examine the impact of significant self-training features on various methods, using different general and image classification benchmarks, and we discuss our ideas for future research in self-training. To the best of our knowledge, this is the first thorough and complete survey on this subject.
△ Less
Submitted 27 May, 2024; v1 submitted 24 February, 2022;
originally announced February 2022.
-
The application of Evolutionary and Nature Inspired Algorithms in Data Science and Data Analytics
Authors:
Farid Ghareh Mohammadi,
Farzan Shenavarmasouleh,
Khaled Rasheed,
Thiab Taha,
M. Hadi Amini,
Hamid R. Arabnia
Abstract:
In the past 30 years, scientists have searched nature, including animals and insects, and biology in order to discover, understand, and model solutions for solving large-scale science challenges. The study of bionics reveals that how the biological structures, functions found in nature have improved our modern technologies. In this study, we present our discovery of evolutionary and nature-inspire…
▽ More
In the past 30 years, scientists have searched nature, including animals and insects, and biology in order to discover, understand, and model solutions for solving large-scale science challenges. The study of bionics reveals that how the biological structures, functions found in nature have improved our modern technologies. In this study, we present our discovery of evolutionary and nature-inspired algorithms applications in Data Science and Data Analytics in three main topics of pre-processing, supervised algorithms, and unsupervised algorithms. Among all applications, in this study, we aim to investigate four optimization algorithms that have been performed using the evolutionary and nature-inspired algorithms within data science and analytics. Feature selection optimization in pre-processing section, Hyper-parameter tuning optimization, and knowledge discovery optimization in supervised algorithms, and clustering optimization in the unsupervised algorithms.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers
Authors:
Amir Ardalan Kalantari,
Mohammad Amini,
Sarath Chandar,
Doina Precup
Abstract:
Much of recent Deep Reinforcement Learning success is owed to the neural architecture's potential to learn and use effective internal representations of the world. While many current algorithms access a simulator to train with a large amount of data, in realistic settings, including while playing games that may be played against people, collecting experience can be quite costly. In this paper, we…
▽ More
Much of recent Deep Reinforcement Learning success is owed to the neural architecture's potential to learn and use effective internal representations of the world. While many current algorithms access a simulator to train with a large amount of data, in realistic settings, including while playing games that may be played against people, collecting experience can be quite costly. In this paper, we introduce a deep reinforcement learning architecture whose purpose is to increase sample efficiency without sacrificing performance. We design this architecture by incorporating advances achieved in recent years in the field of Natural Language Processing and Computer Vision. Specifically, we propose a visually attentive model that uses transformers to learn a self-attention mechanism on the feature maps of the state representation, while simultaneously optimizing return. We demonstrate empirically that this architecture improves sample complexity for several Atari environments, while also achieving better performance in some of the games.
△ Less
Submitted 1 February, 2022;
originally announced February 2022.
-
Self Semi Supervised Neural Architecture Search for Semantic Segmentation
Authors:
Loïc Pauletto,
Massih-Reza Amini,
Nicolas Winckler
Abstract:
In this paper, we propose a Neural Architecture Search strategy based on self supervision and semi-supervised learning for the task of semantic segmentation. Our approach builds an optimized neural network (NN) model for this task by jointly solving a jigsaw pretext task discovered with self-supervised learning over unlabeled training data, and, exploiting the structure of the unlabeled data with…
▽ More
In this paper, we propose a Neural Architecture Search strategy based on self supervision and semi-supervised learning for the task of semantic segmentation. Our approach builds an optimized neural network (NN) model for this task by jointly solving a jigsaw pretext task discovered with self-supervised learning over unlabeled training data, and, exploiting the structure of the unlabeled data with semi-supervised learning. The search of the architecture of the NN model is performed by dynamic routing using a gradient descent algorithm. Experiments on the Cityscapes and PASCAL VOC 2012 datasets demonstrate that the discovered neural network is more efficient than a state-of-the-art hand-crafted NN model with four times less floating operations.
△ Less
Submitted 31 January, 2022; v1 submitted 29 January, 2022;
originally announced January 2022.
-
Topological boundaries of covariant representations
Authors:
Massoud Amini,
Sajad Zavar
Abstract:
We associate a boundary $\mathcal B_{π,u}$ to each covariant representation $(π,u,H)$ of $C^*$-dynamical system $(G,A,α)$ and study the action of $G$ on $\mathcal B_{π,u}$ and its amenability properties. We relate rigidity properties of traces on the associated crossed product C*-algebra to faithfulness of action of the group on this boundary.
We associate a boundary $\mathcal B_{π,u}$ to each covariant representation $(π,u,H)$ of $C^*$-dynamical system $(G,A,α)$ and study the action of $G$ on $\mathcal B_{π,u}$ and its amenability properties. We relate rigidity properties of traces on the associated crossed product C*-algebra to faithfulness of action of the group on this boundary.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
OptABC: an Optimal Hyperparameter Tuning Approach for Machine Learning Algorithms
Authors:
Leila Zahedi,
Farid Ghareh Mohammadi,
M. Hadi Amini
Abstract:
Hyperparameter tuning in machine learning algorithms is a computationally challenging task due to the large-scale nature of the problem. In order to develop an efficient strategy for hyper-parameter tuning, one promising solution is to use swarm intelligence algorithms. Artificial Bee Colony (ABC) optimization lends itself as a promising and efficient optimization algorithm for this purpose. Howev…
▽ More
Hyperparameter tuning in machine learning algorithms is a computationally challenging task due to the large-scale nature of the problem. In order to develop an efficient strategy for hyper-parameter tuning, one promising solution is to use swarm intelligence algorithms. Artificial Bee Colony (ABC) optimization lends itself as a promising and efficient optimization algorithm for this purpose. However, in some cases, ABC can suffer from a slow convergence rate or execution time due to the poor initial population of solutions and expensive objective functions. To address these concerns, a novel algorithm, OptABC, is proposed to help ABC algorithm in faster convergence toward a near-optimum solution. OptABC integrates artificial bee colony algorithm, K-Means clustering, greedy algorithm, and opposition-based learning strategy for tuning the hyper-parameters of different machine learning models. OptABC employs these techniques in an attempt to diversify the initial population, and hence enhance the convergence ability without significantly decreasing the accuracy. In order to validate the performance of the proposed method, we compare the results with previous state-of-the-art approaches. Experimental results demonstrate the effectiveness of the OptABC compared to existing approaches in the literature.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Evidence of nodal superconductivity in monolayer 1H-TaS$_2$ with hidden order fluctuations
Authors:
Viliam Vaňo,
Somesh Chandra Ganguli,
Mohammad Amini,
Linghao Yan,
Maryam Khosravian,
Guangze Chen,
Shawulienu Kezilebieke,
Jose L. Lado,
Peter Liljeroth
Abstract:
Unconventional superconductors represent one of the fundamental directions in modern quantum materials research. In particular, nodal superconductors are known to appear naturally in strongly correlated systems, including cuprate superconductors and heavy-fermion systems. Van der Waals materials hosting superconducting states are well known, yet nodal monolayer van der Waals superconductors have r…
▽ More
Unconventional superconductors represent one of the fundamental directions in modern quantum materials research. In particular, nodal superconductors are known to appear naturally in strongly correlated systems, including cuprate superconductors and heavy-fermion systems. Van der Waals materials hosting superconducting states are well known, yet nodal monolayer van der Waals superconductors have remained elusive. Here, using low-temperature scanning tunneling microscopy (STM) and spectroscopy (STS) experiments, we show that pristine monolayer 1H-TaS$_2$ realizes a nodal superconducting state. By including non-magnetic disorder, we drive the nodal superconducting state to a conventional gapped s-wave state. Furthermore, we observe the emergence of many-body excitations close to the gap edge, signalling a potential unconventional pairing mechanism. Our results demonstrate the emergence of nodal superconductivity in a van der Waals monolayer, providing a building block for van der Waals heterostructures exploiting unconventional superconducting states.
△ Less
Submitted 21 August, 2023; v1 submitted 14 December, 2021;
originally announced December 2021.
-
Recommender systems: when memory matters
Authors:
Aleksandra Burashnikova,
Marianne Clausel,
Massih-Reza Amini,
Yury Maximov,
Nicolas Dante
Abstract:
In this paper, we study the effect of long memory in the learnability of a sequential recommender system including users' implicit feedback. We propose an online algorithm, where model parameters are updated user per user over blocks of items constituted by a sequence of unclicked items followed by a clicked one. We illustrate through thorough empirical evaluations that filtering users with respec…
▽ More
In this paper, we study the effect of long memory in the learnability of a sequential recommender system including users' implicit feedback. We propose an online algorithm, where model parameters are updated user per user over blocks of items constituted by a sequence of unclicked items followed by a clicked one. We illustrate through thorough empirical evaluations that filtering users with respect to the degree of long memory contained in their interactions with the system allows to substantially gain in performance with respect to MAP and NDCG, especially in the context of training large-scale Recommender Systems.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Bilingual Topic Models for Comparable Corpora
Authors:
Georgios Balikas,
Massih-Reza Amini,
Marianne Clausel
Abstract:
Probabilistic topic models like Latent Dirichlet Allocation (LDA) have been previously extended to the bilingual setting. A fundamental modeling assumption in several of these extensions is that the input corpora are in the form of document pairs whose constituent documents share a single topic distribution. However, this assumption is strong for comparable corpora that consist of documents themat…
▽ More
Probabilistic topic models like Latent Dirichlet Allocation (LDA) have been previously extended to the bilingual setting. A fundamental modeling assumption in several of these extensions is that the input corpora are in the form of document pairs whose constituent documents share a single topic distribution. However, this assumption is strong for comparable corpora that consist of documents thematically similar to an extent only, which are, in turn, the most commonly available or easy to obtain. In this paper we relax this assumption by proposing for the paired documents to have separate, yet bound topic distributions. % a binding mechanism between the distributions of the paired documents. We suggest that the strength of the bound should depend on each pair's semantic similarity. To estimate the similarity of documents that are written in different languages we use cross-lingual word embeddings that are learned with shallow neural networks. We evaluate the proposed binding mechanism by extending two topic models: a bilingual adaptation of LDA that assumes bag-of-words inputs and a model that incorporates part of the text structure in the form of boundaries of semantically coherent segments. To assess the performance of the novel topic models we conduct intrinsic and extrinsic experiments on five bilingual, comparable corpora of English documents with French, German, Italian, Spanish and Portuguese documents. The results demonstrate the efficiency of our approach in terms of both topic coherence measured by the normalized point-wise mutual information, and generalization performance measured by perplexity and in terms of Mean Reciprocal Rank in a cross-lingual document retrieval task for each of the language pairs.
△ Less
Submitted 30 November, 2021;
originally announced November 2021.
-
Self-Training of Halfspaces with Generalization Guarantees under Massart Mislabeling Noise Model
Authors:
Lies Hadjadj,
Massih-Reza Amini,
Sana Louhichi,
Alexis Deschamps
Abstract:
We investigate the generalization properties of a self-training algorithm with halfspaces. The approach learns a list of halfspaces iteratively from labeled and unlabeled training data, in which each iteration consists of two steps: exploration and pruning. In the exploration phase, the halfspace is found sequentially by maximizing the unsigned-margin among unlabeled examples and then assigning ps…
▽ More
We investigate the generalization properties of a self-training algorithm with halfspaces. The approach learns a list of halfspaces iteratively from labeled and unlabeled training data, in which each iteration consists of two steps: exploration and pruning. In the exploration phase, the halfspace is found sequentially by maximizing the unsigned-margin among unlabeled examples and then assigning pseudo-labels to those that have a distance higher than the current threshold. The pseudo-labeled examples are then added to the training set, and a new classifier is learned. This process is repeated until no more unlabeled examples remain for pseudo-labeling. In the pruning phase, pseudo-labeled samples that have a distance to the last halfspace greater than the associated unsigned-margin are then discarded. We prove that the misclassification error of the resulting sequence of classifiers is bounded and show that the resulting semi-supervised approach never degrades performance compared to the classifier learned using only the initial labeled training set. Experiments carried out on a variety of benchmarks demonstrate the efficiency of the proposed approach compared to state-of-the-art methods.
△ Less
Submitted 15 February, 2022; v1 submitted 29 November, 2021;
originally announced November 2021.
-
A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling
Authors:
Eustache Diemert,
Artem Betlei,
Christophe Renaudin,
Massih-Reza Amini,
Théophane Gregoir,
Thibaud Rahier
Abstract:
Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collec…
▽ More
Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collection of 13.9 million samples collected from several randomized control trials, scaling up previously available datasets by a healthy 210x factor. We provide details on the data collection and perform sanity checks to validate the use of this data for causal inference tasks. First, we formalize the task of uplift modeling (UM) that can be performed with this data, along with the relevant evaluation metrics. Then, we propose synthetic response surfaces and heterogeneous treatment assignment providing a general set-up for ITE prediction. Finally, we report experiments to validate key characteristics of the dataset leveraging its size to evaluate and compare - with high statistical significance - a selection of baseline UM and ITE prediction methods.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Genomic Data Analysis using a Two Stage Expectation Propagation Algorithm for Analysis of Sparse Bayesian High-Dimensional Instrumental Variables Regression
Authors:
Morteza Amini
Abstract:
Simultaneous analysis of gene expression data and genetic variants is highly of interest, especially when the number of gene expressions and genetic variants are both greater than the sample size. Association of both causal genes and effective SNPs makes the use of sparse modeling of such genetic data sets, highly important. The high-dimensional sparse instrumental variables models are one of such…
▽ More
Simultaneous analysis of gene expression data and genetic variants is highly of interest, especially when the number of gene expressions and genetic variants are both greater than the sample size. Association of both causal genes and effective SNPs makes the use of sparse modeling of such genetic data sets, highly important. The high-dimensional sparse instrumental variables models are one of such useful association models, which models the simultaneous relation of the gene expressions and genetic variants with complex traits. From a Bayesian viewpoint, the sparsity can be favored using sparsity-enforcing priors such as spike-and-slab priors. A two-stage modification of the expectation propagation (EP) algorithm is proposed and examined for approximate inference in high-dimensional sparse instrumental variables models with spike-and-slab priors. This method is an adoption of the classical two-stage least squares method, to be used with the Bayes context. A simulation study is performed to examine the performance of the methods. The proposed method is applied to analysis of the mouse obesity data.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
Multi-class Probabilistic Bounds for Self-learning
Authors:
Vasilii Feofanov,
Emilie Devijver,
Massih-Reza Amini
Abstract:
Self-learning is a classical approach for learning with both labeled and unlabeled observations which consists in giving pseudo-labels to unlabeled training instances with a confidence score over a predetermined threshold. At the same time, the pseudo-labeling technique is prone to error and runs the risk of adding noisy labels into unlabeled training data. In this paper, we present a probabilisti…
▽ More
Self-learning is a classical approach for learning with both labeled and unlabeled observations which consists in giving pseudo-labels to unlabeled training instances with a confidence score over a predetermined threshold. At the same time, the pseudo-labeling technique is prone to error and runs the risk of adding noisy labels into unlabeled training data. In this paper, we present a probabilistic framework for analyzing self-learning in the multi-class classification scenario with partially labeled data. First, we derive a transductive bound over the risk of the multi-class majority vote classifier. Based on this result, we propose to automatically choose the threshold for pseudo-labeling that minimizes the transductive bound. Then, we introduce a mislabeling error model to analyze the error of the majority vote classifier in the case of the pseudo-labeled data. We derive a probabilistic C-bound over the majority vote error when an imperfect label is given. Empirical results on different data sets show the effectiveness of our framework compared to several state-of-the-art semi-supervised approaches.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Hhsmm: An R package for hidden hybrid Markov/semi-Markov models
Authors:
Morteza Amini,
Afarin Bayat,
Reza Salehian
Abstract:
This paper introduces the hhsmm R package, which involves functions for initializing, fitting, and predication of hidden hybrid Markov/semi-Markov models. These models are flexible models with both Markovian and semi-Markovian states, which are applied to situations where the model involves absorbing or macro-states. The left-to-right models and the models with series/parallel networks of states a…
▽ More
This paper introduces the hhsmm R package, which involves functions for initializing, fitting, and predication of hidden hybrid Markov/semi-Markov models. These models are flexible models with both Markovian and semi-Markovian states, which are applied to situations where the model involves absorbing or macro-states. The left-to-right models and the models with series/parallel networks of states are two models with Markovian and semi-Markovian states. The hhsmm also includes Markov/semi-Markov switching regression model as well as the auto-regressive HHSMM, the nonparametric estimation of the emission distribution using penalized B-splines, prediction of future states and the residual useful lifetime estimation in the predict function. The commercial modular aero-propulsion system simulation (C-MAPSS) data-set is also included in the package, which is used for illustration of the application of the package features. The application of the hhsmm package to the analysis and prediction of the Spain's energy demand is also presented.
△ Less
Submitted 28 May, 2022; v1 submitted 26 September, 2021;
originally announced September 2021.
-
Data Analytics for Smart cities: Challenges and Promises
Authors:
Farid Ghareh Mohammadi,
Farzan Shenavarmasouleh,
M. Hadi Amini,
Hamid R. Arabnia
Abstract:
The explosion of advancements in artificial intelligence, sensor technologies, and wireless communication activates ubiquitous sensing through distributed sensors. These sensors are various domains of networks that lead us to smart systems in healthcare, transportation, environment, and other relevant branches/networks. Having collaborative interaction among the smart systems connects end-user dev…
▽ More
The explosion of advancements in artificial intelligence, sensor technologies, and wireless communication activates ubiquitous sensing through distributed sensors. These sensors are various domains of networks that lead us to smart systems in healthcare, transportation, environment, and other relevant branches/networks. Having collaborative interaction among the smart systems connects end-user devices to each other which enables achieving a new integrated entity called Smart Cities. The goal of this study is to provide a comprehensive survey of data analytics in smart cities. In this paper, we aim to focus on one of the smart cities important branches, namely Smart Mobility, and its positive ample impact on the smart cities decision-making process. Intelligent decision-making systems in smart mobility offer many advantages such as saving energy, relaying city traffic, and more importantly, reducing air pollution by offering real-time useful information and imperative knowledge. Making a decision in smart cities in time is challenging due to various and high dimensional factors and parameters, which are not frequently collected. In this paper, we first address current challenges in smart cities and provide an overview of potential solutions to these challenges. Then, we offer a framework of these solutions, called universal smart cities decision making, with three main sections of data capturing, data analysis, and decision making to optimize the smart mobility within smart cities. With this framework, we elaborate on fundamental concepts of big data, machine learning, and deep leaning algorithms that have been applied to smart cities and discuss the role of these algorithms in decision making for smart mobility in smart cities.
△ Less
Submitted 12 September, 2021;
originally announced September 2021.
-
HyP-ABC: A Novel Automated Hyper-Parameter Tuning Algorithm Using Evolutionary Optimization
Authors:
Leila Zahedi,
Farid Ghareh Mohammadi,
M. Hadi Amini
Abstract:
Machine learning techniques lend themselves as promising decision-making and analytic tools in a wide range of applications. Different ML algorithms have various hyper-parameters. In order to tailor an ML model towards a specific application, a large number of hyper-parameters should be tuned. Tuning the hyper-parameters directly affects the performance (accuracy and run-time). However, for large-…
▽ More
Machine learning techniques lend themselves as promising decision-making and analytic tools in a wide range of applications. Different ML algorithms have various hyper-parameters. In order to tailor an ML model towards a specific application, a large number of hyper-parameters should be tuned. Tuning the hyper-parameters directly affects the performance (accuracy and run-time). However, for large-scale search spaces, efficiently exploring the ample number of combinations of hyper-parameters is computationally challenging. Existing automated hyper-parameter tuning techniques suffer from high time complexity. In this paper, we propose HyP-ABC, an automatic innovative hybrid hyper-parameter optimization algorithm using the modified artificial bee colony approach, to measure the classification accuracy of three ML algorithms, namely random forest, extreme gradient boosting, and support vector machine. Compared to the state-of-the-art techniques, HyP-ABC is more efficient and has a limited number of parameters to be tuned, making it worthwhile for real-world hyper-parameter optimization problems. We further compare our proposed HyP-ABC algorithm with state-of-the-art techniques. In order to ensure the robustness of the proposed method, the algorithm takes a wide range of feasible hyper-parameter values, and is tested using a real-world educational dataset.
△ Less
Submitted 11 September, 2021;
originally announced September 2021.
-
Simple tracially $\mathcal{Z}$-absorbing C*-algebras
Authors:
Massoud Amini,
Nasser Golestani,
Saeid Jamali,
N. Christopher Phillips
Abstract:
We define a notion of tracial $\mathcal{Z}$-absorption for simple not necessarily unital C*-algebras, study it systematically, and prove its permanence properties. This extends the notion defined by Hirshberg and Orovitz for unital C*-algebras. The Razak-Jacelon algebra, simple C*-algebras with tracial rank zero, and simple purely infinite C*-algebras are tracially $\mathcal{Z}$-absorbing. We obta…
▽ More
We define a notion of tracial $\mathcal{Z}$-absorption for simple not necessarily unital C*-algebras, study it systematically, and prove its permanence properties. This extends the notion defined by Hirshberg and Orovitz for unital C*-algebras. The Razak-Jacelon algebra, simple C*-algebras with tracial rank zero, and simple purely infinite C*-algebras are tracially $\mathcal{Z}$-absorbing. We obtain the first purely infinite examples of tracially $\mathcal{Z}$-absorbing C*-algebras which are not $\mathcal{Z}$-absorbing. We use techniques from reduced free products of von~Neumann algebras to construct these examples. A stably finite example was given by Z. Niu and Q. Wang in 2021. We study the Cuntz semigroup of a simple tracially $\mathcal{Z}$-absorbing C*-algebra and prove that it is almost unperforated and the algebra is weakly almost divisible.
△ Less
Submitted 24 March, 2022; v1 submitted 11 September, 2021;
originally announced September 2021.
-
Embodied AI-Driven Operation of Smart Cities: A Concise Review
Authors:
Farzan Shenavarmasouleh,
Farid Ghareh Mohammadi,
M. Hadi Amini,
Hamid R. Arabnia
Abstract:
A smart city can be seen as a framework, comprised of Information and Communication Technologies (ICT). An intelligent network of connected devices that collect data with their sensors and transmit them using cloud technologies in order to communicate with other assets in the ecosystem plays a pivotal role in this framework. Maximizing the quality of life of citizens, making better use of resource…
▽ More
A smart city can be seen as a framework, comprised of Information and Communication Technologies (ICT). An intelligent network of connected devices that collect data with their sensors and transmit them using cloud technologies in order to communicate with other assets in the ecosystem plays a pivotal role in this framework. Maximizing the quality of life of citizens, making better use of resources, cutting costs, and improving sustainability are the ultimate goals that a smart city is after. Hence, data collected from connected devices will continuously get thoroughly analyzed to gain better insights into the services that are being offered across the city; with this goal in mind that they can be used to make the whole system more efficient. Robots and physical machines are inseparable parts of a smart city. Embodied AI is the field of study that takes a deeper look into these and explores how they can fit into real-world environments. It focuses on learning through interaction with the surrounding environment, as opposed to Internet AI which tries to learn from static datasets. Embodied AI aims to train an agent that can See (Computer Vision), Talk (NLP), Navigate and Interact with its environment (Reinforcement Learning), and Reason (General Intelligence), all at the same time. Autonomous driving cars and personal companions are some of the examples that benefit from Embodied AI nowadays. In this paper, we attempt to do a concise review of this field. We will go through its definitions, its characteristics, and its current achievements along with different algorithms, approaches, and solutions that are being used in different components of it (e.g. Vision, NLP, RL). We will then explore all the available simulators and 3D interactable databases that will make the research in this area feasible. Finally, we will address its challenges and identify its potentials for future research.
△ Less
Submitted 22 August, 2021;
originally announced August 2021.
-
DRDrV3: Complete Lesion Detection in Fundus Images Using Mask R-CNN, Transfer Learning, and LSTM
Authors:
Farzan Shenavarmasouleh,
Farid Ghareh Mohammadi,
M. Hadi Amini,
Thiab Taha,
Khaled Rasheed,
Hamid R. Arabnia
Abstract:
Medical Imaging is one of the growing fields in the world of computer vision. In this study, we aim to address the Diabetic Retinopathy (DR) problem as one of the open challenges in medical imaging. In this research, we propose a new lesion detection architecture, comprising of two sub-modules, which is an optimal solution to detect and find not only the type of lesions caused by DR, their corresp…
▽ More
Medical Imaging is one of the growing fields in the world of computer vision. In this study, we aim to address the Diabetic Retinopathy (DR) problem as one of the open challenges in medical imaging. In this research, we propose a new lesion detection architecture, comprising of two sub-modules, which is an optimal solution to detect and find not only the type of lesions caused by DR, their corresponding bounding boxes, and their masks; but also the severity level of the overall case. Aside from traditional accuracy, we also use two popular evaluation criteria to evaluate the outputs of our models, which are intersection over union (IOU) and mean average precision (mAP). We hypothesize that this new solution enables specialists to detect lesions with high confidence and estimate the severity of the damage with high accuracy.
△ Less
Submitted 18 August, 2021;
originally announced August 2021.
-
Real-time Grid and DER Co-simulation Platform for Validating Large-scale DER Control Schemes
Authors:
Adil Khurram,
Mahraz Amini,
Luis A. Duffaut Espinosa,
Paul D. H. Hines,
Mads Almassalkhi
Abstract:
Distributed energy resources (DERs) such as responsive loads and energy storage systems are valuable resources available to grid operators for balancing supply-demand mismatches via load coordination. However, consumer acceptance of load coordination schemes depends on ensuring quality of service (QoS), which embodies device-level constraints. Since each device has its own internal energy state, t…
▽ More
Distributed energy resources (DERs) such as responsive loads and energy storage systems are valuable resources available to grid operators for balancing supply-demand mismatches via load coordination. However, consumer acceptance of load coordination schemes depends on ensuring quality of service (QoS), which embodies device-level constraints. Since each device has its own internal energy state, the effect of QoS on the fleet can be cast as fleet-wide energy limits within which the aggregate "state of charge" (SoC) must be actively maintained. This requires coordination of DERs that is cognizant of the SoC, responsive to grid conditions, and depends on fast communication networks. To that effect, this paper presents a novel real-time grid-and-DER co-simulation platform for validating advanced DER coordination schemes and characterizing the capability of such a DER fleet. In particular, we present how the co-simulation platform is suitable for: i) testing real-time performance of a large fleet of DERs in delivering advanced grid services, including frequency regulation; ii) online state estimation to characterize the corresponding SoC of a large fleet of DERs; and iii) incorporating practical limitations of DERs and communications and analyzing the effects on fleet-wide performance. To illustrate these benefits of the presented grid-DER co-simulation platform, we employ the advanced DER coordination scheme called packetized energy management (PEM), which is a novel device-driven, asynchronous, and randomizing control paradigm for DERs. A fleet of thousands of PEM-enabled DERs are then added to a realistic and dynamical model of the Vermont transmission system to complete validation of the co-simulation platform.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
A Survey on Optimal Transport for Machine Learning: Theory and Applications
Authors:
Luis Caicedo Torres,
Luiz Manella Pereira,
M. Hadi Amini
Abstract:
Optimal Transport (OT) theory has seen an increasing amount of attention from the computer science community due to its potency and relevance in modeling and machine learning. It introduces means that serve as powerful ways to compare probability distributions with each other, as well as producing optimal map**s to minimize cost functions. In this survey, we present a brief introduction and hist…
▽ More
Optimal Transport (OT) theory has seen an increasing amount of attention from the computer science community due to its potency and relevance in modeling and machine learning. It introduces means that serve as powerful ways to compare probability distributions with each other, as well as producing optimal map**s to minimize cost functions. In this survey, we present a brief introduction and history, a survey of previous work and propose directions of future study. We will begin by looking at the history of optimal transport and introducing the founders of this field. We then give a brief glance into the algorithms related to OT. Then, we will follow up with a mathematical formulation and the prerequisites to understand OT. These include Kantorovich duality, entropic regularization, KL Divergence, and Wassertein barycenters. Since OT is a computationally expensive problem, we then introduce the entropy-regularized version of computing optimal map**s, which allowed OT problems to become applicable in a wide range of machine learning problems. In fact, the methods generated from OT theory are competitive with the current state-of-the-art methods. We follow this up by breaking down research papers that focus on image processing, graph learning, neural architecture search, document representation, and domain adaptation. We close the paper with a small section on future research. Of the recommendations presented, three main problems are fundamental to allow OT to become widely applicable but rely strongly on its mathematical formulation and thus are hardest to answer. Since OT is a novel method, there is plenty of space for new research, and with more and more competitive methods (either on an accuracy level or computational speed level) being created, the future of applied optimal transport is bright as it has become pervasive in machine learning.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Self-Learning for Received Signal Strength Map Reconstruction with Neural Architecture Search
Authors:
Aleksandra Malkova,
Loic Pauletto,
Christophe Villien,
Benoit Denis,
Massih-Reza Amini
Abstract:
In this paper, we present a Neural Network (NN) model based on Neural Architecture Search (NAS) and self-learning for received signal strength (RSS) map reconstruction out of sparse single-snapshot input measurements, in the case where data-augmentation by side deterministic simulations cannot be performed. The approach first finds an optimal NN architecture and simultaneously train the deduced mo…
▽ More
In this paper, we present a Neural Network (NN) model based on Neural Architecture Search (NAS) and self-learning for received signal strength (RSS) map reconstruction out of sparse single-snapshot input measurements, in the case where data-augmentation by side deterministic simulations cannot be performed. The approach first finds an optimal NN architecture and simultaneously train the deduced model over some ground-truth measurements of a given (RSS) map. These ground-truth measurements along with the predictions of the model over a set of randomly chosen points are then used to train a second NN model having the same architecture. Experimental results show that signal predictions of this second model outperforms non-learning based interpolation state-of-the-art techniques and NN models with no architecture search on five large-scale maps of RSS measurements.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
Rings with finite n-Weak injective dimension and (n,k)-Weak cotorsion modules
Authors:
Mostafa Amini,
Houda Amzil,
Driss Bennis
Abstract:
Let R be a ring and n,k be two non-negative integers. As an extension of several known notions, we introduce and study (n,k)-weak cotorsion modules using the class of right R-modules with n-weak flat dimensions at most k. Various examples and applications are also given.
Let R be a ring and n,k be two non-negative integers. As an extension of several known notions, we introduce and study (n,k)-weak cotorsion modules using the class of right R-modules with n-weak flat dimensions at most k. Various examples and applications are also given.
△ Less
Submitted 13 June, 2021; v1 submitted 2 May, 2021;
originally announced May 2021.
-
Search Algorithms for Automated Hyper-Parameter Tuning
Authors:
Leila Zahedi,
Farid Ghareh Mohammadi,
Shabnam Rezapour,
Matthew W. Ohland,
M. Hadi Amini
Abstract:
Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of machine learning models depends on selecting the proper hyper-parameters. However, it is not an easy task because it requires time and expertise to tune the hype…
▽ More
Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of machine learning models depends on selecting the proper hyper-parameters. However, it is not an easy task because it requires time and expertise to tune the hyper-parameters to fit the machine learning model. In this paper, we examine the effectiveness of automated hyper-parameter tuning techniques to the realm of students' success. Therefore, we develop two automated Hyper-Parameter Optimization methods, namely grid search and random search, to assess and improve a previous study's performance. The experiment results show that applying random search and grid search on machine learning algorithms improves accuracy. We empirically show automated methods' superiority on real-world educational data (MIDFIELD) for tuning HPs of conventional machine learning classifiers. This work emphasizes the effectiveness of automated hyper-parameter optimization while applying machine learning in the education field to aid faculties, directors', or non-expert users' decisions to improve students' success.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
Artificial heavy fermions in a van der Waals heterostructure
Authors:
Viliam Vaňo,
Mohammad Amini,
Somesh Chandra Ganguli,
Guangze Chen,
Jose L. Lado,
Shawulienu Kezilebieke,
Peter Liljeroth
Abstract:
Heavy fermion systems represent one of the paradigmatic strongly correlated states of matter. They have been used as a platform for investigating exotic behavior ranging from quantum criticality and non-Fermi liquid behavior to unconventional topological superconductivity. Heavy fermions arise from the exchange interaction between localized magnetic moments and conduction electrons that leads to t…
▽ More
Heavy fermion systems represent one of the paradigmatic strongly correlated states of matter. They have been used as a platform for investigating exotic behavior ranging from quantum criticality and non-Fermi liquid behavior to unconventional topological superconductivity. Heavy fermions arise from the exchange interaction between localized magnetic moments and conduction electrons that leads to the well-known Kondo effect. In a Kondo lattice, the interaction between the localized moments gives rise to a band with heavy effective mass. This intriguing phenomenology has so far only been realized in compounds containing rare-earth elements with 4f or 5f electrons. Here, we realize a designer van der Waals heterostructure where artificial heavy fermions emerge from the Kondo coupling between a lattice of localized magnetic moments and itinerant electrons in a 1T/1H-TaS$_2$ heterostructure. We study the heterostructure using scanning tunneling microscopy (STM) and spectroscopy (STS) and show that depending on the stacking order of the monolayers, we can either reveal the localized magnetic moments and the associated Kondo effect, or the conduction electrons with a heavy-fermion hybridization gap. Our experiments realize an ultimately tuneable platform for future experiments probing enhanced many-body correlations, dimensional tuning of quantum criticality, and unconventional superconductivity in two-dimensional artificial heavy-fermion systems.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Category of n-weak injective and n-weak flat modules with respect to special super presented modules
Authors:
Mostafa Amini,
Houda Amzil,
Driss Bennis
Abstract:
Let $R$ be a ring and $n$, $k$ two non-negative integers. In this paper, we introduce the concepts of $n$-weak injective and $n$-weak flat modules and via the notion of special super finitely presented modules, we obtain some characterizations of these modules. We also investigate two classes of modules with richer contents, namely $\mathcal{WI}_k^n(R)$ and $\mathcal{WF}_k^n(R^{op})$ which are lar…
▽ More
Let $R$ be a ring and $n$, $k$ two non-negative integers. In this paper, we introduce the concepts of $n$-weak injective and $n$-weak flat modules and via the notion of special super finitely presented modules, we obtain some characterizations of these modules. We also investigate two classes of modules with richer contents, namely $\mathcal{WI}_k^n(R)$ and $\mathcal{WF}_k^n(R^{op})$ which are larger than that of modules with weak injective and weak flat dimensions less than or equal to $k$. Then on any arbitrary ring, we study the existence of $\mathcal{WI}_k^n(R)$ and $\mathcal{WF}_k^n(R^{op})$ covers and preenvelopes
△ Less
Submitted 6 June, 2021; v1 submitted 16 February, 2021;
originally announced February 2021.
-
Thermoelectric properties of armchair phosphorene nanoribbons in the presence of vacancy-induced impurity band
Authors:
Mohsen Rezaei,
Hossein Karbaschi,
M. Amini,
M. Soltani,
Gholamreza Rashedi
Abstract:
Armchair phosphorene nanoribbons (APNRs) are known to be semiconductors with an indirect bandgap. Here, we propose to introduce new states in the gap of APNRs by creating a periodic structure of vacancies (antidots). Based on the tight-binding model, we show that a periodic array of vacancies or nanopores leads to the formation of an impurity band inside the gap region. We first present an analyti…
▽ More
Armchair phosphorene nanoribbons (APNRs) are known to be semiconductors with an indirect bandgap. Here, we propose to introduce new states in the gap of APNRs by creating a periodic structure of vacancies (antidots). Based on the tight-binding model, we show that a periodic array of vacancies or nanopores leads to the formation of an impurity band inside the gap region. We first present an analytical expression for the dispersion relation of an impurity band induced by hybridization of bound states associated with each single vacancy defect. Then, we increase the size of vacancy defects to include a bunch of atoms and theoretically investigate the effect of nanopores size and their spacing on electronic band structure, carrier transmission function, and thermoelectric properties. Our analysis of the power generation rate and thermoelectric efficiency of these structures reveals that an ANPR can be used as a superb thermoelectric power generation module.
△ Less
Submitted 3 February, 2021;
originally announced February 2021.
-
Experimental Validation of Eco-Driving and Eco-Heating Strategies for Connected and Automated HEVs
Authors:
Mohammad Reza Amini,
Qiuhao Hu,
Hao Wang,
Yiheng Feng,
Ilya Kolmanovsky,
**g Sun
Abstract:
This paper presents experimental results that validate eco-driving and eco-heating strategies developed for connected and automated vehicles (CAVs). By exploiting vehicle-to-infrastructure (V2I) communications, traffic signal timing, and queue length estimations, optimized and smoothed speed profiles for the ego-vehicle are generated to reduce energy consumption. Next, the planned eco-trajectories…
▽ More
This paper presents experimental results that validate eco-driving and eco-heating strategies developed for connected and automated vehicles (CAVs). By exploiting vehicle-to-infrastructure (V2I) communications, traffic signal timing, and queue length estimations, optimized and smoothed speed profiles for the ego-vehicle are generated to reduce energy consumption. Next, the planned eco-trajectories are incorporated into a real-time predictive optimization framework that coordinates the cabin thermal load (in cold weather) with the speed preview, i.e., eco-heating. To enable eco-heating, the engine coolant (as the only heat source for cabin heating) and the cabin air are leveraged as two thermal energy storages. Our eco-heating strategy stores thermal energy in the engine coolant and cabin air while the vehicle is driving at high speeds, and releases the stored energy slowly during the vehicle stops for cabin heating without forcing the engine to idle to provide the heating source. To test and validate these solutions, a power-split hybrid electric vehicle (HEV) has been instrumented for cabin thermal management, allowing to regulate heating, ventilation, and air conditioning (HVAC) system inputs (cabin temperature setpoint and blower flow rate) in real-time. Experiments were conducted to demonstrate the energy-saving benefits of eco-driving and eco-heating strategies over real-world city driving cycles at different cold ambient temperatures. The data confirmed average fuel savings of 14.5% and 4.7% achieved by eco-driving and eco-heating, respectively, offering a combined energy saving of more than 19% when comparing to the baseline vehicle driven by a human driver with a constant-heating strategy.
△ Less
Submitted 2 February, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
FedAR: Activity and Resource-Aware Federated Learning Model for Distributed Mobile Robots
Authors:
Ahmed Imteaj,
M. Hadi Amini
Abstract:
Smartphones, autonomous vehicles, and the Internet-of-things (IoT) devices are considered the primary data source for a distributed network. Due to a revolutionary breakthrough in internet availability and continuous improvement of the IoT devices capabilities, it is desirable to store data locally and perform computation at the edge, as opposed to share all local information with a centralized co…
▽ More
Smartphones, autonomous vehicles, and the Internet-of-things (IoT) devices are considered the primary data source for a distributed network. Due to a revolutionary breakthrough in internet availability and continuous improvement of the IoT devices capabilities, it is desirable to store data locally and perform computation at the edge, as opposed to share all local information with a centralized computation agent. A recently proposed Machine Learning (ML) algorithm called Federated Learning (FL) paves the path towards preserving data privacy, performing distributed learning, and reducing communication overhead in large-scale machine learning (ML) problems. This paper proposes an FL model by monitoring client activities and leveraging available local computing resources, particularly for resource-constrained IoT devices (e.g., mobile robots), to accelerate the learning process. We assign a trust score to each FL client, which is updated based on the client's activities. We consider a distributed mobile robot as an FL client with resource limitations either in memory, bandwidth, processor, or battery life. We consider such mobile robots as FL clients to understand their resource-constrained behavior in a real-world setting. We consider an FL client to be untrustworthy if the client infuses incorrect models or repeatedly gives slow responses during the FL process. After disregarding the ineffective and unreliable client, we perform local training on the selected FL clients. To further reduce the straggler issue, we enable an asynchronous FL mechanism by performing aggregation on the FL server without waiting for a long period to receive a particular client's response.
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
Treatment Targeting by AUUC Maximization with Generalization Guarantees
Authors:
Artem Betlei,
Eustache Diemert,
Massih-Reza Amini
Abstract:
We consider the task of optimizing treatment assignment based on individual treatment effect prediction. This task is found in many applications such as personalized medicine or targeted advertising and has gained a surge of interest in recent years under the name of Uplift Modeling. It consists in targeting treatment to the individuals for whom it would be the most beneficial. In real life scenar…
▽ More
We consider the task of optimizing treatment assignment based on individual treatment effect prediction. This task is found in many applications such as personalized medicine or targeted advertising and has gained a surge of interest in recent years under the name of Uplift Modeling. It consists in targeting treatment to the individuals for whom it would be the most beneficial. In real life scenarios, when we do not have access to ground-truth individual treatment effect, the capacity of models to do so is generally measured by the Area Under the Uplift Curve (AUUC), a metric that differs from the learning objectives of most of the Individual Treatment Effect (ITE) models. We argue that the learning of these models could inadvertently degrade AUUC and lead to suboptimal treatment assignment. To tackle this issue, we propose a generalization bound on the AUUC and present a novel learning algorithm that optimizes a derivable surrogate of this bound, called AUUC-max. Finally, we empirically demonstrate the tightness of this generalization bound, its effectiveness for hyper-parameter tuning and show the efficiency of the proposed algorithm compared to a wide range of competitive baselines on two classical benchmarks.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
Learning over no-Preferred and Preferred Sequence of items for Robust Recommendation
Authors:
Aleksandra Burashnikova,
Marianne Clausel,
Charlotte Laclau,
Frack Iutzeller,
Yury Maximov,
Massih-Reza Amini
Abstract:
In this paper, we propose a theoretically founded sequential strategy for training large-scale Recommender Systems (RS) over implicit feedback, mainly in the form of clicks. The proposed approach consists in minimizing pairwise ranking loss over blocks of consecutive items constituted by a sequence of non-clicked items followed by a clicked one for each user. We present two variants of this strate…
▽ More
In this paper, we propose a theoretically founded sequential strategy for training large-scale Recommender Systems (RS) over implicit feedback, mainly in the form of clicks. The proposed approach consists in minimizing pairwise ranking loss over blocks of consecutive items constituted by a sequence of non-clicked items followed by a clicked one for each user. We present two variants of this strategy where model parameters are updated using either the momentum method or a gradient-based approach. To prevent from updating the parameters for an abnormally high number of clicks over some targeted items (mainly due to bots), we introduce an upper and a lower threshold on the number of updates for each user. These thresholds are estimated over the distribution of the number of blocks in the training set. The thresholds affect the decision of RS and imply a shift over the distribution of items that are shown to the users. Furthermore, we provide a convergence analysis of both algorithms and demonstrate their practical efficiency over six large-scale collections, both regarding different ranking measures and computational time.
△ Less
Submitted 12 December, 2020;
originally announced December 2020.
-
Malware Detection using Artificial Bee Colony Algorithm
Authors:
Farid Ghareh Mohammadi,
Farzan Shenavarmasouleh,
M. Hadi Amini,
Hamid R. Arabnia
Abstract:
Malware detection has become a challenging task due to the increase in the number of malware families. Universal malware detection algorithms that can detect all the malware families are needed to make the whole process feasible. However, the more universal an algorithm is, the higher number of feature dimensions it needs to work with, and that inevitably causes the emerging problem of Curse of Di…
▽ More
Malware detection has become a challenging task due to the increase in the number of malware families. Universal malware detection algorithms that can detect all the malware families are needed to make the whole process feasible. However, the more universal an algorithm is, the higher number of feature dimensions it needs to work with, and that inevitably causes the emerging problem of Curse of Dimensionality (CoD). Besides, it is also difficult to make this solution work due to the real-time behavior of malware analysis. In this paper, we address this problem and aim to propose a feature selection based malware detection algorithm using an evolutionary algorithm that is referred to as Artificial Bee Colony (ABC). The proposed algorithm enables researchers to decrease the feature dimension and as a result, boost the process of malware detection. The experimental results reveal that the proposed method outperforms the state-of-the-art.
△ Less
Submitted 1 December, 2020;
originally announced December 2020.
-
DRDr II: Detecting the Severity Level of Diabetic Retinopathy Using Mask RCNN and Transfer Learning
Authors:
Farzan Shenavarmasouleh,
Farid Ghareh Mohammadi,
M. Hadi Amini,
Hamid R. Arabnia
Abstract:
DRDr II is a hybrid of machine learning and deep learning worlds. It builds on the successes of its antecedent, namely, DRDr, that was trained to detect, locate, and create segmentation masks for two types of lesions (exudates and microaneurysms) that can be found in the eyes of the Diabetic Retinopathy (DR) patients; and uses the entire model as a solid feature extractor in the core of its pipeli…
▽ More
DRDr II is a hybrid of machine learning and deep learning worlds. It builds on the successes of its antecedent, namely, DRDr, that was trained to detect, locate, and create segmentation masks for two types of lesions (exudates and microaneurysms) that can be found in the eyes of the Diabetic Retinopathy (DR) patients; and uses the entire model as a solid feature extractor in the core of its pipeline to detect the severity level of the DR cases. We employ a big dataset with over 35 thousand fundus images collected from around the globe and after 2 phases of preprocessing alongside feature extraction, we succeed in predicting the correct severity levels with over 92% accuracy.
△ Less
Submitted 30 November, 2020;
originally announced November 2020.
-
Double-Fano resonance in a two-level quantum system coupled to zigzag Phosphorene nanoribbon
Authors:
Mohsen Amini,
Morteza Soltani,
Samira Baninajarian,
Mohsen Rezaei
Abstract:
Double-level quantum systems are good candidates for revealing coherent quantum transport properties. Here, we consider quantum interference effects due to the formation of a two-level system (TLS) coupled to the edge channel of a zigzag Phosphorene nanoribbon (ZPNR). Using the tight-binding approach, we first demonstrate the formation of a TLS in bulk Phosphorene sheet due to the existence of two…
▽ More
Double-level quantum systems are good candidates for revealing coherent quantum transport properties. Here, we consider quantum interference effects due to the formation of a two-level system (TLS) coupled to the edge channel of a zigzag Phosphorene nanoribbon (ZPNR). Using the tight-binding approach, we first demonstrate the formation of a TLS in bulk Phosphorene sheet due to the existence of two nearby vacancy impurities. Then, we show that such a TLS can couple to the quasi-one-dimensional continuum of the edge states in a ZPNR which results in the the appearance of two-dip Fano-type line shapes. To this end, we generalize the Lippmann-Schwinger approach to study the scattering of edge electrons in a ZPNR by two coupled impurity defects. We obtain an analytical expression of the transmission coefficient which shows that the positions and widths of the anti-resonances can be controlled by changing the intervacancy distance as well as their distance from the edge of the ribbon. This work constitutes a clear example of the multiple Fano resonances in mesoscopic transport.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
A simulation study of semiparametric estimation in copula models based on minimum Alpha-Divergence
Authors:
Morteza Mohammadi,
Mohammad Amini,
Mahdi Emadi
Abstract:
The purpose of this paper is to introduce two semiparametric methods for the estimation of copula parameter. These methods are based on minimum Alpha-Divergence between a non-parametric estimation of copula density using local likelihood probit transformation method and a true copula density function. A Monte Carlo study is performed to measure the performance of these methods based on Hellinger d…
▽ More
The purpose of this paper is to introduce two semiparametric methods for the estimation of copula parameter. These methods are based on minimum Alpha-Divergence between a non-parametric estimation of copula density using local likelihood probit transformation method and a true copula density function. A Monte Carlo study is performed to measure the performance of these methods based on Hellinger distance and Neyman divergence as special cases of Alpha-Divergence. Simulation results are compared to the Maximum Pseudo-Likelihood (MPL) estimation as a conventional estimation method in well-known bivariate copula models. These results show that the proposed method based on Minimum Pseudo Hellinger Distance estimation has a good performance in small sample size and weak dependency situations. The parameter estimation methods are applied to a real data set in Hydrology.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Data-driven Inferences of Agency-level Risk and Response Communication on COVID-19 through Social Media based Interactions
Authors:
Md Ashraf Ahmed,
Arif Mohaimin Sadri,
M. Hadi Amini
Abstract:
Risk and response communication of public agencies through social media played a significant role in the emergence and spread of novel Coronavirus (COVID-19) and such interactions were echoed in other information outlets. This study collected time-sensitive online social media data and analyzed such communication patterns from public health (WHO, CDC), emergency (FEMA), and transportation (FDOT) a…
▽ More
Risk and response communication of public agencies through social media played a significant role in the emergence and spread of novel Coronavirus (COVID-19) and such interactions were echoed in other information outlets. This study collected time-sensitive online social media data and analyzed such communication patterns from public health (WHO, CDC), emergency (FEMA), and transportation (FDOT) agencies using data-driven methods. The scope of the work includes a detailed understanding of how agencies communicate risk information through social media during a pandemic and influence community response (i.e. timing of lockdown, timing of reopening) and disease outbreak indicators (i.e. number of confirmed cases, number of deaths). The data includes Twitter interactions from different agencies (2.15K tweets per agency on average) and crowdsourced data (i.e. Worldometer) on COVID-19 cases and deaths were observed between February 21, 2020 and June 06, 2020. Several machine learning techniques such as (i.e. topic mining and sentiment ratings over time) are applied here to identify the dynamics of emergent topics during this unprecedented time. Temporal infographics of the results captured the agency-levels variations over time in circulating information about the importance of face covering, home quarantine, social distancing and contact tracing. In addition, agencies showed differences in their discussions about community transmission, lack of personal protective equipment, testing and medical supplies, use of tobacco, vaccine, mental health issues, hospitalization, hurricane season, airports, construction work among others. Findings could support more efficient transfer of risk and response information as communities shift to new normal as well as in future pandemics.
△ Less
Submitted 9 August, 2020;
originally announced August 2020.
-
Dynamic asymptotic dimension for actions of virtually cyclic groups
Authors:
Massoud Amini,
Kang Li,
Damian Sawicki,
Ali Shakibazadeh
Abstract:
We show that the dynamic asymptotic dimension of a minimal free action of an infinite virtually cyclic group on a compact Hausdorff space is always one. This extends a well-known result of Guentner, Willett, and Yu for minimal free actions of infinite cyclic groups. Furthermore, the minimality assumption can be replaced by the marker property, and we prove the marker property for all free actions…
▽ More
We show that the dynamic asymptotic dimension of a minimal free action of an infinite virtually cyclic group on a compact Hausdorff space is always one. This extends a well-known result of Guentner, Willett, and Yu for minimal free actions of infinite cyclic groups. Furthermore, the minimality assumption can be replaced by the marker property, and we prove the marker property for all free actions of countable groups on finite dimensional compact Hausdorff spaces, generalising a result of Szabo in the metrisable setting.
△ Less
Submitted 24 February, 2021; v1 submitted 2 July, 2020;
originally announced July 2020.
-
PatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks
Authors:
Mojtaba Faramarzi,
Mohammad Amini,
Akilesh Badrinaaraayanan,
Vikas Verma,
Sarath Chandar
Abstract:
Large capacity deep learning models are often prone to a high generalization gap when trained with a limited amount of labeled training data. A recent class of methods to address this problem uses various ways to construct a new training sample by mixing a pair (or more) of training samples. We propose PatchUp, a hidden state block-level regularization technique for Convolutional Neural Networks (…
▽ More
Large capacity deep learning models are often prone to a high generalization gap when trained with a limited amount of labeled training data. A recent class of methods to address this problem uses various ways to construct a new training sample by mixing a pair (or more) of training samples. We propose PatchUp, a hidden state block-level regularization technique for Convolutional Neural Networks (CNNs), that is applied on selected contiguous blocks of feature maps from a random pair of samples. Our approach improves the robustness of CNN models against the manifold intrusion problem that may occur in other state-of-the-art mixing approaches. Moreover, since we are mixing the contiguous block of features in the hidden space, which has more dimensions than the input space, we obtain more diverse samples for training towards different dimensions. Our experiments on CIFAR10/100, SVHN, Tiny-ImageNet, and ImageNet using ResNet architectures including PreActResnet18/34, WRN-28-10, ResNet101/152 models show that PatchUp improves upon, or equals, the performance of current state-of-the-art regularizers for CNNs. We also show that PatchUp can provide a better generalization to deformed samples and is more robust against adversarial attacks.
△ Less
Submitted 7 January, 2023; v1 submitted 14 June, 2020;
originally announced June 2020.
-
Marshall-Olkin exponential shock model covering all range of dependence
Authors:
H. A. Mohtashami-Borzadaran,
M. Amini,
H. Jabbari,
A. Dolati
Abstract:
In this paper, we present a new Marshall-Olkin exponential shock model. The new construction method gives the proposed model further ability to allocate the common joint shock on each of the components, making it suitable for application in fields like reliability and credit risk. The given model has a singular part and supports both positive and negative dependence structure. Main dependence prop…
▽ More
In this paper, we present a new Marshall-Olkin exponential shock model. The new construction method gives the proposed model further ability to allocate the common joint shock on each of the components, making it suitable for application in fields like reliability and credit risk. The given model has a singular part and supports both positive and negative dependence structure. Main dependence properties of the model is given and an analysis of stress-strength is presented. After a performance analysis on the estimator of parameters, a real data is studied. Finally, we give the multivariate version of the proposed model and its main properties.
△ Less
Submitted 12 June, 2020; v1 submitted 23 April, 2020;
originally announced April 2020.
-
Engine and Aftertreatment Co-Optimization of Connected HEVs via Multi-Range Vehicle Speed Planning and Prediction
Authors:
Qiuhao Hu,
Mohammad Reza Amini,
Yiheng Feng,
Zhen Yang,
Hao Wang,
Ilya Kolmanovsky,
**g Sun,
Ashley Wiese,
Zeng Qiu,
Julia Buckland Seeds
Abstract:
Connected vehicles (CVs) have situational awareness that can be exploited for control and optimization of the powertrain system. While extensive studies have been carried out for energy efficiency improvement of CVs via eco-driving and planning, the implication of such technologies on the thermal responses of CVs has not been fully investigated. One of the key challenges in leveraging connectivity…
▽ More
Connected vehicles (CVs) have situational awareness that can be exploited for control and optimization of the powertrain system. While extensive studies have been carried out for energy efficiency improvement of CVs via eco-driving and planning, the implication of such technologies on the thermal responses of CVs has not been fully investigated. One of the key challenges in leveraging connectivity for optimization-based thermal management of CVs is the relatively slow thermal dynamics, which necessitate the use of a long prediction horizon to achieve the best performance. Long-term prediction of the CV speed, unlike the V2V/V2I-based short-range prediction, is difficult and error-prone. The multiple timescales inherent to power and thermal systems call for a variable timescale optimization framework with access to short- and long-term vehicle speed preview. To this end, a model predictive controller (MPC) with a multi-range speed preview for integrated power and thermal management (iPTM) of connected hybrid electric vehicles (HEVs) is presented in this paper. The MPC is formulated to manage the power-split between the engine and the battery while enforcing the power and thermal (engine coolant and catalytic converter temperatures) constraints. The MPC exploits prediction and optimization over a shorter receding horizon and longer shrinking horizon. Over the longer shrinking horizon, the vehicle speed estimation is based on the data collected from the connected vehicles traveling on the same route as the ego-vehicle. Simulation results of applying the MPC over real-world urban driving cycles in Ann Arbor, MI are presented to demonstrate the effectiveness and fuel-saving potentials of the proposed iPTM strategy under the uncertainty associated with long-term predictions of the CV's speed.
△ Less
Submitted 20 March, 2020;
originally announced March 2020.
-
Integrated Power and Thermal Management of Connected HEVs via Multi-Horizon MPC
Authors:
Qiuhao Hu,
Mohammad Reza Amini,
Hao Wang,
Ilya Kolmanovsky,
**g Sun
Abstract:
In this paper, a multi-horizon model predictive controller (MH-MPC) is developed for integrated power and thermal management (iPTM) of a power-split hybrid electric vehicle (HEV). The proposed MH-MPC leverages an accurate short-horizon vehicle speed preview and an approximate forecast over a longer shrinking horizon till the end of the driving cycle. This multiple-horizon scheme is developed to co…
▽ More
In this paper, a multi-horizon model predictive controller (MH-MPC) is developed for integrated power and thermal management (iPTM) of a power-split hybrid electric vehicle (HEV). The proposed MH-MPC leverages an accurate short-horizon vehicle speed preview and an approximate forecast over a longer shrinking horizon till the end of the driving cycle. This multiple-horizon scheme is developed to cope with fast and slow dynamics associated with power and thermal responses. The main objective of the proposed MH-MPC is to minimize fuel consumption and enforce the power and thermal constraints on the battery state-of-charge and engine coolant temperature, while meeting the driving (traction) and cabin air conditioning (heating) demands. The proposed MH-MPC allows for exploiting the engine coolant as thermal energy storage, providing more flexibility for the HEV energy flow optimization. The simulation results show that the proposed MH-MPC provides near-optimal results in reference to the Dynamic Programming (DP) solution with an affordable computational cost. Moreover, compared with a more conventional MPC strategy, the MH-MPC can leverage the speed previews with different resolutions effectively to achieve the desired performance with satisfactory robustness.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
On Parameter Tuning in Meta-learning for Computer Vision
Authors:
Farid Ghareh Mohammadi,
M. Hadi Amini,
Hamid R. Arabnia
Abstract:
Learning to learn plays a pivotal role in meta-learning (MTL) to obtain an optimal learning model. In this paper, we investigate mage recognition for unseen categories of a given dataset with limited training information. We deploy a zero-shot learning (ZSL) algorithm to achieve this goal. We also explore the effect of parameter tuning on performance of semantic auto-encoder (SAE). We further addr…
▽ More
Learning to learn plays a pivotal role in meta-learning (MTL) to obtain an optimal learning model. In this paper, we investigate mage recognition for unseen categories of a given dataset with limited training information. We deploy a zero-shot learning (ZSL) algorithm to achieve this goal. We also explore the effect of parameter tuning on performance of semantic auto-encoder (SAE). We further address the parameter tuning problem for meta-learning, especially focusing on zero-shot learning. By combining different embedded parameters, we improved the accuracy of tuned-SAE. Advantages and disadvantages of parameter tuning and its application in image classification are also explored.
△ Less
Submitted 11 February, 2020;
originally announced March 2020.
-
MLIR: A Compiler Infrastructure for the End of Moore's Law
Authors:
Chris Lattner,
Mehdi Amini,
Uday Bondhugula,
Albert Cohen,
Andy Davis,
Jacques Pienaar,
River Riddle,
Tatiana Shpeisman,
Nicolas Vasilache,
Oleksandr Zinenko
Abstract:
This work presents MLIR, a novel approach to building reusable and extensible compiler infrastructure. MLIR aims to address software fragmentation, improve compilation for heterogeneous hardware, significantly reduce the cost of building domain specific compilers, and aid in connecting existing compilers together. MLIR facilitates the design and implementation of code generators, translators and o…
▽ More
This work presents MLIR, a novel approach to building reusable and extensible compiler infrastructure. MLIR aims to address software fragmentation, improve compilation for heterogeneous hardware, significantly reduce the cost of building domain specific compilers, and aid in connecting existing compilers together. MLIR facilitates the design and implementation of code generators, translators and optimizers at different levels of abstraction and also across application domains, hardware targets and execution environments. The contribution of this work includes (1) discussion of MLIR as a research artifact, built for extension and evolution, and identifying the challenges and opportunities posed by this novel design point in design, semantics, optimization specification, system, and engineering. (2) evaluation of MLIR as a generalized infrastructure that reduces the cost of building compilers-describing diverse use-cases to show research and educational opportunities for future programming languages, compilers, execution environments, and computer architecture. The paper also presents the rationale for MLIR, its original design principles, structures and semantics.
△ Less
Submitted 29 February, 2020; v1 submitted 25 February, 2020;
originally announced February 2020.
-
Federated Learning for Resource-Constrained IoT Devices: Panoramas and State-of-the-art
Authors:
Ahmed Imteaj,
Urmish Thakker,
Shiqiang Wang,
Jian Li,
M. Hadi Amini
Abstract:
Nowadays, devices are equipped with advanced sensors with higher processing/computing capabilities. Further, widespread Internet availability enables communication among sensing devices. As a result, vast amounts of data are generated on edge devices to drive Internet-of-Things (IoT), crowdsourcing, and other emerging technologies. The collected extensive data can be pre-processed, scaled, classif…
▽ More
Nowadays, devices are equipped with advanced sensors with higher processing/computing capabilities. Further, widespread Internet availability enables communication among sensing devices. As a result, vast amounts of data are generated on edge devices to drive Internet-of-Things (IoT), crowdsourcing, and other emerging technologies. The collected extensive data can be pre-processed, scaled, classified, and finally, used for predicting future events using machine learning (ML) methods. In traditional ML approaches, data is sent to and processed in a central server, which encounters communication overhead, processing delay, privacy leakage, and security issues. To overcome these challenges, each client can be trained locally based on its available data and by learning from the global model. This decentralized learning structure is referred to as Federated Learning (FL). However, in large-scale networks, there may be clients with varying computational resource capabilities. This may lead to implementation and scalability challenges for FL techniques. In this paper, we first introduce some recently implemented real-life applications of FL. We then emphasize on the core challenges of implementing the FL algorithms from the perspective of resource limitations (e.g., memory, bandwidth, and energy budget) of client clients. We finally discuss open issues associated with FL and highlight future directions in the FL area concerning resource-constrained devices.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
Efficient sideband cooling protocol for long trapped-ion chains
Authors:
J. -S. Chen,
K. Wright,
N. C. Pisenti,
D. Murphy,
K. M. Beck,
K. Landsman,
J. M. Amini,
Y. Nam
Abstract:
Trapped ions are a promising candidate for large scale quantum computation. Several systems have been built in both academic and industrial settings to implement modestly-sized quantum algorithms. Efficient cooling of the motional degrees of freedom is a key requirement for high-fidelity quantum operations using trapped ions. Here, we present a technique whereby individual ions are used to cool in…
▽ More
Trapped ions are a promising candidate for large scale quantum computation. Several systems have been built in both academic and industrial settings to implement modestly-sized quantum algorithms. Efficient cooling of the motional degrees of freedom is a key requirement for high-fidelity quantum operations using trapped ions. Here, we present a technique whereby individual ions are used to cool individual motional modes in parallel, reducing the time required to bring an ion chain to its motional ground state. We demonstrate this technique experimentally and develop a model to understand the efficiency of our parallel sideband cooling technique compared to more traditional methods. This technique is applicable to any system using resolved sideband cooling of co-trapped atomic species and only requires individual addressing of the trapped particles.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.