Search | arXiv e-print repository

arXiv:2403.13611 [pdf, other]

Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless

Authors: Agrim Gupta, Adel Heidari, Jiaming **, Dinesh Bharadia

Abstract: Connectivity on-the-go has been one of the most impressive technological achievements in the 2010s decade. However, multiple studies show that this has come at an expense of increased carbon footprint, that also rivals the entire aviation sector's carbon footprint. The two major contributors of this increased footprint are (a) smartphone batteries which affect the embodied footprint and (b) base-s… ▽ More Connectivity on-the-go has been one of the most impressive technological achievements in the 2010s decade. However, multiple studies show that this has come at an expense of increased carbon footprint, that also rivals the entire aviation sector's carbon footprint. The two major contributors of this increased footprint are (a) smartphone batteries which affect the embodied footprint and (b) base-stations that occupy ever-increasing energy footprint to provide the last mile wireless connectivity to smartphones. The root-cause of both these turn out to be the same, which is communicating over the last-mile lossy wireless medium. We show in this paper, titled DensQuer, how base-station densification, which is to replace a single larger base-station with multiple smaller ones, reduces the effect of the last-mile wireless, and in effect conquers both these adverse sources of increased carbon footprint. Backed by a open-source ray-tracing computation framework (Sionna), we show how a strategic densification strategy can minimize the number of required smaller base-stations to practically achievable numbers, which lead to about 3x power-savings in the base-station network. Also, DensQuer is able to also reduce the required deployment height of base-stations to as low as 15m, that makes the smaller cells easily deployable on trees/street poles instead of requiring a dedicated tower. Further, by utilizing newly introduced hardware power rails in Google Pixel 7a and above phones, we also show that this strategic densified network leads to reduction in mobile transmit power by 10-15 dB, leading to about 3x reduction in total cellular power consumption, and about 50% increase in smartphone battery life when it communicates data via the cellular network. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 12 pages, 14 figures

arXiv:2401.06649 [pdf, other]

Data-Efficient Interactive Multi-Objective Optimization Using ParEGO

Authors: Arash Heidari, Sebastian Rojas Gonzalez, Tom Dhaene, Ivo Couckuyt

Abstract: Multi-objective optimization is a widely studied problem in diverse fields, such as engineering and finance, that seeks to identify a set of non-dominated solutions that provide optimal trade-offs among competing objectives. However, the computation of the entire Pareto front can become prohibitively expensive, both in terms of computational resources and time, particularly when dealing with a lar… ▽ More Multi-objective optimization is a widely studied problem in diverse fields, such as engineering and finance, that seeks to identify a set of non-dominated solutions that provide optimal trade-offs among competing objectives. However, the computation of the entire Pareto front can become prohibitively expensive, both in terms of computational resources and time, particularly when dealing with a large number of objectives. In practical applications, decision-makers (DMs) will select a single solution of the Pareto front that aligns with their preferences to be implemented; thus, traditional multi-objective algorithms invest a lot of budget sampling solutions that are not interesting for the DM. In this paper, we propose two novel algorithms that employ Gaussian Processes and advanced discretization methods to efficiently locate the most preferred region of the Pareto front in expensive-to-evaluate problems. Our approach involves interacting with the decision-maker to guide the optimization process towards their preferred trade-offs. Our experimental results demonstrate that our proposed algorithms are effective in finding non-dominated solutions that align with the decision-maker's preferences while maintaining computational efficiency. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: This paper has been accepted at ECML PKDD 2023 workshop: Neuro-Explicit AI and Expert-informed Machine Learning for Engineering and Physical Sciences

arXiv:2310.00027 [pdf, ps, other]

Out-Of-Domain Unlabeled Data Improves Generalization

Authors: Amir Hossein Saberi, Amir Najafi, Alireza Heidari, Mohammad Hosein Movasaghinia, Abolfazl Motahari, Babak H. Khalaj

Abstract: We propose a novel framework for incorporating unlabeled data into semi-supervised classification problems, where scenarios involving the minimization of either i) adversarially robust or ii) non-robust loss functions have been considered. Notably, we allow the unlabeled samples to deviate slightly (in total variation sense) from the in-domain distribution. The core idea behind our framework is to… ▽ More We propose a novel framework for incorporating unlabeled data into semi-supervised classification problems, where scenarios involving the minimization of either i) adversarially robust or ii) non-robust loss functions have been considered. Notably, we allow the unlabeled samples to deviate slightly (in total variation sense) from the in-domain distribution. The core idea behind our framework is to combine Distributionally Robust Optimization (DRO) with self-supervised training. As a result, we also leverage efficient polynomial-time algorithms for the training stage. From a theoretical standpoint, we apply our framework on the classification problem of a mixture of two Gaussians in $\mathbb{R}^d$, where in addition to the $m$ independent and labeled samples from the true distribution, a set of $n$ (usually with $n\gg m$) out of domain and unlabeled samples are given as well. Using only the labeled data, it is known that the generalization error can be bounded by $\propto\left(d/m\right)^{1/2}$. However, using our method on both isotropic and non-isotropic Gaussian mixture models, one can derive a new set of analytically explicit and non-asymptotic bounds which show substantial improvement on the generalization error compared to ERM. Our results underscore two significant insights: 1) out-of-domain samples, even when unlabeled, can be harnessed to narrow the generalization gap, provided that the true data distribution adheres to a form of the ``cluster assumption", and 2) the semi-supervised learning paradigm can be regarded as a special case of our framework when there are no distributional shifts. We validate our claims through experiments conducted on a variety of synthetic and real-world datasets. △ Less

Submitted 15 February, 2024; v1 submitted 28 September, 2023; originally announced October 2023.

Comments: Published at ICLR 2024 (Spotlight), 29 pages, no figures

arXiv:2208.11965 [pdf, ps, other]

Parameter estimation of discretely observed interacting particle systems

Authors: Chiara Amorino, Akram Heidari, Vytautė Pilipauskaitė, Mark Podolskij

Abstract: In this paper, we consider the problem of joint parameter estimation for drift and diffusion coefficients of a stochastic McKean-Vlasov equation and for the associated system of interacting particles. The analysis is provided in a general framework, as both coefficients depend on the solution of the process and on the law of the solution itself. Starting from discrete observations of the interacti… ▽ More In this paper, we consider the problem of joint parameter estimation for drift and diffusion coefficients of a stochastic McKean-Vlasov equation and for the associated system of interacting particles. The analysis is provided in a general framework, as both coefficients depend on the solution of the process and on the law of the solution itself. Starting from discrete observations of the interacting particle system over a fixed interval $[0, T]$, we propose a contrast function based on a pseudo likelihood approach. We show that the associated estimator is consistent when the discretization step ($Δ_n$) and the number of particles ($N$) satisfy $Δ_n \rightarrow 0$ and $N \rightarrow \infty$, and asymptotically normal when additionally the condition $Δ_n N \rightarrow 0$ holds. △ Less

Submitted 23 June, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

arXiv:2008.10549 [pdf, other]

On sampling from data with duplicate records

Authors: Alireza Heidari, Shrinu Kushagra, Ihab F. Ilyas

Abstract: Data deduplication is the task of detecting records in a database that correspond to the same real-world entity. Our goal is to develop a procedure that samples uniformly from the set of entities present in the database in the presence of duplicates. We accomplish this by a two-stage process. In the first step, we estimate the frequencies of all the entities in the database. In the second step, we… ▽ More Data deduplication is the task of detecting records in a database that correspond to the same real-world entity. Our goal is to develop a procedure that samples uniformly from the set of entities present in the database in the presence of duplicates. We accomplish this by a two-stage process. In the first step, we estimate the frequencies of all the entities in the database. In the second step, we use rejection sampling to obtain a (approximately) uniform sample from the set of entities. However, efficiently estimating the frequency of all the entities is a non-trivial task and not attainable in the general case. Hence, we consider various natural properties of the data under which such frequency estimation (and consequently uniform sampling) is possible. Under each of those assumptions, we provide sampling algorithms and give proofs of the complexity (both statistical and computational) of our approach. We complement our study by conducting extensive experiments on both real and synthetic datasets. △ Less

Submitted 24 August, 2020; originally announced August 2020.

Comments: 21 pages, 5 figures

arXiv:2006.10208 [pdf, other]

Record fusion: A learning approach

Authors: Alireza Heidari, George Michalopoulos, Shrinu Kushagra, Ihab F. Ilyas, Theodoros Rekatsinas

Abstract: Record fusion is the task of aggregating multiple records that correspond to the same real-world entity in a database. We can view record fusion as a machine learning problem where the goal is to predict the "correct" value for each attribute for each entity. Given a database, we use a combination of attribute-level, recordlevel, and database-level signals to construct a feature vector for each ce… ▽ More Record fusion is the task of aggregating multiple records that correspond to the same real-world entity in a database. We can view record fusion as a machine learning problem where the goal is to predict the "correct" value for each attribute for each entity. Given a database, we use a combination of attribute-level, recordlevel, and database-level signals to construct a feature vector for each cell (or (row, col)) of that database. We use this feature vector alongwith the ground-truth information to learn a classifier for each of the attributes of the database. Our learning algorithm uses a novel stagewise additive model. At each stage, we construct a new feature vector by combining a part of the original feature vector with features computed by the predictions from the previous stage. We then learn a softmax classifier over the new feature space. This greedy stagewise approach can be viewed as a deep model where at each stage, we are adding more complicated non-linear transformations of the original feature vector. We show that our approach fuses records with an average precision of ~98% when source information of records is available, and ~94% without source information across a diverse array of real-world datasets. We compare our approach to a comprehensive collection of data fusion and entity consolidation methods considered in the literature. We show that our approach can achieve an average precision improvement of ~20%/~45% with/without source information respectively. △ Less

Submitted 17 June, 2020; originally announced June 2020.

Comments: 18 pages, 9 figures

arXiv:2005.08540 [pdf, ps, other]

Approximate Denial Constraints

Authors: Ester Livshits, Alireza Heidari, Ihab F. Ilyas, Benny Kimelfeld

Abstract: The problem of mining integrity constraints from data has been extensively studied over the past two decades for commonly used types of constraints including the classic Functional Dependencies (FDs) and the more general Denial Constraints (DCs). In this paper, we investigate the problem of mining approximate DCs (i.e., DCs that are "almost" satisfied) from data. Considering approximate constraint… ▽ More The problem of mining integrity constraints from data has been extensively studied over the past two decades for commonly used types of constraints including the classic Functional Dependencies (FDs) and the more general Denial Constraints (DCs). In this paper, we investigate the problem of mining approximate DCs (i.e., DCs that are "almost" satisfied) from data. Considering approximate constraints allows us to discover more accurate constraints in inconsistent databases, detect rules that are generally correct but may have a few exceptions, as well as avoid overfitting and obtain more general and less contrived constraints. We introduce the algorithm ADCMiner for mining approximate DCs. An important feature of this algorithm is that it does not assume any specific definition of an approximate DC, but takes the semantics as input. Since there is more than one way to define an approximate DC and different definitions may produce very different results, we do not focus on one definition, but rather on a general family of approximation functions that satisfies some natural axioms defined in this paper and captures commonly used definitions of approximate constraints. We also show how our algorithm can be combined with sampling to return results with high accuracy while significantly reducing the running time. △ Less

Submitted 18 May, 2020; originally announced May 2020.

arXiv:1907.00141 [pdf, other]

Approximate Inference in Structured Instances with Noisy Categorical Observations

Authors: Alireza Heidari, Ihab F. Ilyas, Theodoros Rekatsinas

Abstract: We study the problem of recovering the latent ground truth labeling of a structured instance with categorical random variables in the presence of noisy observations. We present a new approximate algorithm for graphs with categorical variables that achieves low Hamming error in the presence of noisy vertex and edge observations. Our main result shows a logarithmic dependency of the Hamming error to… ▽ More We study the problem of recovering the latent ground truth labeling of a structured instance with categorical random variables in the presence of noisy observations. We present a new approximate algorithm for graphs with categorical variables that achieves low Hamming error in the presence of noisy vertex and edge observations. Our main result shows a logarithmic dependency of the Hamming error to the number of categories of the random variables. Our approach draws connections to correlation clustering with a fixed number of clusters. Our results generalize the works of Globerson et al. (2015) and Foster et al. (2018), who study the hardness of structured prediction under binary labels, to the case of categorical labels. △ Less

Submitted 5 July, 2019; v1 submitted 29 June, 2019; originally announced July 2019.

Comments: UAI 2019, 33 pages

arXiv:1904.02285 [pdf, other]

doi 10.1145/3299869.3319888

HoloDetect: Few-Shot Learning for Error Detection

Authors: Alireza Heidari, Joshua McGrath, Ihab F. Ilyas, Theodoros Rekatsinas

Abstract: We introduce a few-shot learning framework for error detection. We show that data augmentation (a form of weak supervision) is key to training high-quality, ML-based error detection models that require minimal human involvement. Our framework consists of two parts: (1) an expressive model to learn rich representations that capture the inherent syntactic and semantic heterogeneity of errors; and (2… ▽ More We introduce a few-shot learning framework for error detection. We show that data augmentation (a form of weak supervision) is key to training high-quality, ML-based error detection models that require minimal human involvement. Our framework consists of two parts: (1) an expressive model to learn rich representations that capture the inherent syntactic and semantic heterogeneity of errors; and (2) a data augmentation model that, given a small seed of clean records, uses dataset-specific transformations to automatically generate additional training data. Our key insight is to learn data augmentation policies from the noisy input dataset in a weakly supervised manner. We show that our framework detects errors with an average precision of ~94% and an average recall of ~93% across a diverse array of datasets that exhibit different types and amounts of errors. We compare our approach to a comprehensive collection of error detection methods, ranging from traditional rule-based methods to ensemble-based and active learning approaches. We show that data augmentation yields an average improvement of 20 F1 points while it requires access to 3x fewer labeled examples compared to other ML approaches. △ Less

Submitted 3 April, 2019; originally announced April 2019.

Comments: 18 pages,

Journal ref: ACM SIGMOD 2019

arXiv:1901.07207 [pdf, ps, other]

Johnson graphs are panconnected

Authors: S. Morteza Mirafzal, A. Heidari

Abstract: For any given $n,m \in \mathbb{N}$ with $ m < n $, the Johnson graph $J(n,m)$ is defined as the graph whose vertex set is $V=\{v\mid v\subseteq [n]=\{1,...,n\}, |v|=m\}$, where two vertices $v$,$w$ are adjacent if and only if $|v\cap w|=m-1$. A graph $G$ of order $n > 2$ is panconnected if for every two vertices $u$ and $v$, there is a $u$-$v$ path of length $l$ for every integer $l$ with… ▽ More For any given $n,m \in \mathbb{N}$ with $ m < n $, the Johnson graph $J(n,m)$ is defined as the graph whose vertex set is $V=\{v\mid v\subseteq [n]=\{1,...,n\}, |v|=m\}$, where two vertices $v$,$w$ are adjacent if and only if $|v\cap w|=m-1$. A graph $G$ of order $n > 2$ is panconnected if for every two vertices $u$ and $v$, there is a $u$-$v$ path of length $l$ for every integer $l$ with $d(u,v) \leq l \leq n-1$. In this paper, we prove that the Johnson graph $J(n,m)$ is a panconnected graph. △ Less

Submitted 18 August, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

Comments: 6 pages, 1 figures

arXiv:1606.00879 [pdf]

ICTs Effect on Parents Feelings of Presence, Awareness, and Connectedness during a Childs Hospitalization

Authors: Abbas Heidari, Yahya Kazemzadeh, Greg Wadley

Abstract: This study evaluates how off-the-shelf commercial ICTs can contribute to creating a feeling of Presence, Connectedness, and Awareness between parents and their hospitalized child. Thematic analysis and descriptive statistics are used to analyse qualitative and quantitative data collected through a survey of thirty eight parents whose children were admitted to the Royal Childrens Hospital. Through… ▽ More This study evaluates how off-the-shelf commercial ICTs can contribute to creating a feeling of Presence, Connectedness, and Awareness between parents and their hospitalized child. Thematic analysis and descriptive statistics are used to analyse qualitative and quantitative data collected through a survey of thirty eight parents whose children were admitted to the Royal Childrens Hospital. Through analysis of data, Presence is found to be less facilitated through ICT than are Awareness and Connectedness. Although participants reported that voice call on mobile phones was the most common way of communication, their ideal was a video-chat application such as Skype, or a combination of Skype and TV to facilitate feeling of Presence. A strong desire to use rich media such as video-audio to help them have a greater feeling of Presence was identified in parents responses to the questionnaire. △ Less

Submitted 25 September, 2016; v1 submitted 28 May, 2016; originally announced June 2016.

Comments: ISBN# 978-0-646-95337-3 Presented at the Australasian Conference on Information Systems 2015 (arXiv:1605.01032)

Report number: ACIS/2015/76

arXiv:0905.1504

Signal Recovery in Pulsed Terahertz Integrated Circuits

Authors: Abdorreza Heidari, Mohammad Neshat, Daryoosh Saeedkia, Safieddin Safavi-Naeini

Abstract: In this article, a time-domain calibration procedure is proposed for pulsed Terahertz Integrated Circuits (TIC) used in on-chip applications, where the conventional calibration methods are not applicable. The proposed post-detection method removes the unwanted linear distortions, such as interfering echoes and frequency dispersion, by using only one single-port measurement. The method employs a… ▽ More In this article, a time-domain calibration procedure is proposed for pulsed Terahertz Integrated Circuits (TIC) used in on-chip applications, where the conventional calibration methods are not applicable. The proposed post-detection method removes the unwanted linear distortions, such as interfering echoes and frequency dispersion, by using only one single-port measurement. The method employs a wave-transfer model for analysis of the TIC, and the model parameters are obtained by a proposed blind estimation algorithm. A complete implementation of the method is demonstrated for a fabricated TIC, when used in an on-chip sensing application. The features of interest in the measured signal, such as absorption lines, can be masked or weakened by the distortion of the THz signal happening in a TIC. The proposed signal recovery approach improves the detection of those otherwise hidden features, and can significantly enhance the performance of existing TICs. To show the effectiveness of the proposed de-embedding method, numerical results are presented for simulated and measured signals. The method presented in this article is enabling for accurate TIC applications, and can be utilized to optimally design novel TIC structures for specific purposes. △ Less

Submitted 5 April, 2010; v1 submitted 10 May, 2009; originally announced May 2009.

Comments: This paper has been withdrawn by the authors.

Showing 1–12 of 12 results for author: Heidari, A