Search | arXiv e-print repository

SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing

Authors: Nazmul Karim, Umar Khalid, Mohsen Joneidi, Chen Chen, Nazanin Rahnavard

Abstract: Text-to-Image (T2I) diffusion models have achieved remarkable success in synthesizing high-quality images conditioned on text prompts. Recent methods have tried to replicate the success by either training text-to-video (T2V) models on a very large number of text-video pairs or adapting T2I models on text-video pairs independently. Although the latter is computationally less expensive, it still tak… ▽ More Text-to-Image (T2I) diffusion models have achieved remarkable success in synthesizing high-quality images conditioned on text prompts. Recent methods have tried to replicate the success by either training text-to-video (T2V) models on a very large number of text-video pairs or adapting T2I models on text-video pairs independently. Although the latter is computationally less expensive, it still takes a significant amount of time for per-video adaption. To address this issue, we propose SAVE, a novel spectral-shift-aware adaptation framework, in which we fine-tune the spectral shift of the parameter space instead of the parameters themselves. Specifically, we take the spectral decomposition of the pre-trained T2I weights and only update the singular values while freezing the corresponding singular vectors. In addition, we introduce a spectral shift regularizer aimed at placing tighter constraints on larger singular values compared to smaller ones. This form of regularization enables the model to grasp finer details within the video that align with the provided textual descriptions. We also offer theoretical justification for our proposed regularization technique. Since we are only dealing with spectral shifts, the proposed method reduces the adaptation time significantly (approx. 10 times) and has fewer resource constraints for training. Such attributes posit SAVE to be more suitable for real-world applications, e.g. editing undesirable content during video streaming. We validate the effectiveness of SAVE with an extensive experimental evaluation under different settings, e.g. style transfer, object replacement, privacy preservation, etc. △ Less

Submitted 1 December, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

Comments: 11 pages, 10 figures

arXiv:2210.07932 [pdf, other]

Neural Routing in Meta Learning

Authors: Jicang Cai, Saeed Vahidian, Weijia Wang, Mohsen Joneidi, Bill Lin

Abstract: Meta-learning often referred to as learning-to-learn is a promising notion raised to mimic human learning by exploiting the knowledge of prior tasks but being able to adapt quickly to novel tasks. A plethora of models has emerged in this context and improved the learning efficiency, robustness, etc. The question that arises here is can we emulate other aspects of human learning and incorporate the… ▽ More Meta-learning often referred to as learning-to-learn is a promising notion raised to mimic human learning by exploiting the knowledge of prior tasks but being able to adapt quickly to novel tasks. A plethora of models has emerged in this context and improved the learning efficiency, robustness, etc. The question that arises here is can we emulate other aspects of human learning and incorporate them into the existing meta learning algorithms? Inspired by the widely recognized finding in neuroscience that distinct parts of the brain are highly specialized for different types of tasks, we aim to improve the model performance of the current meta learning algorithms by selectively using only parts of the model conditioned on the input tasks. In this work, we describe an approach that investigates task-dependent dynamic neuron selection in deep convolutional neural networks (CNNs) by leveraging the scaling factor in the batch normalization (BN) layer associated with each convolutional layer. The problem is intriguing because the idea of hel** different parts of the model to learn from different types of tasks may help us train better filters in CNNs, and improve the model generalization performance. We find that the proposed approach, neural routing in meta learning (NRML), outperforms one of the well-known existing meta learning baselines on few-shot classification tasks on the most widely used benchmark datasets. △ Less

Submitted 14 October, 2022; originally announced October 2022.

arXiv:2106.06983 [pdf, other]

Two-way Spectrum Pursuit for CUR Decomposition and Its Application in Joint Column/Row Subset Selection

Authors: Ashkan Esmaeili, Mohsen Joneidi, Mehrdad Salimitari, Umar Khalid, Nazanin Rahnavard

Abstract: The problem of simultaneous column and row subset selection is addressed in this paper. The column space and row space of a matrix are spanned by its left and right singular vectors, respectively. However, the singular vectors are not within actual columns/rows of the matrix. In this paper, an iterative approach is proposed to capture the most structural information of columns/rows via selecting a… ▽ More The problem of simultaneous column and row subset selection is addressed in this paper. The column space and row space of a matrix are spanned by its left and right singular vectors, respectively. However, the singular vectors are not within actual columns/rows of the matrix. In this paper, an iterative approach is proposed to capture the most structural information of columns/rows via selecting a subset of actual columns/rows. This algorithm is referred to as two-way spectrum pursuit (TWSP) which provides us with an accurate solution for the CUR matrix decomposition. TWSP is applicable in a wide range of applications since it enjoys a linear complexity w.r.t. number of original columns/rows. We demonstrated the application of TWSP for joint channel and sensor selection in cognitive radio networks, informative users and contents detection, and efficient supervised data reduction. △ Less

Submitted 13 June, 2021; originally announced June 2021.

arXiv:1906.08177 [pdf, other]

AI-enabled Blockchain: An Outlier-aware Consensus Protocol for Blockchain-based IoT Networks

Authors: Mehrdad Salimitari, Mohsen Joneidi, Mainak Chatterjee

Abstract: A new framework for a secure and robust consensus in blockchain-based IoT networks is proposed using machine learning. Hyperledger fabric, which is a blockchain platform developed as part of the Hyperledger project, though looks very apt for IoT applications, has comparatively low tolerance for malicious activities in an untrustworthy environment. To that end, we propose AI-enabled blockchain (AIB… ▽ More A new framework for a secure and robust consensus in blockchain-based IoT networks is proposed using machine learning. Hyperledger fabric, which is a blockchain platform developed as part of the Hyperledger project, though looks very apt for IoT applications, has comparatively low tolerance for malicious activities in an untrustworthy environment. To that end, we propose AI-enabled blockchain (AIBC) with a 2-step consensus protocol that uses an outlier detection algorithm for consensus in an IoT network implemented on hyperledger fabric platform. The outlier-aware consensus protocol exploits a supervised machine learning algorithm which detects anomaly activities via a learned detector in the first step. Then, the data goes through the inherent Practical Byzantine Fault Tolerance (PBFT) consensus protocol in the hyperledger fabric for ledger update. We measure and report the performance of our framework with respect to the various delay components. Results reveal that our implemented AIBC network (2-step consensus protocol) improves hyperledger fabric performance in terms of fault tolerance by marginally compromising the delay performance. △ Less

Submitted 9 August, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

Comments: This paper is accepted in IEEE GLOBECOM 2019 for publication

arXiv:1905.08869 [pdf, other]

Source Localization and Tracking for Dynamic Radio Cartography using Directional Antennas

Authors: Mohsen Joneidi, Hassan Yazdani, Azadeh Vosoughi, Nazanin Rahnavard

Abstract: Utilization of directional antennas is a promising solution for efficient spectrum sensing and accurate source localization and tracking. Spectrum sensors equipped with directional antennas should constantly scan the space in order to track emitting sources and discover new activities in the area of interest. In this paper, we propose a new formulation that unifies received-signal-strength (RSS) a… ▽ More Utilization of directional antennas is a promising solution for efficient spectrum sensing and accurate source localization and tracking. Spectrum sensors equipped with directional antennas should constantly scan the space in order to track emitting sources and discover new activities in the area of interest. In this paper, we propose a new formulation that unifies received-signal-strength (RSS) and direction of arrival (DoA) in a compressive sensing (CS) framework. The underlying CS measurement matrix is a function of beamforming vectors of sensors and is referred to as the propagation matrix. Comparing to the omni-directional antenna case, our employed propagation matrix provides more incoherent projections, an essential factor in the compressive sensing theory. Based on the new formulation, we optimize the antenna beams, enhance spectrum sensing efficiency, track active primary users accurately and monitor spectrum activities in an area of interest. In many practical scenarios there is no fusion center to integrate received data from spectrum sensors. We propose the distributed version of our algorithm for such cases. Experimental results show a significant improvement in source localization accuracy, compared with the scenario when sensors are equipped with omni-directional antennas. Applicability of the proposed framework for dynamic radio cartography is shown. Moreover, comparing the estimated dynamic RF map over time with the ground truth demonstrates the effectiveness of our proposed method for accurate signal estimation and recovery. △ Less

Submitted 21 May, 2019; originally announced May 2019.

Comments: SECON 2019 workshop on Edge Computing for Cyber Physical Systems

arXiv:1905.04392 [pdf, other]

Large-Scale Spectrum Occupancy Learning via Tensor Decomposition and LSTM Networks

Authors: Mohsen Joneidi, Ismail Alkhouri, Nazanin Rahnavard

Abstract: A new paradigm for large-scale spectrum occupancy learning based on long short-term memory (LSTM) recurrent neural networks is proposed. Studies have shown that spectrum usage is a highly correlated time series. Moreover, there is a correlation for occupancy of spectrum between different frequency channels. Therefore, revealing all these correlations using learning and prediction of one-dimensiona… ▽ More A new paradigm for large-scale spectrum occupancy learning based on long short-term memory (LSTM) recurrent neural networks is proposed. Studies have shown that spectrum usage is a highly correlated time series. Moreover, there is a correlation for occupancy of spectrum between different frequency channels. Therefore, revealing all these correlations using learning and prediction of one-dimensional time series is not a trivial task. In this paper, we introduce a new framework for representing the spectrum measurements in a tensor format. Next, a time-series prediction method based on CANDECOMP/PARFAC (CP) tensor decomposition and LSTM recurrent neural networks is proposed. The proposed method is computationally efficient and is able to capture different types of correlation within the measured spectrum. Moreover, it is robust against noise and missing entries of sensed spectrum. The superiority of the proposed method is evaluated over a large-scale synthetic dataset in terms of prediction accuracy and computational efficiency. △ Less

Submitted 10 May, 2019; originally announced May 2019.

Comments: Submitted to the 2019 IEEE Global Communications Conference (GLOBECOM)

arXiv:1905.04284 [pdf, other]

Primary User Localization and Online Radio Cartography via Structured Tensor Decomposition

Authors: Mohsen Joneidi, Nazanin Rahnavard

Abstract: Source localization and radio cartography using multi-way representation of spectrum is the subject of study in this paper. A joint matrix factorization and tensor decomposition problem is proposed and solved using an iterative algorithm. The multi-way measured spectrum is organized in a tensor and it is modeled by multiplication of a propagation tensor and a channel gain matrix. The tensor indica… ▽ More Source localization and radio cartography using multi-way representation of spectrum is the subject of study in this paper. A joint matrix factorization and tensor decomposition problem is proposed and solved using an iterative algorithm. The multi-way measured spectrum is organized in a tensor and it is modeled by multiplication of a propagation tensor and a channel gain matrix. The tensor indicates the propagating power from each location and each frequency over time and the channel matrix links the propagating tensor to the sensed spectrum. We utilize sparsity and other intrinsic characteristics of spectrum to identify the solution of the proposed problem. Moreover, The online implementation of the proposed framework results in online radio cartography which is a powerful tool for efficient spectrum awareness and utilization. The simulation results show that our algorithm is a promising technique for dynamic primary user localization and online radio cartography. △ Less

Submitted 10 May, 2019; originally announced May 2019.

Comments: Submitted to the 2019 IEEE Global Communications Conference (GLOBECOM)

arXiv:1811.12326 [pdf, other]

Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision

Authors: Mohsen Joneidi, Alireza Zaeemzadeh, Nazanin Rahnavard, Mubarak Shah

Abstract: The goal of data selection is to capture the most structural information from a set of data. This paper presents a fast and accurate data selection method, in which the selected samples are optimized to span the subspace of all data. We propose a new selection algorithm, referred to as iterative projection and matching (IPM), with linear complexity w.r.t. the number of data, and without any parame… ▽ More The goal of data selection is to capture the most structural information from a set of data. This paper presents a fast and accurate data selection method, in which the selected samples are optimized to span the subspace of all data. We propose a new selection algorithm, referred to as iterative projection and matching (IPM), with linear complexity w.r.t. the number of data, and without any parameter to be tuned. In our algorithm, at each iteration, the maximum information from the structure of the data is captured by one selected sample, and the captured information is neglected in the next iterations by projection on the null-space of previously selected samples. The computational efficiency and the selection accuracy of our proposed algorithm outperform those of the conventional methods. Furthermore, the superiority of the proposed algorithm is shown on active learning for video action recognition dataset on UCF-101; learning using representatives on ImageNet; training a generative adversarial network (GAN) to generate multi-view images from a single-view input on CMU Multi-PIE dataset; and video summarization on UTE Egocentric dataset. △ Less

Submitted 29 November, 2018; originally announced November 2018.

Comments: 11 pages, 5 figures, 5 tables

arXiv:1703.03340 [pdf, other]

Adaptive Non-uniform Compressive Sampling for Time-varying Signals

Authors: Alireza Zaeemzadeh, Mohsen Joneidi, Nazanin Rahnavard

Abstract: In this paper, adaptive non-uniform compressive sampling (ANCS) of time-varying signals, which are sparse in a proper basis, is introduced. ANCS employs the measurements of previous time steps to distribute the sensing energy among coefficients more intelligently. To this aim, a Bayesian inference method is proposed that does not require any prior knowledge of importance levels of coefficients or… ▽ More In this paper, adaptive non-uniform compressive sampling (ANCS) of time-varying signals, which are sparse in a proper basis, is introduced. ANCS employs the measurements of previous time steps to distribute the sensing energy among coefficients more intelligently. To this aim, a Bayesian inference method is proposed that does not require any prior knowledge of importance levels of coefficients or sparsity of the signal. Our numerical simulations show that ANCS is able to achieve the desired non-uniform recovery of the signal. Moreover, if the signal is sparse in canonical basis, ANCS can reduce the number of required measurements significantly. △ Less

Submitted 9 March, 2017; originally announced March 2017.

Comments: 6 pages, 8 figures, Conference on Information Sciences and Systems (CISS 2017) Baltimore, Maryland

arXiv:1701.03420 [pdf, other]

Joint Dictionary Learning for Example-based Image Super-resolution

Authors: Mojtaba Sahraee-Ardakan, Mohsen Joneidi

Abstract: In this paper, we propose a new joint dictionary learning method for example-based image super-resolution (SR), using sparse representation. The low-resolution (LR) dictionary is trained from a set of LR sample image patches. Using the sparse representation coefficients of these LR patches over the LR dictionary, the high-resolution (HR) dictionary is trained by minimizing the reconstruction error… ▽ More In this paper, we propose a new joint dictionary learning method for example-based image super-resolution (SR), using sparse representation. The low-resolution (LR) dictionary is trained from a set of LR sample image patches. Using the sparse representation coefficients of these LR patches over the LR dictionary, the high-resolution (HR) dictionary is trained by minimizing the reconstruction error of HR sample patches. The error criterion used here is the mean square error. In this way we guarantee that the HR patches have the same sparse representation over HR dictionary as the LR patches over the LR dictionary, and at the same time, these sparse representations can well reconstruct the HR patches. Simulation results show the effectiveness of our method compared to the state-of-art SR algorithms. △ Less

Submitted 12 January, 2017; originally announced January 2017.

Comments: 5 pages, 1 figure, 1 table

arXiv:1508.07269 [pdf, ps, other]

doi 10.1109/MILCOM.2015.7357449

Missing Spectrum-Data Recovery in Cognitive Radio Networks Using Piecewise Constant Nonnegative Matrix Factorization

Authors: Alireza Zaeemzadeh, Mohsen Joneidi, Behzad Shahrasbi, Nazanin Rahnavard

Abstract: In this paper, we propose a missing spectrum data recovery technique for cognitive radio (CR) networks using Nonnegative Matrix Factorization (NMF). It is shown that the spectrum measurements collected from secondary users (SUs) can be factorized as product of a channel gain matrix times an activation matrix. Then, an NMF method with piecewise constant activation coefficients is introduced to anal… ▽ More In this paper, we propose a missing spectrum data recovery technique for cognitive radio (CR) networks using Nonnegative Matrix Factorization (NMF). It is shown that the spectrum measurements collected from secondary users (SUs) can be factorized as product of a channel gain matrix times an activation matrix. Then, an NMF method with piecewise constant activation coefficients is introduced to analyze the measurements and estimate the missing spectrum data. The proposed optimization problem is solved by a Majorization-Minimization technique. The numerical simulation verifies that the proposed technique is able to accurately estimate the missing spectrum data in the presence of noise and fading. △ Less

Submitted 28 August, 2015; originally announced August 2015.

Comments: 6 pages, 6 figures, Accepted for presentation in MILCOM'15 Conference

arXiv:1501.01106 [pdf]

A Study on Clustering for Clustering Based Image De-Noising

Authors: Hossein Bakhshi Golestani, Mohsen Joneidi, Mostafa Sadeghi

Abstract: In this paper, the problem of de-noising of an image contaminated with Additive White Gaussian Noise (AWGN) is studied. This subject is an open problem in signal processing for more than 50 years. Local methods suggested in recent years, have obtained better results than global methods. However by more intelligent training in such a way that first, important data is more effective for training, se… ▽ More In this paper, the problem of de-noising of an image contaminated with Additive White Gaussian Noise (AWGN) is studied. This subject is an open problem in signal processing for more than 50 years. Local methods suggested in recent years, have obtained better results than global methods. However by more intelligent training in such a way that first, important data is more effective for training, second, clustering in such way that training blocks lie in low-rank subspaces, we can design a dictionary applicable for image de-noising and obtain results near the state of the art local methods. In the present paper, we suggest a method based on global clustering of image constructing blocks. As the type of clustering plays an important role in clustering-based de-noising methods, we address two questions about the clustering. The first, which parts of the data should be considered for clustering? and the second, what data clustering method is suitable for de-noising.? Then clustering is exploited to learn an over complete dictionary. By obtaining sparse decomposition of the noisy image blocks in terms of the dictionary atoms, the de-noised version is achieved. In addition to our framework, 7 popular dictionary learning methods are simulated and compared. The results are compared based on two major factors: (1) de-noising performance and (2) execution time. Experimental results show that our dictionary learning framework outperforms its competitors in terms of both factors. △ Less

Submitted 6 January, 2015; originally announced January 2015.

Comments: 9 pages, 8 figures, Journal of Information Systems and Telecommunications (JIST)

Journal ref: Journal of Information Systems and Telecommunications (JIST), vol. 2, no. 4, pp. 196-204, December 2014

arXiv:1412.6125 [pdf, ps, other]

Matrix Coherency Graph: A Tool for Improving Sparse Coding Performance

Authors: Mohsen Joneidi, Mahdi Barzegar Khalilsarai, Alireza Zaeemzadeh, Nazanin Rahnavard

Abstract: Exact recovery of a sparse solution for an underdetermined system of linear equations implies full search among all possible subsets of the dictionary, which is computationally intractable, while l1 minimization will do the job when a Restricted Isometry Property holds for the dictionary. Yet, practical sparse recovery algorithms may fail to recover the vector of coefficients even when the diction… ▽ More Exact recovery of a sparse solution for an underdetermined system of linear equations implies full search among all possible subsets of the dictionary, which is computationally intractable, while l1 minimization will do the job when a Restricted Isometry Property holds for the dictionary. Yet, practical sparse recovery algorithms may fail to recover the vector of coefficients even when the dictionary deviates from the RIP only slightly. To enjoy l1 minimization guarantees in a wider sense, a method based on a combination of full-search and l1 minimization is presented. The idea is based on partitioning the dictionary into atoms which are in some sense well-conditioned and those which are ill-conditioned. Inspired by that, a matrix coherency graph is introduced which is a tool extracted by the structure of the dictionary. This tool can be used for decreasing the greediness of sparse coding algorithms so that recovery will be more reliable. We have modified the IRLS algorithm by applying the proposed method on it and simulation results show that the modified version performs quite better than the original algorithm. △ Less

Submitted 30 November, 2014; originally announced December 2014.

Comments: 5 pages, 8 figures, going to be submitted to SampTA 2015

arXiv:1307.7521 [pdf, ps, other]

Union of Low-Rank Subspaces Detector

Authors: Mohsen Joneidi, Parvin Ahmadi, Mostafa Sadeghi, Nazanin Rahnavard

Abstract: The problem of signal detection using a flexible and general model is considered. Due to applicability and flexibility of sparse signal representation and approximation, it has attracted a lot of attention in many signal processing areas. In this paper, we propose a new detection method based on sparse decomposition in a union of subspaces (UoS) model. Our proposed detector uses a dictionary that… ▽ More The problem of signal detection using a flexible and general model is considered. Due to applicability and flexibility of sparse signal representation and approximation, it has attracted a lot of attention in many signal processing areas. In this paper, we propose a new detection method based on sparse decomposition in a union of subspaces (UoS) model. Our proposed detector uses a dictionary that can be interpreted as a bank of matched subspaces. This improves the performance of signal detection, as it is a generalization for detectors. Low-rank assumption for the desired signals implies that the representations of these signals in terms of some proper bases would be sparse. Our proposed detector exploits sparsity in its decision rule. We demonstrate the high efficiency of our method in the cases of voice activity detection in speech processing. △ Less

Submitted 16 February, 2016; v1 submitted 29 July, 2013; originally announced July 2013.

arXiv:1306.3317 [pdf]

Sparse Auto-Regressive: Robust Estimation of AR Parameters

Authors: Mohsen Joneidi

Abstract: In this paper I present a new approach for regression of time series using their own samples. This is a celebrated problem known as Auto-Regression. Dealing with outlier or missed samples in a time series makes the problem of estimation difficult, so it should be robust against them. Moreover for coding purposes I will show that it is desired the residual of auto-regression be sparse. To these aim… ▽ More In this paper I present a new approach for regression of time series using their own samples. This is a celebrated problem known as Auto-Regression. Dealing with outlier or missed samples in a time series makes the problem of estimation difficult, so it should be robust against them. Moreover for coding purposes I will show that it is desired the residual of auto-regression be sparse. To these aims, I first assume a multivariate Gaussian prior on the residual and then obtain the estimation. Two simple simulations have been done on spectrum estimation and speech coding. △ Less

Submitted 18 August, 2015; v1 submitted 14 June, 2013; originally announced June 2013.

Comments: 4 pages, 4 figures

arXiv:1306.2967

Optimization of Clustering for Clustering-based Image Denoising

Authors: Mohsen Joneidi, Mostafa Sadeghi

Abstract: In this paper, the problem of de-noising of an image contaminated with additive white Gaussian noise (AWGN) is studied. This subject has been continued to be an open problem in signal processing for more than 50 years. In the present paper, we suggest a method based on global clustering of image constructing blocks. Noting that the type of clustering plays an important role in clustering-based de-… ▽ More In this paper, the problem of de-noising of an image contaminated with additive white Gaussian noise (AWGN) is studied. This subject has been continued to be an open problem in signal processing for more than 50 years. In the present paper, we suggest a method based on global clustering of image constructing blocks. Noting that the type of clustering plays an important role in clustering-based de-noising methods, we address two questions about the clustering. First, which parts of data should be considered for clustering? Second, what data clustering method is suitable for de-noising? Clustering is exploited to learn an over complete dictionary. By obtaining sparse decomposition of the noisy image blocks in terms of the dictionary atoms, the de-noised version is achieved. Experimental results show that our dictionary learning framework outperforms traditional dictionary learning methods such as K-SVD. △ Less

Submitted 28 October, 2013; v1 submitted 12 June, 2013; originally announced June 2013.

Comments: The paper have some problems that is needed to be re-written. it has been withdrawn

Showing 1–16 of 16 results for author: Joneidi, M