Search | arXiv e-print repository

Soft Synergies: Model Order Reduction of Hybrid Soft-Rigid Robots via Optimal Strain Parameterization

Authors: Abdulaziz Y. Alkayas, Anup Teejo Mathew, Daniel Feliu-Talegon, ** Deng, Thomas George Thuruthel, Federico Renda

Abstract: Soft robots offer remarkable adaptability and safety advantages over rigid robots, but modeling their complex, nonlinear dynamics remains challenging. Strain-based models have recently emerged as a promising candidate to describe such systems, however, they tend to be high-dimensional and time consuming. This paper presents a novel model order reduction approach for soft and hybrid robots by combi… ▽ More Soft robots offer remarkable adaptability and safety advantages over rigid robots, but modeling their complex, nonlinear dynamics remains challenging. Strain-based models have recently emerged as a promising candidate to describe such systems, however, they tend to be high-dimensional and time consuming. This paper presents a novel model order reduction approach for soft and hybrid robots by combining strain-based modeling with Proper Orthogonal Decomposition (POD). The method identifies optimal coupled strain basis functions -- or mechanical synergies -- from simulation data, enabling the description of soft robot configurations with a minimal number of generalized coordinates. The reduced order model (ROM) achieves substantial dimensionality reduction while preserving accuracy. Rigorous testing demonstrates the interpolation and extrapolation capabilities of the ROM for soft manipulators under static and dynamic conditions. The approach is further validated on a snake-like hyper-redundant rigid manipulator and a closed-chain system with soft and rigid components, illustrating its broad applicability. Finally, the approach is leveraged for shape estimation of a real six-actuator soft manipulator using only two position markers, showcasing its practical utility. This POD-based ROM offers significant computational speed-ups, paving the way for real-time simulation and control of complex soft and hybrid robots. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.10357 [pdf, other]

RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods

Authors: Xin Qiao, Matteo Poggi, Pengchao Deng, Hao Wei, Chenyang Ge, Stefano Mattoccia

Abstract: Integrating an RGB camera into a ToF imaging system has become a significant technique for perceiving the real world. The RGB guided ToF imaging system is crucial to several applications, including face anti-spoofing, saliency detection, and trajectory prediction. Depending on the distance of the working range, the implementation schemes of the RGB guided ToF imaging systems are different. Specifi… ▽ More Integrating an RGB camera into a ToF imaging system has become a significant technique for perceiving the real world. The RGB guided ToF imaging system is crucial to several applications, including face anti-spoofing, saliency detection, and trajectory prediction. Depending on the distance of the working range, the implementation schemes of the RGB guided ToF imaging systems are different. Specifically, ToF sensors with a uniform field of illumination, which can output dense depth but have low resolution, are typically used for close-range measurements. In contrast, LiDARs, which emit laser pulses and can only capture sparse depth, are usually employed for long-range detection. In the two cases, depth quality improvement for RGB guided ToF imaging corresponds to two sub-tasks: guided depth super-resolution and guided depth completion. In light of the recent significant boost to the field provided by deep learning, this paper comprehensively reviews the works related to RGB guided ToF imaging, including network structures, learning strategies, evaluation metrics, benchmark datasets, and objective functions. Besides, we present quantitative comparisons of state-of-the-art methods on widely used benchmark datasets. Finally, we discuss future trends and the challenges in real applications for further research. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: To appear on International Journal of Computer Vision (IJCV)

arXiv:2404.09165 [pdf, ps, other]

Private Multiple Linear Computation: A Flexible Communication-Computation Tradeoff

Authors: **bao Zhu, Lan** Li, Xiaohu Tang, ** Deng

Abstract: We consider the problem of private multiple linear computation (PMLC) over a replicated storage system with colluding and unresponsive constraints. In this scenario, the user wishes to privately compute $P$ linear combinations of $M$ files from a set of $N$ replicated servers without revealing any information about the coefficients of these linear combinations to any $T$ colluding servers, in the… ▽ More We consider the problem of private multiple linear computation (PMLC) over a replicated storage system with colluding and unresponsive constraints. In this scenario, the user wishes to privately compute $P$ linear combinations of $M$ files from a set of $N$ replicated servers without revealing any information about the coefficients of these linear combinations to any $T$ colluding servers, in the presence of $S$ unresponsive servers that do not provide any information in response to user queries. Our focus is on more general performance metrics where the communication and computational overheads incurred by the user are not neglected. Additionally, the communication and computational overheads for servers are also taken into consideration. Unlike most previous literature that primarily focused on download cost from servers as a performance metric, we propose a novel PMLC scheme to establish a flexible tradeoff between communication costs and computational complexities. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: Accepted by IEEE ISIT 2024

arXiv:2303.09824 [pdf, other]

doi 10.1109/TIV.2023.3274536

Motion Planning for Autonomous Driving: The State of the Art and Future Perspectives

Authors: Siyu Teng, Xuemin Hu, Peng Deng, Bai Li, Yuchen Li, Dongsheng Yang, Yunfeng Ai, Lingxi Li, Zhe Xuanyuan, Fenghua Zhu, Long Chen

Abstract: Intelligent vehicles (IVs) have gained worldwide attention due to their increased convenience, safety advantages, and potential commercial value. Despite predictions of commercial deployment by 2025, implementation remains limited to small-scale validation, with precise tracking controllers and motion planners being essential prerequisites for IVs. This paper reviews state-of-the-art motion planni… ▽ More Intelligent vehicles (IVs) have gained worldwide attention due to their increased convenience, safety advantages, and potential commercial value. Despite predictions of commercial deployment by 2025, implementation remains limited to small-scale validation, with precise tracking controllers and motion planners being essential prerequisites for IVs. This paper reviews state-of-the-art motion planning methods for IVs, including pipeline planning and end-to-end planning methods. The study examines the selection, expansion, and optimization operations in a pipeline method, while it investigates training approaches and validation scenarios for driving tasks in end-to-end methods. Experimental platforms are reviewed to assist readers in choosing suitable training and validation strategies. A side-by-side comparison of the methods is provided to highlight their strengths and limitations, aiding system-level design choices. Current challenges and future perspectives are also discussed in this survey. △ Less

Submitted 10 May, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: 21 pages, 15 figures and 5 tables, in IEEE Transactions on Intelligent Vehicles

arXiv:2301.08249 [pdf, other]

Causal conditional hidden Markov model for multimodal traffic prediction

Authors: Yu Zhao, Pan Deng, Junting Liu, Xiaofeng Jia, Mulan Wang

Abstract: Multimodal traffic flow can reflect the health of the transportation system, and its prediction is crucial to urban traffic management. Recent works overemphasize spatio-temporal correlations of traffic flow, ignoring the physical concepts that lead to the generation of observations and their causal relationship. Spatio-temporal correlations are considered unstable under the influence of different… ▽ More Multimodal traffic flow can reflect the health of the transportation system, and its prediction is crucial to urban traffic management. Recent works overemphasize spatio-temporal correlations of traffic flow, ignoring the physical concepts that lead to the generation of observations and their causal relationship. Spatio-temporal correlations are considered unstable under the influence of different conditions, and spurious correlations may exist in observations. In this paper, we analyze the physical concepts affecting the generation of multimode traffic flow from the perspective of the observation generation principle and propose a Causal Conditional Hidden Markov Model (CCHMM) to predict multimodal traffic flow. In the latent variables inference stage, a posterior network disentangles the causal representations of the concepts of interest from conditional information and observations, and a causal propagation module mines their causal relationship. In the data generation stage, a prior network samples the causal latent variables from the prior distribution and feeds them into the generator to generate multimodal traffic flow. We use a mutually supervised training method for the prior and posterior to enhance the identifiability of the model. Experiments on real-world datasets show that CCHMM can effectively disentangle causal representations of concepts of interest and identify causality, and accurately predict multimodal traffic flow. △ Less

Submitted 18 January, 2023; originally announced January 2023.

Comments: 8 pages, 5 figures

arXiv:2301.07843 [pdf, other]

Spatio-temporal neural structural causal models for bike flow prediction

Authors: Pan Deng, Yu Zhao, Junting Liu, Xiaofeng Jia, Mulan Wang

Abstract: As a representative of public transportation, the fundamental issue of managing bike-sharing systems is bike flow prediction. Recent methods overemphasize the spatio-temporal correlations in the data, ignoring the effects of contextual conditions on the transportation system and the inter-regional timevarying causality. In addition, due to the disturbance of incomplete observations in the data, ra… ▽ More As a representative of public transportation, the fundamental issue of managing bike-sharing systems is bike flow prediction. Recent methods overemphasize the spatio-temporal correlations in the data, ignoring the effects of contextual conditions on the transportation system and the inter-regional timevarying causality. In addition, due to the disturbance of incomplete observations in the data, random contextual conditions lead to spurious correlations between data and features, making the prediction of the model ineffective in special scenarios. To overcome this issue, we propose a Spatio-temporal Neural Structure Causal Model(STNSCM) from the perspective of causality. First, we build a causal graph to describe the traffic prediction, and further analyze the causal relationship between the input data, contextual conditions, spatiotemporal states, and prediction results. Second, we propose to apply the frontdoor criterion to eliminate confounding biases in the feature extraction process. Finally, we propose a counterfactual representation reasoning module to extrapolate the spatio-temporal state under the factual scenario to future counterfactual scenarios to improve the prediction performance. Experiments on real-world datasets demonstrate the superior performance of our model, especially its resistance to fluctuations caused by the external environment. The source code and data will be released. △ Less

Submitted 18 January, 2023; originally announced January 2023.

Comments: 8 pages, 4 figures

arXiv:2209.13866 [pdf, other]

Rethinking Blur Synthesis for Deep Real-World Image Deblurring

Authors: Hao Wei, Chenyang Ge, Xin Qiao, Pengchao Deng

Abstract: In this paper, we examine the problem of real-world image deblurring and take into account two key factors for improving the performance of the deep image deblurring model, namely, training data synthesis and network architecture design. Deblurring models trained on existing synthetic datasets perform poorly on real blurry images due to domain shift. To reduce the domain gap between synthetic and… ▽ More In this paper, we examine the problem of real-world image deblurring and take into account two key factors for improving the performance of the deep image deblurring model, namely, training data synthesis and network architecture design. Deblurring models trained on existing synthetic datasets perform poorly on real blurry images due to domain shift. To reduce the domain gap between synthetic and real domains, we propose a novel realistic blur synthesis pipeline to simulate the camera imaging process. As a result of our proposed synthesis method, existing deblurring models could be made more robust to handle real-world blur. Furthermore, we develop an effective deblurring model that captures non-local dependencies and local context in the feature domain simultaneously. Specifically, we introduce the multi-path transformer module to UNet architecture for enriched multi-scale features learning. A comprehensive experiment on three real-world datasets shows that the proposed deblurring model performs better than state-of-the-art methods. △ Less

Submitted 28 September, 2022; originally announced September 2022.

Comments: 9 pages, 7 figures

arXiv:2209.06158 [pdf, other]

Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design

Authors: Kehan Wu, Yingce Xia, Yang Fan, Pan Deng, Haiguang Liu, Lijun Wu, Shufang Xie, Tong Wang, Tao Qin, Tie-Yan Liu

Abstract: Structure-based drug design is drawing growing attentions in computer-aided drug discovery. Compared with the virtual screening approach where a pre-defined library of compounds are computationally screened, de novo drug design based on the structure of a target protein can provide novel drug candidates. In this paper, we present a generative solution named TamGent (Target-aware molecule generator… ▽ More Structure-based drug design is drawing growing attentions in computer-aided drug discovery. Compared with the virtual screening approach where a pre-defined library of compounds are computationally screened, de novo drug design based on the structure of a target protein can provide novel drug candidates. In this paper, we present a generative solution named TamGent (Target-aware molecule generator with Transformer) that can directly generate candidate drugs from scratch for a given target, overcoming the limits imposed by existing compound libraries. Following the Transformer framework (a state-of-the-art framework in deep learning), we design a variant of Transformer encoder to process 3D geometric information of targets and pre-train the Transformer decoder on 10 million compounds from PubChem for candidate drug generation. Systematical evaluation on candidate compounds generated for targets from DrugBank shows that both binding affinity and drugability are largely improved. TamGent outperforms previous baselines in terms of both effectiveness and efficiency. The method is further verified by generating candidate compounds for the SARS-CoV-2 main protease and the oncogenic mutant KRAS G12C. The results show that our method not only re-discovers previously verified drug molecules , but also generates novel molecules with better docking scores, expanding the compound pool and potentially leading to the discovery of novel drugs. △ Less

Submitted 30 August, 2022; originally announced September 2022.

arXiv:2209.02276 [pdf, other]

Zero-shot Aspect-level Sentiment Classification via Explicit Utilization of Aspect-to-Document Sentiment Composition

Authors: Pengfei Deng, Jianhua Yuan, Yanyan Zhao, Bing Qin

Abstract: As aspect-level sentiment labels are expensive and labor-intensive to acquire, zero-shot aspect-level sentiment classification is proposed to learn classifiers applicable to new domains without using any annotated aspect-level data. In contrast, document-level sentiment data with ratings are more easily accessible. In this work, we achieve zero-shot aspect-level sentiment classification by only us… ▽ More As aspect-level sentiment labels are expensive and labor-intensive to acquire, zero-shot aspect-level sentiment classification is proposed to learn classifiers applicable to new domains without using any annotated aspect-level data. In contrast, document-level sentiment data with ratings are more easily accessible. In this work, we achieve zero-shot aspect-level sentiment classification by only using document-level reviews. Our key intuition is that the sentiment representation of a document is composed of the sentiment representations of all the aspects of that document. Based on this, we propose the AF-DSC method to explicitly model such sentiment composition in reviews. AF-DSC first learns sentiment representations for all potential aspects and then aggregates aspect-level sentiments into a document-level one to perform document-level sentiment classification. In this way, we obtain the aspect-level sentiment classifier as the by-product of the document-level sentiment classifier. Experimental results on aspect-level sentiment classification benchmarks demonstrate the effectiveness of explicit utilization of sentiment composition in document-level sentiment classification. Our model with only 30k training data outperforms previous work utilizing millions of data. △ Less

Submitted 6 September, 2022; originally announced September 2022.

arXiv:2203.11660 [pdf, other]

Channel Self-Supervision for Online Knowledge Distillation

Authors: Shixiao Fan, Xuan Cheng, Xiaomin Wang, Chun Yang, Pan Deng, Minghui Liu, Jiali Deng, Ming Liu

Abstract: Recently, researchers have shown an increased interest in the online knowledge distillation. Adopting an one-stage and end-to-end training fashion, online knowledge distillation uses aggregated intermediated predictions of multiple peer models for training. However, the absence of a powerful teacher model may result in the homogeneity problem between group peers, affecting the effectiveness of gro… ▽ More Recently, researchers have shown an increased interest in the online knowledge distillation. Adopting an one-stage and end-to-end training fashion, online knowledge distillation uses aggregated intermediated predictions of multiple peer models for training. However, the absence of a powerful teacher model may result in the homogeneity problem between group peers, affecting the effectiveness of group distillation adversely. In this paper, we propose a novel online knowledge distillation method, \textbf{C}hannel \textbf{S}elf-\textbf{S}upervision for Online Knowledge Distillation (CSS), which structures diversity in terms of input, target, and network to alleviate the homogenization problem. Specifically, we construct a dual-network multi-branch structure and enhance inter-branch diversity through self-supervised learning, adopting the feature-level transformation and augmenting the corresponding labels. Meanwhile, the dual network structure has a larger space of independent parameters to resist the homogenization problem during distillation. Extensive quantitative experiments on CIFAR-100 illustrate that our method provides greater diversity than OKDDip and we also give pretty performance improvement, even over the state-of-the-art such as PCL. The results on three fine-grained datasets (StanfordDogs, StanfordCars, CUB-200-211) also show the significant generalization capability of our approach. △ Less

Submitted 23 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

arXiv:2110.15527 [pdf, other]

Pre-training Co-evolutionary Protein Representation via A Pairwise Masked Language Model

Authors: Liang He, Shizhuo Zhang, Lijun Wu, Huanhuan Xia, Fusong Ju, He Zhang, Siyuan Liu, Yingce Xia, Jianwei Zhu, Pan Deng, Bin Shao, Tao Qin, Tie-Yan Liu

Abstract: Understanding protein sequences is vital and urgent for biology, healthcare, and medicine. Labeling approaches are expensive yet time-consuming, while the amount of unlabeled data is increasing quite faster than that of the labeled data due to low-cost, high-throughput sequencing methods. In order to extract knowledge from these unlabeled data, representation learning is of significant value for p… ▽ More Understanding protein sequences is vital and urgent for biology, healthcare, and medicine. Labeling approaches are expensive yet time-consuming, while the amount of unlabeled data is increasing quite faster than that of the labeled data due to low-cost, high-throughput sequencing methods. In order to extract knowledge from these unlabeled data, representation learning is of significant value for protein-related tasks and has great potential for hel** us learn more about protein functions and structures. The key problem in the protein sequence representation learning is to capture the co-evolutionary information reflected by the inter-residue co-variation in the sequences. Instead of leveraging multiple sequence alignment as is usually done, we propose a novel method to capture this information directly by pre-training via a dedicated language model, i.e., Pairwise Masked Language Model (PMLM). In a conventional masked language model, the masked tokens are modeled by conditioning on the unmasked tokens only, but processed independently to each other. However, our proposed PMLM takes the dependency among masked tokens into consideration, i.e., the probability of a token pair is not equal to the product of the probability of the two tokens. By applying this model, the pre-trained encoder is able to generate a better representation for protein sequences. Our result shows that the proposed method can effectively capture the inter-residue correlations and improves the performance of contact prediction by up to 9% compared to the MLM baseline under the same setting. The proposed model also significantly outperforms the MSA baseline by more than 7% on the TAPE contact prediction benchmark when pre-trained on a subset of the sequence database which the MSA is generated from, revealing the potential of the sequence pre-training method to surpass MSA based methods in general. △ Less

Submitted 29 October, 2021; originally announced October 2021.

arXiv:2106.03148 [pdf]

Assessing Attendance by Peer Information

Authors: Pan Deng, Jianjun Zhou, **g Lyu, Zitong Zhao

Abstract: Attendance rate is an important indicator of students' study motivation, behavior and Psychological status; However, the heterogeneous nature of student attendance rates due to the course registration difference or the online/offline difference in a blended learning environment makes it challenging to compare attendance rates. In this paper, we propose a novel method called Relative Attendance Ind… ▽ More Attendance rate is an important indicator of students' study motivation, behavior and Psychological status; However, the heterogeneous nature of student attendance rates due to the course registration difference or the online/offline difference in a blended learning environment makes it challenging to compare attendance rates. In this paper, we propose a novel method called Relative Attendance Index (RAI) to measure attendance rates, which reflects students' efforts on attending courses. While traditional attendance focuses on the record of a single person or course, relative attendance emphasizes peer attendance information of relevant individuals or courses, making the comparisons of attendance more justified. Experimental results on real-life data show that RAI can indeed better reflect student engagement. △ Less

Submitted 6 June, 2021; originally announced June 2021.

Journal ref: Proceedings of EDM 2021

arXiv:2011.05850 [pdf, other]

Detecting Adversarial Patches with Class Conditional Reconstruction Networks

Authors: Perry Deng, Mohammad Saidur Rahman, Matthew Wright

Abstract: Defending against physical adversarial attacks is a rapidly growing topic in deep learning and computer vision. Prominent forms of physical adversarial attacks, such as overlaid adversarial patches and objects, share similarities with digital attacks, but are easy for humans to notice. This leads us to explore the hypothesis that adversarial detection methods, which have been shown to be ineffecti… ▽ More Defending against physical adversarial attacks is a rapidly growing topic in deep learning and computer vision. Prominent forms of physical adversarial attacks, such as overlaid adversarial patches and objects, share similarities with digital attacks, but are easy for humans to notice. This leads us to explore the hypothesis that adversarial detection methods, which have been shown to be ineffective against adaptive digital adversarial examples, can be effective against these physical attacks. We use one such detection method based on autoencoder architectures, and perform adversarial patching experiments on MNIST, SVHN, and CIFAR10 against a CNN architecture and two CapsNet architectures. We also propose two modifications to the EM-Routed CapsNet architecture, Affine Voting and Matrix Capsule Dropout, to improve its classification performance. Our investigation shows that the detector retains some of its effectiveness even against adaptive adversarial patch attacks. In addition, detection performance tends to decrease among all the architectures with the increase of dataset complexity. △ Less

Submitted 11 November, 2020; v1 submitted 11 November, 2020; originally announced November 2020.

arXiv:2010.04382 [pdf, other]

doi 10.1109/ISI49825.2020.9280538

Weaponizing Unicodes with Deep Learning -- Identifying Homoglyphs with Weakly Labeled Data

Authors: Perry Deng, Cooper Linsky, Matthew Wright

Abstract: Visually similar characters, or homoglyphs, can be used to perform social engineering attacks or to evade spam and plagiarism detectors. It is thus important to understand the capabilities of an attacker to identify homoglyphs -- particularly ones that have not been previously spotted -- and leverage them in attacks. We investigate a deep-learning model using embedding learning, transfer learning,… ▽ More Visually similar characters, or homoglyphs, can be used to perform social engineering attacks or to evade spam and plagiarism detectors. It is thus important to understand the capabilities of an attacker to identify homoglyphs -- particularly ones that have not been previously spotted -- and leverage them in attacks. We investigate a deep-learning model using embedding learning, transfer learning, and augmentation to determine the visual similarity of characters and thereby identify potential homoglyphs. Our approach uniquely takes advantage of weak labels that arise from the fact that most characters are not homoglyphs. Our model drastically outperforms the Normalized Compression Distance approach on pairwise homoglyph identification, for which we achieve an average precision of 0.97. We also present the first attempt at clustering homoglyphs into sets of equivalence classes, which is more efficient than pairwise information for security practitioners to quickly lookup homoglyphs or to normalize confusable string encodings. To measure clustering performance, we propose a metric (mBIOU) building on the classic Intersection-Over-Union (IOU) metric. Our clustering method achieves 0.592 mBIOU, compared to 0.430 for the naive baseline. We also use our model to predict over 8,000 previously unknown homoglyphs, and find good early indications that many of these may be true positives. Source code and list of predicted homoglyphs are uploaded to Github: https://github.com/PerryXDeng/weaponizing_unicode △ Less

Submitted 22 December, 2020; v1 submitted 9 October, 2020; originally announced October 2020.

Comments: Updated DOI

arXiv:1911.02549 [pdf, other]

MLPerf Inference Benchmark

Authors: Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee , et al. (22 additional authors not shown)

Abstract: Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic… ▽ More Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark's flexibility and adaptability. △ Less

Submitted 9 May, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: ISCA 2020

arXiv:1701.01156 [pdf]

Adaptive Real-Time Software Defined MIMO Visible Light Communications using Spatial Multiplexing and Spatial Diversity

Authors: Peng Deng, Mohsen Kavehrad

Abstract: In this paper, we experimentally demonstrate a real-time software defined multiple input multiple output (MIMO) visible light communication (VLC) system employing link adaptation of spatial multiplexing and spatial diversity. Real-time MIMO signal processing is implemented by using the Field Programmable Gate Array (FPGA) based Universal Software Radio Peripheral (USRP) devices. Software defined i… ▽ More In this paper, we experimentally demonstrate a real-time software defined multiple input multiple output (MIMO) visible light communication (VLC) system employing link adaptation of spatial multiplexing and spatial diversity. Real-time MIMO signal processing is implemented by using the Field Programmable Gate Array (FPGA) based Universal Software Radio Peripheral (USRP) devices. Software defined implantation of MIMO VLC can assist in enabling an adaptive and reconfigurable communication system without hardware changes. We measured the error vector magnitude (EVM), bit error rate (BER) and spectral efficiency performance for single carrier M-QAM MIMO VLC using spatial diversity and spatial multiplexing. Results show that spatial diversity MIMO VLC improves error performance at the cost of spectral efficiency that spatial multiplexing should enhance. We propose the adaptive MIMO solution that both modulation schema and MIMO schema are dynamically adapted to the changing channel conditions for enhancing the error performance and spectral efficiency. The average error-free spectral efficiency of adaptive 2x2 MIMO VLC achieved 12 b/s/Hz over 2 meters indoor dynamic transmission. △ Less

Submitted 26 December, 2016; originally announced January 2017.

Comments: 6 pages, 7 figures, Accept by WiSEE_2016. arXiv admin note: substantial text overlap with arXiv:1612.05402

arXiv:1612.08477 [pdf]

Effect of white LED DC-bias on modulation speed for visible light communications

Authors: Peng Deng, Mohsen Kavehrad

Abstract: The light emitting diode (LED) nonlinearities distortion induced degradation in the performance of visible light communication (VLC) systems can be controlled by optimizing the DC bias point of the LED. In this paper, we theoretically analyze and experimentally demonstrate the effect of white LED DC bias on nonlinear modulation bandwidth and dynamic range of the VLC system. The linear dynamic rang… ▽ More The light emitting diode (LED) nonlinearities distortion induced degradation in the performance of visible light communication (VLC) systems can be controlled by optimizing the DC bias point of the LED. In this paper, we theoretically analyze and experimentally demonstrate the effect of white LED DC bias on nonlinear modulation bandwidth and dynamic range of the VLC system. The linear dynamic range is enhanced by using series-connected LED chips, and the modulation bandwidth is extended to 40 MHz by post-equalization without using a blue filter. The experimental results well match the theoretical model of LED nonlinear modulation characteristics. The results show that the modulation bandwidth increases and saturates with an increase in LED DC bias current due to nonlinear effect of carrier lifetime and junction capacitance. The optimized DC-bias current that corresponds to the minimum BER increases with the increase of data rate. A 60-Mbps NRZ transmission can be achieved with BER threshold of 10-3 by properly adjusting LED DC bias point. △ Less

Submitted 26 December, 2016; originally announced December 2016.

Comments: 11 pages, 11 figures

arXiv:1612.05402 [pdf]

Software Defined Adaptive MIMO Visible Light Communications after an Obstruction

Authors: Peng Deng, Mohsen Kavehrad

Abstract: We experimentally demonstrate a software-defined 2x2 MIMO VLC system employing link adaptation of spatial multiplexing and diversity. The average error-free spectral efficiency of 12 b/s/Hz is achieved over 2 meters indoor transmission after an obstruction. We experimentally demonstrate a software-defined 2x2 MIMO VLC system employing link adaptation of spatial multiplexing and diversity. The average error-free spectral efficiency of 12 b/s/Hz is achieved over 2 meters indoor transmission after an obstruction. △ Less

Submitted 16 December, 2016; originally announced December 2016.

Comments: 3 pages, 3 figures, accepted by OFC 2017

arXiv:1504.01320 [pdf]

doi 10.1109/ICNSURV.2015.7121217

Robust Timing Synchronization for AC-OFDM Based Optical Wireless Communications

Authors: Bilal A. Ranjha, Mohammadreza A. Kashani, Mohsen Kavehrad, Peng Deng

Abstract: Visible light communications (VLC) have recently attracted a growing interest and can be a potential solution to realize indoor wireless communication with high bandwidth capacity for RF-restricted environments such as airplanes and hospitals. Optical based orthogonal frequency division multiplexing (OFDM) systems have been proposed in the literature to combat multipath distortion and intersymbol… ▽ More Visible light communications (VLC) have recently attracted a growing interest and can be a potential solution to realize indoor wireless communication with high bandwidth capacity for RF-restricted environments such as airplanes and hospitals. Optical based orthogonal frequency division multiplexing (OFDM) systems have been proposed in the literature to combat multipath distortion and intersymbol interference (ISI) caused by multipath signal propagation. In this paper, we present a robust timing synchronization scheme suitable for asymmetrically clipped (AC) OFDM based optical intensity modulated direct detection (IM/DD) wireless systems. Our proposed method works perfectly for ACO-OFDM, Pulse amplitude modulated discrete multitone (PAM-DMT) and discrete Hartley transform (DHT) based optical OFDM systems. In contrast to existing OFDM timing synchronization methods which are either not suitable for AC OFDM techniques due to unipolar nature of output signal or perform poorly, our proposed method is suitable for AC OFDM schemes and outperforms all other available techniques. Both numerical and experimental results confirm the accuracy of the proposed method. Our technique is also computationally efficient as it requires very few computations as compared to conventional methods in order to achieve good accuracy. △ Less

Submitted 6 April, 2015; originally announced April 2015.

Comments: Accepted for publication in IEEE ICNS 2015, 10 Pages, 7 figs

Showing 1–19 of 19 results for author: Deng, P