-
A multi-frequency spaceborne radar perspective of deep convection
Authors:
Randy J. Chase,
Brenda Dolan,
Kristen L. Rasmussen,
Richard M. Schulte,
Graeme Stephens,
F. Joe Turk,
Susan C. van den Heever
Abstract:
Global numerical weather models are starting to resolve atmospheric moist convection which comes with a critical need for observational constraints. One avenue for such constraints is spaceborne radar which tend to operate at three wavelengths, Ku-, Ka- and W-band. Many studies of deep convection in the past have primarily leveraged Ku-band because it is less affected by attenuation and multiple s…
▽ More
Global numerical weather models are starting to resolve atmospheric moist convection which comes with a critical need for observational constraints. One avenue for such constraints is spaceborne radar which tend to operate at three wavelengths, Ku-, Ka- and W-band. Many studies of deep convection in the past have primarily leveraged Ku-band because it is less affected by attenuation and multiple scattering. However, future spaceborne radar missions might not contain a Ku-band radar and thus considering the view of convection from Ka-band or W-band compared to the Ku-band would be useful. This study examines a coincident dataset between the Global Precipitation Measurement (GPM) Mission and CloudSat as well as the entire GPM record to compare convective characteristics across various wavelengths within deep convection. We find that W-band reflectivity (Z) tends to maximize near the Ku-band defined echo-top while Ka-band often maximizes 4-5 km below. The height of the maximum Z above the melting level for W-band and Ka-band do not linearly relate to the Ku-band maximum. However, using the full GPM record the Ka-band 30 dBZ echo-tops can be linearly related to the Ku-band 40 dBZ echo-top with an $R^2$ of 0.62 and a root mean squared error of about 1 km. The spatial distribution of echo-tops from Ka-band corresponds well to the Ku-band echo-tops, highlighting regions of relatively large ice water path. This paper suggests that Ka-band only missions, like NASA's Investigation for Convective Updrafts, should be able to characterize global convection in a similar manner to a Ku-band system.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
AD-BERT: Using Pre-trained contextualized embeddings to Predict the Progression from Mild Cognitive Impairment to Alzheimer's Disease
Authors:
Chengsheng Mao,
Jie Xu,
Luke Rasmussen,
Yikuan Li,
Prakash Adekkanattu,
Jennifer Pacheco,
Borna Bonakdarpour,
Robert Vassar,
Guoqian Jiang,
Fei Wang,
Jyotishman Pathak,
Yuan Luo
Abstract:
Objective: We develop a deep learning framework based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model using unstructured clinical notes from electronic health records (EHRs) to predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD). Materials and Methods: We identified 3657 patients diagnosed with MCI togethe…
▽ More
Objective: We develop a deep learning framework based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model using unstructured clinical notes from electronic health records (EHRs) to predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD). Materials and Methods: We identified 3657 patients diagnosed with MCI together with their progress notes from Northwestern Medicine Enterprise Data Warehouse (NMEDW) between 2000-2020. The progress notes no later than the first MCI diagnosis were used for the prediction. We first preprocessed the notes by deidentification, cleaning and splitting, and then pretrained a BERT model for AD (AD-BERT) based on the publicly available Bio+Clinical BERT on the preprocessed notes. The embeddings of all the sections of a patient's notes processed by AD-BERT were combined by MaxPooling to compute the probability of MCI-to-AD progression. For replication, we conducted a similar set of experiments on 2563 MCI patients identified at Weill Cornell Medicine (WCM) during the same timeframe. Results: Compared with the 7 baseline models, the AD-BERT model achieved the best performance on both datasets, with Area Under receiver operating characteristic Curve (AUC) of 0.8170 and F1 score of 0.4178 on NMEDW dataset and AUC of 0.8830 and F1 score of 0.6836 on WCM dataset. Conclusion: We developed a deep learning framework using BERT models which provide an effective solution for prediction of MCI-to-AD progression using clinical note analysis.
△ Less
Submitted 6 November, 2022;
originally announced December 2022.
-
Partitioning a Polygon Into Small Pieces
Authors:
Mikkel Abrahamsen,
Nichlas Langhoff Rasmussen
Abstract:
We study the problem of partitioning a given simple polygon $P$ into a minimum number of polygonal pieces, each of which has bounded size. We give algorithms for seven notions of `bounded size,' namely that each piece has bounded area, perimeter, straight-line diameter, geodesic diameter, or that each piece must be contained in a unit disk, an axis-aligned unit square or an arbitrarily rotated uni…
▽ More
We study the problem of partitioning a given simple polygon $P$ into a minimum number of polygonal pieces, each of which has bounded size. We give algorithms for seven notions of `bounded size,' namely that each piece has bounded area, perimeter, straight-line diameter, geodesic diameter, or that each piece must be contained in a unit disk, an axis-aligned unit square or an arbitrarily rotated unit square.
A more general version of the area problem has already been studied. Here we are, in addition to $P$, given positive real values $a_1,\ldots,a_k$ such that the sum $\sum_{i=1}^k a_i$ equals the area of $P$. The goal is to partition $P$ into exactly $k$ pieces $Q_1,\ldots,Q_k$ such that the area of $Q_i$ is $a_i$. Such a partition always exists, and an algorithm with running time $O(nk)$ has previously been described, where $n$ is the number of corners of $P$. We give an algorithm with optimal running time $O(n+k)$. For polygons with holes, we get running time $O(n\log n+k)$.
For the other problems, it seems out of reach to compute optimal partitions for simple polygons; for most of them, even in extremely restricted cases such as when $P$ is a square. We therefore develop $O(1)$-approximation algorithms for these problems, which means that the number of pieces in the produced partition is at most a constant factor larger than the cardinality of a minimum partition. Existing algorithms do not allow Steiner points, which means that all corners of the produced pieces must also be corners of $P$. This has the disappointing consequence that a partition does often not exist, whereas our algorithms always produce useful partitions. Furthermore, an optimal partition without Steiner points may require $Ω(n)$ pieces for polygons where a partition consisting of just $2$ pieces exists when Steiner points are allowed.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
Evolutionary Time-Use Optimization for Improving Children's Health Outcomes
Authors:
Yue Xie,
Aneta Neumann,
Ty Stanford,
Charlotte Lund Rasmussen,
Dorothea Dumuid,
Frank Neumann
Abstract:
How someone allocates their time is important to their health and well-being. In this paper, we show how evolutionary algorithms can be used to promote health and well-being by optimizing time usage. Based on data from a large population-based child cohort, we design fitness functions to explain health outcomes and introduce constraints for viable time plans. We then investigate the performance of…
▽ More
How someone allocates their time is important to their health and well-being. In this paper, we show how evolutionary algorithms can be used to promote health and well-being by optimizing time usage. Based on data from a large population-based child cohort, we design fitness functions to explain health outcomes and introduce constraints for viable time plans. We then investigate the performance of evolutionary algorithms to optimize time use for four individual health outcomes with hypothetical children with different day structures. As the four health outcomes are competing for time allocations, we study how to optimize multiple health outcomes simultaneously in the form of a multi-objective optimization problem. We optimize one-week time-use plans using evolutionary multi-objective algorithms and point out the trade-offs achievable with respect to different health outcomes.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
$α$ Belief Propagation for Approximate Inference
Authors:
Dong Liu,
Minh Thành Vu,
Zuxing Li,
Lars K. Rasmussen
Abstract:
Belief propagation (BP) algorithm is a widely used message-passing method for inference in graphical models. BP on loop-free graphs converges in linear time. But for graphs with loops, BP's performance is uncertain, and the understanding of its solution is limited. To gain a better understanding of BP in general graphs, we derive an interpretable belief propagation algorithm that is motivated by m…
▽ More
Belief propagation (BP) algorithm is a widely used message-passing method for inference in graphical models. BP on loop-free graphs converges in linear time. But for graphs with loops, BP's performance is uncertain, and the understanding of its solution is limited. To gain a better understanding of BP in general graphs, we derive an interpretable belief propagation algorithm that is motivated by minimization of a localized $α$-divergence. We term this algorithm as $α$ belief propagation ($α$-BP). It turns out that $α$-BP generalizes standard BP. In addition, this work studies the convergence properties of $α$-BP. We prove and offer the convergence conditions for $α$-BP. Experimental simulations on random graphs validate our theoretical results. The application of $α$-BP to practical problems is also demonstrated.
△ Less
Submitted 27 June, 2020;
originally announced June 2020.
-
Region-based Energy Neural Network for Approximate Inference
Authors:
Dong Liu,
Ragnar Thobaben,
Lars K. Rasmussen
Abstract:
Region-based free energy was originally proposed for generalized belief propagation (GBP) to improve loopy belief propagation (loopy BP). In this paper, we propose a neural network based energy model for inference in general Markov random fields (MRFs), which directly minimizes the region-based free energy defined on region graphs. We term our model Region-based Energy Neural Network (RENN). Unlik…
▽ More
Region-based free energy was originally proposed for generalized belief propagation (GBP) to improve loopy belief propagation (loopy BP). In this paper, we propose a neural network based energy model for inference in general Markov random fields (MRFs), which directly minimizes the region-based free energy defined on region graphs. We term our model Region-based Energy Neural Network (RENN). Unlike message-passing algorithms, RENN avoids iterative message propagation and is faster. Also different from recent deep neural network based models, inference by RENN does not require sampling, and RENN works on general MRFs. RENN can also be employed for MRF learning. Our experiments on marginal distribution estimation, partition function estimation, and learning of MRFs show that RENN outperforms the mean field method, loopy BP, GBP, and the state-of-the-art neural network based model.
△ Less
Submitted 17 June, 2020;
originally announced June 2020.
-
Will Scale-free Popularity Develop Scale-free Geo-social Networks?
Authors:
Dong Liu,
Viktoria Fodor,
Lars K. Rasmussen
Abstract:
Empirical results show that spatial factors such as distance, population density and communication range affect our social activities, also reflected by the development of ties in social networks. This motivates the need for social network models that take these spatial factors into account. Therefore, in this paper we propose a gravity-low-based geo-social network model, where connections develop…
▽ More
Empirical results show that spatial factors such as distance, population density and communication range affect our social activities, also reflected by the development of ties in social networks. This motivates the need for social network models that take these spatial factors into account. Therefore, in this paper we propose a gravity-low-based geo-social network model, where connections develop according to the popularity of the individuals, but are constrained through their geographic distance and the surrounding population density. Specifically, we consider a power-law distributed popularity, and random node positions governed by a Poisson point process. We evaluate the characteristics of the emerging networks, considering the degree distribution, the average degree of neighbors and the local clustering coefficient. These local metrics reflect the robustness of the network, the information dissemination speed and the communication locality. We show that unless the communication range is strictly limited, the emerging networks are scale-free, with a rank exponent affected by the spatial factors. Even the average neighbor degree and the local clustering coefficient show tendencies known in non-geographic scale-free networks, at least when considering individuals with low popularity. At high-popularity values, however, the spatial constraints lead to popularity-independent average neighbor degrees and clustering coefficients.
△ Less
Submitted 3 March, 2020;
originally announced March 2020.
-
On Dominant Interference in Random Networks and Communication Reliability
Authors:
Dong Liu,
Baptiste Cavarec,
Lars K. Rasmussen,
**g Yue
Abstract:
In this paper, we study the characteristics of dominant interference power with directional reception in a random network modelled by a Poisson Point Process. Additionally, the Laplace functional of cumulative interference excluding the $n$ dominant interferers is also derived, which turns out to be a generalization of omni-directional reception and complete accumulative interference. As an applic…
▽ More
In this paper, we study the characteristics of dominant interference power with directional reception in a random network modelled by a Poisson Point Process. Additionally, the Laplace functional of cumulative interference excluding the $n$ dominant interferers is also derived, which turns out to be a generalization of omni-directional reception and complete accumulative interference. As an application of these results, we study the impact of directional receivers in random networks in terms of outage probability and error probability with queue length constraint.
△ Less
Submitted 3 March, 2020;
originally announced March 2020.
-
Design Principles Developed through User-Centered and Socio-Technical Methods Improve Clinician Satisfaction, Speed, and Confidence in Pharmacogenomic Clinical Decision Support
Authors:
Timothy M. Herr,
Therese A. Nelson,
Luke V. Rasmussen,
Yinan Zheng,
Nicola Lancki,
MPH,
Justin B. Starren
Abstract:
OBJECTIVE: To design and evaluate new pharmacogenomic (PGx) clinical decision support (CDS) alerts, built to adhere to PGx CDS design principles developed through socio-technical approaches.
MATERIALS AND METHODS: Based on previously identified design principles, we created 11 new PGx CDS alert designs and developed an interactive web application containing realistic clinical scenarios and user…
▽ More
OBJECTIVE: To design and evaluate new pharmacogenomic (PGx) clinical decision support (CDS) alerts, built to adhere to PGx CDS design principles developed through socio-technical approaches.
MATERIALS AND METHODS: Based on previously identified design principles, we created 11 new PGx CDS alert designs and developed an interactive web application containing realistic clinical scenarios and user workflows that mimicked a real-world EHR system. We recruited General Internal Medicine and Cardiology clinicians from Northwestern Medicine and recorded their interactions with the original and new designs. We measured clinician response, satisfaction, speed, and confidence through questionnaires and analysis of the recordings.
RESULTS: The study included 12 clinicians. Participants were significantly more satisfied (p=0.0000001), faster (p=0.009), and more confident (p<.05) with the new designs than the original ones. The study lacked statistical power to determine whether prescribing accuracy was improved, but participants were no less accurate, and clinical actions were more concordant with alert interactions (p=0.004) with the new designs. We found a significant learning curve associated with the original designs, which was eliminated with the new designs.
DISCUSSION: This study successfully demonstrates that socio-technical and user-centered design techniques can improve PGx CDS alert designs. Best practices for PGx CDS design are limited in the literature, with few effectiveness studies available. These results can help guide future PGx CDS implementations to be more clinician friendly and less time-consuming.
CONCLUSION: The results of this study support the PGx CDS design principles we proposed in previous work. As a next step, the new designs should be implemented in a live setting for further validation.
△ Less
Submitted 31 January, 2020;
originally announced February 2020.
-
Powering Hidden Markov Model by Neural Network based Generative Models
Authors:
Dong Liu,
Antoine Honoré,
Saikat Chatterjee,
Lars K. Rasmussen
Abstract:
Hidden Markov model (HMM) has been successfully used for sequential data modeling problems. In this work, we propose to power the modeling capacity of HMM by bringing in neural network based generative models. The proposed model is termed as GenHMM. In the proposed GenHMM, each HMM hidden state is associated with a neural network based generative model that has tractability of exact likelihood and…
▽ More
Hidden Markov model (HMM) has been successfully used for sequential data modeling problems. In this work, we propose to power the modeling capacity of HMM by bringing in neural network based generative models. The proposed model is termed as GenHMM. In the proposed GenHMM, each HMM hidden state is associated with a neural network based generative model that has tractability of exact likelihood and provides efficient likelihood computation. A generative model in GenHMM consists of mixture of generators that are realized by flow models. A learning algorithm for GenHMM is proposed in expectation-maximization framework. The convergence of the learning GenHMM is analyzed. We demonstrate the efficiency of GenHMM by classification tasks on practical sequential data. Code available at https://github.com/FirstHandScientist/genhmm.
△ Less
Submitted 24 May, 2020; v1 submitted 13 October, 2019;
originally announced October 2019.
-
$α$ Belief Propagation as Fully Factorized Approximation
Authors:
Dong Liu,
Nima N. Moghadam,
Lars K. Rasmussen,
**liang Huang,
Saikat Chatterjee
Abstract:
Belief propagation (BP) can do exact inference in loop-free graphs, but its performance could be poor in graphs with loops, and the understanding of its solution is limited. This work gives an interpretable belief propagation rule that is actually minimization of a localized $α$-divergence. We term this algorithm as $α$ belief propagation ($α$-BP). The performance of $α$-BP is tested in MAP (maxim…
▽ More
Belief propagation (BP) can do exact inference in loop-free graphs, but its performance could be poor in graphs with loops, and the understanding of its solution is limited. This work gives an interpretable belief propagation rule that is actually minimization of a localized $α$-divergence. We term this algorithm as $α$ belief propagation ($α$-BP). The performance of $α$-BP is tested in MAP (maximum a posterior) inference problems, where $α$-BP can outperform (loopy) BP by a significant margin even in fully-connected graphs.
△ Less
Submitted 23 August, 2019;
originally announced August 2019.
-
Neural Network based Explicit Mixture Models and Expectation-maximization based Learning
Authors:
Dong Liu,
Minh Thành Vu,
Saikat Chatterjee,
Lars K. Rasmussen
Abstract:
We propose two neural network based mixture models in this article. The proposed mixture models are explicit in nature. The explicit models have analytical forms with the advantages of computing likelihood and efficiency of generating samples. Computation of likelihood is an important aspect of our models. Expectation-maximization based algorithms are developed for learning parameters of the propo…
▽ More
We propose two neural network based mixture models in this article. The proposed mixture models are explicit in nature. The explicit models have analytical forms with the advantages of computing likelihood and efficiency of generating samples. Computation of likelihood is an important aspect of our models. Expectation-maximization based algorithms are developed for learning parameters of the proposed models. We provide sufficient conditions to realize the expectation-maximization based learning. The main requirements are invertibility of neural networks that are used as generators and Jacobian computation of functional form of the neural networks. The requirements are practically realized using a flow-based neural network. In our first mixture model, we use multiple flow-based neural networks as generators. Naturally the model is complex. A single latent variable is used as the common input to all the neural networks. The second mixture model uses a single flow-based neural network as a generator to reduce complexity. The single generator has a latent variable input that follows a Gaussian mixture distribution. We demonstrate efficiency of proposed mixture models through extensive experiments for generating samples and maximum likelihood based classification.
△ Less
Submitted 24 May, 2020; v1 submitted 31 July, 2019;
originally announced July 2019.
-
Evaluating the Portability of an NLP System for Processing Echocardiograms: A Retrospective, Multi-site Observational Study
Authors:
Prakash Adekkanattu,
Guoqian Jiang,
Yuan Luo,
Paul R. Kingsbury,
Zhenxing Xu,
Luke V. Rasmussen,
Jennifer A. Pacheco,
Richard C. Kiefer,
Daniel J. Stone,
Pascal S. Brandt,
Liang Yao,
Yizhen Zhong,
Yu Deng,
Fei Wang,
Jessica S. Ancker,
Thomas R. Campion,
Jyotishman Pathak
Abstract:
While natural language processing (NLP) of unstructured clinical narratives holds the potential for patient care and clinical research, portability of NLP approaches across multiple sites remains a major challenge. This study investigated the portability of an NLP system developed initially at the Department of Veterans Affairs (VA) to extract 27 key cardiac concepts from free-text or semi-structu…
▽ More
While natural language processing (NLP) of unstructured clinical narratives holds the potential for patient care and clinical research, portability of NLP approaches across multiple sites remains a major challenge. This study investigated the portability of an NLP system developed initially at the Department of Veterans Affairs (VA) to extract 27 key cardiac concepts from free-text or semi-structured echocardiograms from three academic medical centers: Weill Cornell Medicine, Mayo Clinic and Northwestern Medicine. While the NLP system showed high precision and recall measurements for four target concepts (aortic valve regurgitation, left atrium size at end systole, mitral valve regurgitation, tricuspid valve regurgitation) across all sites, we found moderate or poor results for the remaining concepts and the NLP system performance varied between individual sites.
△ Less
Submitted 1 April, 2019;
originally announced May 2019.
-
Identifying Sub-Phenotypes of Acute Kidney Injury using Structured and Unstructured Electronic Health Record Data with Memory Networks
Authors:
Zhenxing Xu,
**gyuan Chou,
Xi Sheryl Zhang,
Yuan Luo,
Tamara Isakova,
Prakash Adekkanattu,
Jessica S. Ancker,
Guoqian Jiang,
Richard C. Kiefer,
Jennifer A. Pacheco,
Luke V. Rasmussen,
Jyotishman Pathak,
Fei Wang
Abstract:
Acute Kidney Injury (AKI) is a common clinical syndrome characterized by the rapid loss of kidney excretory function, which aggravates the clinical severity of other diseases in a large number of hospitalized patients. Accurate early prediction of AKI can enable in-time interventions and treatments. However, AKI is highly heterogeneous, thus identification of AKI sub-phenotypes can lead to an impr…
▽ More
Acute Kidney Injury (AKI) is a common clinical syndrome characterized by the rapid loss of kidney excretory function, which aggravates the clinical severity of other diseases in a large number of hospitalized patients. Accurate early prediction of AKI can enable in-time interventions and treatments. However, AKI is highly heterogeneous, thus identification of AKI sub-phenotypes can lead to an improved understanding of the disease pathophysiology and development of more targeted clinical interventions. This study used a memory network-based deep learning approach to discover AKI sub-phenotypes using structured and unstructured electronic health record (EHR) data of patients before AKI diagnosis. We leveraged a real world critical care EHR corpus including 37,486 ICU stays. Our approach identified three distinct sub-phenotypes: sub-phenotype I is with an average age of 63.03$ \pm 17.25 $ years, and is characterized by mild loss of kidney excretory function (Serum Creatinine (SCr) $1.55\pm 0.34$ mg/dL, estimated Glomerular Filtration Rate Test (eGFR) $107.65\pm 54.98$ mL/min/1.73$m^2$). These patients are more likely to develop stage I AKI. Sub-phenotype II is with average age 66.81$ \pm 10.43 $ years, and was characterized by severe loss of kidney excretory function (SCr $1.96\pm 0.49$ mg/dL, eGFR $82.19\pm 55.92$ mL/min/1.73$m^2$). These patients are more likely to develop stage III AKI. Sub-phenotype III is with average age 65.07$ \pm 11.32 $ years, and was characterized moderate loss of kidney excretory function and thus more likely to develop stage II AKI (SCr $1.69\pm 0.32$ mg/dL, eGFR $93.97\pm 56.53$ mL/min/1.73$m^2$). Both SCr and eGFR are significantly different across the three sub-phenotypes with statistical testing plus postdoc analysis, and the conclusion still holds after age adjustment.
△ Less
Submitted 22 December, 2019; v1 submitted 9 April, 2019;
originally announced April 2019.
-
Entropy-regularized Optimal Transport Generative Models
Authors:
Dong Liu,
Minh Thành Vu,
Saikat Chatterjee,
Lars K. Rasmussen
Abstract:
We investigate the use of entropy-regularized optimal transport (EOT) cost in develo** generative models to learn implicit distributions. Two generative models are proposed. One uses EOT cost directly in an one-shot optimization problem and the other uses EOT cost iteratively in an adversarial game. The proposed generative models show improved performance over contemporary models for image gener…
▽ More
We investigate the use of entropy-regularized optimal transport (EOT) cost in develo** generative models to learn implicit distributions. Two generative models are proposed. One uses EOT cost directly in an one-shot optimization problem and the other uses EOT cost iteratively in an adversarial game. The proposed generative models show improved performance over contemporary models for image generation on MNSIT.
△ Less
Submitted 16 November, 2018;
originally announced November 2018.
-
Characterizing Design Patterns of EHR-Driven Phenotype Extraction Algorithms
Authors:
Yizhen Zhong,
Luke Rasmussen,
Yu Deng,
Jennifer Pacheco,
Maureen Smith,
Justin Starren,
Wei-Qi Wei,
Peter Speltz,
Joshua Denny,
Nephi Walton,
George Hripcsak,
Christopher G Chute,
Yuan Luo
Abstract:
The automatic development of phenotype algorithms from Electronic Health Record data with machine learning (ML) techniques is of great interest given the current practice is very time-consuming and resource intensive. The extraction of design patterns from phenotype algorithms is essential to understand their rationale and standard, with great potential to automate the development process. In this…
▽ More
The automatic development of phenotype algorithms from Electronic Health Record data with machine learning (ML) techniques is of great interest given the current practice is very time-consuming and resource intensive. The extraction of design patterns from phenotype algorithms is essential to understand their rationale and standard, with great potential to automate the development process. In this pilot study, we perform network visualization on the design patterns and their associations with phenotypes and sites. We classify design patterns using the fragments from previously annotated phenotype algorithms as the ground truth. The classification performance is used as a proxy for coherence at the attribution level. The bag-of-words representation with knowledge-based features generated a good performance in the classification task (0.79 macro-f1 scores). Good classification accuracy with simple features demonstrated the attribution coherence and the feasibility of automatic identification of design patterns. Our results point to both the feasibility and challenges of automatic identification of phenoty** design patterns, which would power the automatic development of phenotype algorithms.
△ Less
Submitted 15 November, 2018;
originally announced November 2018.
-
Develo** a Portable Natural Language Processing Based Phenoty** System
Authors:
Himanshu Sharma,
Chengsheng Mao,
Yizhen Zhang,
Haleh Vatani,
Liang Yao,
Yizhen Zhong,
Luke Rasmussen,
Guoqian Jiang,
Jyotishman Pathak,
Yuan Luo
Abstract:
This paper presents a portable phenoty** system that is capable of integrating both rule-based and statistical machine learning based approaches. Our system utilizes UMLS to extract clinically relevant features from the unstructured text and then facilitates portability across different institutions and data systems by incorporating OHDSI's OMOP Common Data Model (CDM) to standardize necessary d…
▽ More
This paper presents a portable phenoty** system that is capable of integrating both rule-based and statistical machine learning based approaches. Our system utilizes UMLS to extract clinically relevant features from the unstructured text and then facilitates portability across different institutions and data systems by incorporating OHDSI's OMOP Common Data Model (CDM) to standardize necessary data elements. Our system can also store the key components of rule-based systems (e.g., regular expression matches) in the format of OMOP CDM, thus enabling the reuse, adaptation and extension of many existing rule-based clinical NLP systems. We experimented with our system on the corpus from i2b2's Obesity Challenge as a pilot study. Our system facilitates portable phenoty** of obesity and its 15 comorbidities based on the unstructured patient discharge summaries, while achieving a performance that often ranked among the top 10 of the challenge participants. This standardization enables a consistent application of numerous rule-based and machine learning based classification techniques downstream.
△ Less
Submitted 17 July, 2018;
originally announced July 2018.
-
Locally Convex Sparse Learning over Networks
Authors:
Ahmed Zaki,
Saikat Chatterjee,
Partha P. Mitra,
Lars K. Rasmussen
Abstract:
We consider a distributed learning setup where a sparse signal is estimated over a network. Our main interest is to save communication resource for information exchange over the network and reduce processing time. Each node of the network uses a convex optimization based algorithm that provides a locally optimum solution for that node. The nodes exchange their signal estimates over the network in…
▽ More
We consider a distributed learning setup where a sparse signal is estimated over a network. Our main interest is to save communication resource for information exchange over the network and reduce processing time. Each node of the network uses a convex optimization based algorithm that provides a locally optimum solution for that node. The nodes exchange their signal estimates over the network in order to refine their local estimates. At a node, the optimization algorithm is based on an $\ell_1$-norm minimization with appropriate modifications to promote sparsity as well as to include influence of estimates from neighboring nodes. Our expectation is that local estimates in each node improve fast and converge, resulting in a limited demand for communication of estimates between nodes and reducing the processing time. We provide restricted-isometry-property (RIP)-based theoretical analysis on estimation quality. In the scenario of clean observation, it is shown that the local estimates converge to the exact sparse signal under certain technical conditions. Simulation results show that the proposed algorithms show competitive performance compared to a globally optimum distributed LASSO algorithm in the sense of convergence speed and estimation error.
△ Less
Submitted 31 March, 2018;
originally announced April 2018.
-
Golden Angle Modulation: Approaching the AWGN Capacity
Authors:
Peter Larsson,
Lars K. Rasmussen,
Mikael Skoglund
Abstract:
In this work, targeting, e.g., future generation cellular, microwave-links, or optical fiber systems, we propose a new geometric sha** design for golden angle modulation (GAM) based on a (double) truncated Gaussian input distribution. The design improves the mutual information (MI), and the peak-to-average power ratio, over the full signal-to-noise ratio (SNR) range relative to two key GAM schem…
▽ More
In this work, targeting, e.g., future generation cellular, microwave-links, or optical fiber systems, we propose a new geometric sha** design for golden angle modulation (GAM) based on a (double) truncated Gaussian input distribution. The design improves the mutual information (MI), and the peak-to-average power ratio, over the full signal-to-noise ratio (SNR) range relative to two key GAM schemes introduced in [1],[2]. Inspired by the proposed geometric sha**, a simpler, SNR-dependent, design is also suggested. The performance is numerically evaluated with respect to MI and compared with classical modulation schemes. With the proposed design, the SNR can be decreased relative to classical quadrature amplitude modulation, even for relatively modest target spectral efficiencies. As the GAM design can approach the Gaussian channel capacity, the power/energy efficiency is expected to improve.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.
-
Estimate Exchange over Network is Good for Distributed Hard Thresholding Pursuit
Authors:
Ahmed Zaki,
Partha P. Mitra,
Lars K. Rasmussen,
Saikat Chatterjee
Abstract:
We investigate an existing distributed algorithm for learning sparse signals or data over networks. The algorithm is iterative and exchanges intermediate estimates of a sparse signal over a network. This learning strategy using exchange of intermediate estimates over the network requires a limited communication overhead for information transmission. Our objective in this article is to show that th…
▽ More
We investigate an existing distributed algorithm for learning sparse signals or data over networks. The algorithm is iterative and exchanges intermediate estimates of a sparse signal over a network. This learning strategy using exchange of intermediate estimates over the network requires a limited communication overhead for information transmission. Our objective in this article is to show that the strategy is good for learning in spite of limited communication. In pursuit of this objective, we first provide a restricted isometry property (RIP)-based theoretical analysis on convergence of the iterative algorithm. Then, using simulations, we show that the algorithm provides competitive performance in learning sparse signals vis-a-vis an existing alternate distributed algorithm. The alternate distributed algorithm exchanges more information including observations and system parameters.
△ Less
Submitted 22 September, 2017;
originally announced September 2017.
-
The Golden Quantizer: The Complex Gaussian Random Variable Case
Authors:
Peter Larsson,
Lars K. Rasmussen,
Mikael Skoglund
Abstract:
The problem of quantizing a circularly-symmetric complex Gaussian random variable is considered. For this purpose, we design two non-uniform quantizers, a high-rate-, and a Lloyd-Max-, quantizer that are both based on the (golden angle) spiral-phyllotaxis packing principle. We find that the proposed schemes have lower mean-square error distortion compared to (non)-uniform polar/rectangular-quantiz…
▽ More
The problem of quantizing a circularly-symmetric complex Gaussian random variable is considered. For this purpose, we design two non-uniform quantizers, a high-rate-, and a Lloyd-Max-, quantizer that are both based on the (golden angle) spiral-phyllotaxis packing principle. We find that the proposed schemes have lower mean-square error distortion compared to (non)-uniform polar/rectangular-quantizers, and near-identical to the best performing trained vector quantizers. The proposed quantizer scheme offers a structured design, a simple natural index ordering, and allow for any number of centroids.
△ Less
Submitted 10 September, 2017;
originally announced September 2017.
-
The Matrix Exponential Distribution - A Tool for Wireless System Performance Analysis
Authors:
Peter Larsson,
Lars K. Rasmussen,
Mikael Skoglund
Abstract:
In [1], we introduced a new, matrix algebraic, performance analysis framework for wireless systems with fading channels based on the matrix exponential distribution. The main idea was to use the compact, powerful, and easy-to-use, matrix exponential (ME)-distribution for i) modeling the unprocessed channel signal to noise ratio (SNR), ii) exploiting the closure property of the ME-distribution for…
▽ More
In [1], we introduced a new, matrix algebraic, performance analysis framework for wireless systems with fading channels based on the matrix exponential distribution. The main idea was to use the compact, powerful, and easy-to-use, matrix exponential (ME)-distribution for i) modeling the unprocessed channel signal to noise ratio (SNR), ii) exploiting the closure property of the ME-distribution for SNR processing operations to give the effective channel random variable (r.v.) on ME-distribution form, and then to iii) express the performance measure in a closed-form based on ME-distribution matrix/vector parameters only. In this work, we aim to more clearly present, formalize, refine and develop this unified bottom-up analysis framework, show its versatility to handle important communication cases, performance evaluation levels, and performance metrics. The bivariate ME-distribution is introduced here as yet another useful ME-tool, e.g. to account for dependency among two r.v.s. We propose that the ME-distribution may, in addition to fading, also characterize the pdf of discrete-time signal r.v.s, thus extending the ME-distribution matrix form to new generalized 1D/2D-Gaussian-, and Rayleigh-, distribution-like matrix forms. Our findings here, strengthen the observation from [1], [2], and indicates that the ME-distribution can be a promising tool for wireless system modeling and performance analysis.
△ Less
Submitted 20 December, 2016;
originally announced December 2016.
-
Effective Capacity of Retransmission Schemes - A Recurrence Relation Approach
Authors:
Peter Larsson,
James Gross,
Hussein Al-Zubaidy,
Lars K. Rasmussen,
Mikael Skoglund
Abstract:
We consider the effective capacity performance measure of persistent- and truncated-retransmission schemes that can involve any combination of multiple transmissions per packet, multiple communication modes, or multiple packet communication. We present a structured unified analytical approach, based on a random walk model and recurrence relation formulation, and give exact effective capacity expre…
▽ More
We consider the effective capacity performance measure of persistent- and truncated-retransmission schemes that can involve any combination of multiple transmissions per packet, multiple communication modes, or multiple packet communication. We present a structured unified analytical approach, based on a random walk model and recurrence relation formulation, and give exact effective capacity expressions for persistent hybrid automatic repeat request (HARQ) and for truncated-retransmission schemes. For the latter, effective capacity expressions are given for systems with finite (infinite) time horizon on an algebraic (spectral radius-based) form of a special block companion matrix. In contrast to prior HARQ models, assuming infinite time horizon, the proposed method does not involve a non-trivial per case modeling step. We give effective capacity expressions for several important cases that have not been addressed before, e.g. persistent-HARQ, truncated-HARQ, network-coded ARQ (NC-ARQ), two-mode-ARQ, and multilayer-ARQ. We propose an alternative QoS parameter (instead of the commonly used moment generating function parameter) that represents explicitly the target delay and the delay violation probability. This also enables closed-form expressions for many of the studied systems. Moreover, we use the recently proposed matrix-exponential distributed (MED) modeling of wireless fading channels to provide the basis for numerous new effective capacity results for HARQ.
△ Less
Submitted 17 August, 2016; v1 submitted 28 January, 2016;
originally announced January 2016.
-
Cooperative Communication Using Network Coding
Authors:
Nan Li,
Lars K. Rasmussen,
Ming Xiao
Abstract:
We consider a cognitive radio network scenario where a primary transmitter and a secondary transmitter, respectively, communicate a message to their respective primary receiver and secondary receiver over a packet-based wireless link, using a joint automatic-repeat-request (ARQ) error control scheme. The secondary transmitter assists in the retransmission of the primary message, which improves the…
▽ More
We consider a cognitive radio network scenario where a primary transmitter and a secondary transmitter, respectively, communicate a message to their respective primary receiver and secondary receiver over a packet-based wireless link, using a joint automatic-repeat-request (ARQ) error control scheme. The secondary transmitter assists in the retransmission of the primary message, which improves the primary performance, and is granted limited access to the transmission resources. Conventional ARQ, as well as two network-coding schemes are investigated for application in the retransmission phase; namely the static network-coding (SNC) scheme and the adaptive network-coding (ANC) scheme. For each scheme we analyze the transmission process by investigating the distribution of the number of transmission attempts and approximate it by normal distributions. Considering both the cases of an adaptive frame size and a truncated frame size, we derive analytical results on packet throughput and infer that the ANC scheme outperforms the SNC scheme.
△ Less
Submitted 10 December, 2015; v1 submitted 27 November, 2015;
originally announced November 2015.
-
The EChO science case
Authors:
Giovanna Tinetti,
Pierre Drossart,
Paul Eccleston,
Paul Hartogh,
Kate Isaak,
Martin Linder,
Christophe Lovis,
Giusi Micela,
Marc Ollivier,
Ludovic Puig,
Ignasi Ribas,
Ignas Snellen,
Bruce Swinyard. France Allard,
Joanna Barstow,
James Cho,
Athena Coustenis,
Charles Cockell,
Alexandre Correia,
Leen Decin,
Remco de Kok,
Pieter Deroo,
Therese Encrenaz,
Francois Forget,
Alistair Glasse,
Caitlin Griffith
, et al. (326 additional authors not shown)
Abstract:
The discovery of almost 2000 exoplanets has revealed an unexpectedly diverse planet population. Observations to date have shown that our Solar System is certainly not representative of the general population of planets in our Milky Way. The key science questions that urgently need addressing are therefore: What are exoplanets made of? Why are planets as they are? What causes the exceptional divers…
▽ More
The discovery of almost 2000 exoplanets has revealed an unexpectedly diverse planet population. Observations to date have shown that our Solar System is certainly not representative of the general population of planets in our Milky Way. The key science questions that urgently need addressing are therefore: What are exoplanets made of? Why are planets as they are? What causes the exceptional diversity observed as compared to the Solar System?
EChO (Exoplanet Characterisation Observatory) has been designed as a dedicated survey mission for transit and eclipse spectroscopy capable of observing a large and diverse planet sample within its four-year mission lifetime. EChO can target the atmospheres of super-Earths, Neptune-like, and Jupiter-like planets, in the very hot to temperate zones (planet temperatures of 300K-3000K) of F to M-type host stars. Over the next ten years, several new ground- and space-based transit surveys will come on-line (e.g. NGTS, CHEOPS, TESS, PLATO), which will specifically focus on finding bright, nearby systems. The current rapid rate of discovery would allow the target list to be further optimised in the years prior to EChO's launch and enable the atmospheric characterisation of hundreds of planets. Placing the satellite at L2 provides a cold and stable thermal environment, as well as a large field of regard to allow efficient time-critical observation of targets randomly distributed over the sky. A 1m class telescope is sufficiently large to achieve the necessary spectro-photometric precision. The spectral coverage (0.5-11 micron, goal 16 micron) and SNR to be achieved by EChO, thanks to its high stability and dedicated design, would enable a very accurate measurement of the atmospheric composition and structure of hundreds of exoplanets.
△ Less
Submitted 19 February, 2015;
originally announced February 2015.
-
On the Transmit Beamforming for MIMO Wiretap Channels: Large-System Analysis
Authors:
Maksym A. Girnyk,
Frédéric Gabry,
Mikko Vehkaperä,
Lars K. Rasmussen,
Mikael Skoglund
Abstract:
With the growth of wireless networks, security has become a fundamental issue in wireless communications due to the broadcast nature of these networks. In this work, we consider MIMO wiretap channels in a fast fading environment, for which the overall performance is characterized by the ergodic MIMO secrecy rate. Unfortunately, the direct solution to finding ergodic secrecy rates is prohibitive du…
▽ More
With the growth of wireless networks, security has become a fundamental issue in wireless communications due to the broadcast nature of these networks. In this work, we consider MIMO wiretap channels in a fast fading environment, for which the overall performance is characterized by the ergodic MIMO secrecy rate. Unfortunately, the direct solution to finding ergodic secrecy rates is prohibitive due to the expectations in the rates expressions in this setting. To overcome this difficulty, we invoke the large-system assumption, which allows a deterministic approximation to the ergodic mutual information. Leveraging results from random matrix theory, we are able to characterize the achievable ergodic secrecy rates. Based on this characterization, we address the problem of covariance optimization at the transmitter. Our numerical results demonstrate a good match between the large-system approximation and the actual simulated secrecy rates, as well as some interesting features of the precoder optimization.
△ Less
Submitted 1 November, 2014;
originally announced November 2014.
-
On the Optimal Precoding for MIMO Gaussian Wire-Tap Channels
Authors:
Arash Khabbazibasmenj,
Maksym A. Girnyk,
Sergiy A. Vorobyov,
Mikko Vehkaperä,
Lars K. Rasmussen
Abstract:
We consider the problem of finding secrecy rate of a multiple-input multiple-output (MIMO) wire-tap channel. A transmitter, a legitimate receiver, and an eavesdropper are all equipped with multiple antennas. The channel states from the transmitter to the legitimate user and to the eavesdropper are assumed to be known at the transmitter. In this contribution, we address the problem of finding the o…
▽ More
We consider the problem of finding secrecy rate of a multiple-input multiple-output (MIMO) wire-tap channel. A transmitter, a legitimate receiver, and an eavesdropper are all equipped with multiple antennas. The channel states from the transmitter to the legitimate user and to the eavesdropper are assumed to be known at the transmitter. In this contribution, we address the problem of finding the optimal precoder/transmit covariance matrix maximizing the secrecy rate of the given wiretap channel. The problem formulation is shown to be equivalent to a difference of convex functions programming problem and an efficient algorithm for addressing this problem is developed.
△ Less
Submitted 1 November, 2014;
originally announced November 2014.
-
Asymptotic Performance Analysis of a K-Hop Amplify-and-Forward Relay MIMO Channel
Authors:
Maksym A. Girnyk,
Mikko Vehkaperä,
Lars K. Rasmussen
Abstract:
The present paper studies the asymptotic performance of multi-hop amplify-and-forward relay multiple-antenna communication channels. Each multi-antenna terminal in the network amplifies the received signal, sent by a source, and retransmits it upstream towards a destination. Achievable ergodic rates under both jointly optimal detection and decoding and practical separate decoding schemes for arbit…
▽ More
The present paper studies the asymptotic performance of multi-hop amplify-and-forward relay multiple-antenna communication channels. Each multi-antenna terminal in the network amplifies the received signal, sent by a source, and retransmits it upstream towards a destination. Achievable ergodic rates under both jointly optimal detection and decoding and practical separate decoding schemes for arbitrary signaling schemes, along with the average bit error rate for various receiver structures are derived in the regime where the number of antennas at each terminal grows large without a bound. To overcome the difficulty of averaging over channel realizations we apply large-system analysis based on the replica method from statistical physics. The validity of the large-system analysis is further verified through Monte Carlo simulations of realistic finite-sized systems.
△ Less
Submitted 9 March, 2016; v1 submitted 21 October, 2014;
originally announced October 2014.
-
Buffer-Based Distributed LT Codes
Authors:
Iqbal Hussain,
Ming Xiao,
Lars K. Rasmussen
Abstract:
We focus on the design of distributed Luby transform (DLT) codes for erasure networks with multiple sources and multiple relays, communicating to a single destination. The erasure-floor performance of DLT codes improves with the maximum degree of the relay-degree distribution. However, for conventional DLT codes, the maximum degree is upper-bounded by the number of sources. An additional constrain…
▽ More
We focus on the design of distributed Luby transform (DLT) codes for erasure networks with multiple sources and multiple relays, communicating to a single destination. The erasure-floor performance of DLT codes improves with the maximum degree of the relay-degree distribution. However, for conventional DLT codes, the maximum degree is upper-bounded by the number of sources. An additional constraint is that the sources are required to have the same information block length. We introduce a $D$-bit buffer for each source-relay link, which allows the relay to select multiple encoded bits from the same source for the relay-encoding process; thus, the number of sources no longer limits the maximum degree at the relay. Furthermore, the introduction of buffers facilitates the use of different information block sizes across sources. Based on density evolution we develop an asymptotic analytical framework for optimization of the relay-degree distribution. We further integrate techniques for unequal erasure protection into the optimization framework. The proposed codes are considered for both lossless and lossy source-relay links. Numerical examples show that there is no loss in erasure performance for transmission over lossy source-relay links as compared to lossless links. Additional delays, however, may occur. The design framework and our contributions are demonstrated by a number of illustrative examples, showing the improvements obtained by the proposed buffer-based DLT codes.
△ Less
Submitted 1 October, 2014;
originally announced October 2014.
-
Rateless Codes for the Multi-Way Relay Channel
Authors:
Iqbal Hussain,
Ming Xiao,
Lars K. Rasmussen
Abstract:
We consider distributed Luby transform (DLT) codes for efficient packet transmission in a multi-way relay network, where the links are modeled as erasure channels. Density evolution is applied for asymptotic performance analysis, and subsequently used in a linear-programming design framework for optimizing the degree distribution at the relay in terms of overhead. Moreover a buffer is introduced a…
▽ More
We consider distributed Luby transform (DLT) codes for efficient packet transmission in a multi-way relay network, where the links are modeled as erasure channels. Density evolution is applied for asymptotic performance analysis, and subsequently used in a linear-programming design framework for optimizing the degree distribution at the relay in terms of overhead. Moreover a buffer is introduced at the relay to enable efficient downlink transmission even if packets are lost during uplink transmission. Performance losses in terms of delay and/or erasure rates caused by link erasures during uplink transmission are thus alleviated. The proposed DLT codes provide significant improvements in overhead and decoded erasure rates. Numerical results for finite-length codes follow closely the asymptotic analysis. Our results demonstrate that the proposed buffer-based DLT codes outperform its counterparts for lossy uplink transmission.
△ Less
Submitted 1 October, 2014;
originally announced October 2014.
-
Causal/Predictive Imperfect Channel State Information in Block-Fading Channels
Authors:
Khoa D. Nguyen,
Nick Letzepis,
Albert Guillen i Fabregas,
Lars K. Rasmussen
Abstract:
We consider a multi-input multi-output (MIMO) block-fading channel with a general model for channel state information at the transmitter (CSIT). The model covers systems with causal CSIT, where only CSIT of past fading blocks is available, and predictive CSIT, where CSIT of some future fading blocks is available. The optimal diversity-multiplexing tradeoff (DMT) and rate-diversity tradeoff (RDT) o…
▽ More
We consider a multi-input multi-output (MIMO) block-fading channel with a general model for channel state information at the transmitter (CSIT). The model covers systems with causal CSIT, where only CSIT of past fading blocks is available, and predictive CSIT, where CSIT of some future fading blocks is available. The optimal diversity-multiplexing tradeoff (DMT) and rate-diversity tradeoff (RDT) of the channel are studied under long-term power constraints. The impact of imperfect (mismatched) CSIT on the optimal DMT and RDT is also investigated. Our results show the outage diversity gain obtained by providing imperfect causal/predictive CSIT, leading to new insights into system design and analysis.
△ Less
Submitted 25 September, 2014;
originally announced September 2014.
-
SEK: Sparsity exploiting $k$-mer-based estimation of bacterial community composition
Authors:
Saikat Chatterjee,
David Koslicki,
Siyuan Dong,
Nicolas Innocenti,
Lu Cheng,
Yueheng Lan,
Mikko Vehkaperä,
Mikael Skoglund,
Lars K. Rasmussen,
Erik Aurell,
Jukka Corander
Abstract:
Motivation: Estimation of bacterial community composition from a high-throughput sequenced sample is an important task in metagenomics applications. Since the sample sequence data typically harbors reads of variable lengths and different levels of biological and technical noise, accurate statistical analysis of such data is challenging. Currently popular estimation methods are typically very time…
▽ More
Motivation: Estimation of bacterial community composition from a high-throughput sequenced sample is an important task in metagenomics applications. Since the sample sequence data typically harbors reads of variable lengths and different levels of biological and technical noise, accurate statistical analysis of such data is challenging. Currently popular estimation methods are typically very time consuming in a desktop computing environment.
Results: Using sparsity enforcing methods from the general sparse signal processing field (such as compressed sensing), we derive a solution to the community composition estimation problem by a simultaneous assignment of all sample reads to a pre-processed reference database. A general statistical model based on kernel density estimation techniques is introduced for the assignment task and the model solution is obtained using convex optimization tools. Further, we design a greedy algorithm solution for a fast solution. Our approach offers a reasonably fast community composition estimation method which is shown to be more robust to input data variation than a recently introduced related method.
Availability: A platform-independent Matlab implementation of the method is freely available at http://www.ee.kth.se/ctsoftware; source code that does not require access to Matlab is currently being tested and will be made available later through the above website.
△ Less
Submitted 1 July, 2014;
originally announced July 2014.
-
Load Forecasting of Supermarket Refrigeration
Authors:
Lisa Buth Rasmussen,
Peder Bacher,
Henrik Madsen,
Henrik Aalborg Nielsen,
Christian Heerup,
Torben Green
Abstract:
This paper presents a study of models for forecasting the electrical load for supermarket refrigeration. The data used for building the models consists of load measurements, local climate measurements and weather forecasts. The load measurements are from a supermarket located in a village in Denmark. Every hour the hourly electrical load for refrigeration is forecasted for the following 42 hours.…
▽ More
This paper presents a study of models for forecasting the electrical load for supermarket refrigeration. The data used for building the models consists of load measurements, local climate measurements and weather forecasts. The load measurements are from a supermarket located in a village in Denmark. Every hour the hourly electrical load for refrigeration is forecasted for the following 42 hours. The forecast models are adaptive linear time series models. The model has two regimes; one for opening hours and one for closing hours, this is modelled by a regime switching model and two different methods for predicting the regimes are tested. The dynamic relation between the weather and the load is modelled by simple transfer functions and the non-linearities are described using spline functions. The results are thoroughly evaluated and it is shown that the spline functions are suitable for handling the non-linear relations and that after applying an auto-regressive noise model the one-step ahead residuals do not contain further significant information.
△ Less
Submitted 23 June, 2014;
originally announced June 2014.
-
Asymptotic Analysis of SU-MIMO Channels With Transmitter Noise and Mismatched Joint Decoding
Authors:
Mikko Vehkaperä,
Taneli Riihonen,
Maksym A. Girnyk,
Emil Björnson,
Mérouane Debbah,
Lars K. Rasmussen,
Risto Wichman
Abstract:
Hardware impairments in radio-frequency components of a wireless system cause unavoidable distortions to transmission that are not captured by the conventional linear channel model. In this paper, a 'binoisy' single-user multiple-input multiple-output (SU-MIMO) relation is considered where the additional distortions are modeled via an additive noise term at the transmit side. Through this extended…
▽ More
Hardware impairments in radio-frequency components of a wireless system cause unavoidable distortions to transmission that are not captured by the conventional linear channel model. In this paper, a 'binoisy' single-user multiple-input multiple-output (SU-MIMO) relation is considered where the additional distortions are modeled via an additive noise term at the transmit side. Through this extended SU-MIMO channel model, the effects of transceiver hardware impairments on the achievable rate of multi-antenna point-to-point systems are studied. Channel input distributions encompassing practical discrete modulation schemes, such as, QAM and PSK, as well as Gaussian signaling are covered. In addition, the impact of mismatched detection and decoding when the receiver has insufficient information about the non-idealities is investigated. The numerical results show that for realistic system parameters, the effects of transmit-side noise and mismatched decoding become significant only at high modulation orders.
△ Less
Submitted 22 October, 2014; v1 submitted 19 June, 2014;
originally announced June 2014.
-
Large-System Analysis of Correlated MIMO Multiple Access Channels with Arbitrary Signaling in the Presence of Interference
Authors:
Maksym A. Girnyk,
Mikko Vehkaperä,
Lars K. Rasmussen
Abstract:
Presence of multiple antennas on both sides of a communication channel promises significant improvements in system throughput and power efficiency. In effect, a new class of large multiple-input multiple-output (MIMO) communication systems has recently emerged and attracted both scientific and industrial attention. To analyze these systems in realistic scenarios, one has to include such aspects as…
▽ More
Presence of multiple antennas on both sides of a communication channel promises significant improvements in system throughput and power efficiency. In effect, a new class of large multiple-input multiple-output (MIMO) communication systems has recently emerged and attracted both scientific and industrial attention. To analyze these systems in realistic scenarios, one has to include such aspects as co-channel interference, multiple access and spatial correlation. In this paper, we study the properties of correlated MIMO multiple-access channels in the presence of external interference. Using the replica method from statistical physics, we derive the ergodic sum-rate of the communication for arbitrary signal constellations when the numbers of antennas at both ends of the channel grow large. Based on these asymptotic expressions, we also address the problem of sum-rate maximization using statistical channel information and linear precoding. The numerical results demonstrate that when the interfering terminals use discrete constellations, the resulting interference becomes easier to handle compared to Gaussian signals. Thus, it may be possible to accommodate more interfering transmitter-receiver pairs within the same area as compared to the case of Gaussian signals. In addition, we demonstrate numerically for the Gaussian and QPSK signaling schemes that it is possible to design precoder matrices that significantly improve the achievable rates at low-to-mid range of signal-to-noise ratios when compared to isotropic precoding.
△ Less
Submitted 26 January, 2014; v1 submitted 21 May, 2013;
originally announced May 2013.
-
Analysis of Sparse Representations Using Bi-Orthogonal Dictionaries
Authors:
Mikko Vehkaperä,
Yoshiyuki Kabashima,
Saikat Chatterjee,
Erik Aurell,
Mikael Skoglund,
Lars Rasmussen
Abstract:
The sparse representation problem of recovering an N dimensional sparse vector x from M < N linear observations y = Dx given dictionary D is considered. The standard approach is to let the elements of the dictionary be independent and identically distributed (IID) zero-mean Gaussian and minimize the l1-norm of x under the constraint y = Dx. In this paper, the performance of l1-reconstruction is an…
▽ More
The sparse representation problem of recovering an N dimensional sparse vector x from M < N linear observations y = Dx given dictionary D is considered. The standard approach is to let the elements of the dictionary be independent and identically distributed (IID) zero-mean Gaussian and minimize the l1-norm of x under the constraint y = Dx. In this paper, the performance of l1-reconstruction is analyzed, when the dictionary is bi-orthogonal D = [O1 O2], where O1,O2 are independent and drawn uniformly according to the Haar measure on the group of orthogonal M x M matrices. By an application of the replica method, we obtain the critical conditions under which perfect l1-recovery is possible with bi-orthogonal dictionaries.
△ Less
Submitted 10 July, 2012; v1 submitted 18 April, 2012;
originally announced April 2012.
-
MIMO ARQ with Multi-bit Feedback: Outage Analysis
Authors:
Khoa D. Nguyen,
Lars K. Rasmussen,
Albert Guillen i Fabregas,
Nick Letzepis
Abstract:
We study the asymptotic outage performance of incremental redundancy automatic repeat request (INR-ARQ) transmission over the multiple-input multiple-output (MIMO) block-fading channels with discrete input constellations. We first show that transmission with random codes using a discrete signal constellation across all transmit antennas achieves the optimal outage diversity given by the Singleton…
▽ More
We study the asymptotic outage performance of incremental redundancy automatic repeat request (INR-ARQ) transmission over the multiple-input multiple-output (MIMO) block-fading channels with discrete input constellations. We first show that transmission with random codes using a discrete signal constellation across all transmit antennas achieves the optimal outage diversity given by the Singleton bound. We then analyze the optimal SNR-exponent and outage diversity of INR-ARQ transmission over the MIMO block-fading channel. We show that a significant gain in outage diversity is obtained by providing more than one bit feedback at each ARQ round. Thus, the outage performance of INR-ARQ transmission can be remarkably improved with minimal additional overhead. A suboptimal feedback and power adaptation rule, which achieves the optimal outage diversity, is proposed for MIMO INR-ARQ, demonstrating the benefits provided by multi-bit feedback.
△ Less
Submitted 10 June, 2010; v1 submitted 6 June, 2010;
originally announced June 2010.
-
Bounds on Thresholds Related to Maximum Satisfiability of Regular Random Formulas
Authors:
Vishwambhar Rathi,
Erik Aurell,
Lars Rasmussen,
Mikael Skoglund
Abstract:
We consider the regular balanced model of formula generation in conjunctive normal form (CNF) introduced by Boufkhad, Dubois, Interian, and Selman. We say that a formula is $p$-satisfying if there is a truth assignment satisfying $1-2^{-k}+p 2^{-k}$ fraction of clauses. Using the first moment method we determine upper bound on the threshold clause density such that there are no $p$-satisfying assi…
▽ More
We consider the regular balanced model of formula generation in conjunctive normal form (CNF) introduced by Boufkhad, Dubois, Interian, and Selman. We say that a formula is $p$-satisfying if there is a truth assignment satisfying $1-2^{-k}+p 2^{-k}$ fraction of clauses. Using the first moment method we determine upper bound on the threshold clause density such that there are no $p$-satisfying assignments with high probability above this upper bound. There are two aspects in deriving the lower bound using the second moment method. The first aspect is, given any $p \in (0,1)$ and $k$, evaluate the lower bound on the threshold. This evaluation is numerical in nature. The second aspect is to derive the lower bound as a function of $p$ for large enough $k$. We address the first aspect and evaluate the lower bound on the $p$-satisfying threshold using the second moment method. We observe that as $k$ increases the lower bound seems to converge to the asymptotically derived lower bound for uniform model of formula generation by Achlioptas, Naor, and Peres.
△ Less
Submitted 14 April, 2010;
originally announced April 2010.
-
Bounds on Threshold of Regular Random $k$-SAT
Authors:
Vishwambhar Rathi,
Erik Aurell,
Lars Rasmussen,
Mikael Skoglund
Abstract:
We consider the regular model of formula generation in conjunctive normal form (CNF) introduced by Boufkhad et. al. We derive an upper bound on the satisfiability threshold and NAE-satisfiability threshold for regular random $k$-SAT for any $k \geq 3$. We show that these bounds matches with the corresponding bound for the uniform model of formula generation.
We derive lower bound on the thresh…
▽ More
We consider the regular model of formula generation in conjunctive normal form (CNF) introduced by Boufkhad et. al. We derive an upper bound on the satisfiability threshold and NAE-satisfiability threshold for regular random $k$-SAT for any $k \geq 3$. We show that these bounds matches with the corresponding bound for the uniform model of formula generation.
We derive lower bound on the threshold by applying the second moment method to the number of satisfying assignments. For large $k$, we note that the obtained lower bounds on the threshold of a regular random formula converges to the lower bound obtained for the uniform model. Thus, we answer the question posed in \cite{AcM06} regarding the performance of the second moment method for regular random formulas.
△ Less
Submitted 23 April, 2010; v1 submitted 5 February, 2010;
originally announced February 2010.
-
Adaptive Decoding of LDPC Codes with Binary Messages
Authors:
Ingmar Land,
Gottfried Lechner,
Lars K. Rasmussen
Abstract:
A novel adaptive binary decoding algorithm for LDPC codes is proposed, which reduces the decoding complexity while having a comparable or even better performance than corresponding non-adaptive alternatives. In each iteration the variable node decoders use the binary check node decoders multiple times; each single use is referred to as a sub-iteration. To process the sequences of binary messages…
▽ More
A novel adaptive binary decoding algorithm for LDPC codes is proposed, which reduces the decoding complexity while having a comparable or even better performance than corresponding non-adaptive alternatives. In each iteration the variable node decoders use the binary check node decoders multiple times; each single use is referred to as a sub-iteration. To process the sequences of binary messages in each iteration, the variable node decoders employ pre-computed look-up tables. These look-up tables as well as the number of sub-iterations per iteration are dynamically adapted during the decoding process based on the decoder state, represented by the mutual information between the current messages and the syndrome bits. The look-up tables and the number of sub-iterations per iteration are determined and optimized using density evolution. The performance and the complexity of the proposed adaptive decoding algorithm is exemplified by simulations.
△ Less
Submitted 23 April, 2009; v1 submitted 18 February, 2009;
originally announced February 2009.
-
Power Allocation for Fading Channels with Peak-to-Average Power Constraints
Authors:
Khoa D. Nguyen,
Albert Guillen i Fabregas,
Lars K. Rasmussen
Abstract:
Power allocation with peak-to-average power ratio constraints is investigated for transmission over Nakagami-m fading channels with arbitrary input distributions. In the case of delay-limited block-fading channels, we find the solution to the minimum outage power allocation scheme with peak-to-average power constraints and arbitrary input distributions, and show that the signal-to-noise ratio ex…
▽ More
Power allocation with peak-to-average power ratio constraints is investigated for transmission over Nakagami-m fading channels with arbitrary input distributions. In the case of delay-limited block-fading channels, we find the solution to the minimum outage power allocation scheme with peak-to-average power constraints and arbitrary input distributions, and show that the signal-to-noise ratio exponent for any finite peak-to-average power ratio is the same as that of the peak-power limited problem, resulting in an error floor. In the case of the ergodic fully-interleaved channel, we find the power allocation rule that yields the maximal information rate for an arbitrary input distribution and show that capacities with peak-to-average power ratio constraints, even for small ratios, are very close to capacities without peak-power restrictions.
△ Less
Submitted 25 February, 2008;
originally announced February 2008.
-
A Tight Lower Bound to the Outage Probability of Discrete-Input Block-Fading Channels
Authors:
Khoa D. Nguyen,
Albert Guillen i Fabregas,
Lars K. Rasmussen
Abstract:
In this correspondence, we propose a tight lower bound to the outage probability of discrete-input Nakagami-m block-fading channels. The approach permits an efficient method for numerical evaluation of the bound, providing an additional tool for system design. The optimal rate-diversity trade-off for the Nakagami-m block-fading channel is also derived and a tight upper bound is obtained for the…
▽ More
In this correspondence, we propose a tight lower bound to the outage probability of discrete-input Nakagami-m block-fading channels. The approach permits an efficient method for numerical evaluation of the bound, providing an additional tool for system design. The optimal rate-diversity trade-off for the Nakagami-m block-fading channel is also derived and a tight upper bound is obtained for the optimal coding gain constant.
△ Less
Submitted 11 July, 2007;
originally announced July 2007.
-
Power Allocation for Discrete-Input Delay-Limited Fading Channels
Authors:
Khoa D. Nguyen,
Albert Guillen i Fabregas,
Lars K. Rasmussen
Abstract:
We consider power allocation algorithms for fixed-rate transmission over Nakagami-m non-ergodic block-fading channels with perfect transmitter and receiver channel state information and discrete input signal constellations, under both short- and long-term power constraints. Optimal power allocation schemes are shown to be direct applications of previous results in the literature. We show that th…
▽ More
We consider power allocation algorithms for fixed-rate transmission over Nakagami-m non-ergodic block-fading channels with perfect transmitter and receiver channel state information and discrete input signal constellations, under both short- and long-term power constraints. Optimal power allocation schemes are shown to be direct applications of previous results in the literature. We show that the SNR exponent of the optimal short-term scheme is given by m times the Singleton bound. We also illustrate the significant gains available by employing long-term power constraints. In particular, we analyze the optimal long-term solution, showing that zero outage can be achieved provided that the corresponding short-term SNR exponent with the same system parameters is strictly greater than one. Conversely, if the short-term SNR exponent is smaller than one, we show that zero outage cannot be achieved. In this case, we derive the corresponding long-term SNR exponent as a function of the Singleton bound. Due to the nature of the expressions involved, the complexity of optimal schemes may be prohibitive for system implementation. We therefore propose simple sub-optimal power allocation schemes whose outage probability performance is very close to the minimum outage probability obtained by optimal schemes. We also show the applicability of these techniques to practical systems employing orthogonal frequency division multiplexing.
△ Less
Submitted 13 June, 2007;
originally announced June 2007.
-
Power Allocation for Discrete-Input Non-Ergodic Block-Fading Channels
Authors:
Khoa D. Nguyen,
Albert Guillen i Fabregas,
Lars K. Rasmussen
Abstract:
We consider power allocation algorithms for fixed-rate transmission over Nakagami-m non-ergodic block-fading channels with perfect transmitter and receiver channel state information and discrete input signal constellations under both short- and long-term power constraints. Optimal power allocation schemes are shown to be direct applications of previous results in the literature. We show that the…
▽ More
We consider power allocation algorithms for fixed-rate transmission over Nakagami-m non-ergodic block-fading channels with perfect transmitter and receiver channel state information and discrete input signal constellations under both short- and long-term power constraints. Optimal power allocation schemes are shown to be direct applications of previous results in the literature. We show that the SNR exponent of the optimal short-term scheme is given by the Singleton bound. We also illustrate the significant gains available by employing long-term power constraints. Due to the nature of the expressions involved, the complexity of optimal schemes may be prohibitive for system implementation. We propose simple sub-optimal power allocation schemes whose outage probability performance is very close to the minimum outage probability obtained by optimal schemes.
△ Less
Submitted 4 July, 2007; v1 submitted 2 May, 2007;
originally announced May 2007.
-
Optimal Throughput-Diversity-Delay Tradeoff in MIMO ARQ Block-Fading Channels
Authors:
Allen Chuang,
Albert Guillen i Fabregas,
Lars K. Rasmussen,
Iain B. Collings
Abstract:
In this paper, we consider an automatic-repeat-request (ARQ) retransmission protocol signaling over a block-fading multiple-input, multiple-output (MIMO) channel. Unlike previous work, we allow for multiple fading blocks within each transmission (ARQ round), and we constrain the transmitter to fixed rate codes constructed over complex signal constellations. In particular, we examine the general…
▽ More
In this paper, we consider an automatic-repeat-request (ARQ) retransmission protocol signaling over a block-fading multiple-input, multiple-output (MIMO) channel. Unlike previous work, we allow for multiple fading blocks within each transmission (ARQ round), and we constrain the transmitter to fixed rate codes constructed over complex signal constellations. In particular, we examine the general case of average input-power-constrained constellations as well as the practically important case of finite discrete constellations. This scenario is a suitable model for practical wireless communications systems employing orthogonal frequency division multiplexing techniques over a MIMO ARQ channel. Two cases of fading dynamics are considered, namely short-term static fading where channel fading gains change randomly for each ARQ round, and long-term static fading where channel fading gains remain constant over all ARQ rounds pertaining to a given message. As our main result, we prove that for the block-fading MIMO ARQ channel with discrete input signal constellation satisfying a short-term power constraint, the optimal signal-to-noise ratio (SNR) exponent is given by a modified Singleton bound, relating all the system parameters. To demonstrate the practical significance of the theoretical analysis, we present numerical results showing that practical Singleton-bound-achieving maximum distance separable codes achieve the optimal SNR exponent.
△ Less
Submitted 19 January, 2007;
originally announced January 2007.
-
Design of Rate-Compatible Serially Concatenated Convolutional Codes
Authors:
Alexandre Graell i Amat,
Fredrik Brannstrom,
Lars K. Rasmussen
Abstract:
Recently a powerful class of rate-compatible serially concatenated convolutional codes (SCCCs) have been proposed based on minimizing analytical upper bounds on the error probability in the error floor region. Here this class of codes is further investigated by combining analytical upper bounds with extrinsic information transfer charts analysis. Following this approach, we construct a family of…
▽ More
Recently a powerful class of rate-compatible serially concatenated convolutional codes (SCCCs) have been proposed based on minimizing analytical upper bounds on the error probability in the error floor region. Here this class of codes is further investigated by combining analytical upper bounds with extrinsic information transfer charts analysis. Following this approach, we construct a family of rate-compatible SCCCs with good performance in both the error floor and the waterfall regions over a broad range of code rates.
△ Less
Submitted 14 January, 2006;
originally announced January 2006.
-
Non-Data-Aided Parameter Estimation in an Additive White Gaussian Noise Channel
Authors:
Fredrik Brannstrom,
Lars K. Rasmussen
Abstract:
Non-data-aided (NDA) parameter estimation is considered for binary-phase-shift-keying transmission in an additive white Gaussian noise channel. Cramer-Rao lower bounds (CRLBs) for signal amplitude, noise variance, channel reliability constant and bit-error rate are derived and it is shown how these parameters relate to the signal-to-noise ratio (SNR). An alternative derivation of the iterative m…
▽ More
Non-data-aided (NDA) parameter estimation is considered for binary-phase-shift-keying transmission in an additive white Gaussian noise channel. Cramer-Rao lower bounds (CRLBs) for signal amplitude, noise variance, channel reliability constant and bit-error rate are derived and it is shown how these parameters relate to the signal-to-noise ratio (SNR). An alternative derivation of the iterative maximum likelihood (ML) SNR estimator is presented together with a novel, low complexity NDA SNR estimator. The performance of the proposed estimator is compared to previously suggested estimators and the CRLB. The results show that the proposed estimator performs close to the iterative ML estimator at significantly lower computational complexity.
△ Less
Submitted 3 September, 2005;
originally announced September 2005.
-
Nonlinear MMSE Multiuser Detection Based on Multivariate Gaussian Approximation
Authors:
Peng Hui Tan,
Lars K. Rasmussen
Abstract:
In this paper, a class of nonlinear MMSE multiuser detectors are derived based on a multivariate Gaussian approximation of the multiple access interference. This approach leads to expressions identical to those describing the probabilistic data association (PDA) detector, thus providing an alternative analytical justification for this structure. A simplification to the PDA detector based on appr…
▽ More
In this paper, a class of nonlinear MMSE multiuser detectors are derived based on a multivariate Gaussian approximation of the multiple access interference. This approach leads to expressions identical to those describing the probabilistic data association (PDA) detector, thus providing an alternative analytical justification for this structure. A simplification to the PDA detector based on approximating the covariance matrix of the multivariate Gaussian distribution is suggested, resulting in a soft interference cancellation scheme. Corresponding multiuser soft-input, soft-output detectors delivering extrinsic log-likelihood ratios are derived for application in iterative multiuser decoders. Finally, a large system performance analysis is conducted for the simplified PDA, showing that the bit error rate performance of this detector can be accurately predicted and related to the replica method analysis for the optimal detector. Methods from statistical neuro-dynamics are shown to provide a closely related alternative large system prediction. Numerical results demonstrate that for large systems, the bit error rate is accurately predicted by the analysis and found to be close to optimal performance.
△ Less
Submitted 14 February, 2005;
originally announced February 2005.
-
Deep far infrared ISOPHOT survey in "Selected Area 57", I. Observations and source counts
Authors:
M. J. D. Linden-Voernle,
H. U. Noergaard-Nielsen,
H. E. Joergensen,
L. Hansen,
M. Hass,
U. Klaas,
P. Abraham,
D. Lemke,
I. Lundgaard Rasmussen,
H. W. Schnopper
Abstract:
We present here the results of a deep survey in a 0.4 sq.deg. blank field in Selected Area 57 conducted with the ISOPHOT instrument aboard ESAs Infrared Space Observatory (ISO) at both 60 um and 90 um. The resulting sky maps have a spatial resolution of 15 x 23 sq.arcsec. per pixel which is much higher than the 90 x 90 sq.arcsec. pixels of the IRAS All Sky Survey. We describe the main instrument…
▽ More
We present here the results of a deep survey in a 0.4 sq.deg. blank field in Selected Area 57 conducted with the ISOPHOT instrument aboard ESAs Infrared Space Observatory (ISO) at both 60 um and 90 um. The resulting sky maps have a spatial resolution of 15 x 23 sq.arcsec. per pixel which is much higher than the 90 x 90 sq.arcsec. pixels of the IRAS All Sky Survey. We describe the main instrumental effects encountered in our data, outline our data reduction and analysis scheme and present astrometry and photometry of the detected point sources. With a formal signal to noise ratio of 6.75 we have source detection limits of 90 mJy at 60 um and 50 mJy at 90 um. To these limits we find cumulated number densities of 5+-3.5 per sq.deg. at 60 um and 14.8+-5.0 per sq.deg.at 90 um. These number densities of sources are found to be lower than previously reported results from ISO but the data do not allow us to discriminate between no-evolution scenarios and various evolutionary models.
△ Less
Submitted 17 May, 2000;
originally announced May 2000.