Search | arXiv e-print repository

Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction

Authors: Shariq Nadeem Malik, Min Hao Chee, Dayan Mario Anthony Perera, Chern Hong Lim

Abstract: This paper aims to review and determine the feasibility of using variations of NeRF models in order to reconstruct crime scenes given input videos of the scene. We focus on three main innovations of NeRF when it comes to reconstructing crime scenes: Multi-object Synthesis, Deformable Synthesis, and Lighting. From there, we analyse its innovation progress against the requirements to be met in order… ▽ More This paper aims to review and determine the feasibility of using variations of NeRF models in order to reconstruct crime scenes given input videos of the scene. We focus on three main innovations of NeRF when it comes to reconstructing crime scenes: Multi-object Synthesis, Deformable Synthesis, and Lighting. From there, we analyse its innovation progress against the requirements to be met in order to be able to reconstruct crime scenes with given videos of such scenes. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 4 pages, 1 table

arXiv:2404.00120 [pdf]

Dynamics of nano-scale assemblies of amphiphilic PEG-PDMS-PEG copolymers

Authors: Sudipta Gupta, Rasangi M. Perera, Christopher J. Van Leeuwen, Tianyu Li, Laura Stingaciu, Markus Bleuel, Kunlun Hong, Gerald J. Schneider

Abstract: Micelles and vesicles are promising candidates in targeted drug/gene delivery, bioreactors, and templates for nanoparticle synthesis. We investigated the morphology and dynamics of PEG-PDMS-PEG triblock copolymer nano-scale assemblies regarding the membrane dynamics because the molecular dynamics of the membrane govern mechanical properties like the stability of a membrane. We studied the structur… ▽ More Micelles and vesicles are promising candidates in targeted drug/gene delivery, bioreactors, and templates for nanoparticle synthesis. We investigated the morphology and dynamics of PEG-PDMS-PEG triblock copolymer nano-scale assemblies regarding the membrane dynamics because the molecular dynamics of the membrane govern mechanical properties like the stability of a membrane. We studied the structure by cryogenic transmission electron microscopy, small-angle neutron scattering, and the dynamics by dynamic light scattering and neutron spin echo spectroscopy. We changed the length of the hydrophilic block to obtain micellar and vesicular systems. The vesicle has a membrane rigidity, $κ_η= 16 \pm 2 k_B T$, the same order of magnitude as the corresponding liposome value but one order of magnitude higher than polymeric interfaces in microemulsions. Hence, the height-height fluctuations of polymers in a polymersome seem much less than those measured for surfactants at an oil-water interface. Therefore, the polymersome is substantially more stable. The value is very close to liposomes, indicating a similar stability. △ Less

Submitted 29 March, 2024; originally announced April 2024.

arXiv:2403.19593 [pdf, other]

Frame by Familiar Frame: Understanding Replication in Video Diffusion Models

Authors: Aimon Rahman, Malsha V. Perera, Vishal M. Patel

Abstract: Building on the momentum of image generation diffusion models, there is an increasing interest in video-based diffusion models. However, video generation poses greater challenges due to its higher-dimensional nature, the scarcity of training data, and the complex spatiotemporal relationships involved. Image generation models, due to their extensive data requirements, have already strained computat… ▽ More Building on the momentum of image generation diffusion models, there is an increasing interest in video-based diffusion models. However, video generation poses greater challenges due to its higher-dimensional nature, the scarcity of training data, and the complex spatiotemporal relationships involved. Image generation models, due to their extensive data requirements, have already strained computational resources to their limits. There have been instances of these models reproducing elements from the training samples, leading to concerns and even legal disputes over sample replication. Video diffusion models, which operate with even more constrained datasets and are tasked with generating both spatial and temporal content, may be more prone to replicating samples from their training sets. Compounding the issue, these models are often evaluated using metrics that inadvertently reward replication. In our paper, we present a systematic investigation into the phenomenon of sample replication in video diffusion models. We scrutinize various recent diffusion models for video synthesis, assessing their tendency to replicate spatial and temporal content in both unconditional and conditional generation scenarios. Our study identifies strategies that are less likely to lead to replication. Furthermore, we propose new evaluation strategies that take replication into account, offering a more accurate measure of a model's ability to generate the original content. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.01653 [pdf, other]

Day-ahead regional solar power forecasting with hierarchical temporal convolutional neural networks using historical power generation and weather data

Authors: Maneesha Perera, Julian De Hoog, Kasun Bandara, Damith Senanayake, Saman Halgamuge

Abstract: Regional solar power forecasting, which involves predicting the total power generation from all rooftop photovoltaic systems in a region holds significant importance for various stakeholders in the energy sector. However, the vast amount of solar power generation and weather time series from geographically dispersed locations that need to be considered in the forecasting process makes accurate reg… ▽ More Regional solar power forecasting, which involves predicting the total power generation from all rooftop photovoltaic systems in a region holds significant importance for various stakeholders in the energy sector. However, the vast amount of solar power generation and weather time series from geographically dispersed locations that need to be considered in the forecasting process makes accurate regional forecasting challenging. Therefore, previous work has limited the focus to either forecasting a single time series (i.e., aggregated time series) which is the addition of all solar generation time series in a region, disregarding the location-specific weather effects or forecasting solar generation time series of each PV site (i.e., individual time series) independently using location-specific weather data, resulting in a large number of forecasting models. In this work, we propose two deep-learning-based regional forecasting methods that can effectively leverage both types of time series (aggregated and individual) with weather data in a region. We propose two hierarchical temporal convolutional neural network architectures (HTCNN) and two strategies to adapt HTCNNs for regional solar power forecasting. At first, we explore generating a regional forecast using a single HTCNN. Next, we divide the region into multiple sub-regions based on weather information and train separate HTCNNs for each sub-region; the forecasts of each sub-region are then added to generate a regional forecast. The proposed work is evaluated using a large dataset collected over a year from 101 locations across Western Australia to provide a day ahead forecast. We compare our approaches with well-known alternative methods and show that the sub-region HTCNN requires fewer individual networks and achieves a forecast skill score of 40.2% reducing a statistically significant error by 6.5% compared to the best counterpart. △ Less

Submitted 3 March, 2024; originally announced March 2024.

Comments: 37 pages, 16 figures, Accepted to the journal of Applied Energy

arXiv:2306.13877 [pdf]

A Novel Handover Mechanism for Visible Light Communication Network

Authors: M. A. N. Perera, N. G. K. R. Wijewardhana, A. A. D. T Nissanka, S. A. H. A. Suraweera, G. M. R. I. Godaliyadda

Abstract: Visible light communication (VLC) is an emerging technology and considered as an alternative to overcome some of the disadvantages of radio frequency communication technology in an indoor environment. However, the line of sight nature of the technology limits the user mobility and create new challenges to provides seamless network coverage under user mobility scenarios. In VLC multi access points… ▽ More Visible light communication (VLC) is an emerging technology and considered as an alternative to overcome some of the disadvantages of radio frequency communication technology in an indoor environment. However, the line of sight nature of the technology limits the user mobility and create new challenges to provides seamless network coverage under user mobility scenarios. In VLC multi access points and multi cell based network, co channel interference (CCI) between neighbor cells limits the overall performance. Therefore, by following the already existing VLC handover systems, this statistical parameter based novel handover method is designed. This paper proposes a handover mechanism for indoor VLC systems by introducing cell ID bits and statistical kurtosis values of the cell ID waveforms as metric for the handover initiation. As advantage of the method, effects from the CCI can be eliminate when measuring signal strengths in the signal overlap** cell boundary area. Also the handover criteria are adoptive to the different ambient lighting conditions compared to existing pre-configured light intensity threshold based handover systems. With the use of bit error rate (BER), the experiment results showed that Kurtosis value of the cell ID waveforms can be used as metric to initiate network handover in indoor VLC systems. △ Less

Submitted 24 June, 2023; originally announced June 2023.

arXiv:2305.19663 [pdf, other]

Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary Domains

Authors: Levi Lingsch, Mike Y. Michelis, Emmanuel de Bezenac, Sirani M. Perera, Robert K. Katzschmann, Siddhartha Mishra

Abstract: The computational efficiency of many neural operators, widely used for learning solutions of PDEs, relies on the fast Fourier transform (FFT) for performing spectral computations. As the FFT is limited to equispaced (rectangular) grids, this limits the efficiency of such neural operators when applied to problems where the input and output functions need to be processed on general non-equispaced po… ▽ More The computational efficiency of many neural operators, widely used for learning solutions of PDEs, relies on the fast Fourier transform (FFT) for performing spectral computations. As the FFT is limited to equispaced (rectangular) grids, this limits the efficiency of such neural operators when applied to problems where the input and output functions need to be processed on general non-equispaced point distributions. Leveraging the observation that a limited set of Fourier (Spectral) modes suffice to provide the required expressivity of a neural operator, we propose a simple method, based on the efficient direct evaluation of the underlying spectral transformation, to extend neural operators to arbitrary domains. An efficient implementation* of such direct spectral evaluations is coupled with existing neural operator models to allow the processing of data on arbitrary non-equispaced distributions of points. With extensive empirical evaluation, we demonstrate that the proposed method allows us to extend neural operators to arbitrary point distributions with significant gains in training speed over baselines while retaining or improving the accuracy of Fourier neural operators (FNOs) and related neural operators. △ Less

Submitted 20 May, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

Comments: 20 pages, 12 figures

arXiv:2305.06402 [pdf, ps, other]

Analyzing Bias in Diffusion-based Face Generation Models

Authors: Malsha V. Perera, Vishal M. Patel

Abstract: Diffusion models are becoming increasingly popular in synthetic data generation and image editing applications. However, these models can amplify existing biases and propagate them to downstream applications. Therefore, it is crucial to understand the sources of bias in their outputs. In this paper, we investigate the presence of bias in diffusion-based face generation models with respect to attri… ▽ More Diffusion models are becoming increasingly popular in synthetic data generation and image editing applications. However, these models can amplify existing biases and propagate them to downstream applications. Therefore, it is crucial to understand the sources of bias in their outputs. In this paper, we investigate the presence of bias in diffusion-based face generation models with respect to attributes such as gender, race, and age. Moreover, we examine how dataset size affects the attribute composition and perceptual quality of both diffusion and Generative Adversarial Network (GAN) based face generation models across various attribute classes. Our findings suggest that diffusion models tend to worsen distribution bias in the training data for various attributes, which is heavily influenced by the size of the dataset. Conversely, GAN models trained on balanced datasets with a larger number of samples show less bias across different attributes. △ Less

Submitted 10 May, 2023; originally announced May 2023.

arXiv:2303.05454 [pdf, other]

Teleoperation of Soft Modular Robots: Study on Real-time Stability and Gait Control

Authors: Dulanjana M. Perera, Dimuthu D. K. Arachchige, Sanjaya Mallikarachchi, Talal Ghafoor, Iyad Kanj, Yue Chen, Isuru S. Godage

Abstract: Soft robotics holds tremendous potential for various applications, especially in unstructured environments such as search and rescue operations. However, the lack of autonomy and teleoperability, limited capabilities, absence of gait diversity and real-time control, and onboard sensors to sense the surroundings are some of the common issues with soft-limbed robots. To overcome these limitations, w… ▽ More Soft robotics holds tremendous potential for various applications, especially in unstructured environments such as search and rescue operations. However, the lack of autonomy and teleoperability, limited capabilities, absence of gait diversity and real-time control, and onboard sensors to sense the surroundings are some of the common issues with soft-limbed robots. To overcome these limitations, we propose a spatially symmetric, topologically-stable, soft-limbed tetrahedral robot that can perform multiple locomotion gaits. We introduce a kinematic model, derive locomotion trajectories for different gaits, and design a teleoperation mechanism to enable real-time human-robot collaboration. We use the kinematic model to map teleoperation inputs and ensure smooth transitions between gaits. Additionally, we leverage the passive compliance and natural stability of the robot for toppling and obstacle navigation. Through experimental tests, we demonstrate the robot's ability to tackle various locomotion challenges, adapt to different situations, and navigate obstructed environments via teleoperation. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2303.02291 [pdf, other]

Dynamic Modeling and Validation of Soft Robotic Snake Locomotion

Authors: Dimuthu D. K. Arachchige, Dulanjana M. Perera, Sanjaya Mallikarachchi, Iyad Kanj, Yue Chen, Hunter B. Gilbert, Isuru S. Godage

Abstract: Soft robotic snakes made of compliant materials can continuously deform their bodies and, therefore, mimic the biological snakes' flexible and agile locomotion gaits better than their rigid-bodied counterparts. Without wheel support, to date, soft robotic snakes are limited to emulating planar locomotion gaits, which are derived via kinematic modeling and tested on robotic prototypes. Given that t… ▽ More Soft robotic snakes made of compliant materials can continuously deform their bodies and, therefore, mimic the biological snakes' flexible and agile locomotion gaits better than their rigid-bodied counterparts. Without wheel support, to date, soft robotic snakes are limited to emulating planar locomotion gaits, which are derived via kinematic modeling and tested on robotic prototypes. Given that the snake locomotion results from the reaction forces due to the distributed contact between their skin and the ground, it is essential to investigate the locomotion gaits through efficient dynamic models capable of accommodating distributed contact forces. We present a complete spatial dynamic model that utilizes a floating-base kinematic model with distributed contact dynamics for a pneumatically powered soft robotic snake. We numerically evaluate the feasibility of the planar and spatial rolling gaits utilizing the proposed model and experimentally validate the corresponding locomotion gait trajectories on a soft robotic snake prototype. We qualitatively and quantitatively compare the numerical and experimental results which confirm the validity of the proposed dynamic model. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: This paper has been accepted to 2023 IEEE International Conference on Control, Automation and Robotics (ICCAR)

arXiv:2303.02285 [pdf, other]

Wheelless Soft Robotic Snake Locomotion: Study on Sidewinding and Helical Rolling Gaits

Authors: Dimuthu D. K. Arachchige, Dulanjana M. Perera, Sanjaya Mallikarachchi, Iyad Kanj, Yue Chen, Isuru S. Godage

Abstract: Soft robotic snakes (SRSs) have a unique combination of continuous and compliant properties that allow them to imitate the complex movements of biological snakes. Despite the previous attempts to develop SRSs, many have been limited to planar movements or use wheels to achieve locomotion, which restricts their ability to imitate the full range of biological snake movements. We propose a new design… ▽ More Soft robotic snakes (SRSs) have a unique combination of continuous and compliant properties that allow them to imitate the complex movements of biological snakes. Despite the previous attempts to develop SRSs, many have been limited to planar movements or use wheels to achieve locomotion, which restricts their ability to imitate the full range of biological snake movements. We propose a new design for the SRSs that is wheelless and powered by pneumatics, relying solely on spatial bending to achieve its movements. We derive a kinematic model of the proposed SRS and utilize it to achieve two snake locomotion trajectories, namely sidewinding and helical rolling. These movements are experimentally evaluated under different gait parameters on our SRS prototype. The results demonstrate that the SRS can successfully mimic the proposed spatial locomotion trajectories. This is a significant improvement over the previous designs, which were either limited to planar movements or relied on wheels for locomotion. The ability of the SRS to effectively mimic the complex movements of biological snakes opens up new possibilities for its use in various applications. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: This paper has been accepted to 2023 IEEE-RAS International Conference on Soft Robotics (RoboSoft)

arXiv:2211.09455 [pdf, other]

Consultation Checklists: Standardising the Human Evaluation of Medical Note Generation

Authors: Aleksandar Savkov, Francesco Moramarco, Alex Papadopoulos Korfiatis, Mark Perera, Anya Belz, Ehud Reiter

Abstract: Evaluating automatically generated text is generally hard due to the inherently subjective nature of many aspects of the output quality. This difficulty is compounded in automatic consultation note generation by differing opinions between medical experts both about which patient statements should be included in generated notes and about their respective importance in arriving at a diagnosis. Previ… ▽ More Evaluating automatically generated text is generally hard due to the inherently subjective nature of many aspects of the output quality. This difficulty is compounded in automatic consultation note generation by differing opinions between medical experts both about which patient statements should be included in generated notes and about their respective importance in arriving at a diagnosis. Previous real-world evaluations of note-generation systems saw substantial disagreement between expert evaluators. In this paper we propose a protocol that aims to increase objectivity by grounding evaluations in Consultation Checklists, which are created in a preliminary step and then used as a common point of reference during quality assessment. We observed good levels of inter-annotator agreement in a first evaluation study using the protocol; further, using Consultation Checklists produced in the study as reference for automatic metrics such as ROUGE or BERTScore improves their correlation with human judgements compared to using the original human note. △ Less

Submitted 17 November, 2022; originally announced November 2022.

Comments: Accepted for publication at EMNLP 2022

arXiv:2206.10795 [pdf, other]

doi 10.1016/j.eswa.2022.117690

Multi-Resolution, Multi-Horizon Distributed Solar PV Power Forecasting with Forecast Combinations

Authors: Maneesha Perera, Julian De Hoog, Kasun Bandara, Saman Halgamuge

Abstract: Distributed, small-scale solar photovoltaic (PV) systems are being installed at a rapidly increasing rate. This can cause major impacts on distribution networks and energy markets. As a result, there is a significant need for improved forecasting of the power generation of these systems at different time resolutions and horizons. However, the performance of forecasting models depends on the resolu… ▽ More Distributed, small-scale solar photovoltaic (PV) systems are being installed at a rapidly increasing rate. This can cause major impacts on distribution networks and energy markets. As a result, there is a significant need for improved forecasting of the power generation of these systems at different time resolutions and horizons. However, the performance of forecasting models depends on the resolution and horizon. Forecast combinations (ensembles), that combine the forecasts of multiple models into a single forecast may be robust in such cases. Therefore, in this paper, we provide comparisons and insights into the performance of five state-of-the-art forecast models and existing forecast combinations at multiple resolutions and horizons. We propose a forecast combination approach based on particle swarm optimization (PSO) that will enable a forecaster to produce accurate forecasts for the task at hand by weighting the forecasts produced by individual models. Furthermore, we compare the performance of the proposed combination approach with existing forecast combination approaches. A comprehensive evaluation is conducted using a real-world residential PV power data set measured at 25 houses located in three locations in the United States. The results across four different resolutions and four different horizons show that the PSO-based forecast combination approach outperforms the use of any individual forecast model and other forecast combination counterparts, with an average Mean Absolute Scaled Error reduction by 3.81% compared to the best performing individual model. Our approach enables a solar forecaster to produce accurate forecasts for their application regardless of the forecast resolution or horizon. △ Less

Submitted 21 June, 2022; originally announced June 2022.

Journal ref: Expert Systems with Applications 205 (2022)

arXiv:2206.04514 [pdf, ps, other]

doi 10.1109/LGRS.2023.3270799

SAR Despeckling using a Denoising Diffusion Probabilistic Model

Authors: Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

Abstract: Speckle is a multiplicative noise which affects all coherent imaging modalities including Synthetic Aperture Radar (SAR) images. The presence of speckle degrades the image quality and adversely affects the performance of SAR image understanding applications such as automatic target recognition and change detection. Thus, SAR despeckling is an important problem in remote sensing. In this paper, we… ▽ More Speckle is a multiplicative noise which affects all coherent imaging modalities including Synthetic Aperture Radar (SAR) images. The presence of speckle degrades the image quality and adversely affects the performance of SAR image understanding applications such as automatic target recognition and change detection. Thus, SAR despeckling is an important problem in remote sensing. In this paper, we introduce SAR-DDPM, a denoising diffusion probabilistic model for SAR despeckling. The proposed method comprises of a Markov chain that transforms clean images to white Gaussian noise by repeatedly adding random noise. The despeckled image is recovered by a reverse process which iteratively predicts the added noise using a noise predictor which is conditioned on the speckled image. In addition, we propose a new inference strategy based on cycle spinning to improve the despeckling performance. Our experiments on both synthetic and real SAR images demonstrate that the proposed method achieves significant improvements in both quantitative and qualitative results over the state-of-the-art despeckling methods. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: Our code is available at https://github.com/malshaV/SAR_DDPM

arXiv:2206.00779 [pdf, other]

doi 10.1109/ACCESS.2020.2970342

Radix-2 Self-Recursive Sparse Factorizations of Delay Vandermonde Matrices for Wideband Multi-Beam Antenna Arrays

Authors: S. M. Perera, A. Madanayake, R. J. Cintra

Abstract: This paper presents a self-contained factorization for the Vandermonde matrices associated with true-time delay based wideband analog multi-beam beamforming using antenna arrays. The proposed factorization contains sparse and orthogonal matrices. Novel self-recursive radix-2 algorithms for Vandermonde matrices associated with true time delay based delay-sum filterbanks are presented to reduce the… ▽ More This paper presents a self-contained factorization for the Vandermonde matrices associated with true-time delay based wideband analog multi-beam beamforming using antenna arrays. The proposed factorization contains sparse and orthogonal matrices. Novel self-recursive radix-2 algorithms for Vandermonde matrices associated with true time delay based delay-sum filterbanks are presented to reduce the circuit complexity of multi-beam analog beamforming systems. The proposed algorithms for Vandermonde matrices by a vector attain $\mathcal{O}(N \log N)$ delay-amplifier circuit counts. Error bounds for the Vandermode matrices associated with true-time delay are established and then analyzed for numerical stability. The potential for real-world circuit implementation of the proposed algorithms will be shown through signal flow graphs that are the starting point for high-frequency analog circuit realizations. △ Less

Submitted 1 June, 2022; originally announced June 2022.

Comments: 20 pages, 1 figure

Journal ref: IEEE Access, vol. 8, 2020

arXiv:2206.00778 [pdf, other]

doi 10.1109/OJSP.2020.2991586

Efficient and Self-Recursive Delay Vandermonde Algorithm for Multi-Beam Antenna Arrays

Authors: S. M. Perera, A. Madanayake, R. J. Cintra

Abstract: This paper presents a self-contained factorization for the delay Vandermonde matrix (DVM), which is the super class of the discrete Fourier transform, using sparse and companion matrices. An efficient DVM algorithm is proposed to reduce the complexity of radio-frequency (RF) $N$-beam analog beamforming systems. There exist applications for wideband multi-beam beamformers in wireless communication… ▽ More This paper presents a self-contained factorization for the delay Vandermonde matrix (DVM), which is the super class of the discrete Fourier transform, using sparse and companion matrices. An efficient DVM algorithm is proposed to reduce the complexity of radio-frequency (RF) $N$-beam analog beamforming systems. There exist applications for wideband multi-beam beamformers in wireless communication networks such as 5G/6G systems, system capacity can be improved by exploiting the improvement of the signal to noise ratio (SNR) using coherent summation of propagating waves based on their directions of propagation. The presence of a multitude of RF beams allows multiple independent wireless links to be established at high SNR, or used in conjunction with multiple-input multiple-output (MIMO) wireless systems, with the overall goal of improving system SNR and therefore capacity. To realize such multi-beam beamformers at acceptable analog circuit complexities, we use sparse factorization of the DVM in order to derive a low arithmetic complexity DVM algorithm. The paper also establishes an error bound and stability analysis of the proposed DVM algorithm. The proposed efficient DVM algorithm is aimed at implementation using analog realizations. For purposes of evaluation, the algorithm can be realized using both digital hardware as well as software defined radio platforms. △ Less

Submitted 1 June, 2022; originally announced June 2022.

Comments: 25 pages, 2 figures

Journal ref: IEEE Open Journal of Signal Processing, vol. 1, 2020

arXiv:2205.15906 [pdf, ps, other]

SAR Despeckling Using Overcomplete Convolutional Networks

Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

Abstract: Synthetic Aperture Radar (SAR) despeckling is an important problem in remote sensing as speckle degrades SAR images, affecting downstream tasks like detection and segmentation. Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods. Traditional CNNs try to increase the receptive field size as the network goes deeper, thus extracting global features. H… ▽ More Synthetic Aperture Radar (SAR) despeckling is an important problem in remote sensing as speckle degrades SAR images, affecting downstream tasks like detection and segmentation. Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods. Traditional CNNs try to increase the receptive field size as the network goes deeper, thus extracting global features. However,speckle is relatively small, and increasing receptive field does not help in extracting speckle features. This study employs an overcomplete CNN architecture to focus on learning low-level features by restricting the receptive field. The proposed network consists of an overcomplete branch to focus on the local structures and an undercomplete branch that focuses on the global structures. We show that the proposed network improves despeckling performance compared to recent despeckling methods on synthetic and real SAR images. △ Less

Submitted 31 May, 2022; originally announced May 2022.

Comments: Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at https://github.com/malshaV/sar_overcomplete

arXiv:2205.02549 [pdf, other]

User-Driven Research of Medical Note Generation Software

Authors: Tom Knoll, Francesco Moramarco, Alex Papadopoulos Korfiatis, Rachel Young, Claudia Ruffini, Mark Perera, Christian Perstl, Ehud Reiter, Anya Belz, Aleksandar Savkov

Abstract: A growing body of work uses Natural Language Processing (NLP) methods to automatically generate medical notes from audio recordings of doctor-patient consultations. However, there are very few studies on how such systems could be used in clinical practice, how clinicians would adjust to using them, or how system design should be influenced by such considerations. In this paper, we present three ro… ▽ More A growing body of work uses Natural Language Processing (NLP) methods to automatically generate medical notes from audio recordings of doctor-patient consultations. However, there are very few studies on how such systems could be used in clinical practice, how clinicians would adjust to using them, or how system design should be influenced by such considerations. In this paper, we present three rounds of user studies, carried out in the context of develo** a medical note generation system. We present, analyse and discuss the participating clinicians' impressions and views of how the system ought to be adapted to be of value to them. Next, we describe a three-week test run of the system in a live telehealth clinical practice. Major findings include (i) the emergence of five different note-taking behaviours; (ii) the importance of the system generating notes in real time during the consultation; and (iii) the identification of a number of clinical use cases that could prove challenging for automatic note generation systems. △ Less

Submitted 6 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

Comments: Accepted for publication at NAACL 2022

arXiv:2205.02071 [pdf, other]

ANUBIS: Skeleton Action Recognition Dataset, Review, and Benchmark

Authors: Zhenyue Qin, Yang Liu, Madhawa Perera, Tom Gedeon, Pan Ji, Dongwoo Kim, Saeed Anwar

Abstract: Skeleton-based action recognition, as a subarea of action recognition, is swiftly accumulating attention and popularity. The task is to recognize actions performed by human articulation points. Compared with other data modalities, 3D human skeleton representations have extensive unique desirable characteristics, including succinctness, robustness, racial-impartiality, and many more. We aim to prov… ▽ More Skeleton-based action recognition, as a subarea of action recognition, is swiftly accumulating attention and popularity. The task is to recognize actions performed by human articulation points. Compared with other data modalities, 3D human skeleton representations have extensive unique desirable characteristics, including succinctness, robustness, racial-impartiality, and many more. We aim to provide a roadmap for new and existing researchers a on the landscapes of skeleton-based action recognition for new and existing researchers. To this end, we present a review in the form of a taxonomy on existing works of skeleton-based action recognition. We partition them into four major categories: (1) datasets; (2) extracting spatial features; (3) capturing temporal patterns; (4) improving signal quality. For each method, we provide concise yet informatively-sufficient descriptions. To promote more fair and comprehensive evaluation on existing approaches of skeleton-based action recognition, we collect ANUBIS, a large-scale human skeleton dataset. Compared with previously collected dataset, ANUBIS are advantageous in the following four aspects: (1) employing more recently released sensors; (2) containing novel back view; (3) encouraging high enthusiasm of subjects; (4) including actions of the COVID pandemic era. Using ANUBIS, we comparably benchmark performance of current skeleton-based action recognizers. At the end of this paper, we outlook future development of skeleton-based action recognition by listing several new technical problems. We believe they are valuable to solve in order to commercialize skeleton-based action recognition in the near future. The dataset of ANUBIS is available at: http://hcc-workshop.anu.edu.au/webs/anu101/home. △ Less

Submitted 8 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

arXiv:2204.00447 [pdf, other]

Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Authors: Francesco Moramarco, Alex Papadopoulos Korfiatis, Mark Perera, Damir Juric, Jack Flann, Ehud Reiter, Anya Belz, Aleksandar Savkov

Abstract: In recent years, machine learning models have rapidly become better at generating clinical consultation notes; yet, there is little work on how to properly evaluate the generated consultation notes to understand the impact they may have on both the clinician using them and the patient's clinical safety. To address this we present an extensive human evaluation study of consultation notes where 5 cl… ▽ More In recent years, machine learning models have rapidly become better at generating clinical consultation notes; yet, there is little work on how to properly evaluate the generated consultation notes to understand the impact they may have on both the clinician using them and the patient's clinical safety. To address this we present an extensive human evaluation study of consultation notes where 5 clinicians (i) listen to 57 mock consultations, (ii) write their own notes, (iii) post-edit a number of automatically generated notes, and (iv) extract all the errors, both quantitative and qualitative. We then carry out a correlation study with 18 automatic quality metrics and the human judgements. We find that a simple, character-based Levenshtein distance metric performs on par if not better than common model-based metrics like BertScore. All our findings and annotations are open-sourced. △ Less

Submitted 1 April, 2022; originally announced April 2022.

Comments: To be published in proceedings of ACL 2022

arXiv:2201.09355 [pdf, ps, other]

Transformer-based SAR Image Despeckling

Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

Abstract: Synthetic Aperture Radar (SAR) images are usually degraded by a multiplicative noise known as speckle which makes processing and interpretation of SAR images difficult. In this paper, we introduce a transformer-based network for SAR image despeckling. The proposed despeckling network comprises of a transformer-based encoder which allows the network to learn global dependencies between different im… ▽ More Synthetic Aperture Radar (SAR) images are usually degraded by a multiplicative noise known as speckle which makes processing and interpretation of SAR images difficult. In this paper, we introduce a transformer-based network for SAR image despeckling. The proposed despeckling network comprises of a transformer-based encoder which allows the network to learn global dependencies between different image regions - aiding in better despeckling. The network is trained end-to-end with synthetically generated speckled images using a composite loss function. Experiments show that the proposed method achieves significant improvements over traditional and convolutional neural network-based despeckling methods on both synthetic and real SAR images. △ Less

Submitted 23 January, 2022; originally announced January 2022.

Comments: Submitted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at https://github.com/malshaV/sar_transformer

arXiv:2108.10130 [pdf, other]

No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees

Authors: R. Malinga Perera, Bastian Oetomo, Benjamin I. P. Rubinstein, Renata Borovica-Gajic

Abstract: Automating physical database design has remained a long-term interest in database research due to substantial performance gains afforded by optimised structures. Despite significant progress, a majority of today's commercial solutions are highly manual, requiring offline invocation by database administrators (DBAs) who are expected to identify and supply representative training workloads. Even the… ▽ More Automating physical database design has remained a long-term interest in database research due to substantial performance gains afforded by optimised structures. Despite significant progress, a majority of today's commercial solutions are highly manual, requiring offline invocation by database administrators (DBAs) who are expected to identify and supply representative training workloads. Even the latest advancements like query stores provide only limited support for dynamic environments. This status quo is untenable: identifying representative static workloads is no longer realistic; and physical design tools remain susceptible to the query optimiser's cost misestimates. Furthermore, modern application environments such as hybrid transactional and analytical processing (HTAP) systems render analytical modelling next to impossible. We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead learns the benefits of viable structures through strategic exploration and direct performance observation. We view the problem as one of sequential decision making under uncertainty, specifically within the bandit learning setting. Multi-armed bandits balance exploration and exploitation to provably guarantee average performance that converges to policies that are optimal with perfect hindsight. Our comprehensive empirical evaluation against a state-of-the-art commercial tuning tool demonstrates up to 75% speed-up on shifting and ad-hoc workloads and up to 28% speed-up on static workloads in analytical processing environments. In HTAP environments, our solution provides up to 59% speed-up on shifting and 51% speed-up on static workloads. Furthermore, our bandit framework outperforms deep reinforcement learning (RL) in terms of convergence speed and performance volatility (providing up to 58% speed-up). △ Less

Submitted 23 August, 2021; originally announced August 2021.

Comments: 25 pages, 20 figures, 5 tables. arXiv admin note: substantial text overlap with arXiv:2010.09208

arXiv:2107.01179 [pdf, ps, other]

Google COVID-19 Vaccination Search Insights: Anonymization Process Description

Authors: Shailesh Bavadekar, Adam Boulanger, John Davis, Damien Desfontaines, Evgeniy Gabrilovich, Krishna Gadepalli, Badih Ghazi, Tague Griffith, Jai Gupta, Chaitanya Kamath, Dennis Kraft, Ravi Kumar, Akim Kumok, Yael Mayer, Pasin Manurangsi, Arti Patankar, Irippuge Milinda Perera, Chris Scott, Tomer Shekel, Benjamin Miller, Karen Smith, Charlotte Stanton, Mimi Sun, Mark Young, Gregory Wellenius

Abstract: This report describes the aggregation and anonymization process applied to the COVID-19 Vaccination Search Insights (published at http://goo.gle/covid19vaccinationinsights), a publicly available dataset showing aggregated and anonymized trends in Google searches related to COVID-19 vaccination. The applied anonymization techniques protect every user's daily search activity related to COVID-19 vacc… ▽ More This report describes the aggregation and anonymization process applied to the COVID-19 Vaccination Search Insights (published at http://goo.gle/covid19vaccinationinsights), a publicly available dataset showing aggregated and anonymized trends in Google searches related to COVID-19 vaccination. The applied anonymization techniques protect every user's daily search activity related to COVID-19 vaccinations with $(\varepsilon, δ)$-differential privacy for $\varepsilon = 2.19$ and $δ= 10^{-5}$. △ Less

Submitted 7 July, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

arXiv:2106.07893 [pdf, ps, other]

A General Purpose Transpiler for Fully Homomorphic Encryption

Authors: Shruthi Gorantala, Rob Springer, Sean Purser-Haskell, William Lam, Royce Wilson, Asra Ali, Eric P. Astor, Itai Zukerman, Sam Ruth, Christoph Dibak, Phillipp Schoppmann, Sasha Kulankhina, Alain Forget, David Marn, Cameron Tew, Rafael Misoczki, Bernat Guillen, Xinyu Ye, Dennis Kraft, Damien Desfontaines, Aishe Krishnamurthy, Miguel Guevara, Irippuge Milinda Perera, Yurii Sushko, Bryant Gipson

Abstract: Fully homomorphic encryption (FHE) is an encryption scheme which enables computation on encrypted data without revealing the underlying data. While there have been many advances in the field of FHE, develo** programs using FHE still requires expertise in cryptography. In this white paper, we present a fully homomorphic encryption transpiler that allows developers to convert high-level code (e.g.… ▽ More Fully homomorphic encryption (FHE) is an encryption scheme which enables computation on encrypted data without revealing the underlying data. While there have been many advances in the field of FHE, develo** programs using FHE still requires expertise in cryptography. In this white paper, we present a fully homomorphic encryption transpiler that allows developers to convert high-level code (e.g., C++) that works on unencrypted data into high-level code that operates on encrypted data. Thus, our transpiler makes transformations possible on encrypted data. Our transpiler builds on Google's open-source XLS SDK (https://github.com/google/xls) and uses an off-the-shelf FHE library, TFHE (https://tfhe.github.io/tfhe/), to perform low-level FHE operations. The transpiler design is modular, which means the underlying FHE library as well as the high-level input and output languages can vary. This modularity will help accelerate FHE research by providing an easy way to compare arbitrary programs in different FHE schemes side-by-side. We hope this lays the groundwork for eventual easy adoption of FHE by software developers. As a proof-of-concept, we are releasing an experimental transpiler (https://github.com/google/fully-homomorphic-encryption/tree/main/transpiler) as open-source software. △ Less

Submitted 15 June, 2021; originally announced June 2021.

arXiv:2101.00435 [pdf, ps, other]

A Thickness Sensitive Vessel Extraction Framework for Retinal and Conjunctival Vascular Tortuosity Analysis

Authors: Ashwin De Silva, Malsha V. Perera, Navodini Wijethilake, Saroj Jayasinghe, Nuwan D. Nanayakkara, Anjula De Silva

Abstract: Systemic diseases such as diabetes, hypertension, atherosclerosis are among the leading causes of annual human mortality rate. It is suggested that retinal and conjunctival vascular tortuosity is a potential biomarker for such systemic diseases. Most importantly, it is observed that the tortuosity depends on the thickness of these vessels. Therefore, selective calculation of tortuosity within spec… ▽ More Systemic diseases such as diabetes, hypertension, atherosclerosis are among the leading causes of annual human mortality rate. It is suggested that retinal and conjunctival vascular tortuosity is a potential biomarker for such systemic diseases. Most importantly, it is observed that the tortuosity depends on the thickness of these vessels. Therefore, selective calculation of tortuosity within specific vessel thicknesses is required depending on the disease being analysed. In this paper, we propose a thickness sensitive vessel extraction framework that is primarily applicable for studies related to retinal and conjunctival vascular tortuosity. The framework uses a Convolutional Neural Network based on the IterNet architecture to obtain probability maps of the entire vasculature. They are then processed by a multi-scale vessel enhancement technique that exploits both fine and coarse structural vascular details of these probability maps in order to extract vessels of specified thicknesses. We evaluated the proposed framework on four datasets including DRIVE and SBVPI, and obtained Matthew's Correlation Coefficient values greater than 0.71 for all the datasets. In addition, the proposed framework was utilized to determine the association of diabetes with retinal and conjunctival vascular tortuosity. We observed that retinal vascular tortuosity (Eccentricity based Tortuosity Index) of the diabetic group was significantly higher (p < .05) than that of the non-diabetic group and that conjunctival vascular tortuosity (Total Curvature normalized by Arc Length) of diabetic group was significantly lower (p < .05) than that of the non-diabetic group. These observations were in agreement with the literature, strengthening the suitability of the proposed framework. △ Less

Submitted 2 January, 2021; originally announced January 2021.

Comments: Submitted for Reviewing

arXiv:2010.13268 [pdf, ps, other]

A Joint Convolutional and Spatial Quad-Directional LSTM Network for Phase Unwrap**

Authors: Malsha V. Perera, Ashwin De Silva

Abstract: Phase unwrap** is a classical ill-posed problem which aims to recover the true phase from wrapped phase. In this paper, we introduce a novel Convolutional Neural Network (CNN) that incorporates a Spatial Quad-Directional Long Short Term Memory (SQD-LSTM) for phase unwrap**, by formulating it as a regression problem. Incorporating SQD-LSTM can circumvent the typical CNNs' inherent difficulty of… ▽ More Phase unwrap** is a classical ill-posed problem which aims to recover the true phase from wrapped phase. In this paper, we introduce a novel Convolutional Neural Network (CNN) that incorporates a Spatial Quad-Directional Long Short Term Memory (SQD-LSTM) for phase unwrap**, by formulating it as a regression problem. Incorporating SQD-LSTM can circumvent the typical CNNs' inherent difficulty of learning global spatial dependencies which are vital when recovering the true phase. Furthermore, we employ a problem specific composite loss function to train this network. The proposed network is found to be performing better than the existing methods under severe noise conditions (Normalized Root Mean Square Error of 1.3 % at SNR = 0 dB) while spending a significantly less computational time (0.054 s). The network also does not require a large scale dataset during training, thus making it ideal for applications with limited data that require fast and accurate phase unwrap**. △ Less

Submitted 25 October, 2020; originally announced October 2020.

Comments: Under Review

arXiv:2010.09208 [pdf, other]

DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees

Authors: R. Malinga Perera, Bastian Oetomo, Benjamin I. P. Rubinstein, Renata Borovica-Gajic

Abstract: Automating physical database design has remained a long-term interest in database research due to substantial performance gains afforded by optimised structures. Despite significant progress, a majority of today's commercial solutions are highly manual, requiring offline invocation by database administrators (DBAs) who are expected to identify and supply representative training workloads. Unfortun… ▽ More Automating physical database design has remained a long-term interest in database research due to substantial performance gains afforded by optimised structures. Despite significant progress, a majority of today's commercial solutions are highly manual, requiring offline invocation by database administrators (DBAs) who are expected to identify and supply representative training workloads. Unfortunately, the latest advancements like query stores provide only limited support for dynamic environments. This status quo is untenable: identifying representative static workloads is no longer realistic; and physical design tools remain susceptible to the query optimiser's cost misestimates (stemming from unrealistic assumptions such as attribute value independence and uniformity of data distribution). We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead learns the benefits of viable structures through strategic exploration and direct performance observation. We view the problem as one of sequential decision making under uncertainty, specifically within the bandit learning setting. Multi-armed bandits balance exploration and exploitation to provably guarantee average performance that converges to a fixed policy that is optimal with perfect hindsight. Our comprehensive empirical results demonstrate up to 75% speed-up on shifting and ad-hoc workloads and 28% speed-up on static workloads compared against a state-of-the-art commercial tuning tool. △ Less

Submitted 19 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

Comments: 12 pages, 8 figures

arXiv:2009.02575 [pdf, ps, other]

doi 10.1109/SMC42975.2020.9283285

Low-cost Active Dry-Contact Surface EMG Sensor for Bionic Arms

Authors: Asma M. Naim, Kithmin Wickramasinghe, Ashwin De Silva, Malsha V. Perera, Thilina Dulantha Lalitharatne, Simon L. Kappel

Abstract: Surface electromyography (sEMG) is a popular bio-signal used for controlling prostheses and finger gesture recognition mechanisms. Myoelectric prostheses are costly, and most commercially available sEMG acquisition systems are not suitable for real-time gesture recognition. In this paper, a method of acquiring sEMG signals using novel low-cost, active, dry-contact, flexible sensors has been propos… ▽ More Surface electromyography (sEMG) is a popular bio-signal used for controlling prostheses and finger gesture recognition mechanisms. Myoelectric prostheses are costly, and most commercially available sEMG acquisition systems are not suitable for real-time gesture recognition. In this paper, a method of acquiring sEMG signals using novel low-cost, active, dry-contact, flexible sensors has been proposed. Since the active sEMG sensor was developed to be used along with a bionic arm, the sensor was tested for its ability to acquire sEMG signals that could be used for real-time classification of five selected gestures. In a study of 4 subjects, the average classification accuracy for real-time gesture classification using the active sEMG sensor system was 85%. The common-mode rejection ratio of the sensor was measured to 59 dB, and thus the sensor's performance was not substantially limited by its active circuitry. The proposed sensors can be interfaced with a variety of amplifiers to perform fully wearable sEMG acquisition. This satisfies the need for a low-cost sEMG acquisition system for prostheses. △ Less

Submitted 9 September, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

Comments: Paper accepted to IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2020

arXiv:2009.01265 [pdf, ps, other]

Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (version 1.0)

Authors: Shailesh Bavadekar, Andrew Dai, John Davis, Damien Desfontaines, Ilya Eckstein, Katie Everett, Alex Fabrikant, Gerardo Flores, Evgeniy Gabrilovich, Krishna Gadepalli, Shane Glass, Rayman Huang, Chaitanya Kamath, Dennis Kraft, Akim Kumok, Hinali Marfatia, Yael Mayer, Benjamin Miller, Adam Pearce, Irippuge Milinda Perera, Venky Ramachandran, Karthik Raman, Thomas Roessler, Izhak Shafran, Tomer Shekel , et al. (5 additional authors not shown)

Abstract: This report describes the aggregation and anonymization process applied to the initial version of COVID-19 Search Trends symptoms dataset (published at https://goo.gle/covid19symptomdataset on September 2, 2020), a publicly available dataset that shows aggregated, anonymized trends in Google searches for symptoms (and some related topics). The anonymization process is designed to protect the daily… ▽ More This report describes the aggregation and anonymization process applied to the initial version of COVID-19 Search Trends symptoms dataset (published at https://goo.gle/covid19symptomdataset on September 2, 2020), a publicly available dataset that shows aggregated, anonymized trends in Google searches for symptoms (and some related topics). The anonymization process is designed to protect the daily symptom search activity of every user with $\varepsilon$-differential privacy for $\varepsilon$ = 1.68. △ Less

Submitted 2 September, 2020; originally announced September 2020.

arXiv:2004.04869 [pdf]

Influence of salt on membrane rigidity of neu-tral DOPC vesicles

Authors: Judith U. De Mel, Sudipta Gupta, Rasangi M. Perera, Ly Ngo, Piotr Zolnierczuk, Markus Bleuel, Sai Venkatesh **ali, Gerald J. Schneider

Abstract: Salt is a very common molecule in aqueous environments but the question of whether the interactions of monovalent ions Na^+ and Cl^- ,with the neutral heads of phospholipids are impactful enough to change the membrane rigidity is still a mystery. To provide a resolution to this long simmering debate, we investigated the dynamics of DOPC vesicles in the fluid phase with increasing external salt con… ▽ More Salt is a very common molecule in aqueous environments but the question of whether the interactions of monovalent ions Na^+ and Cl^- ,with the neutral heads of phospholipids are impactful enough to change the membrane rigidity is still a mystery. To provide a resolution to this long simmering debate, we investigated the dynamics of DOPC vesicles in the fluid phase with increasing external salt concentration. At higher salt concentrations, we observe an increase in bending rigidity from neutron spin echo spectroscopy (NSE) and an increase in bilayer thickness from small-angle X-ray scattering (SAXS). We compared different models to distinguish membrane undulations, lipid tail motions and the translational diffusion of the vesicles. All the models indicate an increase in bending rigidity by a factor of 1.3 to 3.6. We demonstrate that even for t > 10 ns, and for Q > 0.07 1/Å the observed NSE relaxation spectra is clearly influenced by the translational diffusion of the vesicles. For t < 5 ns, the lipid tail motions dominate the intermediate dynamic structure factor. As the salt concentration increases this contribution diminishes. We introduced a new time-dependent analysis for the bending rigidity that highlights only a limited Zilman-Granek time window where the rigidity is physically meaningful. △ Less

Submitted 9 April, 2020; originally announced April 2020.

Comments: 7 figures, 32 pages, submitted to ACS Langmuir

arXiv:2002.03159 [pdf, other]

doi 10.1109/ICASSP40776.2020.9054227

Real-Time Hand Gesture Recognition Using Temporal Muscle Activation Maps of Multi-Channel sEMG Signals

Authors: Ashwin De Silva, Malsha V. Perera, Kithmin Wickramasinghe, Asma M. Naim, Thilina Dulantha Lalitharatne, Simon L. Kappel

Abstract: Accurate and real-time hand gesture recognition is essential for controlling advanced hand prostheses. Surface Electromyography (sEMG) signals obtained from the forearm are widely used for this purpose. Here, we introduce a novel hand gesture representation called Temporal Muscle Activation (TMA) maps which captures information about the activation patterns of muscles in the forearm. Based on thes… ▽ More Accurate and real-time hand gesture recognition is essential for controlling advanced hand prostheses. Surface Electromyography (sEMG) signals obtained from the forearm are widely used for this purpose. Here, we introduce a novel hand gesture representation called Temporal Muscle Activation (TMA) maps which captures information about the activation patterns of muscles in the forearm. Based on these maps, we propose an algorithm that can recognize hand gestures in real-time using a Convolution Neural Network. The algorithm was tested on 8 healthy subjects with sEMG signals acquired from 8 electrodes placed along the circumference of the forearm. The average classification accuracy of the proposed method was 94%, which is comparable to state-of-the-art methods. The average computation time of a prediction was 5.5ms, making the algorithm ideal for the real-time gesture recognition applications. △ Less

Submitted 8 February, 2020; originally announced February 2020.

Comments: Paper accepted to IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020

arXiv:1911.03756 [pdf, ps, other]

doi 10.1007/s40315-020-00345-6

Pluripotential Theory and Convex Bodies: A Siciak-Zaharjuta theorem

Authors: T. Bayraktar, S. Hussung, N. Levenberg, M. Perera

Abstract: We work in the setting of weighted pluripotential theory arising from polynomials associated to a convex body $P$ in $({\bf R}^+)^d$. We define the {\it logarithmic indicator function} on ${\bf C}^d$: $$H_P(z):=\sup_{ J\in P} \log |z^{ J}|:=\sup_{ J\in P} \log[|z_1|^{ j_1}\cdots |z_d|^{ j_d}]$$ and an associated class of plurisubharmonic (psh) functions:… ▽ More We work in the setting of weighted pluripotential theory arising from polynomials associated to a convex body $P$ in $({\bf R}^+)^d$. We define the {\it logarithmic indicator function} on ${\bf C}^d$: $$H_P(z):=\sup_{ J\in P} \log |z^{ J}|:=\sup_{ J\in P} \log[|z_1|^{ j_1}\cdots |z_d|^{ j_d}]$$ and an associated class of plurisubharmonic (psh) functions: $$L_P:=\{u\in PSH({\bf C}^d): u(z)- H_P(z) =0(1), \ |z| \to \infty \}.$$ We first show that $L_P$ is not closed under standard smoothing operations. However, utilizing a continuous regularization due to Ferrier which preserves $L_P$, we prove a general Siciak-Zaharjuta type-result in our $P-$setting: the weighted $P-$extremal function $$V_{P,K,Q}(z):=\sup \{u(z):u\in L_P, \ u\leq Q \ \hbox{on} \ K\}$$ associated to a compact set $K$ and an admissible weight $Q$ on $K$ can be obtained using the subclass of $L_P$ arising from functions of the form $\frac{1}{deg_P(p)}\log |p|$ (appropriately normalized). △ Less

Submitted 9 November, 2019; originally announced November 2019.

MSC Class: 32U15

Journal ref: Computational Methods and Function Theory, 20 (2020) no. 3-4, 571-590

arXiv:1902.07500 [pdf, other]

A Note on Bounding Regret of the C$^2$UCB Contextual Combinatorial Bandit

Authors: Bastian Oetomo, Malinga Perera, Renata Borovica-Gajic, Benjamin I. P. Rubinstein

Abstract: We revisit the proof by Qin et al. (2014) of bounded regret of the C$^2$UCB contextual combinatorial bandit. We demonstrate an error in the proof of volumetric expansion of the moment matrix, used in upper bounding a function of context vector norms. We prove a relaxed inequality that yields the originally-stated regret bound. We revisit the proof by Qin et al. (2014) of bounded regret of the C$^2$UCB contextual combinatorial bandit. We demonstrate an error in the proof of volumetric expansion of the moment matrix, used in upper bounding a function of context vector norms. We prove a relaxed inequality that yields the originally-stated regret bound. △ Less

Submitted 20 February, 2019; originally announced February 2019.

Comments: 3 pages

arXiv:1809.07356 [pdf, other]

doi 10.1109/JSEN.2019.2958210

A Single-Channel Consumer-Grade EEG Device for Brain-Computer Interface: Enhancing Detection of SSVEP and Its Amplitude Modulation

Authors: Phairot Autthasan, Xiangqian Du, Jetsada Arnin, Sirakorn Lamyai, Maneesha Perera, Sirawaj Itthipuripat, Tohru Yagi, Poramate Manoonpong, Theerawit Wilaiprasitporn

Abstract: Brain-Computer interfaces (BCIs) play a significant role in easing neuromuscular patients on controlling computers and prosthetics. Due to their high signal-to-noise ratio, steady-state visually evoked potentials (SSVEPs) has been widely used to build BCIs. However, currently developed algorithms do not predict the modulation of SSVEP amplitude, which is known to change as a function of stimulus l… ▽ More Brain-Computer interfaces (BCIs) play a significant role in easing neuromuscular patients on controlling computers and prosthetics. Due to their high signal-to-noise ratio, steady-state visually evoked potentials (SSVEPs) has been widely used to build BCIs. However, currently developed algorithms do not predict the modulation of SSVEP amplitude, which is known to change as a function of stimulus luminance contrast. In this study, we aim to develop an integrated approach to simultaneously estimate the frequency and contrast-related amplitude modulations of the SSVEP signal. To achieve that, we developed a behavioral task in which human participants focused on a visual flicking target which the luminance contrast can change through time in several ways. SSVEP signals from 16 subjects were then recorded from electrodes placed at the central occipital site using a low-cost, consumer-grade EEG. Our results demonstrate that the filter bank canonical correlation analysis (FBCCA) performed well in SSVEP frequency recognition, while the support vector regression (SVR) outperformed the other supervised machine learning algorithms in predicting the contrast-dependent amplitude modulations of the SSVEPs. These findings indicate the applicability and strong performance of our integrated method at simultaneously predicting both frequency and amplitude of visually evoked signals, and have proven to be useful for advancing SSVEP-based applications. △ Less

Submitted 4 December, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

Comments: IEEE Sensors (Accepted)

Journal ref: IEEE Sensor Journal, 2019

arXiv:1806.06426 [pdf, ps, other]

A global domination principle for P-pluripotential theory

Authors: Norm Levenberg, Menuja Perera

Abstract: We prove a global domination principle in the setting of P-pluripotential theory. This has many applications including a general product property for P-extremal functions. The key ingredient is the proof of the existence of a strictly plurisubharmonic P-potential. We prove a global domination principle in the setting of P-pluripotential theory. This has many applications including a general product property for P-extremal functions. The key ingredient is the proof of the existence of a strictly plurisubharmonic P-potential. △ Less

Submitted 17 June, 2018; originally announced June 2018.

MSC Class: 32U15; 32U20; 31C15

arXiv:1805.10685 [pdf, other]

Legal Document Retrieval using Document Vector Embeddings and Deep Learning

Authors: Keet Sugathadasa, Buddhi Ayesha, Nisansa de Silva, Amal Shehan Perera, Vindula Jayawardana, Dimuthu Lakmal, Madhavi Perera

Abstract: Domain specific information retrieval process has been a prominent and ongoing research in the field of natural language processing. Many researchers have incorporated different techniques to overcome the technical and domain specificity and provide a mature model for various domains of interest. The main bottleneck in these studies is the heavy coupling of domain experts, that makes the entire pr… ▽ More Domain specific information retrieval process has been a prominent and ongoing research in the field of natural language processing. Many researchers have incorporated different techniques to overcome the technical and domain specificity and provide a mature model for various domains of interest. The main bottleneck in these studies is the heavy coupling of domain experts, that makes the entire process to be time consuming and cumbersome. In this study, we have developed three novel models which are compared against a golden standard generated via the on line repositories provided, specifically for the legal domain. The three different models incorporated vector space representations of the legal domain, where document vector generation was done in two different mechanisms and as an ensemble of the above two. This study contains the research being carried out in the process of representing legal case documents into different vector spaces, whilst incorporating semantic word measures and natural language processing techniques. The ensemble model built in this study, shows a significantly higher accuracy level, which indeed proves the need for incorporation of domain specific semantic similarity measures into the information retrieval process. This study also shows, the impact of varying distribution of the word similarity measures, against varying document vector dimensions, which can lead to improvements in the process of legal information retrieval. △ Less

Submitted 27 May, 2018; originally announced May 2018.

arXiv:1709.02911 [pdf, other]

doi 10.1109/ICTER.2017.8257822

Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

Authors: Vindula Jayawardana, Dimuthu Lakmal, Nisansa de Silva, Amal Shehan Perera, Keet Sugathadasa, Buddhi Ayesha, Madhavi Perera

Abstract: In many modern day systems such as information extraction and knowledge management agents, ontologies play a vital role in maintaining the concept hierarchies of the selected domain. However, ontology population has become a problematic process due to its nature of heavy coupling with manual human intervention. With the use of word embeddings in the field of natural language processing, it became… ▽ More In many modern day systems such as information extraction and knowledge management agents, ontologies play a vital role in maintaining the concept hierarchies of the selected domain. However, ontology population has become a problematic process due to its nature of heavy coupling with manual human intervention. With the use of word embeddings in the field of natural language processing, it became a popular topic due to its ability to cope up with semantic sensitivity. Hence, in this study, we propose a novel way of semi-supervised ontology population through word embeddings as the basis. We built several models including traditional benchmark models and new types of models which are based on word embeddings. Finally, we ensemble them together to come up with a synergistic model with better accuracy. We demonstrate that our ensemble model can outperform the individual models. △ Less

Submitted 9 September, 2017; originally announced September 2017.

arXiv:1706.01967 [pdf, other]

doi 10.1109/ICIINFS.2017.8300343

Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Authors: Keet Sugathadasa, Buddhi Ayesha, Nisansa de Silva, Amal Shehan Perera, Vindula Jayawardana, Dimuthu Lakmal, Madhavi Perera

Abstract: Semantic similarity measures are an important part in Natural Language Processing tasks. However Semantic similarity measures built for general use do not perform well within specific domains. Therefore in this study we introduce a domain specific semantic similarity measure that was created by the synergistic union of word2vec, a word embedding method that is used for semantic similarity calculat… ▽ More Semantic similarity measures are an important part in Natural Language Processing tasks. However Semantic similarity measures built for general use do not perform well within specific domains. Therefore in this study we introduce a domain specific semantic similarity measure that was created by the synergistic union of word2vec, a word embedding method that is used for semantic similarity calculation and lexicon based (lexical) semantic similarity methods. We prove that this proposed methodology out performs word embedding methods trained on generic corpus and methods trained on domain specific corpus but do not use lexical semantic similarity methods to augment the results. Further, we prove that text lemmatization can improve the performance of word embedding methods. △ Less

Submitted 8 June, 2017; v1 submitted 6 June, 2017; originally announced June 2017.

Comments: 6 Pages, 3 figures

arXiv:1602.01790 [pdf]

doi 10.1021/acs.nanolett.5b05066

Low-Resistance 2D/2D Ohmic Contacts: A Universal Approach to High-Performance WSe2, MoS2, and MoSe2 Transistors

Authors: Hsun-Jen Chuang, Bhim Chamlagain, Michael Koehler, Meeghage Madusanka Perera, Jiaqiang Yan, David Mandrus, David Tomanek, Zhixian Zhou

Abstract: We report a new strategy for fabricating 2D/2D low-resistance ohmic contacts for a variety of transition metal dichalcogenides (TMDs) using van der Waals assembly of substitutionally doped TMDs as drain/source contacts and TMDs with no intentional do** as channel materials. We demonstrate that few-layer WSe2 field-effect transistors (FETs) with 2D/2D contacts exhibit low contact resistances of ~… ▽ More We report a new strategy for fabricating 2D/2D low-resistance ohmic contacts for a variety of transition metal dichalcogenides (TMDs) using van der Waals assembly of substitutionally doped TMDs as drain/source contacts and TMDs with no intentional do** as channel materials. We demonstrate that few-layer WSe2 field-effect transistors (FETs) with 2D/2D contacts exhibit low contact resistances of ~ 0.3 k ohm.um, high on/off ratios up to > 109, and high drive currents exceeding 320 uA um-1. These favorable characteristics are combined with a two-terminal field-effect hole mobility ~ 2x102 cm2 V-1 s-1 at room temperature, which increases to >2x103 cm2 V-1 s-1 at cryogenic temperatures. We observe a similar performance also in MoS2 and MoSe2 FETs with 2D/2D drain and source contacts. The 2D/2D low-resistance ohmic contacts presented here represent a new device paradigm that overcomes a significant bottleneck in the performance of TMDs and a wide variety of other 2D materials as the channel materials in post-silicon electronics. △ Less

Submitted 4 February, 2016; originally announced February 2016.

Comments: 23 pages, 4 figures, Nano Lett. Accepted

arXiv:1601.04662 [pdf, other]

Signal Flow Graph Approach to Efficient DST I-IV Algorithms

Authors: Sirani M. Perera

Abstract: In this paper, fast and efficient discrete sine transformation (DST) algorithms are presented based on the factorization of sparse, scaled orthogonal, rotation, rotation-reflection, and butterfly matrices. These algorithms are completely recursive and solely based on DST I-IV. The presented algorithms have low arithmetic cost compared to the known fast DST algorithms. Furthermore, the language of… ▽ More In this paper, fast and efficient discrete sine transformation (DST) algorithms are presented based on the factorization of sparse, scaled orthogonal, rotation, rotation-reflection, and butterfly matrices. These algorithms are completely recursive and solely based on DST I-IV. The presented algorithms have low arithmetic cost compared to the known fast DST algorithms. Furthermore, the language of signal flow graph representation of digital structures is used to describe these efficient and recursive DST algorithms having $(n-1)$ points signal flow graph for DST-I and $n$ points signal flow graphs for DST II-IV. △ Less

Submitted 18 January, 2016; originally announced January 2016.

MSC Class: 15A23; 15B10; 65F50; 65T50; 65Y05; 65Y20; 94A12

arXiv:1509.04535 [pdf, ps, other]

A criterion for p-henselianity in characteristic p

Authors: Zoé Chatzidakis, Milan Perera

Abstract: Let $p$ be a prime. In this paper we give a proof of the followingresult: A valued field $(K,v)$ of characteristic $p \textgreater{} 0$ is$p$-henselian if and only if every element of strictly positivevaluation if of the form $x^p - x$ for some $x \in K$. Let $p$ be a prime. In this paper we give a proof of the followingresult: A valued field $(K,v)$ of characteristic $p \textgreater{} 0$ is$p$-henselian if and only if every element of strictly positivevaluation if of the form $x^p - x$ for some $x \in K$. △ Less

Submitted 15 September, 2015; originally announced September 2015.

arXiv:1503.04106 [pdf, other]

Signal Processing based on Stable radix-2 DCT Algorithms having Orthogonal Factors

Authors: Sirani M. Perera

Abstract: This paper presents stable, radix-2, completely recursive discrete cosine transformation algorithms DCT-I and DCT-III solely based on DCT-I, DCT-II, DCT-III, and DCT-IV having sparse and orthogonal factors. Error bounds for computing the completely recursive DCT-I, DCT-II, DCT-III, and DCT-IV algorithms having sparse and orthogonal factors are addressed. Image compression results are presented bas… ▽ More This paper presents stable, radix-2, completely recursive discrete cosine transformation algorithms DCT-I and DCT-III solely based on DCT-I, DCT-II, DCT-III, and DCT-IV having sparse and orthogonal factors. Error bounds for computing the completely recursive DCT-I, DCT-II, DCT-III, and DCT-IV algorithms having sparse and orthogonal factors are addressed. Image compression results are presented based on the recursive 2D DCT-II and DCT-IV algorithms for image size $512 \times 512$ pixels with transfer block sizes $8 \times 8$, $16 \times 16$, and $32 \times 32$ with $93.75\%$ absence of coefficients in each transfer block. Finally signal flow graphs are demonstrated based on the completely recursive DCT-I, DCT-II, DCT-III, and DCT-IV algorithms having orthogonal factors. △ Less

Submitted 7 August, 2015; v1 submitted 13 March, 2015; originally announced March 2015.

Comments: Older email removed

arXiv:1405.5437 [pdf]

doi 10.1021/nl501275p

High Mobility WSe2 p- and n-Type Field Effect Transistors Contacted by Highly Doped Graphene for Low-Resistance Contacts

Authors: Hsun-Jen Chuang, Xuebin Tan, Nirmal Jeevi Ghimire, Meeghage Madusanka Perera, Bhim Chamlagain, Mark Ming-Cheng Cheng, Jiaqiang Yan, David Mandrus, David Tománek, Zhixian Zhou

Abstract: We report the fabrication of both n-type and p-type WSe2 field effect transistors with hexagonal boron nitride passivated channels and ionic-liquid (IL)-gated graphene contacts. Our transport measurements reveal intrinsic channel properties including a metal-insulator transition at a characteristic conductivity close to the quantum conductance e2/h, a high ON/OFF ratio of >107 at 170 K, and large… ▽ More We report the fabrication of both n-type and p-type WSe2 field effect transistors with hexagonal boron nitride passivated channels and ionic-liquid (IL)-gated graphene contacts. Our transport measurements reveal intrinsic channel properties including a metal-insulator transition at a characteristic conductivity close to the quantum conductance e2/h, a high ON/OFF ratio of >107 at 170 K, and large electron and hole mobility of ~200 cm2V-1s-1 at 160 K. Decreasing the temperature to 77 K increases mobility of electrons to ~330 cm2V-1s-1 and that of holes to ~270 cm2V-1s-1. We attribute our ability to observe the intrinsic, phonon limited conduction in both the electron and hole channels to the drastic reduction of the Schottky barriers between the channel and the graphene contact electrodes using IL gating. We elucidate this process by studying a Schottky diode consisting of a single graphene/WSe2 Schottky junction. Our results indicate the possibility to utilize chemically or electrostatically highly doped graphene for versatile, flexible and transparent low-resistance Ohmic contacts to a wide range of quasi-2D semiconductors. KEYWORDS: MoS2, WSe2, field-effect transistors, graphene, Schottky barrier, ionic-liquid gate △ Less

Submitted 21 May, 2014; originally announced May 2014.

Comments: 28 pages, 6 figures, accepted for publication in Nano Lett. 2014

arXiv:1404.3762 [pdf]

Mobility Improvement and Temperature Dependence in MoSe2 Field-Effect Transistors on Parylene-C Substrate

Authors: Bhim Chamlagain, Qing Li, Nirmal Jeevi Ghimire, Hsun-Jen Chuang, Meeghage Madusanka Perera, Honggen Tu, Yong Xu, Minghu Pan, Di Xaio, Jiaqiang Yan, David Mandrus, Zhixian Zhou

Abstract: We report low temperature scanning tunneling microscopy characterization of MoSe2 crystals, and the fabrication and electrical characterization of MoSe2 field-effect transistors on both SiO2 and parylene-C substrates. We find that the multilayer MoSe2 devices on parylene-C show a room temperature mobility close to the mobility of bulk MoSe2 (100 cm2V-1s-1 - 160 cm2V-1s-1), which is significantly h… ▽ More We report low temperature scanning tunneling microscopy characterization of MoSe2 crystals, and the fabrication and electrical characterization of MoSe2 field-effect transistors on both SiO2 and parylene-C substrates. We find that the multilayer MoSe2 devices on parylene-C show a room temperature mobility close to the mobility of bulk MoSe2 (100 cm2V-1s-1 - 160 cm2V-1s-1), which is significantly higher than that on SiO2 substrate (~50 cm2V-1s-1). The room temperature mobility on both types of substrates are nearly thickness independent. Our variable temperature transport measurements reveal a metal-insulator transition at a characteristic conductivity of e2/h. The mobility of MoSe2 devices extracted from the metallic region on both SiO2 and parylene-C increases up to ~ 500 cm2V-1s-1 as the temperature decreases to ~ 100 K, with the mobility of MoSe2 on SiO2 increasing more rapidly. In spite of the notable variation of charged impurities as indicated by the strongly sample dependent low temperature mobility, the mobility of all MoSe2 devices on SiO2 converges above 200 K, indicating that the high temperature (> 200 K) mobility in these devices is nearly independent of the charged impurities. Our atomic force microscopy study of SiO2 and parylene-C substrates further rule out the surface roughness scattering as a major cause of the substrate dependent mobility. We attribute the observed substrate dependence of MoSe2 mobility primarily to the surface polar optical phonon scattering originating from the SiO2 substrate, which is nearly absent in MoSe2 devices on parylene-C substrate. △ Less

Submitted 14 April, 2014; originally announced April 2014.

Comments: 28 pages, 6 figures, accepted for publication in ACS Nano, 2014

arXiv:1401.1874 [pdf, ps, other]

A Fast Algorithm for the Inversion of Quasiseparable Vandermonde-like Matrices

Authors: Sirani M. Perera, Grigory Bonik, Vadim Olshevsky

Abstract: The results on Vandermonde-like matrices were introduced as a generalization of polynomial Vandermonde matrices, and the displacement structure of these matrices was used to derive an inversion formula. In this paper we first present a fast Gaussian elimination algorithm for the polynomial Vandermonde-like matrices. Later we use the said algorithm to derive fast inversion algorithms for quasisepar… ▽ More The results on Vandermonde-like matrices were introduced as a generalization of polynomial Vandermonde matrices, and the displacement structure of these matrices was used to derive an inversion formula. In this paper we first present a fast Gaussian elimination algorithm for the polynomial Vandermonde-like matrices. Later we use the said algorithm to derive fast inversion algorithms for quasiseparable, semiseparable and well-free Vandermonde-like matrices having $\mathcal{O}(n^2)$ complexity. To do so we identify structures of displacement operators in terms of generators and the recurrence relations(2-term and 3-term) between the columns of the basis transformation matrices for quasiseparable, semiseparable and well-free polynomials. Finally we present an $\mathcal{O}(n^2)$ algorithm to compute the inversion of quasiseparable Vandermonde-like matrices. △ Less

Submitted 8 January, 2014; originally announced January 2014.

MSC Class: 15A09; 15B05; 65Y04

arXiv:1304.4669 [pdf]

Improved Carrier Mobility in Few-Layer MoS2 Field-Effect Transistors with Ionic-Liquid Gating

Authors: Meeghage Madusanka Perera, Ming-Wei Lin, Hsun-Jen Chuang, Bhim Prasad Chamlagain, Chongyu Wang, Xuebin Tan, Mark Ming-Cheng Cheng, David Tománek, Zhixian Zhou

Abstract: We report the fabrication of ionic liquid (IL) gated field-effect transistors (FETs) consisting of bilayer and few-layer MoS2. Our transport measurements indicate that the electron mobility about 60 cm2V-1s-1 at 250 K in ionic liquid gated devices exceeds significantly that of comparable back-gated devices. IL-FETs display a mobility increase from about 100 cm2V-1s-1 at 180 K to about 220 cm2V-1s-… ▽ More We report the fabrication of ionic liquid (IL) gated field-effect transistors (FETs) consisting of bilayer and few-layer MoS2. Our transport measurements indicate that the electron mobility about 60 cm2V-1s-1 at 250 K in ionic liquid gated devices exceeds significantly that of comparable back-gated devices. IL-FETs display a mobility increase from about 100 cm2V-1s-1 at 180 K to about 220 cm2V-1s-1 at 77 K in good agreement with the true channel mobility determined from four-terminal measurements, ambipolar behavior with a high ON/OFF ratio >107 (104) for electrons (holes), and a near ideal sub-threshold swing of about 50 mV/dec at 250 K. We attribute the observed performance enhancement, specifically the increased carrier mobility that is limited by phonons, to the reduction of the Schottky barrier at the source and drain electrode by band bending caused by the ultrathin ionic-liquid dielectric layer. △ Less

Submitted 16 April, 2013; originally announced April 2013.

Comments: 29 pages, 7 figures, ACS Nano, ASAP, 2013

Showing 1–45 of 45 results for author: Perera, M