-
Vision Transformer Computation and Resilience for Dynamic Inference
Authors:
Kavya Sreedhar,
Jason Clemons,
Rangharajan Venkatesan,
Stephen W. Keckler,
Mark Horowitz
Abstract:
State-of-the-art deep learning models for computer vision tasks are based on the transformer architecture and often deployed in real-time applications. In this scenario, the resources available for every inference can vary, so it is useful to be able to dynamically adapt execution to trade accuracy for efficiency. To create dynamic models, we leverage the resilience of vision transformers to pruni…
▽ More
State-of-the-art deep learning models for computer vision tasks are based on the transformer architecture and often deployed in real-time applications. In this scenario, the resources available for every inference can vary, so it is useful to be able to dynamically adapt execution to trade accuracy for efficiency. To create dynamic models, we leverage the resilience of vision transformers to pruning and switch between different scaled versions of a model. Surprisingly, we find that most FLOPs are generated by convolutions, not attention. These relative FLOP counts are not a good predictor of GPU performance since GPUs have special optimizations for convolutions. Some models are fairly resilient and their model execution can be adapted without retraining, while all models achieve better accuracy with retraining alternative execution paths. These insights mean that we can leverage CNN accelerators and these alternative execution paths to enable efficient and dynamic vision transformer inference. Our analysis shows that leveraging this type of dynamic execution can lead to saving 28\% of energy with a 1.4\% accuracy drop for SegFormer (63 GFLOPs), with no additional training, and 53\% of energy for ResNet-50 (4 GFLOPs) with a 3.3\% accuracy drop by switching between pretrained Once-For-All models.
△ Less
Submitted 15 April, 2024; v1 submitted 5 December, 2022;
originally announced December 2022.
-
Optimal Clip** and Magnitude-aware Differentiation for Improved Quantization-aware Training
Authors:
Charbel Sakr,
Steve Dai,
Rangharajan Venkatesan,
Brian Zimmer,
William J. Dally,
Brucek Khailany
Abstract:
Data clip** is crucial in reducing noise in quantization operations and improving the achievable accuracy of quantization-aware training (QAT). Current practices rely on heuristics to set clip** threshold scalars and cannot be shown to be optimal. We propose Optimally Clipped Tensors And Vectors (OCTAV), a recursive algorithm to determine MSE-optimal clip** scalars. Derived from the fast New…
▽ More
Data clip** is crucial in reducing noise in quantization operations and improving the achievable accuracy of quantization-aware training (QAT). Current practices rely on heuristics to set clip** threshold scalars and cannot be shown to be optimal. We propose Optimally Clipped Tensors And Vectors (OCTAV), a recursive algorithm to determine MSE-optimal clip** scalars. Derived from the fast Newton-Raphson method, OCTAV finds optimal clip** scalars on the fly, for every tensor, at every iteration of the QAT routine. Thus, the QAT algorithm is formulated with provably minimum quantization noise at each step. In addition, we reveal limitations in common gradient estimation techniques in QAT and propose magnitude-aware differentiation as a remedy to further improve accuracy. Experimentally, OCTAV-enabled QAT achieves state-of-the-art accuracy on multiple tasks. These include training-from-scratch and retraining ResNets and MobileNets on ImageNet, and Squad fine-tuning using BERT models, where OCTAV-enabled QAT consistently preserves accuracy at low precision (4-to-6-bits). Our results require no modifications to the baseline training recipe, except for the insertion of quantization operations where appropriate.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
PDL Impact on Linearly Coded Digital Phase Conjugation Techniques in CO-OFDM Systems
Authors:
O. S. Sunish Kumar,
A. Amari,
O. A. Dobre,
R. Venkatesan
Abstract:
We investigate the impact of polarization-dependent loss (PDL) on the linearly coded digital phase conjugation (DPC) techniques in coherent optical orthogonal frequency division multiplexing (CO-OFDM) superchannel systems. We consider two DPC approaches: one uses orthogonal polarizations to transmit the linearly coded signal and its phase conjugate, while the other uses two orthogonal time slots o…
▽ More
We investigate the impact of polarization-dependent loss (PDL) on the linearly coded digital phase conjugation (DPC) techniques in coherent optical orthogonal frequency division multiplexing (CO-OFDM) superchannel systems. We consider two DPC approaches: one uses orthogonal polarizations to transmit the linearly coded signal and its phase conjugate, while the other uses two orthogonal time slots of the same polarization. We compare the performances of these DPC approaches by considering both aligned- and statistical-PDL models. The investigation with aligned-PDL model indicates that the latter approach is more tolerant to PDL-induced distortions when compared to the former. Furthermore, the study using statistical-PDL model shows that the outage probability of the latter approach tends to zero at a root mean square PDL value of 3.6 dB. On the other hand, the former shows an outage probability of 0.63 for the same PDL value.
△ Less
Submitted 27 June, 2021;
originally announced June 2021.
-
Second-Order Perturbation Theory-Based Digital Predistortion for Fiber Nonlinearity Compensation
Authors:
O. S. Sunish Kumar,
A. Amari,
O. A. Dobre,
R. Venkatesan
Abstract:
The first-order (FO) perturbation theory-based nonlinearity compensation (PB-NLC) technique has been widely investigated to combat the detrimental effects of the intra-channel Kerr nonlinearity in polarization-multiplexed (Pol-Mux) optical fiber communication systems. However, the NLC performance of the FO-PB-NLC technique is significantly limited in highly nonlinear regimes of the Pol-Mux long-ha…
▽ More
The first-order (FO) perturbation theory-based nonlinearity compensation (PB-NLC) technique has been widely investigated to combat the detrimental effects of the intra-channel Kerr nonlinearity in polarization-multiplexed (Pol-Mux) optical fiber communication systems. However, the NLC performance of the FO-PB-NLC technique is significantly limited in highly nonlinear regimes of the Pol-Mux long-haul optical transmission systems. In this paper, we extend the FO theory to second-order (SO) to improve the NLC performance. This technique is referred to as the SO-PB-NLC. A detailed theoretical analysis is performed to derive the SO perturbative field for a Pol-Mux optical transmission system. Following that, we investigate a few simplifying assumptions to reduce the implementation complexity of the SO-PB-NLC technique. The numerical simulations for a single-channel system show that the SO-PB-NLC technique provides an improved bit-error-rate performance and increases the transmission reach, in comparison with the FO-PB-NLC technique. The complexity analysis demonstrates that the proposed SO-PB-NLC technique has a reduced computational complexity when compared to the digital back-propagation with one step per span.
△ Less
Submitted 27 June, 2021;
originally announced June 2021.
-
A Joint Technique for Nonlinearity Compensation in CO-OFDM Superchannel Systems
Authors:
O. S. Sunish Kumar,
A. Amari,
O. A. Dobre,
R. Venkatesan,
S. K. Wilson
Abstract:
We propose a technique combining the singlechannel digital-back-propagation (SC-DBP) with phaseconjugated-twin-wave (PCTW) to compensate nonlinearities in CO-OFDM superchannel systems. This exhibits a similar performance as multi-channel DBP while providing increased transmission reach compared to SC-DBP, PCTW, and linear dispersion compensation (LDC).
We propose a technique combining the singlechannel digital-back-propagation (SC-DBP) with phaseconjugated-twin-wave (PCTW) to compensate nonlinearities in CO-OFDM superchannel systems. This exhibits a similar performance as multi-channel DBP while providing increased transmission reach compared to SC-DBP, PCTW, and linear dispersion compensation (LDC).
△ Less
Submitted 27 June, 2021;
originally announced June 2021.
-
A Spectrally Efficient Linear Polarization Coding Scheme for Fiber Nonlinearity Compensation in CO-OFDM Systems
Authors:
O. S. Sunish Kumar,
O. A. Dobre,
R. Venkatesan,
S. K. Wilson,
O. Omomukuyo,
A. Amari,
D. Chang
Abstract:
In this paper, we propose a linear polarization coding scheme (LPC) combined with the phase conjugated twin signals (PCTS) technique, referred to as LPC-PCTS, for fiber nonlinearity mitigation in coherent optical orthogonal frequency division multiplexing (CO-OFDM) systems. The LPC linearly combines the data symbols on the adjacent subcarriers of the OFDM symbol, one at full amplitude and the othe…
▽ More
In this paper, we propose a linear polarization coding scheme (LPC) combined with the phase conjugated twin signals (PCTS) technique, referred to as LPC-PCTS, for fiber nonlinearity mitigation in coherent optical orthogonal frequency division multiplexing (CO-OFDM) systems. The LPC linearly combines the data symbols on the adjacent subcarriers of the OFDM symbol, one at full amplitude and the other at half amplitude. The linearly coded data is then transmitted as phase conjugate pairs on the same subcarriers of the two OFDM symbols on the two orthogonal polarizations. The nonlinear distortions added to these subcarriers are essentially anti-correlated, since they carry phase conjugate pairs of data. At the receiver, the coherent superposition of the information symbols received on these pairs of subcarriers eventually leads to the cancellation of the nonlinear distortions. We conducted numerical simulation of a single channel 200 Gb/s CO-OFDM system employing the LPCPCTS technique. The results show that a Q-factor improvement of 2.3 dB and 1.7 dB with and without the dispersion symmetry, respectively, when compared to the recently proposed phase conjugated subcarrier coding (PCSC) technique, at an average launch power of 3 dBm. In addition, our proposed LPCPCTS technique shows a significant performance improvement when compared to the 16-quadrature amplitude modulation (QAM) with phase conjugated twin waves (PCTW) scheme, at the same spectral efficiency, for an uncompensated transmission distance of 2800 km.
△ Less
Submitted 27 June, 2021;
originally announced June 2021.
-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Authors:
Jiawei Zhao,
Steve Dai,
Rangharajan Venkatesan,
Brian Zimmer,
Mustafa Ali,
Ming-Yu Liu,
Brucek Khailany,
Bill Dally,
Anima Anandkumar
Abstract:
Representing deep neural networks (DNNs) in low-precision is a promising approach to enable efficient acceleration and memory reduction. Previous methods that train DNNs in low-precision typically keep a copy of weights in high-precision during the weight updates. Directly training with low-precision weights leads to accuracy degradation due to complex interactions between the low-precision number…
▽ More
Representing deep neural networks (DNNs) in low-precision is a promising approach to enable efficient acceleration and memory reduction. Previous methods that train DNNs in low-precision typically keep a copy of weights in high-precision during the weight updates. Directly training with low-precision weights leads to accuracy degradation due to complex interactions between the low-precision number systems and the learning algorithms. To address this issue, we develop a co-designed low-precision training framework, termed LNS-Madam, in which we jointly design a logarithmic number system (LNS) and a multiplicative weight update algorithm (Madam). We prove that LNS-Madam results in low quantization error during weight updates, leading to stable performance even if the precision is limited. We further propose a hardware design of LNS-Madam that resolves practical challenges in implementing an efficient datapath for LNS computations. Our implementation effectively reduces energy overhead incurred by LNS-to-integer conversion and partial sum accumulation. Experimental results show that LNS-Madam achieves comparable accuracy to full-precision counterparts with only 8 bits on popular computer vision and natural language tasks. Compared to FP32 and FP8, LNS-Madam reduces the energy consumption by over 90% and 55%, respectively.
△ Less
Submitted 23 August, 2022; v1 submitted 25 June, 2021;
originally announced June 2021.
-
Softermax: Hardware/Software Co-Design of an Efficient Softmax for Transformers
Authors:
Jacob R. Stevens,
Rangharajan Venkatesan,
Steve Dai,
Brucek Khailany,
Anand Raghunathan
Abstract:
Transformers have transformed the field of natural language processing. This performance is largely attributed to the use of stacked self-attention layers, each of which consists of matrix multiplies as well as softmax operations. As a result, unlike other neural networks, the softmax operation accounts for a significant fraction of the total run-time of Transformers. To address this, we propose S…
▽ More
Transformers have transformed the field of natural language processing. This performance is largely attributed to the use of stacked self-attention layers, each of which consists of matrix multiplies as well as softmax operations. As a result, unlike other neural networks, the softmax operation accounts for a significant fraction of the total run-time of Transformers. To address this, we propose Softermax, a hardware-friendly softmax design. Softermax consists of base replacement, low-precision softmax computations, and an online normalization calculation. We show Softermax results in 2.35x the energy efficiency at 0.90x the size of a comparable baseline, with negligible impact on network accuracy.
△ Less
Submitted 16 March, 2021;
originally announced March 2021.
-
Verifying High-Level Latency-Insensitive Designs with Formal Model Checking
Authors:
Steve Dai,
Alicia Klinefelter,
Haoxing Ren,
Rangharajan Venkatesan,
Ben Keller,
Nathaniel Pinckney,
Brucek Khailany
Abstract:
Latency-insensitive design mitigates increasing interconnect delay and enables productive component reuse in complex digital systems. This design style has been adopted in high-level design flows because untimed functional blocks connected through latency-insensitive interfaces provide a natural communication abstraction. However, latency-insensitive design with high-level languages also introduce…
▽ More
Latency-insensitive design mitigates increasing interconnect delay and enables productive component reuse in complex digital systems. This design style has been adopted in high-level design flows because untimed functional blocks connected through latency-insensitive interfaces provide a natural communication abstraction. However, latency-insensitive design with high-level languages also introduces a unique set of verification challenges that jeopardize functional correctness. In particular, bugs due to invalid consumption of inputs and deadlocks can be difficult to detect and debug with dynamic simulation methods. To tackle these two classes of bugs, we propose formal model checking methods to guarantee that a high-level latency-insensitive design is unaffected by invalid input data and is free of deadlock. We develop a well-structured verification wrapper for each property to automatically construct the corresponding formal model for checking. Our experiments demonstrate that the formal checks are effective in realistic bug scenarios from high-level designs.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Authors:
Steve Dai,
Rangharajan Venkatesan,
Haoxing Ren,
Brian Zimmer,
William J. Dally,
Brucek Khailany
Abstract:
Quantization enables efficient acceleration of deep neural networks by reducing model memory footprint and exploiting low-cost integer math hardware units. Quantization maps floating-point weights and activations in a trained model to low-bitwidth integer values using scale factors. Excessive quantization, reducing precision too aggressively, results in accuracy degradation. When scale factors are…
▽ More
Quantization enables efficient acceleration of deep neural networks by reducing model memory footprint and exploiting low-cost integer math hardware units. Quantization maps floating-point weights and activations in a trained model to low-bitwidth integer values using scale factors. Excessive quantization, reducing precision too aggressively, results in accuracy degradation. When scale factors are shared at a coarse granularity across many dimensions of each tensor, effective precision of individual elements within the tensor are limited. To reduce quantization-related accuracy loss, we propose using a separate scale factor for each small vector of ($\approx$16-64) elements within a single dimension of a tensor. To achieve an efficient hardware implementation, the per-vector scale factors can be implemented with low-bitwidth integers when calibrated using a two-level quantization scheme. We find that per-vector scaling consistently achieves better inference accuracy at low precision compared to conventional scaling techniques for popular neural networks without requiring retraining. We also modify a deep learning accelerator hardware design to study the area and energy overheads of per-vector scaling support. Our evaluation demonstrates that per-vector scaled quantization with 4-bit weights and activations achieves 37% area saving and 24% energy saving while maintaining over 75% accuracy for ResNet50 on ImageNet. 4-bit weights and 8-bit activations achieve near-full-precision accuracy for both BERT-base and BERT-large on SQuAD while reducing area by 26% compared to an 8-bit baseline.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Evaluating the Effectiveness of Efficient Neural Architecture Search for Sentence-Pair Tasks
Authors:
Ansel MacLaughlin,
Jwala Dhamala,
Anoop Kumar,
Sriram Venkatapathy,
Ragav Venkatesan,
Rahul Gupta
Abstract:
Neural Architecture Search (NAS) methods, which automatically learn entire neural model or individual neural cell architectures, have recently achieved competitive or state-of-the-art (SOTA) performance on variety of natural language processing and computer vision tasks, including language modeling, natural language inference, and image classification. In this work, we explore the applicability of…
▽ More
Neural Architecture Search (NAS) methods, which automatically learn entire neural model or individual neural cell architectures, have recently achieved competitive or state-of-the-art (SOTA) performance on variety of natural language processing and computer vision tasks, including language modeling, natural language inference, and image classification. In this work, we explore the applicability of a SOTA NAS algorithm, Efficient Neural Architecture Search (ENAS) (Pham et al., 2018) to two sentence pair tasks, paraphrase detection and semantic textual similarity. We use ENAS to perform a micro-level search and learn a task-optimized RNN cell architecture as a drop-in replacement for an LSTM. We explore the effectiveness of ENAS through experiments on three datasets (MRPC, SICK, STS-B), with two different models (ESIM, BiLSTM-Max), and two sets of embeddings (Glove, BERT). In contrast to prior work applying ENAS to NLP tasks, our results are mixed -- we find that ENAS architectures sometimes, but not always, outperform LSTMs and perform similarly to random architecture search.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Intra-Channel Nonlinearity Compensation Based on Second-Order Perturbation Theory
Authors:
O. S. Sunish Kumar,
A. Amari,
O. A. Dobre,
R. Venkatesan
Abstract:
The first-order (FO) perturbation theory has been widely investigated to design the digital nonlinearity compensation (NLC) technique to deal with the intra-channel fiber nonlinearity effect in coherent optical communication systems. The main advantages of the perturbation theory-based approach are the possibility of the implementation on a single stage for the entire fiber link and one sample per…
▽ More
The first-order (FO) perturbation theory has been widely investigated to design the digital nonlinearity compensation (NLC) technique to deal with the intra-channel fiber nonlinearity effect in coherent optical communication systems. The main advantages of the perturbation theory-based approach are the possibility of the implementation on a single stage for the entire fiber link and one sample per symbol operation. In this paper, we propose to extend the FO perturbation theory-based NLC (FO-PB-NLC) technique to the second-order (SO), referred to as the SO-PB-NLC, to enhance the NLC performance. We present a comprehensive theoretical analysis for the derivation of the SO nonlinear distortion field, which is the foundation for the SO-PB-NLC technique. Through numerical simulations, we show that the proposed SO-PB-NLC technique significantly enhances the NLC performance and the maximum transmission reach when compared to the FO-PB-NLC technique. Then, the performance of the SO-PB-NLC technique is compared with that of the benchmark digital back-propagation (DBP).
△ Less
Submitted 3 May, 2020;
originally announced May 2020.
-
Out-of-the-box channel pruned networks
Authors:
Ragav Venkatesan,
Gurumurthy Swaminathan,
Xiong Zhou,
Anna Luo
Abstract:
In the last decade convolutional neural networks have become gargantuan. Pre-trained models, when used as initializers are able to fine-tune ever larger networks on small datasets. Consequently, not all the convolutional features that these fine-tuned models detect are requisite for the end-task. Several works of channel pruning have been proposed to prune away compute and memory from models that…
▽ More
In the last decade convolutional neural networks have become gargantuan. Pre-trained models, when used as initializers are able to fine-tune ever larger networks on small datasets. Consequently, not all the convolutional features that these fine-tuned models detect are requisite for the end-task. Several works of channel pruning have been proposed to prune away compute and memory from models that were trained already. Typically, these involve policies that decide which and how many channels to remove from each layer leading to channel-wise and/or layer-wise pruning profiles, respectively. In this paper, we conduct several baseline experiments and establish that profiles from random channel-wise pruning policies are as good as metric-based ones. We also establish that there may exist profiles from some layer-wise pruning policies that are measurably better than common baselines. We then demonstrate that the top layer-wise pruning profiles found using an exhaustive random search from one datatset are also among the top profiles for other datasets. This implies that we could identify out-of-the-box layer-wise pruning profiles using benchmark datasets and use these directly for new datasets. Furthermore, we develop a Reinforcement Learning (RL) policy-based search algorithm with a direct objective of finding transferable layer-wise pruning profiles using many models for the same architecture. We use a novel reward formulation that drives this RL search towards an expected compression while maximizing accuracy. Our results show that our transferred RL-based profiles are as good or better than best profiles found on the original dataset via exhaustive search. We then demonstrate that if we found the profiles using a mid-sized dataset such as Cifar10/100, we are able to transfer them to even a large dataset such as Imagenet.
△ Less
Submitted 30 April, 2020;
originally announced April 2020.
-
Fiber Nonlinearity Mitigation via the Parzen Window Classifier for Dispersion Managed and Unmanaged Links
Authors:
Abdelkerim Amari,
Xiang Lin,
Octavia A. Dobre,
Ramachandran Venkatesan,
Alex Alvarado
Abstract:
Machine learning techniques have recently received significant attention as promising approaches to deal with the optical channel impairments, and in particular, the nonlinear effects. In this work, a machine learning-based classification technique, known as the Parzen window (PW) classifier, is applied to mitigate the nonlinear effects in the optical channel. The PW classifier is used as a detect…
▽ More
Machine learning techniques have recently received significant attention as promising approaches to deal with the optical channel impairments, and in particular, the nonlinear effects. In this work, a machine learning-based classification technique, known as the Parzen window (PW) classifier, is applied to mitigate the nonlinear effects in the optical channel. The PW classifier is used as a detector with improved nonlinear decision boundaries more adapted to the nonlinear fiber channel. Performance improvement is observed when applying the PW in the context of dispersion managed and dispersion unmanaged systems.
△ Less
Submitted 17 September, 2019;
originally announced September 2019.
-
$d$-SNE: Domain Adaptation using Stochastic Neighborhood Embedding
Authors:
Xiang Xu,
Xiong Zhou,
Ragav Venkatesan,
Gurumurthy Swaminathan,
Orchid Majumder
Abstract:
Deep neural networks often require copious amount of labeled-data to train their scads of parameters. Training larger and deeper networks is hard without appropriate regularization, particularly while using a small dataset. Laterally, collecting well-annotated data is expensive, time-consuming and often infeasible. A popular way to regularize these networks is to simply train the network with more…
▽ More
Deep neural networks often require copious amount of labeled-data to train their scads of parameters. Training larger and deeper networks is hard without appropriate regularization, particularly while using a small dataset. Laterally, collecting well-annotated data is expensive, time-consuming and often infeasible. A popular way to regularize these networks is to simply train the network with more data from an alternate representative dataset. This can lead to adverse effects if the statistics of the representative dataset are dissimilar to our target. This predicament is due to the problem of domain shift. Data from a shifted domain might not produce bespoke features when a feature extractor from the representative domain is used. In this paper, we propose a new technique ($d$-SNE) of domain adaptation that cleverly uses stochastic neighborhood embedding techniques and a novel modified-Hausdorff distance. The proposed technique is learnable end-to-end and is therefore, ideally suited to train neural networks. Extensive experiments demonstrate that $d$-SNE outperforms the current states-of-the-art and is robust to the variances in different datasets, even in the one-shot and semi-supervised learning settings. $d$-SNE also demonstrates the ability to generalize to multiple domains concurrently.
△ Less
Submitted 29 May, 2019;
originally announced May 2019.
-
Detection of excited state absorption cross-section of porphyrin through cw and femto-second laser pump-probe technique
Authors:
A. Srinivasa Rao,
Alok Sharan,
N Venkatramaiah,
R Venkatesan
Abstract:
We report on direct detection of excited states absorption cross-section using dual wavelength pump-probe technique. Also, we experimentally demonstrate using porphyrin composite molecules (porphyrin derivatives such as 5,10,15,20-meso-tetrakis phenyl porphyrin (H2TPP), 5,10,15,20 - meso-tetrakis(4-hydroxyphenyl) porphyrin (H2TPP(OH)4)). The cw laser at 761 nm wavelength is used as a pump to maint…
▽ More
We report on direct detection of excited states absorption cross-section using dual wavelength pump-probe technique. Also, we experimentally demonstrate using porphyrin composite molecules (porphyrin derivatives such as 5,10,15,20-meso-tetrakis phenyl porphyrin (H2TPP), 5,10,15,20 - meso-tetrakis(4-hydroxyphenyl) porphyrin (H2TPP(OH)4)). The cw laser at 761 nm wavelength is used as a pump to maintain excited state population. Changes in the population of excited states lead to the change in transmission are monitored using femto-second probe pulses of 130 fs width and repeated at a 1kHz rate with central wavelength around 800 nm. Transmittance changes due to excited state population are modeled using rate equation approach. The effect of the absorption on the transmitted pulse shape has been discussed as a function of fluence. Obtained excited state absorption cross-sections of H2TPP and H2TPP(OH)4 doped boric acid glass (BAG) films are 4.9X10-18 cm2 and 1.2X10-17 cm2 respectively.
△ Less
Submitted 5 March, 2019;
originally announced March 2019.
-
A Machine Learning-Based Detection Technique for Optical Fiber Nonlinearity Mitigation
Authors:
Abdelkerim Amari,
Xiang Lin,
Octavia A. Dobre,
Ramachandran Venkatesan,
Alex Alvarado
Abstract:
We investigate the performance of a machine learning classification technique, called the Parzen window, to mitigate the fiber nonlinearity in the context of dispersion managed and dispersion unmanaged systems. The technique is applied for detection at the receiver side, and deals with the non-Gaussian nonlinear effects by designing improved decision boundaries. We also propose a two-stage mitigat…
▽ More
We investigate the performance of a machine learning classification technique, called the Parzen window, to mitigate the fiber nonlinearity in the context of dispersion managed and dispersion unmanaged systems. The technique is applied for detection at the receiver side, and deals with the non-Gaussian nonlinear effects by designing improved decision boundaries. We also propose a two-stage mitigation technique using digital back propagation and Parzen window for dispersion unmanaged systems. In this case, digital back propagation compensates for the deterministic nonlinearity and the Parzen window deals with the stochastic nonlinear signal-noise interactions, which are not taken into account by digital back propagation. A performance improvement up to 0:4 dB in terms of Q factor is observed.
△ Less
Submitted 6 March, 2019; v1 submitted 27 February, 2019;
originally announced March 2019.
-
Bright magnetic dipole radiation from two-dimensional lead-halide perovskites
Authors:
Ryan A. DeCrescent,
Naveen R. Venkatesan,
Clayton J. Dahlman,
Rhys M. Kennard,
Xie Zhang,
Wenhao Li,
Xinhong Du,
Michael L. Chabinyc,
Rashid Zia,
Jon A. Schuller
Abstract:
Light-matter interactions in semiconductor systems are uniformly treated within the electric dipole (ED) approximation, as multipolar interactions are considered "forbidden". Here, we demonstrate that this approximation inadequately describes light emission in novel two-dimensional hybrid organic-inorganic perovskite materials (2D HOIPs) --- a class of solution processable layered semiconductor wi…
▽ More
Light-matter interactions in semiconductor systems are uniformly treated within the electric dipole (ED) approximation, as multipolar interactions are considered "forbidden". Here, we demonstrate that this approximation inadequately describes light emission in novel two-dimensional hybrid organic-inorganic perovskite materials (2D HOIPs) --- a class of solution processable layered semiconductor with promising optoelectronic properties. Consequently, photoluminescence (PL) spectra become strongly dependent on the experimental geometry, a fact that is often overlooked, though critical for correct optical characterization of materials. Using energy-momentum and time-resolved spectroscopies, we experimentally demonstrate that low-energy sideband emission in 2D HOIPs exhibits a highly unusual, multipolar polarization and angle dependence. Using combined electromagnetic and quantum-mechanical analyses, we attribute this radiation pattern to an out-of-plane oriented magnetic dipole transition arising from the 2D character of the excited and ground state orbitals. Symmetry arguments point toward the presence of significant inversion symmetry-breaking mechanisms that are currently under great debate. These results provide a new perspective on the origins of unexpected sideband emission in HOIPs, clarify discrepancies in previous literature, and generally challenge the paradigm of ED-dominated light-matter interactions in novel optoelectronic materials.
△ Less
Submitted 15 April, 2019; v1 submitted 16 January, 2019;
originally announced January 2019.
-
Rigorous Analysis of a Randomised Number Field Sieve
Authors:
Jonathan Lee,
Ramarathnam Venkatesan
Abstract:
Factorisation of integers $n$ is of number theoretic and cryptographic significance. The Number Field Sieve (NFS) introduced circa 1990, is still the state of the art algorithm, but no rigorous proof that it halts or generates relationships is known. We propose and analyse an explicitly randomised variant. For each $n$, we show that these randomised variants of the NFS and Coppersmith's multiple p…
▽ More
Factorisation of integers $n$ is of number theoretic and cryptographic significance. The Number Field Sieve (NFS) introduced circa 1990, is still the state of the art algorithm, but no rigorous proof that it halts or generates relationships is known. We propose and analyse an explicitly randomised variant. For each $n$, we show that these randomised variants of the NFS and Coppersmith's multiple polynomial sieve find congruences of squares in expected times matching the best-known heuristic estimates.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.
-
Nonlinear Spectroscopic Study of Porphyrin Under cw and Femto-second Laser Pulse Excitation
Authors:
A. Srinivasa Rao,
Alok Sharan,
N Venkatramaiah,
R Venkatesan
Abstract:
Single Beam Transmittance (SBT) was used as nonlinear spectroscopic tool to investigate the absorption cross-sections and lifetimes of Tetra Phenyl porphyrin (H2TPP) and its OH- group derivative (H2TPP(OH)4) doped in boric acid glass (BAG). We have used 671 nm wavelength as exciting wavelength for both CW (incident intensity up to 1010 W/cm2) and femto-second laser pulse (up to fluence of 102 mJ/c…
▽ More
Single Beam Transmittance (SBT) was used as nonlinear spectroscopic tool to investigate the absorption cross-sections and lifetimes of Tetra Phenyl porphyrin (H2TPP) and its OH- group derivative (H2TPP(OH)4) doped in boric acid glass (BAG). We have used 671 nm wavelength as exciting wavelength for both CW (incident intensity up to 1010 W/cm2) and femto-second laser pulse (up to fluence of 102 mJ/cm2). Under cw laser excitation, H2TPP doped BAG demonstrates Double Saturable Absorber (DSA) behavior whereas H2TPP(OH)4 doped BAG act as Revere Saturable Absorber (RSA). Rate equation model espouses to extract the spectroscopic parameters from the experimental data. Excited state life times and absorption cross-sections were obtained as parameters for theoretical fit on SBT data. Porphyrin molecules act as four level systems under cw laser excitation, whereas in the presence of femto-second laser excitation they act as a two level system. We have derived the equations for transmitted energy through the material in the presence of femto-second laser illumination. Both systems viz., H2TPP and H2TPP(OH)4 doped BAG behaves as saturable absorbers when excited by femto-second laser pulses.
△ Less
Submitted 1 April, 2018;
originally announced April 2018.
-
Simple sampling clock synchronisation scheme for reduced-guard-interval coherent optical OFDM systems
Authors:
Oluyemi Omomukuyo,
Deyuan Chang,
Octavia A. Dobre,
Ramachandran Venkatesan,
Telex M. N. Ngatched
Abstract:
A simple data-aided scheme for sampling clock synchronisation in reduced-guard-interval coherent optical orthogonal frequency division multiplexing (RGI-CO-OFDM) systems is proposed. In the proposed scheme, the sampling clock offset (SCO) is estimated by using the training symbols reserved for channel estimation, thus avoiding extra training overhead. The SCO is then compensated by resampling, usi…
▽ More
A simple data-aided scheme for sampling clock synchronisation in reduced-guard-interval coherent optical orthogonal frequency division multiplexing (RGI-CO-OFDM) systems is proposed. In the proposed scheme, the sampling clock offset (SCO) is estimated by using the training symbols reserved for channel estimation, thus avoiding extra training overhead. The SCO is then compensated by resampling, using a time-domain interpolation filter. The feasibility of the proposed scheme is demonstrated by means of numerical simulations in a 32-Gbaud 16-QAM dual-polarisation RGI-CO-OFDM system.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Joint timing and frequency synchronization based on weighted CAZAC sequences for reduced-guard-interval CO-OFDM systems
Authors:
Oluyemi Omomukuyo,
Deyuan Chang,
**gwen Zhu,
Octavia Dobre,
Ramachandran Venkatesan,
Telex Ngatched,
Chuck Rumbolt
Abstract:
A novel joint symbol timing and carrier frequency offset (CFO) estimation algorithm is proposed for reduced-guard-interval coherent optical orthogonal frequency-division multiplexing (RGI-CO-OFDM) systems. The proposed algorithm is based on a constant amplitude zero autocorrelation (CAZAC) sequence weighted by a pseudo-random noise (PN) sequence. The symbol timing is accomplished by using only one…
▽ More
A novel joint symbol timing and carrier frequency offset (CFO) estimation algorithm is proposed for reduced-guard-interval coherent optical orthogonal frequency-division multiplexing (RGI-CO-OFDM) systems. The proposed algorithm is based on a constant amplitude zero autocorrelation (CAZAC) sequence weighted by a pseudo-random noise (PN) sequence. The symbol timing is accomplished by using only one training symbol of two identical halves, with the weighting applied to the second half. The special structure of the training symbol is also utilized in estimating the CFO. The performance of the proposed algorithm is demonstrated by means of numerical simulations in a 115.8-Gb/s 16-QAM RGI-CO-OFDM system.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Robust Frame and Frequency Synchronization Based on Alamouti Coding for RGI-CO-OFDM
Authors:
Oluyemi Omomukuyo,
Deyuan Chang,
Octavia Dobre,
Ramachandran Venkatesan,
Telex M. N. Ngatched
Abstract:
We propose an algorithm for carrying out joint frame and frequency synchronization in reduced-guard-interval coherent optical orthogonal frequency division multiplexing (RGI-CO-OFDM) systems. The synchronization is achieved by using the same training symbols (TS) employed for training-aided channel estimation (TA-CE), thereby avoiding additional training overhead. The proposed algorithm is designe…
▽ More
We propose an algorithm for carrying out joint frame and frequency synchronization in reduced-guard-interval coherent optical orthogonal frequency division multiplexing (RGI-CO-OFDM) systems. The synchronization is achieved by using the same training symbols (TS) employed for training-aided channel estimation (TA-CE), thereby avoiding additional training overhead. The proposed algorithm is designed for polarization division multiplexing (PDM) RGI-CO-OFDM systems that use the Alamouti-type polarization-time coding for TA-CE. Due to their optimal TA-CE performance, Golay complementary sequences have been used as the TS in the proposed algorithm. The frame synchronization is accomplished by exploiting the cross-correlation between the received TS from the two orthogonal polarizations. The arrangement of the TS is also used to estimate the carrier frequency offset. Simulation results of a PDM RGI-CO-OFDM system operating at 238.1 Gb/s data rate (197.6-Gb/s after coding), with a total overhead of 9.2% (31.6% after coding), show that the proposed scheme has accurate synchronization, and is robust to linear fiber impairments.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Bandwidth-Efficient Synchronization for Fiber Optic Transmission: System Performance Measurements
Authors:
Oluyemi Omomukuyo,
Octavia A. Dobre,
Ramachandran Venkatesan,
Telex M. N. Ngatched
Abstract:
In this article, we first provide a brief overview of optical transmission systems and some of their performance specifications. We then present a simple, robust, and bandwidth-efficient OFDM synchronization method, and carry out measurements to validate the presented synchronization method with the aid of an experimental setup.
In this article, we first provide a brief overview of optical transmission systems and some of their performance specifications. We then present a simple, robust, and bandwidth-efficient OFDM synchronization method, and carry out measurements to validate the presented synchronization method with the aid of an experimental setup.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Discrete FRFT-Based Frame and Frequency Synchronization for Coherent Optical Systems
Authors:
Oluyemi Omomukuyo,
Shu Zhang,
Octavia Dobre,
Ramachandran Venkatesan,
Telex M. N. Ngatched
Abstract:
A joint frame and carrier frequency synchronization algorithm for coherent optical systems, based on the digital computation of the fractional Fourier transform (FRFT), is proposed. The algorithm utilizes the characteristics of energy centralization of chirp signals in the FRFT domain, together with the time and phase shift properties of the FRFT. Chirp signals are used to construct a training seq…
▽ More
A joint frame and carrier frequency synchronization algorithm for coherent optical systems, based on the digital computation of the fractional Fourier transform (FRFT), is proposed. The algorithm utilizes the characteristics of energy centralization of chirp signals in the FRFT domain, together with the time and phase shift properties of the FRFT. Chirp signals are used to construct a training sequence (TS), and fractional cross-correlation is employed to define a detection metric for the TS, from which a set of equations can be obtained. Estimates of both the timing offset and carrier frequency offset (CFO) are obtained by solving these equations. This TS is later employed in a phase-dependent decision-directed least-mean square algorithm for adaptive equalization. Simulation results of a 32-Gbaud coherent polarization division multiplexed Nyquist system show that the proposed scheme has a wide CFO estimation range and accurate synchronization performance even in poor optical signal-to-noise ratio conditions.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Robust Faster-than-Nyquist PDM-mQAM Systems with Tomlinson-Harashima Precoding
Authors:
Deyuan Chang,
Oluyemi Omomukuyo,
Xiang Lin,
Shu Zhang,
Octavia A. Dobre,
Ramachandran Venkatesan
Abstract:
A training-based channel estimation algorithm is proposed for the faster-than-Nyquist PDM-mQAM (m = 4, 16, 64) systems with Tomlinson-Harashima precoding (THP). This is robust to the convergence failure phenomenon suffered by the existing algorithm, yet remaining format-transparent. Simulation results show that the proposed algorithm requires a reduced optical signal-to-noise ratio (OSNR) to achie…
▽ More
A training-based channel estimation algorithm is proposed for the faster-than-Nyquist PDM-mQAM (m = 4, 16, 64) systems with Tomlinson-Harashima precoding (THP). This is robust to the convergence failure phenomenon suffered by the existing algorithm, yet remaining format-transparent. Simulation results show that the proposed algorithm requires a reduced optical signal-to-noise ratio (OSNR) to achieve a certain bit error rate (BER) in the presence of first-order polarization mode dispersion and phase noise introduced by the laser linewidth.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Training Symbol-Based Equalization for Quadrature Duobinary PDM-FTN Systems
Authors:
S. Zhang,
D. Chang,
O. A. Dobre,
O. Omomukuyo,
X. Lin,
R. Venkatesan
Abstract:
A training symbol-based equalization algorithm is proposed for polarization de-multiplexing in quadrature duobinary (QDB) modulated polarization division multiplexedfaster-than-Nyquist (FTN) coherent optical systems. The proposed algorithm is based on the least mean square algorithm, and multiple location candidates of a symbol are considered in order to make use of the training symbols with QDB m…
▽ More
A training symbol-based equalization algorithm is proposed for polarization de-multiplexing in quadrature duobinary (QDB) modulated polarization division multiplexedfaster-than-Nyquist (FTN) coherent optical systems. The proposed algorithm is based on the least mean square algorithm, and multiple location candidates of a symbol are considered in order to make use of the training symbols with QDB modulation.Results show that an excellent convergence performance is obtained using the proposed algorithm under different polarization alignment scenarios. The optical signal-to-noise ratio required to attain a bit error rate of 2*10-2 is reduced by 1.7 and 1.8 dB using the proposed algorithm, compared to systems using the constant modulus algorithm with differential coding for 4-ary quadrature amplitude modulation(4-QAM) and 16-QAM systems with symbol-by-symbol detection, respectively.Furthermore, comparisons with the Tomlinson-Harashima precoding-based FTN systems illustrate that QDB is preferable when 4-QAM is utilized.
△ Less
Submitted 3 January, 2018;
originally announced January 2018.
-
Coppersmith's lattices and "focus groups": an attack on small-exponent RSA
Authors:
Stephen D. Miller,
Bhargav Narayanan,
Ramarathnam Venkatesan
Abstract:
We present a principled technique for reducing the lattice and matrix size in some applications of Coppersmith's lattice method for finding roots of modular polynomial equations. Motivated by ideas from machine learning, it relies on extrapolating patterns from the actual behavior of Coppersmith's attack for smaller parameter sizes, which can be thought of as "focus group" testing. When applied to…
▽ More
We present a principled technique for reducing the lattice and matrix size in some applications of Coppersmith's lattice method for finding roots of modular polynomial equations. Motivated by ideas from machine learning, it relies on extrapolating patterns from the actual behavior of Coppersmith's attack for smaller parameter sizes, which can be thought of as "focus group" testing. When applied to the small-exponent RSA problem, our technique reduces lattice dimensions and consequently running times, and hence can be applied to a wider range of exponents. Moreover, in many difficult examples our attack is not only faster but also more successful in recovering the RSA secret key. We include a discussion of subtleties concerning whether or not existing metrics (such as enabling condition bounds) are decisive in predicting the true efficacy of attacks based on Coppersmith's method. Finally, indications are given which suggest certain lattice basis reduction algorithms (such as Nguyen-Stehlé's L2) may be particularly well-suited for Coppersmith's method.
△ Less
Submitted 16 December, 2020; v1 submitted 30 August, 2017;
originally announced August 2017.
-
A survey on fiber nonlinearity compensation for 400 Gbps and beyond optical communication systems
Authors:
Abdelkerim Amari,
Octavia A. Dobre,
Ramachandran Venkatesan,
O. S. Sunish Kumar,
Philippe Ciblat,
Yves Jaouën
Abstract:
Optical communication systems represent the backbone of modern communication networks. Since their deployment, different fiber technologies have been used to deal with optical fiber impairments such as dispersion-shifted fibers and dispersion-compensation fibers. In recent years, thanks to the introduction of coherent detection based systems, fiber impairments can be mitigated using digital signal…
▽ More
Optical communication systems represent the backbone of modern communication networks. Since their deployment, different fiber technologies have been used to deal with optical fiber impairments such as dispersion-shifted fibers and dispersion-compensation fibers. In recent years, thanks to the introduction of coherent detection based systems, fiber impairments can be mitigated using digital signal processing (DSP) algorithms. Coherent systems are used in the current 100 Gbps wavelength-division multiplexing (WDM) standard technology. They allow the increase of spectral efficiency by using multi-level modulation formats, and are combined with DSP techniques to combat the linear fiber distortions. In addition to linear impairments, the next generation 400 Gbps/1 Tbps WDM systems are also more affected by the fiber nonlinearity due to the Kerr effect. At high input power, the fiber nonlinear effects become more important and their compensation is required to improve the transmission performance. Several approaches have been proposed to deal with the fiber nonlinearity. In this paper, after a brief description of the Kerr-induced nonlinear effects, a survey on the fiber nonlinearity compensation (NLC) techniques is provided. We focus on the well-known NLC techniques and discuss their performance, as well as their implementation and complexity. An extension of the inter-subcarrier nonlinear interference canceler approach is also proposed. A performance evaluation of the well-known NLC techniques and the proposed approach is provided in the context of Nyquist and super-Nyquist superchannel systems.
△ Less
Submitted 21 August, 2017;
originally announced August 2017.
-
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Authors:
Angshuman Parashar,
Minsoo Rhu,
Anurag Mukkara,
Antonio Puglielli,
Rangharajan Venkatesan,
Brucek Khailany,
Joel Emer,
Stephen W. Keckler,
William J. Dally
Abstract:
Convolutional Neural Networks (CNNs) have emerged as a fundamental technology for machine learning. High performance and extreme energy efficiency are critical for deployments of CNNs in a wide range of situations, especially mobile platforms such as autonomous vehicles, cameras, and electronic personal assistants. This paper introduces the Sparse CNN (SCNN) accelerator architecture, which improve…
▽ More
Convolutional Neural Networks (CNNs) have emerged as a fundamental technology for machine learning. High performance and extreme energy efficiency are critical for deployments of CNNs in a wide range of situations, especially mobile platforms such as autonomous vehicles, cameras, and electronic personal assistants. This paper introduces the Sparse CNN (SCNN) accelerator architecture, which improves performance and energy efficiency by exploiting the zero-valued weights that stem from network pruning during training and zero-valued activations that arise from the common ReLU operator applied during inference. Specifically, SCNN employs a novel dataflow that enables maintaining the sparse weights and activations in a compressed encoding, which eliminates unnecessary data transfers and reduces storage requirements. Furthermore, the SCNN dataflow facilitates efficient delivery of those weights and activations to the multiplier array, where they are extensively reused. In addition, the accumulation of multiplication products are performed in a novel accumulator array. Our results show that on contemporary neural networks, SCNN can improve both performance and energy by a factor of 2.7x and 2.3x, respectively, over a comparably provisioned dense CNN accelerator.
△ Less
Submitted 23 May, 2017;
originally announced August 2017.
-
graph2vec: Learning Distributed Representations of Graphs
Authors:
Annamalai Narayanan,
Mahinthan Chandramohan,
Rajasekar Venkatesan,
Lihui Chen,
Yang Liu,
Shantanu Jaiswal
Abstract:
Recent works on representation learning for graph structured data predominantly focus on learning distributed representations of graph substructures such as nodes and subgraphs. However, many graph analytics tasks such as graph classification and clustering require representing entire graphs as fixed length feature vectors. While the aforementioned approaches are naturally unequipped to learn such…
▽ More
Recent works on representation learning for graph structured data predominantly focus on learning distributed representations of graph substructures such as nodes and subgraphs. However, many graph analytics tasks such as graph classification and clustering require representing entire graphs as fixed length feature vectors. While the aforementioned approaches are naturally unequipped to learn such representations, graph kernels remain as the most effective way of obtaining them. However, these graph kernels use handcrafted features (e.g., shortest paths, graphlets, etc.) and hence are hampered by problems such as poor generalization. To address this limitation, in this work, we propose a neural embedding framework named graph2vec to learn data-driven distributed representations of arbitrary sized graphs. graph2vec's embeddings are learnt in an unsupervised manner and are task agnostic. Hence, they could be used for any downstream task such as graph classification, clustering and even seeding supervised representation learning approaches. Our experiments on several benchmark and large real-world datasets show that graph2vec achieves significant improvements in classification and clustering accuracies over substructure representation learning approaches and are competitive with state-of-the-art graph kernels.
△ Less
Submitted 17 July, 2017;
originally announced July 2017.
-
A Strategy for an Uncompromising Incremental Learner
Authors:
Ragav Venkatesan,
Hemanth Venkateswara,
Sethuraman Panchanathan,
Baoxin Li
Abstract:
Multi-class supervised learning systems require the knowledge of the entire range of labels they predict. Often when learnt incrementally, they suffer from catastrophic forgetting. To avoid this, generous leeways have to be made to the philosophy of incremental learning that either forces a part of the machine to not learn, or to retrain the machine again with a selection of the historic data. Whi…
▽ More
Multi-class supervised learning systems require the knowledge of the entire range of labels they predict. Often when learnt incrementally, they suffer from catastrophic forgetting. To avoid this, generous leeways have to be made to the philosophy of incremental learning that either forces a part of the machine to not learn, or to retrain the machine again with a selection of the historic data. While these hacks work to various degrees, they do not adhere to the spirit of incremental learning. In this article, we redefine incremental learning with stringent conditions that do not allow for any undesirable relaxations and assumptions. We design a strategy involving generative models and the distillation of dark knowledge as a means of hallucinating data along with appropriate targets from past distributions. We call this technique, phantom sampling.We show that phantom sampling helps avoid catastrophic forgetting during incremental learning. Using an implementation based on deep neural networks, we demonstrate that phantom sampling dramatically avoids catastrophic forgetting. We apply these strategies to competitive multi-class incremental learning of deep neural networks. Using various benchmark datasets and through our strategy, we demonstrate that strict incremental learning could be achieved. We further put our strategy to test on challenging cases, including cross-domain increments and incrementing on a novel label space. We also propose a trivial extension to unbounded-continual learning and identify potential for future development.
△ Less
Submitted 17 July, 2017; v1 submitted 1 May, 2017;
originally announced May 2017.
-
Classification of Diabetic Retinopathy Images Using Multi-Class Multiple-Instance Learning Based on Color Correlogram Features
Authors:
Ragav Venkatesan,
Parag S. Chandakkar,
Baoxin Li
Abstract:
All people with diabetes have the risk of develo** diabetic retinopathy (DR), a vision-threatening complication. Early detection and timely treatment can reduce the occurrence of blindness due to DR. Computer-aided diagnosis has the potential benefit of improving the accuracy and speed in DR detection. This study is concerned with automatic classification of images with microaneurysm (MA) and ne…
▽ More
All people with diabetes have the risk of develo** diabetic retinopathy (DR), a vision-threatening complication. Early detection and timely treatment can reduce the occurrence of blindness due to DR. Computer-aided diagnosis has the potential benefit of improving the accuracy and speed in DR detection. This study is concerned with automatic classification of images with microaneurysm (MA) and neovascularization (NV), two important DR clinical findings. Together with normal images, this presents a 3-class classification problem. We propose a modified color auto-correlogram feature (AutoCC) with low dimensionality that is spectrally tuned towards DR images. Recognizing the fact that the images with or without MA or NV are generally different only in small, localized regions, we propose to employ a multi-class, multiple-instance learning framework for performing the classification task using the proposed feature. Extensive experiments including comparison with a few state-of-art image classification approaches have been performed and the results suggest that the proposed approach is promising as it outperforms other methods by a large margin.
△ Less
Submitted 5 April, 2017;
originally announced April 2017.
-
Fisher Information Framework for Time Series Modeling
Authors:
R. C. Venkatesan,
A. Plastino
Abstract:
A robust prediction model invoking the Takens embedding theorem, whose \textit{working hypothesis} is obtained via an inference procedure based on the minimum Fisher information principle, is presented. The coefficients of the ansatz, central to the \textit{working hypothesis} satisfy a time independent Schrödinger-like equation in a vector setting. The inference of i) the probability density func…
▽ More
A robust prediction model invoking the Takens embedding theorem, whose \textit{working hypothesis} is obtained via an inference procedure based on the minimum Fisher information principle, is presented. The coefficients of the ansatz, central to the \textit{working hypothesis} satisfy a time independent Schrödinger-like equation in a vector setting. The inference of i) the probability density function of the coefficients of the \textit{working hypothesis} and ii) the establishing of constraint driven pseudo-inverse condition for the modeling phase of the prediction scheme, is made, for the case of normal distributions, with the aid of the quantum mechanical virial theorem. The well-known reciprocity relations and the associated Legendre transform structure for the Fisher information measure (FIM, hereafter)-based model in a vector setting (with least square constraints) are self-consistently derived. These relations are demonstrated to yield an intriguing form of the FIM for the modeling phase, which defines the \textit{working hypothesis}, solely in terms of the observed data. Cases for prediction employing time series' obtained from the: $(i)$ the Mackey-Glass delay-differential equation, $(ii)$ one ECG sample from the MIT-Beth Israel Deaconess Hospital (MIT-BIH) cardiac arrhythmia database, and $(iii)$ one ECG from the Creighton University ventricular tachyarrhythmia database. The ECG samples were obtained from the Physionet online repository. These examples demonstrate the efficiency of the prediction model. Numerical examples for exemplary cases are provided.
△ Less
Submitted 2 December, 2016; v1 submitted 14 November, 2016;
originally announced November 2016.
-
Balancing sums of random vectors
Authors:
Juhan Aru,
Bhargav Narayanan,
Alex Scott,
Ramarathnam Venkatesan
Abstract:
We study a higher-dimensional 'balls-into-bins' problem. An infinite sequence of i.i.d. random vectors is revealed to us one vector at a time, and we are required to partition these vectors into a fixed number of bins in such a way as to keep the sums of the vectors in the different bins close together; how close can we keep these sums almost surely? This question, our primary focus in this paper,…
▽ More
We study a higher-dimensional 'balls-into-bins' problem. An infinite sequence of i.i.d. random vectors is revealed to us one vector at a time, and we are required to partition these vectors into a fixed number of bins in such a way as to keep the sums of the vectors in the different bins close together; how close can we keep these sums almost surely? This question, our primary focus in this paper, is closely related to the classical problem of partitioning a sequence of vectors into balanced subsequences, in addition to having applications to some problems in computer science.
△ Less
Submitted 10 March, 2018; v1 submitted 17 October, 2016;
originally announced October 2016.
-
A Novel Progressive Multi-label Classifier for Classincremental Data
Authors:
Mihika Dave,
Sahil Tapiawala,
Meng Joo Er,
Rajasekar Venkatesan
Abstract:
In this paper, a progressive learning algorithm for multi-label classification to learn new labels while retaining the knowledge of previous labels is designed. New output neurons corresponding to new labels are added and the neural network connections and parameters are automatically restructured as if the label has been introduced from the beginning. This work is the first of the kind in multi-l…
▽ More
In this paper, a progressive learning algorithm for multi-label classification to learn new labels while retaining the knowledge of previous labels is designed. New output neurons corresponding to new labels are added and the neural network connections and parameters are automatically restructured as if the label has been introduced from the beginning. This work is the first of the kind in multi-label classifier for class-incremental learning. It is useful for real-world applications such as robotics where streaming data are available and the number of labels is often unknown. Based on the Extreme Learning Machine framework, a novel universal classifier with plug and play capabilities for progressive multi-label classification is developed. Experimental results on various benchmark synthetic and real datasets validate the efficiency and effectiveness of our proposed algorithm.
△ Less
Submitted 22 September, 2016;
originally announced September 2016.
-
An Online Universal Classifier for Binary, Multi-class and Multi-label Classification
Authors:
Meng Joo Er,
Rajasekar Venkatesan,
Ning Wang
Abstract:
Classification involves the learning of the map** function that associates input samples to corresponding target label. There are two major categories of classification problems: Single-label classification and Multi-label classification. Traditional binary and multi-class classifications are sub-categories of single-label classification. Several classifiers are developed for binary, multi-class…
▽ More
Classification involves the learning of the map** function that associates input samples to corresponding target label. There are two major categories of classification problems: Single-label classification and Multi-label classification. Traditional binary and multi-class classifications are sub-categories of single-label classification. Several classifiers are developed for binary, multi-class and multi-label classification problems, but there are no classifiers available in the literature capable of performing all three types of classification. In this paper, a novel online universal classifier capable of performing all the three types of classification is proposed. Being a high speed online classifier, the proposed technique can be applied to streaming data applications. The performance of the developed classifier is evaluated using datasets from binary, multi-class and multi-label problems. The results obtained are compared with state-of-the-art techniques from each of the classification types.
△ Less
Submitted 3 September, 2016;
originally announced September 2016.
-
A novel online multi-label classifier for high-speed streaming data applications
Authors:
Rajasekar Venkatesan,
Meng Joo Er,
Mihika Dave,
Mahardhika Pratama,
Shiqian Wu
Abstract:
In this paper, a high-speed online neural network classifier based on extreme learning machines for multi-label classification is proposed. In multi-label classification, each of the input data sample belongs to one or more than one of the target labels. The traditional binary and multi-class classification where each sample belongs to only one target class forms the subset of multi-label classifi…
▽ More
In this paper, a high-speed online neural network classifier based on extreme learning machines for multi-label classification is proposed. In multi-label classification, each of the input data sample belongs to one or more than one of the target labels. The traditional binary and multi-class classification where each sample belongs to only one target class forms the subset of multi-label classification. Multi-label classification problems are far more complex than binary and multi-class classification problems, as both the number of target labels and each of the target labels corresponding to each of the input samples are to be identified. The proposed work exploits the high-speed nature of the extreme learning machines to achieve real-time multi-label classification of streaming data. A new threshold-based online sequential learning algorithm is proposed for high speed and streaming data classification of multi-label problems. The proposed method is experimented with six different datasets from different application domains such as multimedia, text, and biology. The hamming loss, accuracy, training time and testing time of the proposed technique is compared with nine different state-of-the-art methods. Experimental studies shows that the proposed technique outperforms the existing multi-label classifiers in terms of performance and speed.
△ Less
Submitted 31 August, 2016;
originally announced September 2016.
-
A Novel Progressive Learning Technique for Multi-class Classification
Authors:
Rajasekar Venkatesan,
Meng Joo Er
Abstract:
In this paper, a progressive learning technique for multi-class classification is proposed. This newly developed learning technique is independent of the number of class constraints and it can learn new classes while still retaining the knowledge of previous classes. Whenever a new class (non-native to the knowledge learnt thus far) is encountered, the neural network structure gets remodeled autom…
▽ More
In this paper, a progressive learning technique for multi-class classification is proposed. This newly developed learning technique is independent of the number of class constraints and it can learn new classes while still retaining the knowledge of previous classes. Whenever a new class (non-native to the knowledge learnt thus far) is encountered, the neural network structure gets remodeled automatically by facilitating new neurons and interconnections, and the parameters are calculated in such a way that it retains the knowledge learnt thus far. This technique is suitable for real-world applications where the number of classes is often unknown and online learning from real-time data is required. The consistency and the complexity of the progressive learning technique are analyzed. Several standard datasets are used to evaluate the performance of the developed technique. A comparative study shows that the developed technique is superior.
△ Less
Submitted 22 January, 2017; v1 submitted 31 August, 2016;
originally announced September 2016.
-
A Novel Online Real-time Classifier for Multi-label Data Streams
Authors:
Rajasekar Venkatesan,
Meng Joo Er,
Shiqian Wu,
Mahardhika Pratama
Abstract:
In this paper, a novel extreme learning machine based online multi-label classifier for real-time data streams is proposed. Multi-label classification is one of the actively researched machine learning paradigm that has gained much attention in the recent years due to its rapidly increasing real world applications. In contrast to traditional binary and multi-class classification, multi-label class…
▽ More
In this paper, a novel extreme learning machine based online multi-label classifier for real-time data streams is proposed. Multi-label classification is one of the actively researched machine learning paradigm that has gained much attention in the recent years due to its rapidly increasing real world applications. In contrast to traditional binary and multi-class classification, multi-label classification involves association of each of the input samples with a set of target labels simultaneously. There are no real-time online neural network based multi-label classifier available in the literature. In this paper, we exploit the inherent nature of high speed exhibited by the extreme learning machines to develop a novel online real-time classifier for multi-label data streams. The developed classifier is experimented with datasets from different application domains for consistency, performance and speed. The experimental studies show that the proposed method outperforms the existing state-of-the-art techniques in terms of speed and accuracy and can classify multi-label data streams in real-time.
△ Less
Submitted 31 August, 2016;
originally announced August 2016.
-
A High Speed Multi-label Classifier based on Extreme Learning Machines
Authors:
Meng Joo Er,
Rajasekar Venkatesan,
Ning Wang
Abstract:
In this paper a high speed neural network classifier based on extreme learning machines for multi-label classification problem is proposed and dis-cussed. Multi-label classification is a superset of traditional binary and multi-class classification problems. The proposed work extends the extreme learning machine technique to adapt to the multi-label problems. As opposed to the single-label problem…
▽ More
In this paper a high speed neural network classifier based on extreme learning machines for multi-label classification problem is proposed and dis-cussed. Multi-label classification is a superset of traditional binary and multi-class classification problems. The proposed work extends the extreme learning machine technique to adapt to the multi-label problems. As opposed to the single-label problem, both the number of labels the sample belongs to, and each of those target labels are to be identified for multi-label classification resulting in in-creased complexity. The proposed high speed multi-label classifier is applied to six benchmark datasets comprising of different application areas such as multi-media, text and biology. The training time and testing time of the classifier are compared with those of the state-of-the-arts methods. Experimental studies show that for all the six datasets, our proposed technique have faster execution speed and better performance, thereby outperforming all the existing multi-label clas-sification methods.
△ Less
Submitted 31 August, 2016;
originally announced August 2016.
-
Multi-Label Classification Method Based on Extreme Learning Machines
Authors:
Rajasekar Venkatesan,
Meng Joo Er
Abstract:
In this paper, an Extreme Learning Machine (ELM) based technique for Multi-label classification problems is proposed and discussed. In multi-label classification, each of the input data samples belongs to one or more than one class labels. The traditional binary and multi-class classification problems are the subset of the multi-label problem with the number of labels corresponding to each sample…
▽ More
In this paper, an Extreme Learning Machine (ELM) based technique for Multi-label classification problems is proposed and discussed. In multi-label classification, each of the input data samples belongs to one or more than one class labels. The traditional binary and multi-class classification problems are the subset of the multi-label problem with the number of labels corresponding to each sample limited to one. The proposed ELM based multi-label classification technique is evaluated with six different benchmark multi-label datasets from different domains such as multimedia, text and biology. A detailed comparison of the results is made by comparing the proposed method with the results from nine state of the arts techniques for five different evaluation metrics. The nine methods are chosen from different categories of multi-label methods. The comparative results shows that the proposed Extreme Learning Machine based multi-label classification technique is a better alternative than the existing state of the art methods for multi-label problems.
△ Less
Submitted 30 August, 2016;
originally announced August 2016.
-
Neural Dataset Generality
Authors:
Ragav Venkatesan,
Vijetha Gattupalli,
Baoxin Li
Abstract:
Often the filters learned by Convolutional Neural Networks (CNNs) from different datasets appear similar. This is prominent in the first few layers. This similarity of filters is being exploited for the purposes of transfer learning and some studies have been made to analyse such transferability of features. This is also being used as an initialization technique for different tasks in the same dat…
▽ More
Often the filters learned by Convolutional Neural Networks (CNNs) from different datasets appear similar. This is prominent in the first few layers. This similarity of filters is being exploited for the purposes of transfer learning and some studies have been made to analyse such transferability of features. This is also being used as an initialization technique for different tasks in the same dataset or for the same task in similar datasets. Off-the-shelf CNN features have capitalized on this idea to promote their networks as best transferable and most general and are used in a cavalier manner in day-to-day computer vision tasks.
It is curious that while the filters learned by these CNNs are related to the atomic structures of the images from which they are learnt, all datasets learn similar looking low-level filters. With the understanding that a dataset that contains many such atomic structures learn general filters and are therefore useful to initialize other networks with, we propose a way to analyse and quantify generality among datasets from their accuracies on transferred filters. We applied this metric on several popular character recognition, natural image and a medical image dataset, and arrived at some interesting conclusions. On further experimentation we also discovered that particular classes in a dataset themselves are more general than others.
△ Less
Submitted 13 May, 2016;
originally announced May 2016.
-
Information Flows in Encrypted Databases
Authors:
Kapil Vaswani,
Ravi Ramamurthy,
Ramarathnam Venkatesan
Abstract:
In encrypted databases, sensitive data is protected from an untrusted server by encrypting columns using partially homomorphic encryption schemes, and storing encryption keys in a trusted client. However, encrypting columns and protecting encryption keys does not ensure confidentiality - sensitive data can leak during query processing due to information flows through the trusted client. In this pa…
▽ More
In encrypted databases, sensitive data is protected from an untrusted server by encrypting columns using partially homomorphic encryption schemes, and storing encryption keys in a trusted client. However, encrypting columns and protecting encryption keys does not ensure confidentiality - sensitive data can leak during query processing due to information flows through the trusted client. In this paper, we propose SecureSQL, an encrypted database that partitions query processing between an untrusted server and a trusted client while ensuring the absence of information flows. Our evaluation based on OLTP benchmarks suggests that SecureSQL can protect against explicit flows with low overheads (< 30%). However, protecting against implicit flows can be expensive because it precludes the use of key databases optimizations and introduces additional round trips between client and server.
△ Less
Submitted 3 May, 2016;
originally announced May 2016.
-
Diving deeper into mentee networks
Authors:
Ragav Venkatesan,
Baoxin Li
Abstract:
Modern computer vision is all about the possession of powerful image representations. Deeper and deeper convolutional neural networks have been built using larger and larger datasets and are made publicly available. A large swath of computer vision scientists use these pre-trained networks with varying degrees of successes in various tasks. Even though there is tremendous success in copying these…
▽ More
Modern computer vision is all about the possession of powerful image representations. Deeper and deeper convolutional neural networks have been built using larger and larger datasets and are made publicly available. A large swath of computer vision scientists use these pre-trained networks with varying degrees of successes in various tasks. Even though there is tremendous success in copying these networks, the representational space is not learnt from the target dataset in a traditional manner. One of the reasons for opting to use a pre-trained network over a network learnt from scratch is that small datasets provide less supervision and require meticulous regularization, smaller and careful tweaking of learning rates to even achieve stable learning without weight explosion. It is often the case that large deep networks are not portable, which necessitates the ability to learn mid-sized networks from scratch.
In this article, we dive deeper into training these mid-sized networks on small datasets from scratch by drawing additional supervision from a large pre-trained network. Such learning also provides better generalization accuracies than networks trained with common regularization techniques such as l2, l1 and dropouts. We show that features learnt thus, are more general than those learnt independently. We studied various characteristics of such networks and found some interesting behaviors.
△ Less
Submitted 27 April, 2016;
originally announced April 2016.
-
Trending Chic: Analyzing the Influence of Social Media on Fashion Brands
Authors:
Lydia Manikonda,
Ragav Venkatesan,
Subbarao Kambhampati,
Baoxin Li
Abstract:
Social media platforms are popular venues for fashion brand marketing and advertising. With the introduction of native advertising, users don't have to endure banner ads that hold very little saliency and are unattractive. Using images and subtle text overlays, even in a world of ever-depreciating attention span, brands can retain their audience and have a capacious creative potential. While an as…
▽ More
Social media platforms are popular venues for fashion brand marketing and advertising. With the introduction of native advertising, users don't have to endure banner ads that hold very little saliency and are unattractive. Using images and subtle text overlays, even in a world of ever-depreciating attention span, brands can retain their audience and have a capacious creative potential. While an assortment of marketing strategies are conjectured, the subtle distinctions between various types of marketing strategies remain under-explored. This paper presents a qualitative analysis on the influence of social media platforms on different behaviors of fashion brand marketing. We employ both linguistic and computer vision techniques while comparing and contrasting strategic idiosyncrasies. We also analyze brand audience retention and social engagement hence providing suggestions in adapting advertising and marketing strategies over Twitter and Instagram.
△ Less
Submitted 8 March, 2016; v1 submitted 3 December, 2015;
originally announced December 2015.
-
Spatial self-phase modulation in the H2TPP(OH)4 doped in Boric Acid Glass
Authors:
Srinivasa Rao Allam,
Mudasir H Dar,
N Venkatramaiah,
R Venkatesan,
Alok Sharan
Abstract:
Self-diffraction rings or spatial self-phase modulation (SSPM) was observed in tetra-phenyl porphyrin derivative 5,10,15,20 - meso-tetrakis (4-hydroxyphenyl) porphyrin (H2TPP(OH)4) doped in boric acid glass (BAG) at 671 nm excitation wave-length lying within the absorption band of sample with TEM00 mode profile. Intensity modulated Z-scan was performed on these systems to study the thermal diffusi…
▽ More
Self-diffraction rings or spatial self-phase modulation (SSPM) was observed in tetra-phenyl porphyrin derivative 5,10,15,20 - meso-tetrakis (4-hydroxyphenyl) porphyrin (H2TPP(OH)4) doped in boric acid glass (BAG) at 671 nm excitation wave-length lying within the absorption band of sample with TEM00 mode profile. Intensity modulated Z-scan was performed on these systems to study the thermal diffusion and to estimate the thermo-optic coefficients. The results obtained from self-diffraction rings experiment and modulated Z-scan are compared and analyzed for different concentration.
△ Less
Submitted 3 November, 2015;
originally announced November 2015.
-
Non-Abelian Analogs of Lattice Rounding
Authors:
Evgeni Begelfor,
Stephen D. Miller,
Ramarathnam Venkatesan
Abstract:
Lattice rounding in Euclidean space can be viewed as finding the nearest point in the orbit of an action by a discrete group, relative to the norm inherited from the ambient space. Using this point of view, we initiate the study of non-abelian analogs of lattice rounding involving matrix groups. In one direction, we give an algorithm for solving a normed word problem when the inputs are random pro…
▽ More
Lattice rounding in Euclidean space can be viewed as finding the nearest point in the orbit of an action by a discrete group, relative to the norm inherited from the ambient space. Using this point of view, we initiate the study of non-abelian analogs of lattice rounding involving matrix groups. In one direction, we give an algorithm for solving a normed word problem when the inputs are random products over a basis set, and give theoretical justification for its success. In another direction, we prove a general inapproximability result which essentially rules out strong approximation algorithms (i.e., whose approximation factors depend only on dimension) analogous to LLL in the general case.
△ Less
Submitted 13 January, 2015;
originally announced January 2015.
-
Hellmann-Feynman connection for the relative Fisher information
Authors:
R. C. Venkatesan,
A. Plastino
Abstract:
The $(i)$ reciprocity relations for the relative Fisher information (RFI, hereafter) and $(ii)$ a generalized RFI-Euler theorem, are self-consistently derived from the Hellmann-Feynman theorem. These new reciprocity relations generalize the RFI-Euler theorem and constitute the basis for building up a mathematical Legendre transform structure (LTS, hereafter), akin to that of thermodynamics, that u…
▽ More
The $(i)$ reciprocity relations for the relative Fisher information (RFI, hereafter) and $(ii)$ a generalized RFI-Euler theorem, are self-consistently derived from the Hellmann-Feynman theorem. These new reciprocity relations generalize the RFI-Euler theorem and constitute the basis for building up a mathematical Legendre transform structure (LTS, hereafter), akin to that of thermodynamics, that underlies the RFI scenario. This demonstrates the possibility of translating the entire mathematical structure of thermodynamics into a RFI-based theoretical framework. Virial theorems play a prominent role in this endeavor, as a Schrödinger-like equation can be associated to the RFI. Lagrange multipliers are determined invoking the RFI-LTS link and the quantum mechanical virial theorem. An appropriate ansatz allows for the inference of probability density functions (pdf's, hereafter) and energy-eigenvalues of the above mentioned Schrödinger-like equation. The energy-eigenvalues obtained here via inference are benchmarked against established theoretical and numerical results. A principled theoretical basis to reconstruct the RFI-framework from the FIM framework is established. Numerical examples for exemplary cases are provided.
△ Less
Submitted 10 May, 2015; v1 submitted 6 December, 2014;
originally announced December 2014.
-
Legendre transform structure and extremal properties of the relative Fisher information
Authors:
R. C. Venkatesan,
A. Plastino
Abstract:
Variational extremization of the relative Fisher information (RFI, hereafter) is performed. Reciprocity relations, akin to those of thermodynamics are derived, employing the extremal results of the RFI expressed in terms of probability amplitudes. A time independent Schrödinger-like equation (Schrödinger-like link) for the RFI is derived. The concomitant Legendre transform structure (LTS, hereafte…
▽ More
Variational extremization of the relative Fisher information (RFI, hereafter) is performed. Reciprocity relations, akin to those of thermodynamics are derived, employing the extremal results of the RFI expressed in terms of probability amplitudes. A time independent Schrödinger-like equation (Schrödinger-like link) for the RFI is derived. The concomitant Legendre transform structure (LTS, hereafter) is developed by utilizing a generalized RFI-Euler theorem, which shows that the entire mathematical structure of thermodynamics translates into the RFI framework, both for equilibrium and non-equilibrium cases. The qualitatively distinct nature of the present results \textit{vis-á-vis} those of prior studies utilizing the Shannon entropy and/or the Fisher information measure (FIM, hereafter) is discussed. A principled relationship between the RFI and the FIM frameworks is derived. The utility of this relationship is demonstrated by an example wherein the energy eigenvalues of the Schrödinger-like link for the RFI is inferred solely using the quantum mechanical virial theorem and the LTS of the RFI.
△ Less
Submitted 22 March, 2014; v1 submitted 16 December, 2013;
originally announced December 2013.