Skip to main content

Showing 1–20 of 20 results for author: Culurciello, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17927  [pdf, other

    cs.AI cs.CL cs.CV cs.LG eess.AS

    The Evolution of Multimodal Model Architectures

    Authors: Shakti N. Wadekar, Abhishek Chaurasia, Aman Chadha, Eugenio Culurciello

    Abstract: This work uniquely identifies and characterizes four prevalent multimodal model architectural patterns in the contemporary multimodal landscape. Systematically categorizing models by architecture type facilitates monitoring of developments in the multimodal domain. Distinct from recent survey papers that present general information on multimodal architectures, this research conducts a comprehensiv… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 30 pages, 6 tables, 7 figures

  2. arXiv:2305.07167  [pdf, other

    cs.CV cs.CL cs.LG eess.IV

    OneCAD: One Classifier for All image Datasets using multimodal learning

    Authors: Shakti N. Wadekar, Eugenio Culurciello

    Abstract: Vision-Transformers (ViTs) and Convolutional neural networks (CNNs) are widely used Deep Neural Networks (DNNs) for classification task. These model architectures are dependent on the number of classes in the dataset it was trained on. Any change in number of classes leads to change (partial or full) in the model's architecture. This work addresses the question: Is it possible to create a number-o… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures

  3. arXiv:2205.13675  [pdf, other

    cs.AR cs.AI cs.LG

    Reinforcement Learning Approach for Map** Applications to Dataflow-Based Coarse-Grained Reconfigurable Array

    Authors: Andre Xian Ming Chang, Parth Khopkar, Bashar Romanous, Abhishek Chaurasia, Patrick Estep, Skyler Windh, Doug Vanesko, Sheik Dawood Beer Mohideen, Eugenio Culurciello

    Abstract: The Streaming Engine (SE) is a Coarse-Grained Reconfigurable Array which provides programming flexibility and high-performance with energy efficiency. An application program to be executed on the SE is represented as a combination of Synchronous Data Flow (SDF) graphs, where every instruction is represented as a node. Each node needs to be mapped to the right slot and array in the SE to ensure the… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 10 pages, 12 figures

  4. Capsule Network Performance with Autonomous Navigation

    Authors: Thomas Molnar, Eugenio Culurciello

    Abstract: Capsule Networks (CapsNets) have been proposed as an alternative to Convolutional Neural Networks (CNNs). This paper showcases how CapsNets are more capable than CNNs for autonomous agent exploration of realistic scenarios. In real world navigation, rewards external to agents may be rare. In turn, reinforcement learning algorithms can struggle to form meaningful policy functions. This paper's appr… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

    Comments: In IJAIA Vol.11, No.1 for January 2020; 15 pages, 9 figures

    Journal ref: International Journal of Artificial Intelligence and Applications (IJAIA), Vol. 11, No. 1, January 2020

  5. arXiv:1905.10112  [pdf, other

    cs.LG cs.CV stat.ML

    Continual Reinforcement Learning in 3D Non-stationary Environments

    Authors: Vincenzo Lomonaco, Karan Desai, Eugenio Culurciello, Davide Maltoni

    Abstract: High-dimensional always-changing environments constitute a hard challenge for current reinforcement learning techniques. Artificial agents, nowadays, are often trained off-line in very static and controlled conditions in simulation such that training observations can be thought as sampled i.i.d. from the entire observations space. However, in real world settings, the environment is often non-stati… ▽ More

    Submitted 21 April, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Accepted in the CLVision Workshop at CVPR2020: 13 pages, 4 figures, 5 tables

  6. arXiv:1805.07526  [pdf, other

    cs.CV

    Deep Predictive Coding Network with Local Recurrent Processing for Object Recognition

    Authors: Kuan Han, Haiguang Wen, Yizhen Zhang, Di Fu, Eugenio Culurciello, Zhongming Liu

    Abstract: Inspired by "predictive coding" - a theory in neuroscience, we develop a bi-directional and dynamic neural network with local recurrent processing, namely predictive coding network (PCN). Unlike feedforward-only convolutional neural networks, PCN includes both feedback connections, which carry top-down predictions, and feedforward connections, which carry bottom-up errors of prediction. Feedback a… ▽ More

    Submitted 25 October, 2018; v1 submitted 19 May, 2018; originally announced May 2018.

    Comments: 12 pages, 3 figures

  7. arXiv:1802.04762  [pdf

    cs.CV

    Deep Predictive Coding Network for Object Recognition

    Authors: Haiguang Wen, Kuan Han, Junxing Shi, Yizhen Zhang, Eugenio Culurciello, Zhongming Liu

    Abstract: Based on the predictive coding theory in neuroscience, we designed a bi-directional and recurrent neural net, namely deep predictive coding networks (PCN). It has feedforward, feedback, and recurrent connections. Feedback connections from a higher layer carry the prediction of its lower-layer representation; feedforward connections carry the prediction errors to its higher-layer. Given image input… ▽ More

    Submitted 29 July, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: 10 pages, 5 figures, 4 tables

  8. arXiv:1708.02579  [pdf, other

    cs.AR

    Snowflake: A Model Agnostic Accelerator for Deep Convolutional Neural Networks

    Authors: Vinayak Gokhale, Aliasger Zaidy, Andre Xian Ming Chang, Eugenio Culurciello

    Abstract: Deep convolutional neural networks (CNNs) are the deep learning model of choice for performing object detection, classification, semantic segmentation and natural language processing tasks. CNNs require billions of operations to process a frame. This computational complexity, combined with the inherent parallelism of the convolution operation make CNNs an excellent target for custom accelerators.… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

  9. arXiv:1708.00117  [pdf, other

    cs.DC cs.LG

    Compiling Deep Learning Models for Custom Hardware Accelerators

    Authors: Andre Xian Ming Chang, Aliasger Zaidy, Vinayak Gokhale, Eugenio Culurciello

    Abstract: Convolutional neural networks (CNNs) are the core of most state-of-the-art deep learning algorithms specialized for object detection and classification. CNNs are both computationally complex and embarrassingly parallel. Two properties that leave room for potential software and hardware optimizations for embedded systems. Given a programmable hardware accelerator with a CNN oriented custom instruct… ▽ More

    Submitted 10 December, 2017; v1 submitted 31 July, 2017; originally announced August 2017.

    Comments: 8 pages

  10. LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation

    Authors: Abhishek Chaurasia, Eugenio Culurciello

    Abstract: Pixel-wise semantic segmentation for visual scene understanding not only needs to be accurate, but also efficient in order to find any use in real-time application. Existing algorithms even though are accurate but they do not focus on utilizing the parameters of neural network efficiently. As a result they are huge in terms of parameters and number of operations; hence slow too. In this paper, we… ▽ More

    Submitted 14 June, 2017; originally announced July 2017.

    Comments: 5 pages, 5 figures, GitHub: https://github.com/e-lab/LinkNet

  11. arXiv:1706.02735  [pdf, other

    cs.CV

    CortexNet: a Generic Network Family for Robust Visual Temporal Representations

    Authors: Alfredo Canziani, Eugenio Culurciello

    Abstract: In the past five years we have observed the rise of incredibly well performing feed-forward neural networks trained supervisedly for vision related tasks. These models have achieved super-human performance on object recognition, localisation, and detection in still images. However, there is a need to identify the best strategy to employ these networks with temporal visual inputs and obtain a robus… ▽ More

    Submitted 14 June, 2017; v1 submitted 8 June, 2017; originally announced June 2017.

    Comments: 8 pages, 4 figures. Edit: 4.2 - define n = t - 1; fix grammar/meaning in last sentence. 5.2 - add Open Images data set ref

  12. arXiv:1606.02147  [pdf, other

    cs.CV

    ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

    Authors: Adam Paszke, Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello

    Abstract: The ability to perform pixel-wise semantic segmentation in real-time is of paramount importance in mobile applications. Recent deep neural networks aimed at this task have the disadvantage of requiring a large number of floating point operations and have long run-times that hinder their usability. In this paper, we propose a novel deep neural network architecture named ENet (efficient neural netwo… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  13. arXiv:1605.07678  [pdf, other

    cs.CV

    An Analysis of Deep Neural Network Models for Practical Applications

    Authors: Alfredo Canziani, Adam Paszke, Eugenio Culurciello

    Abstract: Since the emergence of Deep Neural Networks (DNNs) as a prominent technique in the field of computer vision, the ImageNet classification challenge has played a major role in advancing the state-of-the-art. While accuracy figures have steadily increased, the resource utilisation of winning models has not been properly taken into account. In this work, we present a comprehensive analysis of importan… ▽ More

    Submitted 14 April, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: 7 pages, 10 figures, legend for Figure 2 got lost :/

  14. arXiv:1511.06306  [pdf, other

    cs.LG cs.CV

    Robust Convolutional Neural Networks under Adversarial Noise

    Authors: Jonghoon **, Aysegul Dundar, Eugenio Culurciello

    Abstract: Recent studies have shown that Convolutional Neural Networks (CNNs) are vulnerable to a small perturbation of input called "adversarial examples". In this work, we propose a new feedforward CNN that improves robustness in the presence of adversarial noise. Our model uses stochastic additive noise added to the input image and to the CNN models. The proposed model operates in conjunction with a CNN… ▽ More

    Submitted 25 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 8 pages

  15. arXiv:1511.06241  [pdf, other

    cs.LG cs.CV

    Convolutional Clustering for Unsupervised Learning

    Authors: Aysegul Dundar, Jonghoon **, Eugenio Culurciello

    Abstract: The task of labeling data for training deep neural networks is daunting and tedious, requiring millions of labels to achieve the current state-of-the-art results. Such reliance on large amounts of labeled data can be relaxed by exploiting hierarchical features via unsupervised learning techniques. In this work, we propose to train a deep convolutional network based on an enhanced version of the k-… ▽ More

    Submitted 16 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 11 pages

  16. arXiv:1511.05552  [pdf, other

    cs.NE

    Recurrent Neural Networks Hardware Implementation on FPGA

    Authors: Andre Xian Ming Chang, Berin Martini, Eugenio Culurciello

    Abstract: Recurrent Neural Networks (RNNs) have the ability to retain memory and learn data sequences. Due to the recurrent nature of RNNs, it is sometimes hard to parallelize all its computations on conventional hardware. CPUs do not currently offer large parallelism, while GPUs offer limited parallelism due to sequential components of RNN models. In this paper we present a hardware implementation of Long-… ▽ More

    Submitted 4 March, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: 7 pages, 8 figures, changed format, added figures, added references, modified introduction

  17. arXiv:1412.5474  [pdf, other

    cs.NE cs.LG

    Flattened Convolutional Neural Networks for Feedforward Acceleration

    Authors: Jonghoon **, Aysegul Dundar, Eugenio Culurciello

    Abstract: We present flattened convolutional neural networks that are designed for fast feedforward execution. The redundancy of the parameters, especially weights of the convolutional filters in convolutional neural networks has been extensively studied and different heuristics have been proposed to construct a low rank basis of the filters after training. In this work, we train flattened networks that con… ▽ More

    Submitted 20 November, 2015; v1 submitted 17 December, 2014; originally announced December 2014.

    Comments: International Conference on Learning Representations (ICLR) 2015

  18. arXiv:1306.0152  [pdf, other

    cs.CV

    An Analysis of the Connections Between Layers of Deep Neural Networks

    Authors: Eugenio Culurciello, Jonghoon **, Aysegul Dundar, Jordan Bates

    Abstract: We present an analysis of different techniques for selecting the connection be- tween layers of deep neural networks. Traditional deep neural networks use ran- dom connection tables between layers to keep the number of connections small and tune to different image features. This kind of connection performs adequately in supervised deep networks because their values are refined during the training.… ▽ More

    Submitted 1 June, 2013; originally announced June 2013.

  19. arXiv:1301.2820  [pdf, other

    cs.CV

    Clustering Learning for Robotic Vision

    Authors: Eugenio Culurciello, Jordan Bates, Aysegul Dundar, Jose Carrasco, Clement Farabet

    Abstract: We present the clustering learning technique applied to multi-layer feedforward deep neural networks. We show that this unsupervised learning technique can compute network filters with only a few minutes and a much reduced set of parameters. The goal of this paper is to promote the technique for general-purpose robotic vision systems. We report its use in static image datasets and object tracking… ▽ More

    Submitted 13 March, 2013; v1 submitted 13 January, 2013; originally announced January 2013.

    Comments: Code for this paper is available here: https://github.com/culurciello/CL_paper1_code

  20. arXiv:1209.2696  [pdf, ps, other

    cs.CV cs.RO

    Visual Tracking with Similarity Matching Ratio

    Authors: Aysegul Dundar, Jonghoon **, Eugenio Culurciello

    Abstract: This paper presents a novel approach to visual tracking: Similarity Matching Ratio (SMR). The traditional approach of tracking is minimizing some measures of the difference between the template and a patch from the frame. This approach is vulnerable to outliers and drastic appearance changes and an extensive study is focusing on making the approach more tolerant to them. However, this often result… ▽ More

    Submitted 12 September, 2012; originally announced September 2012.