Search | arXiv e-print repository

A research infrastructure for generating and sharing diversity-aware data

Authors: Matteo Busso, Ronal Chenu Abente Acosta, Amalia de Götzen

Abstract: The intensive flow of personal data associated with the trend of computerizing aspects of people's diversity in their daily lives is associated with issues concerning not only people protection and their trust in new technologies, but also bias in the analysis of data and problems in their management and reuse. Faced with a complex problem, the strategies adopted, including technologies and servic… ▽ More The intensive flow of personal data associated with the trend of computerizing aspects of people's diversity in their daily lives is associated with issues concerning not only people protection and their trust in new technologies, but also bias in the analysis of data and problems in their management and reuse. Faced with a complex problem, the strategies adopted, including technologies and services, often focus on individual aspects, which are difficult to integrate into a broader framework, which can be of effective support for researchers and developers. Therefore, we argue for the development of an end-to-end research infrastructure (RI) that enables trustworthy diversity-aware data within a citizen science community. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2207.07489 [pdf]

doi 10.1038/s41467-022-35356-5

How ornithopters can perch autonomously on a branch

Authors: Raphael Zufferey, Jesus Tormo Barbero, Daniel Feliu Talegon, Saeed Rafee Nekoo, Jose Angel Acosta, Anibal Ollero

Abstract: Flap** wings are a bio-inspired method to produce lift and thrust in aerial robots, leading to quiet and efficient motion. The advantages of this technology are safety and maneuverability, and physical interaction with the environment, humans, and animals. However, to enable substantial applications, these robots must perch and land. Despite recent progress in the perching field, flap**-wing v… ▽ More Flap** wings are a bio-inspired method to produce lift and thrust in aerial robots, leading to quiet and efficient motion. The advantages of this technology are safety and maneuverability, and physical interaction with the environment, humans, and animals. However, to enable substantial applications, these robots must perch and land. Despite recent progress in the perching field, flap**-wing vehicles, or ornithopters, are to this day unable to stop their flight on a branch. In this paper, we present a novel method that defines a process to reliably and autonomously land an ornithopter on a branch. This method describes the joint operation of a flap**-flight controller, a close-range correction system and a passive claw appendage. Flight is handled by a triple pitch-yaw-altitude controller and integrated body electronics, permitting perching at 3 m/s. The close-range correction system, with fast optical branch sensing compensates for position misalignment when landing. This is complemented by a passive bistable claw design can lock and hold 2 Nm of torque, grasp within 25 ms and can re-open thanks to an integrated tendon actuation. The perching method is supplemented by a four-step experimental development process which optimizes for a successful design. We validate this method with a 700 g ornithopter and demonstrate the first autonomous perching flight of a flap**-wing robot on a branch, a result replicated with a second robot. This work paves the way towards the application of flap**-wing robots for long-range missions, bird observation, manipulation, and outdoor flight. △ Less

Submitted 15 July, 2022; originally announced July 2022.

Journal ref: Nat Commun 13, 7713 (2022)

arXiv:2104.04731 [pdf, other]

Joint Program and Layout Transformations to enable Convolutional Operators on Specialized Hardware based on Constraint Programming

Authors: Dennis Rieber, Axel Acosta, Holger Fröning

Abstract: The success of Deep Artificial Neural Networks (DNNs) in many domains created a rich body of research concerned with hardware accelerators for compute-intensive DNN operators. However, implementing such operators efficiently with complex hardware intrinsics such as matrix multiply is a task not yet automated gracefully. Solving this task often requires joint program and data layout transformations… ▽ More The success of Deep Artificial Neural Networks (DNNs) in many domains created a rich body of research concerned with hardware accelerators for compute-intensive DNN operators. However, implementing such operators efficiently with complex hardware intrinsics such as matrix multiply is a task not yet automated gracefully. Solving this task often requires joint program and data layout transformations. First solutions to this problem have been proposed, such as TVM, UNIT or ISAMIR, which work on a loop-level representation of operators and specify data layout and possible program transformations before the embedding into the operator is performed. This top-down approach creates a tension between exploration range and search space complexity, especially when also exploring data layout transformations such as im2col, channel packing or padding. In this work, we propose a new approach to this problem. We created a bottom-up method that allows the joint transformation of both compuation and data layout based on the found embedding. By formulating the embedding as a constraint satisfaction problem over the scalar dataflow, every possible embedding solution is contained in the search space. Adding additional constraints and optmization targets to the solver generates the subset of preferable solutions. An evaluation using the VTA hardware accelerator with the Baidu DeepBench inference benchmark shows that our approach can automatically generate code competitive to reference implementations. Further, we show that dynamically determining the data layout based on intrinsic and workload is beneficial for hardware utilization and performance. In cases where the reference implementation has low hardware utilization due to its fixed deployment strategy, we achieve a geomean speedup of up to x2.813, while individual operators can improve as much as x170. △ Less

Submitted 26 August, 2021; v1 submitted 10 April, 2021; originally announced April 2021.

Comments: 25 Pages

arXiv:2104.03780 [pdf, other]

Enabling Cross-Domain Communication: How to Bridge the Gap between AI and HW Engineers

Authors: Michael J. Klaiber, Axel J. Acosta, Ingo Feldner, Falk Rehm

Abstract: A key issue in system design is the lack of communication between hardware, software and domain expert. Recent research work shows progress in automatic HW/SW co-design flows of neural accelerators that seems to make this kind of communication obsolete. Most real-world systems, however, are a composition of multiple processing units, communication networks and memories. A HW/SW co-design process o… ▽ More A key issue in system design is the lack of communication between hardware, software and domain expert. Recent research work shows progress in automatic HW/SW co-design flows of neural accelerators that seems to make this kind of communication obsolete. Most real-world systems, however, are a composition of multiple processing units, communication networks and memories. A HW/SW co-design process of (reconfigurable) neural accelerators, therefore, is an important sub-problem towards a common co-design methodology. The ultimate challenge is to define the constraints for the design space exploration on system level - a task which requires deep knowledge and understanding of hardware architectures, map** of workloads onto hardware and the application domain, e.g. artificial intelligence. For most projects, these skills are distributed among several people or even different teams which is one of the major reasons why there is no established end-to-end development methodology for digital systems. This position paper discusses possibilities how to establish such a methodology for systems that include (reconfigurable) dedicated accelerators and outlines the central role that languages and tools play in the process. △ Less

Submitted 8 April, 2021; originally announced April 2021.

Comments: LATTE 2021 Workshop on Languages, Tools, and Techniques for Accelerator Design

arXiv:2011.15019 [pdf, other]

doi 10.1109/ACCESS.2022.3159695

Burning graphs through farthest-first traversal

Authors: Jesús García Díaz, Julio César Pérez Sansalvador, Lil María Xibai Rodríguez Henríquez, José Alejandro Cornejo Acosta

Abstract: The graph burning problem is an NP-hard combinatorial optimization problem that helps quantify the vulnerability of a graph to contagion. This paper introduces a simple farthest-first traversal-based approximation algorithm for this problem over general graphs. We refer to this proposal as the Burning Farthest-First (BFF) algorithm. BFF runs in $O(n^3)$ steps and has an approximation factor of… ▽ More The graph burning problem is an NP-hard combinatorial optimization problem that helps quantify the vulnerability of a graph to contagion. This paper introduces a simple farthest-first traversal-based approximation algorithm for this problem over general graphs. We refer to this proposal as the Burning Farthest-First (BFF) algorithm. BFF runs in $O(n^3)$ steps and has an approximation factor of $3-2/b(G)$, where $b(G)$ is the size of an optimal solution. Despite its simplicity, BFF tends to generate near-optimal solutions when tested over some benchmark datasets; in fact, it returns similar solutions to those returned by much more elaborated heuristics from the literature. △ Less

Submitted 13 December, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

MSC Class: 05C85

arXiv:1910.11632 [pdf, other]

doi 10.1145/3372394.3372396

An End-to-End HW/SW Co-Design Methodology to Design Efficient Deep Neural Network Systems using Virtual Models

Authors: Michael J. Klaiber, Sebastian Vogel, Axel Acosta, Robert Korn, Leonardo Ecco, Kristine Back, Andre Guntoro, Ingo Feldner

Abstract: End-to-end performance estimation and measurement of deep neural network (DNN) systems become more important with increasing complexity of DNN systems consisting of hardware and software components. The methodology proposed in this paper aims at a reduced turn-around time for evaluating different design choices of hardware and software components of DNN systems. This reduction is achieved by movin… ▽ More End-to-end performance estimation and measurement of deep neural network (DNN) systems become more important with increasing complexity of DNN systems consisting of hardware and software components. The methodology proposed in this paper aims at a reduced turn-around time for evaluating different design choices of hardware and software components of DNN systems. This reduction is achieved by moving the performance estimation from the implementation phase to the concept phase by employing virtual hardware models instead of gathering measurement results from physical prototypes. Deep learning compilers introduce hardware-specific transformations and are, therefore, considered a part of the design flow of virtual system models to extract end-to-end performance estimations. To validate the run-time accuracy of the proposed methodology, a system processing the DilatedVGG DNN is realized both as virtual system model and as hardware implementation. The results show that up to 92 % accuracy can be reached in predicting the processing time of the DNN inference. △ Less

Submitted 18 November, 2019; v1 submitted 25 October, 2019; originally announced October 2019.

Journal ref: Embedded Systems Week 2019, INTelligent Embedded Systems Architectures and Applications Workshop 2019

arXiv:1711.06045 [pdf, other]

Frame Interpolation with Multi-Scale Deep Loss Functions and Generative Adversarial Networks

Authors: Joost van Amersfoort, Wenzhe Shi, Alejandro Acosta, Francisco Massa, Johannes Totz, Zehan Wang, Jose Caballero

Abstract: Frame interpolation attempts to synthesise frames given one or more consecutive video frames. In recent years, deep learning approaches, and notably convolutional neural networks, have succeeded at tackling low- and high-level computer vision problems including frame interpolation. These techniques often tackle two problems, namely algorithm efficiency and reconstruction quality. In this paper, we… ▽ More Frame interpolation attempts to synthesise frames given one or more consecutive video frames. In recent years, deep learning approaches, and notably convolutional neural networks, have succeeded at tackling low- and high-level computer vision problems including frame interpolation. These techniques often tackle two problems, namely algorithm efficiency and reconstruction quality. In this paper, we present a multi-scale generative adversarial network for frame interpolation (\mbox{FIGAN}). To maximise the efficiency of our network, we propose a novel multi-scale residual estimation module where the predicted flow and synthesised frame are constructed in a coarse-to-fine fashion. To improve the quality of synthesised intermediate video frames, our network is jointly supervised at different levels with a perceptual loss function that consists of an adversarial and two content losses. We evaluate the proposed approach using a collection of 60fps videos from YouTube-8m. Our results improve the state-of-the-art accuracy and provide subjective visual quality comparable to the best performing interpolation method at x47 faster runtime. △ Less

Submitted 26 February, 2019; v1 submitted 16 November, 2017; originally announced November 2017.

arXiv:1611.05250 [pdf, other]

Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Authors: Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi

Abstract: Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while m… ▽ More Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency. △ Less

Submitted 10 April, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

Comments: Changes: * Uploaded Vid4 results (footnote 1). * Added references [14, 29] as spatial-transformer prior art. * Fixed typos

arXiv:1609.04802 [pdf, other]

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Authors: Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi

Abstract: Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. R… ▽ More Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. Recent work has largely focused on minimizing the mean squared reconstruction error. The resulting estimates have high peak signal-to-noise ratios, but they are often lacking high-frequency details and are perceptually unsatisfying in the sense that they fail to match the fidelity expected at the higher resolution. In this paper, we present SRGAN, a generative adversarial network (GAN) for image super-resolution (SR). To our knowledge, it is the first framework capable of inferring photo-realistic natural images for 4x upscaling factors. To achieve this, we propose a perceptual loss function which consists of an adversarial loss and a content loss. The adversarial loss pushes our solution to the natural image manifold using a discriminator network that is trained to differentiate between the super-resolved images and original photo-realistic images. In addition, we use a content loss motivated by perceptual similarity instead of similarity in pixel space. Our deep residual network is able to recover photo-realistic textures from heavily downsampled images on public benchmarks. An extensive mean-opinion-score (MOS) test shows hugely significant gains in perceptual quality using SRGAN. The MOS scores obtained with SRGAN are closer to those of the original high-resolution images than to those obtained with any state-of-the-art method. △ Less

Submitted 25 May, 2017; v1 submitted 15 September, 2016; originally announced September 2016.

Comments: 19 pages, 15 figures, 2 tables, accepted for oral presentation at CVPR, main paper + some supplementary material

arXiv:1101.2711 [pdf]

doi 10.3989/redc.2013.1.876

A Proposal to Classify Latinamerican Scientific Journals using Citation Indicators: Case Study in Colombia

Authors: Mauricio Romero-Torres, Maria Alejandra Tejada, Alberto Acosta

Abstract: Colombian scientific journals are poorly represented in international digital libraries; however, through Google Scholar (GS) it is possible to determine their use by the community. Between the years of 2003 and 2007 a classification of 185 Colombian journals indexed in the Colombian National Bibliographical Index (IBNP) was performed using the information provided by GS, basing categorization on… ▽ More Colombian scientific journals are poorly represented in international digital libraries; however, through Google Scholar (GS) it is possible to determine their use by the community. Between the years of 2003 and 2007 a classification of 185 Colombian journals indexed in the Colombian National Bibliographical Index (IBNP) was performed using the information provided by GS, basing categorization on size indicators, indexation and citation. The indicators were analyzed by grou** the journals in two general areas: sciences and social sciences. In each area, the indicators provided by the digital libraries Scopus, Redalyc and Scielo were compared. Additionally, the indicators provided by IBNP journals categories (A1, A2, B and C) were also compared. The sciences and social sciences had a similar pattern in their indicators. The existence of positive correlations was established between some indicators and they predicted that the number of citations per journal in GS and the h index depends on its visibility in GS and Scopus. We put forward that the current IBNP categories (A1, A2, B or C) faintly reflect the use of journals by the community and we propose a classification based on the h index as an infometric indicator, which reflects not only its visibility in Google Scholar, but also its inclusion in certain international digital libraries, particularly Scopus. Our results may be applied to the creation of public policies regarding science and technology in Colombia and in develo** countries. △ Less

Submitted 13 January, 2011; originally announced January 2011.

Comments: 27 pages, 8 tables

MSC Class: 68

Showing 1–10 of 10 results for author: Acosta, A