Skip to main content

Showing 1–50 of 56 results for author: De Souza, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04596  [pdf, ps, other

    cs.SE

    Teaching and Learning Ethnography for Software Engineering Contexts

    Authors: Yvonne Dittrich, Helen Sharp, Cleidson de Souza

    Abstract: Ethnography has become one of the established methods for empirical research on software engineering. Although there is a wide variety of introductory books available, there has been no material targeting software engineering students particularly, until now. In this chapter we provide an introduction to teaching and learning ethnography for faculty teaching ethnography to software engineering gra… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 38 pages, to be published in: Daniel Mendez, Paris Avgeriou, Marcos Kalinowski, and Nauman bin Ali (eds.) Teaching Empirical Research Methods in Software Engineering, Springer

  2. arXiv:2406.00049  [pdf, other

    cs.CL cs.LG

    QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

    Authors: Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins

    Abstract: An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  3. arXiv:2402.17733  [pdf, other

    cs.CL

    Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

    Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

    Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2402.17420  [pdf, other

    cs.CV cs.AI

    PANDAS: Prototype-based Novel Class Discovery and Detection

    Authors: Tyler L. Hayes, César R. de Souza, Namil Kim, Jiwon Kim, Riccardo Volpi, Diane Larlus

    Abstract: Object detectors are typically trained once and for all on a fixed set of classes. However, this closed-world assumption is unrealistic in practice, as new classes will inevitably emerge after the detector is deployed in the wild. In this work, we look at ways to extend a detector trained for a set of base classes so it can i) spot the presence of novel classes, and ii) automatically enrich its re… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to the Conference on Lifelong Learning Agents (CoLLAs 2024)

  5. arXiv:2312.08472  [pdf, other

    cs.NE cs.LG math.NA

    AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

    Authors: Esteban Real, Yao Chen, Mirko Rossini, Connal de Souza, Manav Garg, Akhil Verghese, Moritz Firsching, Quoc V. Le, Ekin Dogus Cubuk, David H. Park

    Abstract: Computers calculate transcendental functions by approximating them through the composition of a few limited-precision instructions. For example, an exponential can be calculated with a Taylor series. These approximation methods were developed over the centuries by mathematicians, who emphasized the attainability of arbitrary precision. Computers, however, operate on few limited precision types, su… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    ACM Class: I.2.2; I.2.6; G.1.2

  6. arXiv:2311.18452  [pdf, other

    cs.SE

    Developer Experiences with a Contextualized AI Coding Assistant: Usability, Expectations, and Outcomes

    Authors: Gustavo Pinto, Cleidson de Souza, Thayssa Rocha, Igor Steinmacher, Alberto de Souza, Edward Monteiro

    Abstract: In the rapidly advancing field of artificial intelligence, software development has emerged as a key area of innovation. Despite the plethora of general-purpose AI assistants available, their effectiveness diminishes in complex, domain-specific scenarios. Noting this limitation, both the academic community and industry players are relying on contextualized coding AI assistants. These assistants su… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  7. arXiv:2311.18450  [pdf, other

    cs.SE

    Lessons from Building StackSpot AI: A Contextualized AI Coding Assistant

    Authors: Gustavo Pinto, Cleidson de Souza, João Batista Neto, Alberto de Souza, Tarcísio Gotto, Edward Monteiro

    Abstract: With their exceptional natural language processing capabilities, tools based on Large Language Models (LLMs) like ChatGPT and Co-Pilot have swiftly become indispensable resources in the software developer's toolkit. While recent studies suggest the potential productivity gains these tools can unlock, users still encounter drawbacks, such as generic or incorrect answers. Additionally, the pursuit o… ▽ More

    Submitted 4 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  8. arXiv:2310.13448  [pdf, other

    cs.CL

    Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

    Authors: Duarte M. Alves, Nuno M. Guerreiro, João Alves, José Pombal, Ricardo Rei, José G. C. de Souza, Pierre Colombo, André F. T. Martins

    Abstract: Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 - Findings

  9. arXiv:2310.11430  [pdf, other

    cs.CL

    An Empirical Study of Translation Hypothesis Ensembling with Large Language Models

    Authors: António Farinhas, José G. C. de Souza, André F. T. Martins

    Abstract: Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and A… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (main conference)

  10. arXiv:2309.11925  [pdf, other

    cs.CL

    Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

    Abstract: We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  11. ProWis: A Visual Approach for Building, Managing, and Analyzing Weather Simulation Ensembles at Runtime

    Authors: Carolina Veiga Ferreira de Souza, Suzanna Maria Bonnet, Daniel de Oliveira, Marcio Cataldi, Fabio Miranda, Marcos Lage

    Abstract: Weather forecasting is essential for decision-making and is usually performed using numerical modeling. Numerical weather models, in turn, are complex tools that require specialized training and laborious setup and are challenging even for weather experts. Moreover, weather simulations are data-intensive computations and may take hours to days to complete. When the simulation is finished, the expe… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted at IEEE VIS 2023

    Journal ref: Published in: IEEE Transactions on Visualization and Computer Graphics ( Volume: 30, Issue: 1, January 2024)

  12. arXiv:2305.00955  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

    Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

    Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Work in Progress

  13. arXiv:2302.05488  [pdf

    cs.LG cs.AI cs.CV

    Element-Wise Attention Layers: an option for optimization

    Authors: Giovanni Araujo Bacochina, Rodrigo Clemente Thom de Souza

    Abstract: The use of Attention Layers has become a trend since the popularization of the Transformer-based models, being the key element for many state-of-the-art models that have been developed through recent years. However, one of the biggest obstacles in implementing these architectures - as well as many others in Deep Learning Field - is the enormous amount of optimizing parameters they possess, which m… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  14. arXiv:2302.05433  [pdf, other

    cs.LG cs.NE

    Unified Functional Hashing in Automatic Machine Learning

    Authors: Ryan Gillard, Stephen Jonany, Yingjie Miao, Michael Munn, Connal de Souza, Jonathan Dungay, Chen Liang, David R. So, Quoc V. Le, Esteban Real

    Abstract: The field of Automatic Machine Learning (AutoML) has recently attained impressive results, including the discovery of state-of-the-art machine learning solutions, such as neural image classifiers. This is often done by applying an evolutionary search method, which samples multiple candidate solutions from a large space and evaluates the quality of each candidate through a long training process. As… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    ACM Class: I.2.2; I.2.6

  15. arXiv:2209.12985  [pdf, other

    cs.CR

    A Bibliometrics Analysis on 28 years of Authentication and Threat Model Area

    Authors: Wesley dos Reis Bezerra, Cristiano Antônio de Souza, Carla Merkle Westphall, Carlos Becker Westphall

    Abstract: The large volume of publications in any research area can make it difficult for researchers to track their research areas' trends, challenges, and characteristics. Bibliometrics solves this problem by bringing statistical tools to help the analysis of selected publications from an online database. Although there are different works in security, our study aims to fill the bibliometric gap in the au… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  16. arXiv:2209.12984  [pdf, other

    cs.CR cs.SE

    Characteristics and Main Threats about Multi-Factor Authentication: A Survey

    Authors: Wesley dos Reis Bezerra, Cristiano Antônio de Souza, Carla Merkle Westphall, Carlos Becker Westphall

    Abstract: This work reports that the Systematic Literature Review process is responsible for providing theoretical support to research in the Threat Model and Multi-Factor Authentication. However, different from the related works, this study aims to evaluate the main characteristics of authentication solutions and their threat model. Also, it intends to list characteristics, threats, and related content to… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  17. arXiv:2209.06243  [pdf, other

    cs.CL cs.LG

    CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

    Abstract: We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it w… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: WMT 2022 Quality Estimation shared task

  18. arXiv:2205.00978  [pdf, other

    cs.CL

    Quality-Aware Decoding for Neural Machine Translation

    Authors: Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, André F. T. Martins

    Abstract: Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: NAACL2022

  19. arXiv:2203.05446  [pdf, other

    cs.DS

    Algorithms for the Maximum Eulerian Cycle Decomposition Problem

    Authors: Pedro O. Pinheiro, Alexsandro Oliveira Alexandrino, Andre R. Oliveira, Cid C. de Souza, Zanoni Dias

    Abstract: Given an Eulerian graph G, in the Maximum Eulerian Cycle Decomposition problem, we are interested in finding a collection of edge-disjoint cycles {E_1, E_2, ..., E_k} in G such that all edges of G are in exactly one cycle and k is maximum. We present an algorithm to solve the pricing problem of a column generation Integer Linear Programming (ILP) model introduced by Lancia and Serafini (2016). Fur… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Journal ref: LIII S. Brasileiro de Pesquisa Operacional (SBPO 2021), Galoa, 2021. v. 53. p. 139228

  20. arXiv:2201.05658  [pdf, other

    cs.AI cs.CL

    Sequence-to-Sequence Models for Extracting Information from Registration and Legal Documents

    Authors: Ramon Pires, Fábio C. de Souza, Guilherme Rosa, Roberto A. Lotufo, Rodrigo Nogueira

    Abstract: A typical information extraction pipeline consists of token- or span-level classification models coupled with a series of pre- and post-processing scripts. In a production pipeline, requirements often change, with classes being added and removed, which leads to nontrivial modifications to the source code and the possible introduction of bugs. In this work, we evaluate sequence-to-sequence models a… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  21. arXiv:2012.03381  [pdf, other

    cs.CG

    Solving the Minimum Convex Partition of Point Sets with Integer Programming

    Authors: Allan Sapucaia, Pedro J. de Rezende, Cid C. de Souza

    Abstract: The partition of a problem into smaller sub-problems satisfying certain properties is often a key ingredient in the design of divide-and-conquer algorithms. For questions related to location, the partition problem can be modeled, in geometric terms, as finding a subdivision of a planar map -- which represents, say, a geographical area -- into regions subject to certain conditions while optimizing… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

    Comments: 28 pages, 14 figures, submitted for publication

  22. arXiv:2010.11677  [pdf, other

    cs.CY

    Second layer data governance for permissioned blockchains: the privacy management challenge

    Authors: Paulo Henrique Alves, Isabella Z. Frajhof, Fernando A. Correia, Clarisse de Souza, Helio Lopes

    Abstract: Data privacy is a trending topic in the internet era. Given such importance, many challenges emerged in order to collect, manage, process, and publish data. In this sense, personal data have got attention, and many regulations emerged, such as GDPR in the European Union and LGPD in Brazil. This regulation model aims to protect users' data from misusage and leakage and allow users to request an exp… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  23. arXiv:2007.13867  [pdf, other

    cs.CV cs.LG

    Robust Image Retrieval-based Visual Localization using Kapture

    Authors: Martin Humenberger, Yohann Cabon, Nicolas Guerin, Julien Morat, Vincent Leroy, Jérôme Revaud, Philippe Rerole, Noé Pion, Cesar de Souza, Gabriela Csurka

    Abstract: Visual localization tackles the challenge of estimating the camera pose from images by using correspondence analysis between query images and a map. This task is computation and data intensive which poses challenges on thorough evaluation of methods on various datasets. However, in order to further advance in the field, we claim that robust visual localization algorithms should be evaluated on mul… ▽ More

    Submitted 7 January, 2022; v1 submitted 27 July, 2020; originally announced July 2020.

  24. arXiv:2007.10816  [pdf, ps, other

    math.NT cs.DM eess.SP math.GR

    Infinite Sequences, Series Convergence and the Discrete Time Fourier Transform over Finite Fields

    Authors: R. M. Campello de Souza, M. M. Campello de Souza, H. M. de Oliveira, M. M. Vasconcelos

    Abstract: Digital Transforms have important applications on subjects such as channel coding, cryptography and digital signal processing. In this paper, two Fourier Transforms are considered, the discrete time Fourier transform (DTFT) and the finite field Fourier transform (FFFT). A finite field version of the DTFT is introduced and the FFFT is redefined with a complex kernel, which makes it a more appropria… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 8 pages. arXiv admin note: text overlap with arXiv:1502.03371

    MSC Class: 12E20; 11F80; 11F80; 40G05; 40A05 ACM Class: G.2

  25. Academic viewpoints and concerns on CSCW education and training in Latin America

    Authors: Francisco J. Gutierrez, Yazmin Magallanes, Laura S. Gaytán-Lugo, Claudia López, Cleidson R. B. de Souza

    Abstract: Computer-Supported Cooperative Work, or simply CSCW, is the research area that studies the design and use of socio-technical technology for supporting group work. CSCW has a long tradition in interdisciplinary work exploring technical, social, and theoretical challenges for the design of technologies to support cooperative and collaborative work and life activities. However, most of the research t… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: https://dl.acm.org/doi/abs/10.1145/3358961.3358971

  26. arXiv:1910.06699  [pdf, other

    cs.CV cs.LG cs.MM

    Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models

    Authors: César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Naila Murray, Antonio Manuel López

    Abstract: Deep video action recognition models have been highly successful in recent years but require large quantities of manually annotated data, which are expensive and laborious to obtain. In this work, we investigate the generation of synthetic training data for video action recognition, as synthetic data have been successfully used to supervise models for a variety of other computer vision tasks. We p… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

    Comments: Pre-print of the article accepted for publication in the Special Issue on Generating Realistic Visual Data of Human Behavior of the International Journal of Computer Vision (IJCV). arXiv admin note: substantial text overlap with arXiv:1612.00881

  27. arXiv:1907.07178  [pdf

    cs.AI cs.HC cs.LG

    Mediation Challenges and Socio-Technical Gaps for Explainable Deep Learning Applications

    Authors: Rafael Brandão, Joel Carbonera, Clarisse de Souza, Juliana Ferreira, Bernardo Gonçalves, Carla Leitão

    Abstract: The presumed data owners' right to explanations brought about by the General Data Protection Regulation in Europe has shed light on the social challenges of explainable artificial intelligence (XAI). In this paper, we present a case study with Deep Learning (DL) experts from a research and development laboratory focused on the delivery of industrial-strength AI technologies. Our aim was to investi… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 39 pages

  28. arXiv:1906.07589  [pdf, other

    cs.CV

    Learning with Average Precision: Training Image Retrieval with a Listwise Loss

    Authors: Jerome Revaud, Jon Almazan, Rafael Sampaio de Rezende, Cesar Roberto de Souza

    Abstract: Image retrieval can be formulated as a ranking problem where the goal is to order database images by decreasing similarity to the query. Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain. First, rather than directly optimizing the global ranking, they minimize an upper-… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  29. arXiv:1906.06195  [pdf, other

    cs.CV

    R2D2: Repeatable and Reliable Detector and Descriptor

    Authors: Jerome Revaud, Philippe Weinzaepfel, César De Souza, Noe Pion, Gabriela Csurka, Yohann Cabon, Martin Humenberger

    Abstract: Interest point detection and local feature description are fundamental steps in many computer vision applications. Classical methods for these tasks are based on a detect-then-describe paradigm where separate handcrafted methods are used to first identify repeatable keypoints and then represent them with a local descriptor. Neural networks trained with metric learning losses have recently caught u… ▽ More

    Submitted 17 June, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  30. arXiv:1808.08138  [pdf

    cs.HC

    SigniFYI-CDN: merged communicability and usability methods to evaluate notation-intensive interaction

    Authors: Juliana Soares Jansen Ferreira, Clarisse Sieckenius de Souza, Rafael Rossi de Mello Brandão, Carla Faria Leitão

    Abstract: We present SigniFYI-CDN, an inspection method built from previously proposed methods combining Semiotic Engineering and the Cognitive Dimensions of Notations. Compared to its predecessors, SigniFYI-CDN simplifies procedural steps and supports them with more analytic scaffolds. It is especially fit for the study of interaction with technologies where notations are created and used by various people… ▽ More

    Submitted 25 August, 2022; v1 submitted 24 August, 2018; originally announced August 2018.

  31. arXiv:1808.05891  [pdf

    cs.SE

    The Case for API Communicability Evaluation: Introducing API-SI with Examples from Keras

    Authors: Luiz Marques Afonso, João Antonio Marcondes Dutra Bastos, Clarisse Sieckenius de Souza, Renato Fontoura de Gusmão Cerqueira

    Abstract: In addition to their vital role in professional software development, Application Programming Interfaces (APIs) are now increasingly used by non-professional programmers, including end users, scientists and experts from other domains. Therefore, good APIs must meet old and new user requirements. Most of the re-search on API evaluation and design derives from user-centered, cognitive perspectives o… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

  32. The Hamming and Golay Number-Theoretic Transforms

    Authors: A. J. A. Paschoal, R. M. Campello de Souza, H. M. de Oliveira

    Abstract: New number-theoretic transforms are derived from known linear block codes over finite fields. In particular, two new such transforms are built from perfect codes, namely the \textit {Hamming number-theoretic transform} and the \textit {Golay number-theoretic transform}. A few properties of these new transforms are presented.

    Submitted 25 September, 2018; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: 5 pages, 2 figures

    Report number: XXXVI Simp\'osio Brasileiro de Telecomunica\c{c}\~oes SBrT 2018 MSC Class: 11Txx; 11Yxx; 11H71; 11D04; 15Bxx

  33. arXiv:1702.01793  [pdf, ps, other

    cs.IT stat.AP

    Multiuser Communication Based on the DFT Eigenstructure

    Authors: R. M. Campello de Souza, H. M. de Oliveira, R. J. Cintra

    Abstract: The eigenstructure of the discrete Fourier transform (DFT) is examined and new systematic procedures to generate eigenvectors of the unitary DFT are proposed. DFT eigenvectors are suggested as user signatures for data communication over the real adder channel (RAC). The proposed multiuser communication system over the 2-user RAC is detailed.

    Submitted 6 February, 2017; originally announced February 2017.

    Comments: 5 pages, 2 figures, 3 tables

    MSC Class: 94A05; 94A11; 94A40

  34. arXiv:1612.00881  [pdf, other

    cs.CV

    Procedural Generation of Videos to Train Deep Action Recognition Networks

    Authors: César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López Peña

    Abstract: Deep learning for human action recognition in videos is making significant progress, but is slowed down by its dependency on expensive manual labeling of large video collections. In this work, we investigate the generation of synthetic training data for action recognition, as it has recently shown promising results for a variety of other computer vision tasks. We propose an interpretable parametri… ▽ More

    Submitted 19 July, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

    Comments: Accepted for publication at CVPR 2017. http://adas.cvc.uab.es/phav/

  35. arXiv:1608.07138  [pdf, other

    cs.CV

    Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action Recognition

    Authors: César Roberto de Souza, Adrien Gaidon, Eleonora Vig, Antonio Manuel López

    Abstract: Action recognition in videos is a challenging task due to the complexity of the spatio-temporal patterns to model and the difficulty to acquire and learn on large quantities of video data. Deep learning, although a breakthrough for image classification and showing promise for videos, has still not clearly superseded action recognition methods using hand-crafted features, even when training on mass… ▽ More

    Submitted 25 August, 2016; originally announced August 2016.

    Comments: Accepted for publication in the 14th European Conference on Computer Vision (ECCV), Amsterdam, 2016, plus supplementary material

  36. arXiv:1506.03865  [pdf, ps, other

    cs.CG

    Counterexample for the 2-approximation of finding partitions of rectilinear polygons with minimum stabbing number

    Authors: Breno Piva, Cid C. de Souza

    Abstract: This paper presents a counterexample for the approximation algorithm proposed by Durocher and Mehrabi [1] for the general problem of finding a rectangular partition of a rectilinear polygon with minimum stabbing number.

    Submitted 11 June, 2015; originally announced June 2015.

  37. arXiv:1505.04140  [pdf, other

    cs.IT

    Efficient Multiplex for Band-Limited Channels: Galois-Field Division Multiple Access

    Authors: H. M. de Oliveira, R. M. Campello de Souza, A. N. Kauffman

    Abstract: A new Efficient-bandwidth code-division-multiple-access (CDMA) for band-limited channels is introduced which is based on finite field transforms. A multilevel code division multiplex exploits orthogonality properties of nonbinary sequences defined over a complex finite field. Galois-Fourier transforms contain some redundancy and just cyclotomic coefficients are needed to be transmitted yielding co… ▽ More

    Submitted 15 May, 2015; originally announced May 2015.

    Comments: 6 pages, 5 figures, in: Workshop on Coding and Cryptography, INRIA, 1999, Paris. pp.235-241. arXiv admin note: text overlap with arXiv:1502.05881

  38. arXiv:1503.08109  [pdf

    cs.IT math.NT

    Spread-Spectrum Based on Finite Field Fourier Transforms

    Authors: H. M. de Oliveira, J. P. C. L. Miranda, R. M. Campello de Souza

    Abstract: Spread-spectrum systems are presented, which are based on Finite Field Fourier Transforms. Orthogonal spreading sequences defined over a finite field are derived. New digital multiplex schemes based on such spread-spectrum systems are also introduced, which are multilevel Coding Division Multiplex. These schemes termed Galois-field Division Multiplex (GDM) offer compact bandwidth requirements beca… ▽ More

    Submitted 12 February, 2015; originally announced March 2015.

    Comments: 6 pages, 7 figures. Int. Conf. on System Engineering, Comm. and. Info. Technol., Punta Arenas, Chile, 2001

  39. arXiv:1503.07551  [pdf

    cs.MM cs.CR

    A Low-throughput Wavelet-based Steganography Audio Scheme

    Authors: P. Carrion, H. M. de Oliveira, R. M. Campello de Souza

    Abstract: This paper presents the preliminary of a novel scheme of steganography, and introduces the idea of combining two secret keys in the operation. The first secret key encrypts the text using a standard cryptographic scheme (e.g. IDEA, SAFER+, etc.) prior to the wavelet audio decomposition. The way in which the cipher text is embedded in the file requires another key, namely a stego-key, which is asso… ▽ More

    Submitted 4 February, 2015; originally announced March 2015.

    Comments: 2 pages, 1 figure, conference: 8th Brazilian Symposium on Information and Computer System Security, 2008, Gramado, RS, Brazil

  40. arXiv:1503.03794  [pdf

    cs.DM

    Radix-2 Fast Hartley Transform Revisited

    Authors: H. M. de Oliveira, V. L. Sousa, H. A. N., R. M. Campello de Souza

    Abstract: A Fast algorithm for the Discrete Hartley Transform (DHT) is presented, which resembles radix-2 fast Fourier Transform (FFT). Although fast DHTs are already known, this new approach bring some light about the deep relationship between fast DHT algorithms and a multiplication-free fast algorithm for the Hadamard Transform.

    Submitted 12 March, 2015; originally announced March 2015.

    Comments: 5 pages, 4 figures: Anais do I Congresso de Informática da Amazônia, 2001. v.1.pp.285-292

  41. arXiv:1503.03763  [pdf

    cs.DM cs.DS eess.SP

    The Discrete Cosine Transform over Prime Finite Fields

    Authors: M. M. Campello de Souza, H. M. de Oliveira, R. M. Campello de Souza, M. M. Vasconcelos

    Abstract: This paper examines finite field trigonometry as a tool to construct trigonometric digital transforms. In particular, by using properties of the k-cosine function over GF(p), the Finite Field Discrete Cosine Transform (FFDCT) is introduced. The FFDCT pair in GF(p) is defined, having blocklengths that are divisors of (p+1)/2. A special case is the Mersenne FFDCT, defined when p is a Mersenne prime.… ▽ More

    Submitted 12 March, 2015; originally announced March 2015.

    Comments: 5 pages, 1 table, Lecture Notes in Computer Science, LNCS 3124, Heidelberg: Springer Verlag, 2004, vol.1, pp.482-487, 2004

  42. arXiv:1503.03293  [pdf

    cs.IT cs.DM

    Fourier Codes

    Authors: R. M. Campello de Souza, E. S. V. Freire, H. M. de Oliveira

    Abstract: A new family of error-correcting codes, called Fourier codes, is introduced. The code parity-check matrix, dimension and an upper bound on its minimum distance are obtained from the eigenstructure of the Fourier number theoretic transform. A decoding technique for such codes is proposed.

    Submitted 11 March, 2015; originally announced March 2015.

    Comments: 6 pages, 2 tables. In: 10th International Symposium on Communication Theory and Applications 2009, Ambleside, Lake District, UK

  43. arXiv:1503.02577  [pdf, ps, other

    cs.DM cs.DS eess.SP stat.ME

    New Algorithms for Computing a Single Component of the Discrete Fourier Transform

    Authors: G. Jerônimo da Silva Jr., R. M. Campello de Souza, H. M. de Oliveira

    Abstract: This paper introduces the theory and hardware implementation of two new algorithms for computing a single component of the discrete Fourier transform. In terms of multiplicative complexity, both algorithms are more efficient, in general, than the well known Goertzel Algorithm.

    Submitted 9 March, 2015; originally announced March 2015.

    Comments: 4 pages, 3 figures, 1 table. In: 10th International Symposium on Communication Theory and Applications, Ambleside, UK

  44. arXiv:1503.02536  [pdf

    q-bio.OT cs.CE

    Genomic Imaging Based on Codongrams and a^2grams

    Authors: E. A. Bouton, H. M. de Oliveira, R. M. Campello de Souza, N. S. Santos-Magalhaes

    Abstract: This paper introduces new tools for genomic signal processing, which can assist for genomic attribute extracting or describing biologically meaningful features embedded in a DNA. The codongrams and a2grams are offered as an alternative to spectrograms and scalograms. Twenty different a^2grams are defined for a genome, one for each amino acid (valgram is an a^2gram for valine; alagram is an a^2gram… ▽ More

    Submitted 5 March, 2015; originally announced March 2015.

    Comments: 7 pages, 3 figures

    Journal ref: WSEAS Trans. on Biology and Biomedicine, vol.1, n.2, pp.255-260, April 2004

  45. arXiv:1502.05881  [pdf

    cs.IT

    Orthogonal Multilevel Spreading Sequence Design

    Authors: H. M. de Oliveira, R. M. Campello de Souza

    Abstract: Finite field transforms are offered as a new tool of spreading sequence design. This approach exploits orthogonality properties of synchronous non-binary sequences defined over a complex finite field. It is promising for channels supporting a high signal-to-noise ratio. New digital multiplex schemes based on such sequences have also been introduced, which are multilevel Code Division Multiplex. Th… ▽ More

    Submitted 20 February, 2015; originally announced February 2015.

    Comments: 9 pages, 5 figures. In: Coding, Communication and Broadcasting.1 ed.Hertfordshire: Reseach Studies Press (RSP), 2000. ISBN 0-86380-259-1

  46. arXiv:1502.05880  [pdf

    math.NA cs.DM eess.SP

    A Flexible Implementation of a Matrix Laurent Series-Based 16-Point Fast Fourier and Hartley Transforms

    Authors: R. C. de Oliveira, H. M. de Oliveira, R. M. Campello de Souza, E. J. P. Santos

    Abstract: This paper describes a flexible architecture for implementing a new fast computation of the discrete Fourier and Hartley transforms, which is based on a matrix Laurent series. The device calculates the transforms based on a single bit selection operator. The hardware structure and synthesis are presented, which handled a 16-point fast transform in 65 nsec, with a Xilinx SPARTAN 3E device.

    Submitted 20 February, 2015; originally announced February 2015.

    Comments: 4 pages, 4 figures. IEEE VI Southern Programmable Logic Conference 2010

  47. arXiv:1502.04670  [pdf

    math.NT cs.DM

    The Hartley Transform in a Finite Field

    Authors: R. M. Campello de Souza, H. M. de Oliveira, A. N. Kauffman

    Abstract: The k-trigonometric functions over the Galois Field GF(q) are introduced and their main properties derived. This leads to the definition of the cask(.) function over GF(q), which in turn leads to a finite field Hartley Transform. The main properties of this new discrete transform are presented and areas for possible applications are mentioned.

    Submitted 16 February, 2015; originally announced February 2015.

    Comments: 7 pages, IEEE/SBT International Telecommunication Symposium, ITS, 1998, Sao Paulo, Brazil

    Journal ref: Journal of Communication and Information Systems, vol.14, N.1, 1999

  48. A Full Frequency Masking Vocoder for Legal Eavesdrop** Conversation Recording

    Authors: R. F. B. Sotero Filho, H. M. de Oliveira, R. M. Campello de Souza

    Abstract: This paper presents a new approach for a vocoder design based on full frequency masking by octaves in addition to a technique for spectral filling via beta probability distribution. Some psycho-acoustic characteristics of human hearing - inaudibility masking in frequency and phase - are used as a basis for the proposed algorithm. The results confirm that this technique may be useful to save bandwi… ▽ More

    Submitted 11 February, 2015; originally announced February 2015.

    Comments: 7 pages, 3 figures, 3 tables, XXXV Cong. Nac. de Matematica Aplicada e Computacional, Natal, RN, Brazil 2014

  49. arXiv:1502.02489  [pdf

    cs.IT

    Fourier Codes and Hartley Codes

    Authors: H. M. de Oliveira, C. M. F. Barros, R. M. Campello de Souza

    Abstract: Real-valued block codes are introduced, which are derived from Discrete Fourier Transforms (DFT) and Discrete Hartley Transforms (DHT). These algebraic structures are built from the eigensequences of the transforms. Generator and parity check matrices were computed for codes up to block length N=24. They can be viewed as lattices codes so the main parameters (dimension, minimal norm, area of the V… ▽ More

    Submitted 9 February, 2015; originally announced February 2015.

    Comments: 5 pages, 4 tables, 1 appedix. conference: XXV Simposio Brasileiro de Telecomunicacoes, SBrT'07, Recife, PE, Brazil, 2007

  50. arXiv:1502.02168  [pdf, other

    cs.DS cs.DM

    Multilayer Hadamard Decomposition of Discrete Hartley Transforms

    Authors: H. M. de Oliveira, R. J. Cintra, R. M. Campello de Souza

    Abstract: Discrete transforms such as the discrete Fourier transform (DFT) or the discrete Hartley transform (DHT) furnish an indispensable tool in signal processing. The successful application of transform techniques relies on the existence of the so-called fast transforms. In this paper some fast algorithms are derived which meet the lower bound on the multiplicative complexity of the DFT/DHT. The approac… ▽ More

    Submitted 26 August, 2015; v1 submitted 7 February, 2015; originally announced February 2015.

    Comments: Fixed several typos. 7 pages, 5 figures, XVIII Simpósio Brasileiro de Telecomunicações, 2000, Gramado, RS, Brazil