Skip to main content

Showing 1–22 of 22 results for author: Miranda, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10118  [pdf, other

    cs.CL

    SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

    Authors: Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Railey Montalan, Ryan Ignatius, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse , et al. (36 additional authors not shown)

    Abstract: Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due t… ▽ More

    Submitted 8 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: https://github.com/SEACrowd

  2. arXiv:2405.09559  [pdf, other

    eess.SP cs.LG

    KID-PPG: Knowledge Informed Deep Learning for Extracting Heart Rate from a Smartwatch

    Authors: Christodoulos Kechris, Jonathan Dan, Jose Miranda, David Atienza

    Abstract: Accurate extraction of heart rate from photoplethysmography (PPG) signals remains challenging due to motion artifacts and signal degradation. Although deep learning methods trained as a data-driven inference problem offer promising solutions, they often underutilize existing knowledge from the medical and signal processing community. In this paper, we address three shortcomings of deep learning mo… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  3. arXiv:2404.12503  [pdf, other

    cs.AR

    STRELA: STReaming ELAstic CGRA Accelerator for Embedded Systems

    Authors: Daniel Vazquez, Jose Miranda, Alfonso Rodriguez, Andres Otero, Pascuale Davide Schiavone, David Atienza

    Abstract: Reconfigurable computing offers a good balance between flexibility and energy efficiency. When combined with software-programmable devices such as CPUs, it is possible to obtain higher performance by spatially distributing the parallelizable sections of an application throughout the reconfigurable device while the CPU is in charge of control-intensive sections. This work introduces an elastic Coar… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 14 pages, 11 figures

  4. arXiv:2311.07171  [pdf, other

    cs.CL

    calamanCy: A Tagalog Natural Language Processing Toolkit

    Authors: Lester James V. Miranda

    Abstract: We introduce calamanCy, an open-source toolkit for constructing natural language processing (NLP) pipelines for Tagalog. It is built on top of spaCy, enabling easy experimentation and integration with other frameworks. calamanCy addresses the development gap by providing a consistent API for building NLP applications and offering general-purpose multitask models with out-of-the-box support for dep… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: To be published in The Third Workshop for NLP-OSS at EMNLP 2023

  5. arXiv:2311.07161  [pdf, other

    cs.CL

    Develo** a Named Entity Recognition Dataset for Tagalog

    Authors: Lester James V. Miranda

    Abstract: We present the development of a Named Entity Recognition (NER) dataset for Tagalog. This corpus helps fill the resource gap present in Philippine languages today, where NER resources are scarce. The texts were obtained from a pretraining corpora containing news reports, and were labeled by native speakers in an iterative fashion. The resulting dataset contains ~7.8k documents across three entity t… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: To be published in The First Workshop for Southeast Asian Language Processing 2023 at IJCNLP-AACL

  6. arXiv:2309.13464  [pdf, other

    cs.HC cs.AI cs.LG

    Personalised and Adjustable Interval Type-2 Fuzzy-Based PPG Quality Assessment for the Edge

    Authors: Jose A. Miranda, Celia López-Ongil, Javier Andreu-Perez

    Abstract: Most of today's wearable technology provides seamless cardiac activity monitoring. Specifically, the vast majority employ Photoplethysmography (PPG) sensors to acquire blood volume pulse information, which is further analysed to extract useful and physiologically related features. Nevertheless, PPG-based signal reliability presents different challenges that strongly affect such data processing. Th… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Journal ref: 2023 IEEE International Conference on Fuzzy Systems (FUZZ 2023)

  7. A Game-Based Learning Application to Help Learners to Practice Mathematical Patterns and Structures

    Authors: Adrian S. Lozano, Reister Justine B. Canlas, Kimberly M. Coronel, Justin M. Canlas, Jerico G. Duya, Regina C. Macapagal, Ericson M. Dungca, John Paul P. Miranda

    Abstract: Purpose - The purpose of this study is to develop a game-based mobile application to help learners practice mathematical patterns and structures. Method - The study followed a mixed-method research design and prototy** methodology to guide the study in develo** the mobile application. An instrument based on the Octalysis framework was developed as an evaluation tool for the study. Results… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 15 pages, 5 figures, journal article, 2 tables

    Journal ref: International Journal of Computing Sciences Research 7 (2023) 1-15

  8. arXiv:2212.09255  [pdf, other

    cs.CL

    Multi hash embeddings in spaCy

    Authors: Lester James Miranda, Ákos Kádár, Adriane Boyd, Sofie Van Landeghem, Anders Søgaard, Matthew Honnibal

    Abstract: The distributed representation of symbols is one of the key technologies in machine learning systems today, playing a pivotal role in modern natural language processing. Traditional word embeddings associate a separate vector with each word. While this approach is simple and leads to good performance, it requires a lot of memory for representing a large vocabulary. To reduce the memory footprint,… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    ACM Class: I.2.7

  9. Datasets of Fire and Crime Incidents in Pampanga, Philippines

    Authors: John Paul P. Miranda, Julieta M. Umali, Aileen P. de Leon

    Abstract: The fire and crime incident datasets were requested and collected from two Philippine regional agencies (i.e., the Bureau of Fire Protection and the Philippine National Police). The datasets were used to initially analyze and map both fire and crime incidents within the province of Pampanga for a specific time frame. Several data preparation, normalization, and data cleaning steps were implemented… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 10 pages, 10 citations, 5 figures, 1 table, journal article, peer-reviewed

    Journal ref: International Journal of Computing Sciences Research, 2022

  10. Development of Augmented Reality Application for Made-to-Order Furniture Industry in Pampanga, Philippines

    Authors: Jaymark A. Yambao, John Paul P. Miranda, Earl Lawrence B. Pelayo

    Abstract: The focus of the study was to develop a mobile application utilizing marker-less augmented reality for specific made-to-order products to support furniture and fixtures businesses. The study implemented mixed-methodology to properly identify the various stakeholders' considerations in develo** the application. Interviews with key informants were conducted to ensure that the features were appropr… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: 11 pages, 3 figures, journal article, short paper

    Journal ref: International Journal of Computing Sciences Research 6 (2022) 1-11

  11. arXiv:2203.00456  [pdf, other

    cs.HC eess.SP

    WEMAC: Women and Emotion Multi-modal Affective Computing dataset

    Authors: Jose A. Miranda, Esther Rituerto-González, Laura Gutiérrez-Martín, Clara Luis-Mingueza, Manuel F. Canabal, Alberto Ramírez Bárcenas, Jose M. Lanza-Gutiérrez, Carmen Peláez-Moreno, Celia López-Ongil

    Abstract: Among the seventeen Sustainable Development Goals (SDGs) proposed within the 2030 Agenda and adopted by all the United Nations member states, the Fifth SDG is a call for action to turn Gender Equality into a fundamental human right and an essential foundation for a better world. It includes the eradication of all types of violence against women. Within this context, the UC3M4Safety research team a… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 March, 2022; originally announced March 2022.

  12. Dataset of Philippine Presidents Speeches from 1935 to 2016

    Authors: John Paul P. Miranda

    Abstract: The dataset was collected to examine and identify possible key topics within these texts. Data preparation such as data cleaning, transformation, tokenization, removal of stop words from both English and Filipino, and word stemming was employed in the dataset before feeding it to sentiment analysis and the LDA model. The topmost occurring word within the dataset is "development" and there are thre… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 11 pages, 4 figures, 4 tables, dataset

    Journal ref: International Journal of Computing Sciences Research 5 (2021) 1-11

  13. Towards the Development of 3D Engine Assembly Simulation Learning Module for Senior High School

    Authors: John Paul P. Miranda, Jaymark A. Yambao, Jhon Asley M. Marcelo, Christopher Robert N. Gonzales, Vee-jay T. Mungcal

    Abstract: The focus of the study is to develop a 3D engine assembly simulation learning module to address the lack of equipment in one senior high school in the Philippines. The study used mixed-method to determine the considerations needed in develo** an application for educational use particularly among laboratory/practical subjects like engine assembly. The study used ISO 25010 quality standards in eva… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 13 pages, 10 figures

    Journal ref: International Journal of Computing Sciences Research (ISSN print: 2546-0552; ISSN online: 2546-115X) Vol. 5, No. 1, 2021

  14. arXiv:1910.05571  [pdf, other

    cs.LG

    Geomancer: An Open-Source Framework for Geospatial Feature Engineering

    Authors: Lester James V. Miranda, Mark Steve Samson, Alfiero K. Orden II, Bianca S. Silmaro, Ram K. De Guzman III, Stephanie S. Sy

    Abstract: This paper presents Geomancer, an open-source framework for geospatial feature engineering. It simplifies the acquisition of geospatial attributes for downstream, large-scale machine learning tasks. Geomancer leverages any geospatial dataset stored in a data warehouse, users need only to define the features (Spells) they want to create, and cast them on any spatial dataset. In addition, these feat… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

  15. arXiv:1806.03889  [pdf, other

    q-bio.NC cond-mat.stat-mech cs.IT nlin.AO nlin.CD

    Fractal and Multifractal Properties of Electrographic Recordings of Human Brain Activity: Toward Its Use as a Signal Feature for Machine Learning in Clinical Applications

    Authors: Lucas G. S. França, José G. V. Miranda, Marco Leite, Niraj K. Sharma, Matthew C. Walker, Louis Lemieux, Yujiang Wang

    Abstract: The brain is a system operating on multiple time scales, and characterisation of dynamics across time scales remains a challenge. One framework to study such dynamics is that of fractal geometry. However, currently there exists no established method for the study of brain dynamics using fractal geometry, due to the many challenges in the conceptual and technical understanding of the methods. We ai… ▽ More

    Submitted 11 December, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

    Comments: Final version published at Frontiers in Physiology. https://doi.org/10.3389/fphys.2018.01767

    Journal ref: França, LGS et al., (2018) Fractal and Multifractal Properties of Electrographic Recordings of Human Brain Activity: Toward Its Use as a Signal Feature for Machine Learning in Clinical Applications. Front. Physiol. 9:1767

  16. arXiv:1602.04513  [pdf

    q-bio.QM cs.CV physics.med-ph

    Validity and reliability of free software for bidimensional gait analysis

    Authors: Ana Paula Quixadá, Andrea Naomi Onodera, Norberto Peña, José Garcia Vivas Miranda, Katia Nunes Sá

    Abstract: Despite the evaluation systems of human movement that have been advancing in recent decades, their use are not feasible for clinical practice because it has a high cost and scarcity of trained operators to interpret their results. An ideal videogrammetry system should be easy to use, low cost, with minimal equipment, and fast realization. The CvMob is a free tool for dynamic evaluation of human mo… ▽ More

    Submitted 14 February, 2016; originally announced February 2016.

  17. arXiv:1503.08109  [pdf

    cs.IT math.NT

    Spread-Spectrum Based on Finite Field Fourier Transforms

    Authors: H. M. de Oliveira, J. P. C. L. Miranda, R. M. Campello de Souza

    Abstract: Spread-spectrum systems are presented, which are based on Finite Field Fourier Transforms. Orthogonal spreading sequences defined over a finite field are derived. New digital multiplex schemes based on such spread-spectrum systems are also introduced, which are multilevel Coding Division Multiplex. These schemes termed Galois-field Division Multiplex (GDM) offer compact bandwidth requirements beca… ▽ More

    Submitted 12 February, 2015; originally announced March 2015.

    Comments: 6 pages, 7 figures. Int. Conf. on System Engineering, Comm. and. Info. Technol., Punta Arenas, Chile, 2001

  18. arXiv:1503.02192  [pdf, other

    cs.IT

    Uplink Performance Evaluation of Massive MU-MIMO Systems

    Authors: Felipe A. P. de Figueiredo, Joao Paulo Miranda, Fabricio L. Figueiredo, Fabbryccio A. C. M. Cardoso

    Abstract: The present paper deals with an OFDM-based uplink within a multi-user MIMO (MU-MIMO) system where a massive MIMO approach is employed. In this context, the linear detectors Minimum Mean-Squared Error (MMSE), Zero Forcing (ZF) and Maximum Ratio Combining (MRC) are considered and assessed. This papers includes Bit Error Rate (BER) results for uncoded QPSK/OFDM transmissions through a flat Rayleigh f… ▽ More

    Submitted 7 March, 2015; originally announced March 2015.

  19. arXiv:1502.03698  [pdf

    cs.IT cs.DM

    On Galois-Division Multiple Access Systems: Figures of Merit and Performance Evaluation

    Authors: J. P. C. L. Miranda, H. M. de Oliveira

    Abstract: A new approach to multiple access based on finite field transforms is investigated. These schemes, termed Galois-Division Multiple Access (GDMA), offer compact bandwidth requirements. A new digital transform, the Finite Field Hartley Transform (FFHT) requires to deal with fields of characteristic p, p \neq 2. A binary-to-p-ary (p \neq 2) map** based on the opportunistic secondary channel is intr… ▽ More

    Submitted 12 February, 2015; originally announced February 2015.

    Comments: 6 pages, 4 figures. In: XIX Simposio Brasileiro de Telecomunicacoes, 2001, Fortaleza, CE, Brazil

  20. arXiv:1501.00305  [pdf, ps, other

    cs.IT

    Massive MIMO and Waveform Design for 5th Generation Wireless Communication Systems

    Authors: Arman Farhang, Nicola Marchetti, Fabricio Figueiredo, Joao Paulo Miranda

    Abstract: This article reviews existing related work and identifies the main challenges in the key 5G area at the intersection of waveform design and large-scale multiple antenna systems, also known as Massive MIMO. The property of self-equalization is introduced for Filter Bank Multicarrier (FBMC)-based Massive MIMO, which can reduce the number of subcarriers required by the system. It is also shown that t… ▽ More

    Submitted 23 September, 2016; v1 submitted 1 January, 2015; originally announced January 2015.

    Comments: 6 pages, 2 figures, 1st International Conference on 5G for Ubiquitous Connectivity

  21. arXiv:1307.0155  [pdf

    physics.ins-det cs.GR physics.pop-ph

    Free Instrument for Movement Measure

    Authors: Norberto Peña, Bruno Cecílio Credidio, Lorena Peixoto Nogueira Rodriguez Martinez Salles Corrêa, Lucas Gabriel Souza França, Marcelo do Vale Cunha, Marcos Cavalcanti de Sousa, João Paulo Bomfim Cruz Vieira, José Garcia Vivas Miranda

    Abstract: This paper presents the validation of a computational tool that serves to obtain continuous measurements of moving objects. The software uses techniques of computer vision, pattern recognition and optical flow, to enable tracking of objects in videos, generating data trajectory, velocity, acceleration and angular movement. The program was applied to track a ball around a simple pendulum. The metho… ▽ More

    Submitted 29 June, 2013; originally announced July 2013.

    Comments: Accepted for publication at the RBEF - Revista Brasileira de Ensino de Física

  22. arXiv:1306.2537  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO

    Analysis of communities in a mythological social network

    Authors: Pedro J. Miranda, Murilo S. Baptista, Sandro E. de S. Pinto

    Abstract: The intriguing nature of classical Homeric narratives has always fascinated the occidental culture contributing to philosophy, history, mythology and straight forwardly to literature. However what would be so intriguing about Homer's narratives' At a first gaze we shall recognize the very literal appeal and aesthetic pleasure presented on every page across Homer's chants in Odyssey and rhapsodies… ▽ More

    Submitted 19 June, 2013; v1 submitted 11 June, 2013; originally announced June 2013.