-
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Authors:
Holy Lovenia,
Rahmad Mahendra,
Salsabil Maulana Akbar,
Lester James V. Miranda,
Jennifer Santoso,
Elyanah Aco,
Akhdan Fadhilah,
Jonibek Mansurov,
Joseph Marvin Imperial,
Onno P. Kampman,
Joel Ruben Antony Moniz,
Muhammad Ravi Shulthan Habibi,
Frederikus Hudi,
Railey Montalan,
Ryan Ignatius,
Joanito Agili Lopo,
William Nixon,
Börje F. Karlsson,
James Jaya,
Ryandito Diandaru,
Yuze Gao,
Patrick Amadeus,
Bin Wang,
Jan Christian Blaise Cruz,
Chenxi Whitehouse
, et al. (36 additional authors not shown)
Abstract:
Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due t…
▽ More
Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due to the scarcity of high-quality datasets, compounded by the dominance of English training data, raising concerns about potential cultural misrepresentation. To address these challenges, we introduce SEACrowd, a collaborative initiative that consolidates a comprehensive resource hub that fills the resource gap by providing standardized corpora in nearly 1,000 SEA languages across three modalities. Through our SEACrowd benchmarks, we assess the quality of AI models on 36 indigenous languages across 13 tasks, offering valuable insights into the current AI landscape in SEA. Furthermore, we propose strategies to facilitate greater AI advancements, maximizing potential utility and resource equity for the future of AI in SEA.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Authors:
David Romero,
Chenyang Lyu,
Haryo Akbarianto Wibowo,
Teresa Lynn,
Injy Hamed,
Aditya Nanda Kishore,
Aishik Mandal,
Alina Dragonetti,
Artem Abzaliev,
Atnafu Lambebo Tonja,
Bontu Fufa Balcha,
Chenxi Whitehouse,
Christian Salamea,
Dan John Velasco,
David Ifeoluwa Adelani,
David Le Meur,
Emilio Villa-Cueva,
Fajri Koto,
Fauzan Farooqui,
Frederico Belcavello,
Ganzorig Batnasan,
Gisela Vallejo,
Grainne Caulfield,
Guido Ivetta,
Haiyue Song
, et al. (50 additional authors not shown)
Abstract:
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen…
▽ More
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recent efforts have tried to increase the number of languages covered on VQA datasets, they still lack diversity in low-resource languages. More importantly, although these datasets often extend their linguistic range via translation or some other approaches, they usually keep images the same, resulting in narrow cultural representation. To address these limitations, we construct CVQA, a new Culturally-diverse multilingual Visual Question Answering benchmark, designed to cover a rich set of languages and cultures, where we engage native speakers and cultural experts in the data collection process. As a result, CVQA includes culturally-driven images and questions from across 28 countries on four continents, covering 26 languages with 11 scripts, providing a total of 9k questions. We then benchmark several Multimodal Large Language Models (MLLMs) on CVQA, and show that the dataset is challenging for the current state-of-the-art models. This benchmark can serve as a probing evaluation suite for assessing the cultural capability and bias of multimodal models and hopefully encourage more research efforts toward increasing cultural awareness and linguistic diversity in this field.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Learning Difference Equations with Structured Grammatical Evolution for Postprandial Glycaemia Prediction
Authors:
Daniel Parra,
David Joedicke,
J. Manuel Velasco,
Gabriel Kronberger,
J. Ignacio Hidalgo
Abstract:
People with diabetes must carefully monitor their blood glucose levels, especially after eating. Blood glucose regulation requires a proper combination of food intake and insulin boluses. Glucose prediction is vital to avoid dangerous post-meal complications in treating individuals with diabetes. Although traditional methods, such as artificial neural networks, have shown high accuracy rates, some…
▽ More
People with diabetes must carefully monitor their blood glucose levels, especially after eating. Blood glucose regulation requires a proper combination of food intake and insulin boluses. Glucose prediction is vital to avoid dangerous post-meal complications in treating individuals with diabetes. Although traditional methods, such as artificial neural networks, have shown high accuracy rates, sometimes they are not suitable for develo** personalised treatments by physicians due to their lack of interpretability. In this study, we propose a novel glucose prediction method emphasising interpretability: Interpretable Sparse Identification by Grammatical Evolution. Combined with a previous clustering stage, our approach provides finite difference equations to predict postprandial glucose levels up to two hours after meals. We divide the dataset into four-hour segments and perform clustering based on blood glucose values for the twohour window before the meal. Prediction models are trained for each cluster for the two-hour windows after meals, allowing predictions in 15-minute steps, yielding up to eight predictions at different time horizons. Prediction safety was evaluated based on Parkes Error Grid regions. Our technique produces safe predictions through explainable expressions, avoiding zones D (0.2% average) and E (0%) and reducing predictions on zone C (6.2%). In addition, our proposal has slightly better accuracy than other techniques, including sparse identification of non-linear dynamics and artificial neural networks. The results demonstrate that our proposal provides interpretable solutions without sacrificing prediction accuracy, offering a promising approach to glucose prediction in diabetes management that balances accuracy, interpretability, and computational efficiency.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Patterns Detection in Glucose Time Series by Domain Transformations and Deep Learning
Authors:
J. Alvarado,
J. Manuel Velasco,
F. Chávez,
J. Ignacio Hidalgo,
F. Fernández de Vega
Abstract:
People with diabetes have to manage their blood glucose level to keep it within an appropriate range. Predicting whether future glucose values will be outside the healthy threshold is of vital importance in order to take corrective actions to avoid potential health damage. In this paper we describe our research with the aim of predicting the future behavior of blood glucose levels, so that hypogly…
▽ More
People with diabetes have to manage their blood glucose level to keep it within an appropriate range. Predicting whether future glucose values will be outside the healthy threshold is of vital importance in order to take corrective actions to avoid potential health damage. In this paper we describe our research with the aim of predicting the future behavior of blood glucose levels, so that hypoglycemic events may be anticipated. The approach of this work is the application of transformation functions on glucose time series, and their use in convolutional neural networks. We have tested our proposed method using real data from 4 different diabetes patients with promising results.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings
Authors:
Dan John Velasco,
Axel Alba,
Trisha Gail Pelagio,
Bryce Anthony Ramirez,
Unisse Chua,
Briane Paul Samson,
Jan Christian Blaise Cruz,
Charibeth Cheng
Abstract:
Wordnets are indispensable tools for various natural language processing applications. Unfortunately, wordnets get outdated, and producing or updating wordnets can be slow and costly in terms of time and resources. This problem intensifies for low-resource languages. This study proposes a method for word sense induction and synset induction using only two linguistic resources, namely, an unlabeled…
▽ More
Wordnets are indispensable tools for various natural language processing applications. Unfortunately, wordnets get outdated, and producing or updating wordnets can be slow and costly in terms of time and resources. This problem intensifies for low-resource languages. This study proposes a method for word sense induction and synset induction using only two linguistic resources, namely, an unlabeled corpus and a sentence embeddings-based language model. The resulting sense inventory and synonym sets can be used in automatically creating a wordnet. We applied this method on a corpus of Filipino text. The sense inventory and synsets were evaluated by matching them with the sense inventory of the machine translated Princeton WordNet, as well as comparing the synsets to the Filipino WordNet. This study empirically shows that the 30% of the induced word senses are valid and 40% of the induced synsets are valid in which 20% are novel synsets.
△ Less
Submitted 19 October, 2023; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets
Authors:
Jan Christian Blaise Cruz,
Jose Kristian Resabal,
James Lin,
Dan John Velasco,
Charibeth Cheng
Abstract:
Transformers represent the state-of-the-art in Natural Language Processing (NLP) in recent years, proving effective even in tasks done in low-resource languages. While pretrained transformers for these languages can be made, it is challenging to measure their true performance and capacity due to the lack of hard benchmark datasets, as well as the difficulty and cost of producing them. In this pape…
▽ More
Transformers represent the state-of-the-art in Natural Language Processing (NLP) in recent years, proving effective even in tasks done in low-resource languages. While pretrained transformers for these languages can be made, it is challenging to measure their true performance and capacity due to the lack of hard benchmark datasets, as well as the difficulty and cost of producing them. In this paper, we present three contributions: First, we propose a methodology for automatically producing Natural Language Inference (NLI) benchmark datasets for low-resource languages using published news articles. Through this, we create and release NewsPH-NLI, the first sentence entailment benchmark dataset in the low-resource Filipino language. Second, we produce new pretrained transformers based on the ELECTRA technique to further alleviate the resource scarcity in Filipino, benchmarking them on our dataset against other commonly-used transfer learning techniques. Lastly, we perform analyses on transfer learning techniques to shed light on their true performance when operating in low-data domains through the use of degradation tests.
△ Less
Submitted 13 August, 2021; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Pagsusuri ng RNN-based Transfer Learning Technique sa Low-Resource Language
Authors:
Dan John Velasco
Abstract:
Low-resource languages such as Filipino suffer from data scarcity which makes it challenging to develop NLP applications for Filipino language. The use of Transfer Learning (TL) techniques alleviates this problem in low-resource setting. In recent years, transformer-based models are proven to be effective in low-resource tasks but faces challenges in accessibility due to its high compute and memor…
▽ More
Low-resource languages such as Filipino suffer from data scarcity which makes it challenging to develop NLP applications for Filipino language. The use of Transfer Learning (TL) techniques alleviates this problem in low-resource setting. In recent years, transformer-based models are proven to be effective in low-resource tasks but faces challenges in accessibility due to its high compute and memory requirements. For this reason, there's a need for a cheaper but effective alternative. This paper has three contributions. First, release a pre-trained AWD-LSTM language model for Filipino language. Second, benchmark AWD-LSTM in the Hate Speech classification task and show that it performs on par with transformer-based models. Third, analyze the the performance of AWD-LSTM in low-resource setting using degradation test and compare it with transformer-based models.
-----
Ang mga low-resource languages tulad ng Filipino ay gipit sa accessible na datos kaya't mahirap gumawa ng mga applications sa wikang ito. Ang mga Transfer Learning (TL) techniques ay malaking tulong para sa low-resource setting o mga pagkakataong gipit sa datos. Sa mga nagdaang taon, nanaig ang mga transformer-based TL techniques pagdating sa low-resource tasks ngunit ito ay mataas na compute and memory requirements kaya nangangailangan ng mas mura pero epektibong alternatibo. Ang papel na ito ay may tatlong kontribusyon. Una, maglabas ng pre-trained AWD-LSTM language model sa wikang Filipino upang maging tuntungan sa pagbuo ng mga NLP applications sa wikang Filipino. Pangalawa, mag benchmark ng AWD-LSTM sa Hate Speech classification task at ipakita na kayang nitong makipagsabayan sa mga transformer-based models. Pangatlo, suriin ang performance ng AWD-LSTM sa low-resource setting gamit ang degradation test at ikumpara ito sa mga transformer-based models.
△ Less
Submitted 14 October, 2020; v1 submitted 13 October, 2020;
originally announced October 2020.
-
Internet of things-based (IoT) inventory monitoring refrigerator using arduino sensor network
Authors:
Jessica Velasco,
Leandro Alberto,
Henrick Dave Ambatali,
Marlon Canilang,
Vincent Daria,
Jerome Bryan Liwanag,
Gilfred Allen Madrigal
Abstract:
This study presents a system that combines a conventional refrigerator, microcontrollers and a smart phone to create an inventory monitoring that can monitor the stocks inside the refrigerator wirelessly by accessing an Android application. The developed refrigerator uses a sensor network system that is installed in a respective compartment inside the refrigerator. Each sensor will transmit data t…
▽ More
This study presents a system that combines a conventional refrigerator, microcontrollers and a smart phone to create an inventory monitoring that can monitor the stocks inside the refrigerator wirelessly by accessing an Android application. The developed refrigerator uses a sensor network system that is installed in a respective compartment inside the refrigerator. Each sensor will transmit data to the microcontrollers, such as Arduino Yun and Arduino Uno, which are interconnected by the I2C communications. All data and images will be processed to provide the user an Internet of Things application through the cloud-based website Temboo. Temboo will have access to send data to the Dropbox. A smartphone is connected to the Dropbox where all the data and images are stored. The user can monitor the stocks or contents of the refrigerator wirelessly using an Android Application.
△ Less
Submitted 25 November, 2019;
originally announced November 2019.
-
A Smartphone-Based Skin Disease Classification Using MobileNet CNN
Authors:
Jessica Velasco,
Cherry Pascion,
Jean Wilmar Alberio,
Jonathan Apuang,
John Stephen Cruz,
Mark Angelo Gomez,
Benjamin Jr. Molina,
Lyndon Tuala,
August Thio-ac,
Romeo Jr. Jorda
Abstract:
The MobileNet model was used by applying transfer learning on the 7 skin diseases to create a skin disease classification system on Android application. The proponents gathered a total of 3,406 images and it is considered as imbalanced dataset because of the unequal number of images on its classes. Using different sampling method and preprocessing of input data was explored to further improved the…
▽ More
The MobileNet model was used by applying transfer learning on the 7 skin diseases to create a skin disease classification system on Android application. The proponents gathered a total of 3,406 images and it is considered as imbalanced dataset because of the unequal number of images on its classes. Using different sampling method and preprocessing of input data was explored to further improved the accuracy of the MobileNet. Using under-sampling method and the default preprocessing of input data achieved an 84.28% accuracy. While, using imbalanced dataset and default preprocessing of input data achieved a 93.6% accuracy. Then, researchers explored oversampling the dataset and the model attained a 91.8% accuracy. Lastly, by using oversampling technique and data augmentation on preprocessing the input data provide a 94.4% accuracy and this model was deployed on the developed Android application.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
Blockchain-based System Evaluation: The Effectiveness of Blockchain on E-Procurements
Authors:
August Thio-ac,
Alfred Keanu Serut,
Rayn Louise Torrejos,
Keenan Dave Rivo,
Jessica Velasco
Abstract:
Electronic systems tend to simplify the tedious traditional scheme and basically focuses on the platform design and process organization. The integrity of the output of an automated system is not left behind but the possibility of internal manipulation is still high. This paper presents the current issues in company procurements and the solution in the form of blockchain technology. Several indivi…
▽ More
Electronic systems tend to simplify the tedious traditional scheme and basically focuses on the platform design and process organization. The integrity of the output of an automated system is not left behind but the possibility of internal manipulation is still high. This paper presents the current issues in company procurements and the solution in the form of blockchain technology. Several individuals and professionals were asked to evaluate a blockchain-based procurement system in comparison to the current electronic (e-procurement) system. A blockchain-based system has the capability to hold transactional data with complete decentralization and eliminate the growing number of fraud cases in companies and organizations. This paper mainly focuses on the effectiveness of a blockchain-based system in company procurements.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
Development of a Secure and Private Electronic Procurement System based on Blockchain Implementation
Authors:
August Thio-ac,
Erwin John Domingo,
Ricca May Reyes,
Nilo Arago,
Romeo Jr. Jorda,
Jessica Velasco
Abstract:
This paper presents the development of an online procurement system and the integration of blockchain technology. Various tools such as PHP, JavaScript, HTML, CSS, and jQuery were used in designing the graphical, programming logic, and blockchain aspect of the system. Every page and function will have their respective construction and result. In addition, the proposed system's flow of process and…
▽ More
This paper presents the development of an online procurement system and the integration of blockchain technology. Various tools such as PHP, JavaScript, HTML, CSS, and jQuery were used in designing the graphical, programming logic, and blockchain aspect of the system. Every page and function will have their respective construction and result. In addition, the proposed system's flow of process and the methods on the testing and hosting of the site as well as the different web development languages used in every part of the development and design process were presented. The proposed system was successfully and functionally developed starting from the execution of procurement proper, to the placement of procured items or goods, and up to the signing of contracts by the winner and the procurer. Lastly, features were added such as user profiles of the bidder and procurer.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
Automated Smart Wick System-Based Microfarm Using Internet of Things
Authors:
R. Jorda, Jr.,
C. Alcabasa,
A. Buhay,
E. C. Dela Cruz,
J. P. Mendoza,
A. Tolentino,
L. K. Tolentino,
E. Fernandez,
A. Thio-ac,
J. Velasco,
N. Arago
Abstract:
This paper presents a study conducted to allow urban farmers to remotely monitor their farm through the design and development of an Internet of Things-based (IoT) microfarm prototype which utilized wick system as planting method. The system involves the detection of three environmental parameters namely, light intensity, soil moisture and temperature through the use of respective sensors which we…
▽ More
This paper presents a study conducted to allow urban farmers to remotely monitor their farm through the design and development of an Internet of Things-based (IoT) microfarm prototype which utilized wick system as planting method. The system involves the detection of three environmental parameters namely, light intensity, soil moisture and temperature through the use of respective sensors which were connected to the Arduino microcontroller, the sensor node of the system. Irregularities in the aforementioned parameters were neutralized through the use of parameter regulators such as LED growlight strips, water pump and air cooler. The data collected by these sensors were gathered by the Arduino microcontroller and were sent to the Web database through the IoT gateway which was the Raspberry Pi computer chip. These data were also sent to an Android unit installed with the Microfarm Companion application which was capable of monitoring and controlling the environmental parameters observed in the microfarm. The application allows the user to view the current value of the parameter involved and to choose whether to control the parameter regulators automatically or manually. The microfarm system runs autonomously which reduces the labor required to produce healthy plants and crops. Mustard greens samples were used in testing the system. After a month of monitoring the height of the samples, it was observed that the average height of the samples is about 0.23 cm taller than the standard height. The proponents has also tested the system functionality by evaluating the sensor data log that provides the values gathered by the sensors and the turn-on times of the parameter regulators. From these data, it can be observed that whenever the values obtained by the sensors fall outside the threshold range, the parameter regulators turns on, indicating that the system is working properly.
△ Less
Submitted 30 October, 2019;
originally announced November 2019.
-
TDOA Matrices: Algebraic Properties and their Application to Robust Denoising with Missing Data
Authors:
Jose Velasco,
Daniel Pizarro,
Javier Macias-Guarasa,
Afsaneh Asaei
Abstract:
Measuring the Time delay of Arrival (TDOA) between a set of sensors is the basic setup for many applications, such as localization or signal beamforming. This paper presents the set of TDOA matrices, which are built from noise-free TDOA measurements, not requiring knowledge of the sensor array geometry. We prove that TDOA matrices are rank-two and have a special SVD decomposition that leads to a c…
▽ More
Measuring the Time delay of Arrival (TDOA) between a set of sensors is the basic setup for many applications, such as localization or signal beamforming. This paper presents the set of TDOA matrices, which are built from noise-free TDOA measurements, not requiring knowledge of the sensor array geometry. We prove that TDOA matrices are rank-two and have a special SVD decomposition that leads to a compact linear parametric representation. Properties of TDOA matrices are applied in this paper to perform denoising, by finding the TDOA matrix closest to the matrix composed with noisy measurements. The paper shows that this problem admits a closed-form solution for TDOA measurements contaminated with Gaussian noise which extends to the case of having missing data. The paper also proposes a novel robust denoising method resistant to outliers, missing data and inspired in recent advances in robust low-rank estimation. Experiments in synthetic and real datasets show TDOA-based localization, both in terms of TDOA accuracy estimation and localization error.
△ Less
Submitted 24 May, 2016; v1 submitted 18 January, 2016;
originally announced January 2016.
-
Measuring Verifiability in Online Information
Authors:
Reed H. Harder,
Alfredo J. Velasco,
Michael S. Evans,
Daniel N. Rockmore
Abstract:
The verifiability of online information is important, but difficult to assess systematically. We examine verifiability in the case of Wikipedia, one of the world's largest and most consulted online information sources. We extend prior work about quality of Wikipedia articles, knowledge production, and sources to consider the quality of Wikipedia references. We propose a multidimensional measure of…
▽ More
The verifiability of online information is important, but difficult to assess systematically. We examine verifiability in the case of Wikipedia, one of the world's largest and most consulted online information sources. We extend prior work about quality of Wikipedia articles, knowledge production, and sources to consider the quality of Wikipedia references. We propose a multidimensional measure of verifiability that takes into account technical accuracy and practical accessibility of sources. We calculate article verifiability scores for a sample of 5,000 articles and 295,800 citations, and compare differently weighted models to illustrate effects of emphasizing particular elements of verifiability over others. We find that, while the quality of references in the overall sample is reasonably high, verifiability varies significantly by article, particularly when emphasizing the use of standard digital identifiers and taking into account the practical availability of referenced sources. We discuss the implications of these findings for measuring verifiability in online information more generally.
△ Less
Submitted 16 November, 2015; v1 submitted 18 September, 2015;
originally announced September 2015.
-
Well-posedness of a nonlinear integro-differential problem and its rearranged formulation
Authors:
Gonzalo Galiano,
Emanuele Schiavi,
Julián Velasco
Abstract:
We study the existence and uniqueness of solutions of a nonlinear integro-differential problem which we reformulate introducing the notion of the decreasing rearrangement of the solution. A dimensional reduction of the problem is obtained and a detailed analysis of the properties of the solutions of the model is provided. Finally, a fast numerical method is devised and implemented to show the perf…
▽ More
We study the existence and uniqueness of solutions of a nonlinear integro-differential problem which we reformulate introducing the notion of the decreasing rearrangement of the solution. A dimensional reduction of the problem is obtained and a detailed analysis of the properties of the solutions of the model is provided. Finally, a fast numerical method is devised and implemented to show the performance of the model when typical image processing tasks such as filtering and segmentation are performed.
△ Less
Submitted 8 April, 2016; v1 submitted 7 June, 2015;
originally announced June 2015.
-
On a fast bilateral filtering formulation using functional rearrangements
Authors:
Gonzalo Galiano,
Julián Velasco
Abstract:
We introduce an exact reformulation of a broad class of neighborhood filters, among which the bilateral filters, in terms of two functional rearrangements: the decreasing and the relative rearrangements.
Independently of the image spatial dimension (one-dimensional signal, image, volume of images, etc.), we reformulate these filters as integral operators defined in a one-dimensional space corres…
▽ More
We introduce an exact reformulation of a broad class of neighborhood filters, among which the bilateral filters, in terms of two functional rearrangements: the decreasing and the relative rearrangements.
Independently of the image spatial dimension (one-dimensional signal, image, volume of images, etc.), we reformulate these filters as integral operators defined in a one-dimensional space corresponding to the level sets measures.
We prove the equivalence between the usual pixel-based version and the rearranged version of the filter. When restricted to the discrete setting, our reformulation of bilateral filters extends previous results for the so-called fast bilateral filtering. We, in addition, prove that the solution of the discrete setting, understood as constant-wise interpolators, converges to the solution of the continuous setting.
Finally, we numerically illustrate computational aspects concerning quality approximation and execution time provided by the rearranged formulation.
△ Less
Submitted 3 May, 2015;
originally announced May 2015.
-
On a new formulation of nonlocal image filters involving the relative rearrangement
Authors:
Gonzalo Galiano,
Julián Velasco
Abstract:
Nonlocal filters are simple and powerful techniques for image denoising. In this paper we study the reformulation of a broad class of nonlocal filters in terms of two functional rearrangements: the decreasing and the relative rearrangements.
Independently of the dimension of the image, we reformulate these filters as integral operators defined in a one-dimensional space corresponding to the leve…
▽ More
Nonlocal filters are simple and powerful techniques for image denoising. In this paper we study the reformulation of a broad class of nonlocal filters in terms of two functional rearrangements: the decreasing and the relative rearrangements.
Independently of the dimension of the image, we reformulate these filters as integral operators defined in a one-dimensional space corresponding to the level sets measures.
We prove the equivalency between the original and the rearranged versions of the filters and propose a discretization in terms of constant-wise interpolators, which we prove to be convergent to the solution of the continuous setting.
For some particular cases, this new formulation allows us to perform a detailed analysis of the filtering properties. Among others, we prove that the filtered image is a contrast change of the original image, and that the filtering procedure behaves asymptotically as a shock filter combined with a border diffusive term, responsible for the staircaising effect and the loss of contrast.
△ Less
Submitted 27 June, 2014;
originally announced June 2014.
-
On a non-local spectrogram for denoising one-dimensional signals
Authors:
Gonzalo Galiano,
Julián Velasco
Abstract:
In previous works, we investigated the use of local filters based on partial differential equations (PDE) to denoise one-dimensional signals through the image processing of time-frequency representations, such as the spectrogram. In this image denoising algorithms, the particularity of the image was hardly taken into account. We turn, in this paper, to study the performance of non-local filters, l…
▽ More
In previous works, we investigated the use of local filters based on partial differential equations (PDE) to denoise one-dimensional signals through the image processing of time-frequency representations, such as the spectrogram. In this image denoising algorithms, the particularity of the image was hardly taken into account. We turn, in this paper, to study the performance of non-local filters, like Neighborhood or Yaroslavsky filters, in the same problem. We show that, for certain iterative schemes involving the Neighborhood filter, the computational time is drastically reduced with respect to Yaroslavsky or nonlinear PDE based filters, while the outputs of the filtering processes are similar. This is heuristically justified by the connection between the (fast) Neighborhood filter applied to a spectrogram and the corresponding Nonlocal Means filter (accurate) applied to the Wigner-Ville distribution of the signal. This correspondence holds only for time-frequency representations of one-dimensional signals, not to usual images, and in this sense the particularity of the image is exploited. We compare though a series of experiments on synthetic and biomedical signals the performance of local and non-local filters.
△ Less
Submitted 13 November, 2013;
originally announced November 2013.
-
Neighborhood filters and the decreasing rearrangement
Authors:
Gonzalo Galiano,
Julián Velasco
Abstract:
Nonlocal filters are simple and powerful techniques for image denoising. In this paper, we give new insights into the analysis of one kind of them, the Neighborhood filter, by using a classical although not very common transformation: the decreasing rearrangement of a function (the image). Independently of the dimension of the image, we reformulate the Neighborhood filter and its iterative variant…
▽ More
Nonlocal filters are simple and powerful techniques for image denoising. In this paper, we give new insights into the analysis of one kind of them, the Neighborhood filter, by using a classical although not very common transformation: the decreasing rearrangement of a function (the image). Independently of the dimension of the image, we reformulate the Neighborhood filter and its iterative variants as an integral operator defined in a one-dimensional space. The simplicity of this formulation allows to perform a detailed analysis of its properties. Among others, we prove that the filter behaves asymptotically as a shock filter combined with a border diffusive term, responsible for the staircaising effect and the loss of contrast.
△ Less
Submitted 27 June, 2014; v1 submitted 9 November, 2013;
originally announced November 2013.
-
An estimation of distribution algorithm with adaptive Gibbs sampling for unconstrained global optimization
Authors:
Jonás Velasco,
Mario A. Saucedo-Espinosa,
Hugo Jair Escalante,
Karlo Mendoza,
César Emilio Villarreal-Rodríguez,
Óscar L. Chacón-Mondragón,
Adrián Rodríguez,
Arturo Berrones
Abstract:
In this paper is proposed a new heuristic approach belonging to the field of evolutionary Estimation of Distribution Algorithms (EDAs). EDAs builds a probability model and a set of solutions is sampled from the model which characterizes the distribution of such solutions. The main framework of the proposed method is an estimation of distribution algorithm, in which an adaptive Gibbs sampling is us…
▽ More
In this paper is proposed a new heuristic approach belonging to the field of evolutionary Estimation of Distribution Algorithms (EDAs). EDAs builds a probability model and a set of solutions is sampled from the model which characterizes the distribution of such solutions. The main framework of the proposed method is an estimation of distribution algorithm, in which an adaptive Gibbs sampling is used to generate new promising solutions and, in combination with a local search strategy, it improves the individual solutions produced in each iteration. The Estimation of Distribution Algorithm with Adaptive Gibbs Sampling we are proposing in this paper is called AGEDA. We experimentally evaluate and compare this algorithm against two deterministic procedures and several stochastic methods in three well known test problems for unconstrained global optimization. It is empirically shown that our heuristic is robust in problems that involve three central aspects that mainly determine the difficulty of global optimization problems, namely high-dimensionality, multi-modality and non-smoothness.
△ Less
Submitted 29 May, 2013; v1 submitted 11 July, 2011;
originally announced July 2011.
-
JANUS: an FPGA-based System for High Performance Scientific Computing
Authors:
F. Belletti,
M. Cotallo,
A. Cruz,
L. A. Fernández,
A. Gordillo,
M. Guidetti,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martín-Mayor,
A. Muñoz-Sudupe,
D. Navarro,
G. Parisi,
S. Pérez-Gaviro,
M. Rossi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
D. Sciretti,
A. Tarancón,
R. Tripiccione,
J. L. Velasco
Abstract:
This paper describes JANUS, a modular massively parallel and reconfigurable FPGA-based computing system. Each JANUS module has a computational core and a host. The computational core is a 4x4 array of FPGA-based processing elements with nearest-neighbor data links. Processors are also directly connected to an I/O node attached to the JANUS host, a conventional PC. JANUS is tailored for, but not…
▽ More
This paper describes JANUS, a modular massively parallel and reconfigurable FPGA-based computing system. Each JANUS module has a computational core and a host. The computational core is a 4x4 array of FPGA-based processing elements with nearest-neighbor data links. Processors are also directly connected to an I/O node attached to the JANUS host, a conventional PC. JANUS is tailored for, but not limited to, the requirements of a class of hard scientific applications characterized by regular code structure, unconventional data manipulation instructions and not too large data-base size. We discuss the architecture of this configurable machine, and focus on its use on Monte Carlo simulations of statistical mechanics. On this class of application JANUS achieves impressive performances: in some cases one JANUS processing element outperfoms high-end PCs by a factor ~ 1000. We also discuss the role of JANUS on other classes of scientific applications.
△ Less
Submitted 8 April, 2008; v1 submitted 18 October, 2007;
originally announced October 2007.
-
Simulating spin systems on IANUS, an FPGA-based computer
Authors:
F. Belletti,
M. Cotallo,
A. Cruz,
L. A. Fernández,
A. Gordillo,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martín-Mayor,
A. Muñoz-Sudupe,
D. Navarro,
S. Pérez-Gaviro,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
D. Sciretti,
A. Tarancón,
R. Tripiccione,
J. L. Velasco
Abstract:
We describe the hardwired implementation of algorithms for Monte Carlo simulations of a large class of spin models. We have implemented these algorithms as VHDL codes and we have mapped them onto a dedicated processor based on a large FPGA device. The measured performance on one such processor is comparable to O(100) carefully programmed high-end PCs: it turns out to be even better for some sele…
▽ More
We describe the hardwired implementation of algorithms for Monte Carlo simulations of a large class of spin models. We have implemented these algorithms as VHDL codes and we have mapped them onto a dedicated processor based on a large FPGA device. The measured performance on one such processor is comparable to O(100) carefully programmed high-end PCs: it turns out to be even better for some selected spin models. We describe here codes that we are currently executing on the IANUS massively parallel FPGA-based system.
△ Less
Submitted 26 April, 2007;
originally announced April 2007.