Skip to main content

Showing 1–43 of 43 results for author: Cruz, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10118  [pdf, other

    cs.CL

    SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

    Authors: Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Railey Montalan, Ryan Ignatius, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse , et al. (36 additional authors not shown)

    Abstract: Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due t… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: https://github.com/SEACrowd

  2. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2404.02565  [pdf, other

    cs.HC

    Spatial Summation of Localized Pressure for Haptic Sensory Prostheses

    Authors: Sreela Kodali, Cihualpilli Camino Cruz, Thomas C. Bulea, Kevin S. Rao Diana Bharucha-Goebel, Alexander T. Chesler, Carsten G. Bonnemann, Allison M. Okamura

    Abstract: A host of medical conditions, including amputations, diabetes, stroke, and genetic disease, result in loss of touch sensation. Because most types of sensory loss have no pharmacological treatment or rehabilitative therapy, we propose a haptic sensory prosthesis that provides substitutive feedback. The wrist and forearm are compelling locations for feedback due to available skin area and not occlud… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 2 pages, 2 figures, 2024 IEEE Haptics Symposium Work-in-Progress Paper

  4. arXiv:2403.07769  [pdf

    cs.AI cs.CL cs.CY cs.MA

    Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations

    Authors: Carlos Jose Xavier Cruz

    Abstract: This article explores the dynamic influence of computational entities based on multi-agent systems theory (SMA) combined with large language models (LLM), which are characterized by their ability to simulate complex human interactions, as a possibility to revolutionize human user interaction from the use of specialized artificial agents to support everything from operational organizational process… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  5. arXiv:2401.06161  [pdf

    cs.CY cs.AI

    Trustworthy human-centric based Automated Decision-Making Systems

    Authors: Marcelino Cabrera, Carlos Cruz, Pavel Novoa-Hernández, David A. Pelta, José Luis Verdegay

    Abstract: Automated Decision-Making Systems (ADS) have become pervasive across various fields, activities, and occupations, to enhance performance. However, this widespread adoption introduces potential risks, including the misuse of ADS. Such misuse may manifest when ADS is employed in situations where it is unnecessary or when essential requirements, conditions, and terms are overlooked, leading to uninte… ▽ More

    Submitted 22 December, 2023; originally announced January 2024.

    Comments: 16 pages, 1 Table

  6. arXiv:2310.16322  [pdf, other

    cs.CL

    Samsung R&D Institute Philippines at WMT 2023

    Authors: Jan Christian Blaise Cruz

    Abstract: In this paper, we describe the constrained MT systems submitted by Samsung R&D Institute Philippines to the WMT 2023 General Translation Task for two directions: en$\rightarrow$he and he$\rightarrow$en. Our systems comprise of Transformer-based sequence-to-sequence models that are trained with a mix of best practices: comprehensive data preprocessing pipelines, synthetic backtranslated data, and t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: To appear in Proceedings of the Eighth Conference on Machine Translation 2023 (WMT)

  7. arXiv:2308.05609  [pdf, ps, other

    cs.CL cs.IR cs.PF

    LASIGE and UNICAGE solution to the NASA LitCoin NLP Competition

    Authors: Pedro Ruas, Diana F. Sousa, André Neves, Carlos Cruz, Francisco M. Couto

    Abstract: Biomedical Natural Language Processing (NLP) tends to become cumbersome for most researchers, frequently due to the amount and heterogeneity of text to be processed. To address this challenge, the industry is continuously develo** highly efficient tools and creating more flexible engineering solutions. This work presents the integration between industry data engineering solutions for efficient d… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  8. arXiv:2307.10296  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Automated Semantic Segmentation in Mammography Images

    Authors: Cesar A. Sierra-Franco, Jan Hurtado, Victor de A. Thomaz, Leonardo C. da Cruz, Santiago V. Silva, Alberto B. Raposo

    Abstract: Mammography images are widely used to detect non-palpable breast lesions or nodules, preventing cancer and providing the opportunity to plan interventions when necessary. The identification of some structures of interest is essential to make a diagnosis and evaluate image adequacy. Thus, computer-aided detection systems can be helpful in assisting medical interpretation by automatically segmenting… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 6 pages

  9. arXiv:2307.01548  [pdf, other

    cs.AI

    Knowledge Graph for NLG in the context of conversational agents

    Authors: Hussam Ghanem, Massinissa Atmani, Christophe Cruz

    Abstract: The use of knowledge graphs (KGs) enhances the accuracy and comprehensiveness of the responses provided by a conversational agent. While generating answers during conversations consists in generating text from these KGs, it is still regarded as a challenging task that has gained significant attention in recent years. In this document, we provide a review of different architectures used for knowled… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Journal ref: French Regional Conference on Complex Systems (FRCCS 2023), May 2023, Le Havre, France

  10. Pseudo-Labeling Enhanced by Privileged Information and Its Application to In Situ Sequencing Images

    Authors: Marzieh Haghighi, Mario C. Cruz, Erin Weisbart, Beth A. Cimini, Avtar Singh, Julia Bauman, Maria E. Lozada, Sanam L. Kavari, James T. Neal, Paul C. Blainey, Anne E. Carpenter, Shantanu Singh

    Abstract: Various strategies for label-scarce object detection have been explored by the computer vision research community. These strategies mainly rely on assumptions that are specific to natural images and not directly applicable to the biological and biomedical vision domains. For example, most semi-supervised learning strategies rely on a small set of labeled data as a confident source of ground truth.… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted for publication at IJCAI 2023

    Journal ref: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI), Main Track, Pages 4775-4784, 2023

  11. arXiv:2306.10034  [pdf

    cs.IR cs.LG

    Unlocking Insights into Business Trajectories with Transformer-based Spatio-temporal Data Analysis

    Authors: Muhammad Arslan, Christophe Cruz

    Abstract: The world of business is constantly evolving and staying ahead of the curve requires a deep understanding of market trends and performance. This article addresses this requirement by modeling business trajectories using news articles data.

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Presented in the conference Spatial Analysis and GEOmatics 2023 SAGEO

  12. arXiv:2306.07046  [pdf

    cs.IR cs.LG

    Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces

    Authors: Muhammad Arslan, Christophe Cruz

    Abstract: In this study, we compared the performance of four different methods for multi label text classification using a specific imbalanced business dataset. The four methods we evaluated were fine tuned BERT, Binary Relevance, Classifier Chains, and Label Powerset. The results show that fine tuned BERT outperforms the other three methods by a significant margin, achieving high values of accuracy, F1 Sco… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Journal ref: https://easychair.org/smart-program/FRCCS2023/2023-06-01.html

  13. arXiv:2305.14235  [pdf, other

    cs.CL cs.AI

    Multilingual Large Language Models Are Not (Yet) Code-Switchers

    Authors: Ruochen Zhang, Samuel Cahyawijaya, Jan Christian Blaise Cruz, Genta Indra Winata, Alham Fikri Aji

    Abstract: Multilingual Large Language Models (LLMs) have recently shown great capabilities in a wide range of tasks, exhibiting state-of-the-art performance through zero-shot or few-shot prompting methods. While there have been extensive studies on their abilities in monolingual tasks, the investigation of their potential in the context of code-switching (CSW), the practice of alternating languages within a… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

  14. arXiv:2303.13592  [pdf, other

    cs.CL cs.AI

    Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages

    Authors: Zheng-Xin Yong, Ruochen Zhang, Jessica Zosa Forde, Skyler Wang, Arjun Subramonian, Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Lintang Sutawika, Jan Christian Blaise Cruz, Yin Lin Tan, Long Phan, Rowena Garcia, Thamar Solorio, Alham Fikri Aji

    Abstract: While code-mixing is a common linguistic practice in many parts of the world, collecting high-quality and low-cost code-mixed data remains a challenge for natural language processing (NLP) research. The recent proliferation of Large Language Models (LLMs) compels one to ask: how capable are these systems in generating code-mixed data? In this paper, we explore prompting multilingual LLMs in a zero… ▽ More

    Submitted 12 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Updating Authors

  15. arXiv:2301.05122  [pdf, other

    quant-ph cs.CC cs.DS

    Quantum algorithm for finding minimum values in a Quantum Random Access Memory

    Authors: Anton S. Albino, Lucas Q. Galvão, Ethan Hansen, Mauro Q. Nooblath Neto, Clebson Cruz

    Abstract: Finding the minimum value in an unordered database is a common and fundamental task in computer science. However, the optimal classical deterministic algorithm can find the minimum value with a time complexity that grows linearly with the number of elements in the database. In this paper, we present the proposal of a quantum algorithm for finding the minimum value of a database, which is quadratic… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  16. arXiv:2212.13656  [pdf, other

    cs.DC

    Smart meter data processing: a showcase for simple and efficient textual processing

    Authors: Miguel Ferreira, André Neves, Rodrigo Gorjão, Carlos Cruz, Miguel L. Pardal

    Abstract: The increase in the production and collection of data from devices is an ongoing trend due to the roll-out of more cyber-physical applications. Smart meters, because of their importance in power grids, are a class of such devices whose produced data requires meticulous processing. In this paper, we use Unicage, a data processing system based on classic Unix shell scripting, that delivers excellent… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 11 pages, 5 figures, 1 table, 9 listings. Accepted after review for the 1st Workshop on High-Performance and Reliable Big Data (HPBD 2021), which was held virtually on September 20th 2021, and was co-located with the 40th International Symposium on Reliable Distributed Systems (SRDS 2021)

  17. arXiv:2205.00952  [pdf, other

    cs.CV

    Leaf Tar Spot Detection Using RGB Images

    Authors: Sriram Baireddy, Da-Young Lee, Carlos Gongora-Canul, Christian D. Cruz, Edward J. Delp

    Abstract: Tar spot disease is a fungal disease that appears as a series of black circular spots containing spores on corn leaves. Tar spot has proven to be an impactful disease in terms of reducing crop yield. To quantify disease progression, experts usually have to visually phenotype leaves from the plant. This process is very time-consuming and is difficult to incorporate in any high-throughput phenotypin… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  18. arXiv:2204.03251  [pdf, other

    cs.CL

    Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings

    Authors: Dan John Velasco, Axel Alba, Trisha Gail Pelagio, Bryce Anthony Ramirez, Unisse Chua, Briane Paul Samson, Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Wordnets are indispensable tools for various natural language processing applications. Unfortunately, wordnets get outdated, and producing or updating wordnets can be slow and costly in terms of time and resources. This problem intensifies for low-resource languages. This study proposes a method for word sense induction and synset induction using only two linguistic resources, namely, an unlabeled… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in SEALP 2023. Formerly titled "Automatic WordNet Construction using Word Sense Induction through Sentence Embeddings"

  19. arXiv:2204.02653  [pdf, ps, other

    cs.CL

    Using Synthetic Data for Conversational Response Generation in Low-resource Settings

    Authors: Gabriel Louis Tan, Adrian Paule Ty, Schuyler Ng, Denzel Adrian Co, Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Response generation is a task in natural language processing (NLP) where a model is trained to respond to human statements. Conversational response generators take this one step further with the ability to respond within the context of previous responses. While there are existing techniques for training such models, they all require an abundance of conversational data which are not always availabl… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  20. arXiv:2111.10513  [pdf, other

    cs.CL

    Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21

    Authors: Lintang Sutawika, Jan Christian Blaise Cruz

    Abstract: In this paper, we describe the submission of the joint Samsung Research Philippines-Konvergen AI team for the WMT'21 Large Scale Multilingual Translation Task - Small Track 2. We submit a standard Seq2Seq Transformer model to the shared task without any training or architecture tricks, relying mainly on the strength of our data preprocessing techniques to boost performance. Our final submission mo… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: In Proceedings of the Sixth Conference on Machine Translation (WMT)

  21. arXiv:2111.06053  [pdf, other

    cs.CL

    Improving Large-scale Language Models and Resources for Filipino

    Authors: Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: In this paper, we improve on existing language resources for the low-resource Filipino language in two ways. First, we outline the construction of the TLUnified dataset, a large-scale pretraining corpus that serves as an improvement over smaller existing pretraining datasets for the language in terms of scale and topic variety. Second, we pretrain new Transformer language models following the RoBE… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: Resources are available at blaisecruz.com/resources

  22. arXiv:2105.12949  [pdf, other

    cs.HC

    A Survey on Interactive Reinforcement Learning: Design Principles and Open Challenges

    Authors: Christian Arzate Cruz, Takeo Igarashi

    Abstract: Interactive reinforcement learning (RL) has been successfully used in various applications in different fields, which has also motivated HCI researchers to contribute in this area. In this paper, we survey interactive RL to empower human-computer interaction (HCI) researchers with the technical background in RL needed to design new interaction techniques and propose new applications. We elucidate… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  23. arXiv:2105.12944  [pdf, other

    cs.HC

    MarioMix: Creating Aligned Playstyles for Bots with Interactive Reinforcement Learning

    Authors: Christian Arzate Cruz, Takeo Igarashi

    Abstract: In this paper, we propose a generic framework that enables game developers without knowledge of machine learning to create bot behaviors with playstyles that align with their preferences. Our framework is based on interactive reinforcement learning (RL), and we used it to create a behavior authoring tool called MarioMix. This tool enables non-experts to create bots with varied playstyles for the g… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  24. arXiv:2105.12938  [pdf, other

    cs.HC

    Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors

    Authors: Christian Arzate Cruz, Takeo Igarashi

    Abstract: Reinforcement learning techniques successfully generate convincing agent behaviors, but it is still difficult to tailor the behavior to align with a user's specific preferences. What is missing is a communication method for the system to explain the behavior and for the user to repair it. In this paper, we present a novel interaction method that uses interactive explanations using templates of nat… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  25. arXiv:2010.11574  [pdf, other

    cs.CL

    Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets

    Authors: Jan Christian Blaise Cruz, Jose Kristian Resabal, James Lin, Dan John Velasco, Charibeth Cheng

    Abstract: Transformers represent the state-of-the-art in Natural Language Processing (NLP) in recent years, proving effective even in tasks done in low-resource languages. While pretrained transformers for these languages can be made, it is challenging to measure their true performance and capacity due to the lack of hard benchmark datasets, as well as the difficulty and cost of producing them. In this pape… ▽ More

    Submitted 13 August, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: To appear in PRICAI 2021. Formerly titled "Investigating the True Performance of Transformers in Low-Resource Languages: A Case Study in Automatic Corpus Creation." Code and data available at https://github.com/jcblaisecruz02/Filipino-Text-Benchmarks

  26. arXiv:2005.02068  [pdf, other

    cs.CL

    Establishing Baselines for Text Classification in Low-Resource Languages

    Authors: Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: While transformer-based finetuning techniques have proven effective in tasks that involve low-resource, low-data environments, a lack of properly established baselines and benchmark datasets make it hard to compare different approaches that are aimed at tackling the low-resource setting. In this work, we provide three contributions. First, we introduce two previously unreleased datasets as benchma… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: We release all our models, finetuning code, and data at https://github.com/jcblaisecruz02/Filipino-Text-Benchmarks

  27. arXiv:2005.01107  [pdf, other

    cs.CL

    Simplifying Paragraph-level Question Generation via Transformer Language Models

    Authors: Luis Enrico Lopez, Diane Kathryn Cruz, Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Question generation (QG) is a natural language generation task where a model is trained to ask questions corresponding to some input text. Most recent approaches frame QG as a sequence-to-sequence problem and rely on additional features and mechanisms to increase performance; however, these often increase model complexity, and can rely on auxiliary data unavailable in practical use. A single Trans… ▽ More

    Submitted 13 August, 2021; v1 submitted 3 May, 2020; originally announced May 2020.

    Comments: To appear in PRICAI 2021. Formerly titled "Transformer-based End-to-End Question Generation."

  28. arXiv:2003.00762  [pdf, other

    eess.IV cs.LG

    Flashlight CNN Image Denoising

    Authors: Pham Huu Thanh Binh, Cristóvão Cruz, Karen Egiazarian

    Abstract: This paper proposes a learning-based denoising method called FlashLight CNN (FLCNN) that implements a deep neural network for image denoising. The proposed approach is based on deep residual networks and inception networks and it is able to leverage many more parameters than residual networks alone for denoising grayscale images corrupted by additive white Gaussian noise (AWGN). FlashLight CNN dem… ▽ More

    Submitted 2 July, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  29. arXiv:1911.01279  [pdf

    cs.CY eess.SP

    Automated Smart Wick System-Based Microfarm Using Internet of Things

    Authors: R. Jorda, Jr., C. Alcabasa, A. Buhay, E. C. Dela Cruz, J. P. Mendoza, A. Tolentino, L. K. Tolentino, E. Fernandez, A. Thio-ac, J. Velasco, N. Arago

    Abstract: This paper presents a study conducted to allow urban farmers to remotely monitor their farm through the design and development of an Internet of Things-based (IoT) microfarm prototype which utilized wick system as planting method. The system involves the detection of three environmental parameters namely, light intensity, soil moisture and temperature through the use of respective sensors which we… ▽ More

    Submitted 30 October, 2019; originally announced November 2019.

    Journal ref: Lecture Notes on Research and Innovation in Computer Engineering and Computer Sciences, 2019

  30. Localization of Fake News Detection via Multitask Transfer Learning

    Authors: Jan Christian Blaise Cruz, Julianne Agatha Tan, Charibeth Cheng

    Abstract: The use of the internet as a fast medium of spreading fake news reinforces the need for computational tools that combat it. Techniques that train fake news classifiers exist, but they all assume an abundance of resources including large labeled datasets and expert-curated corpora, which low-resource languages may not have. In this work, we make two main contributions: First, we alleviate resource… ▽ More

    Submitted 15 May, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Published in the LREC 2020 Proceedings. Models and data available at https://github.com/jcblaisecruz02/Tagalog-fake-news

    Journal ref: In Proceedings of The 12th Language Resources and Evaluation Conference, pp.2589-2597 (2020)

  31. arXiv:1907.07286  [pdf, other

    math.CO cs.DM

    Vertex arboricity of cographs

    Authors: Sebastián González Hermosillo de la Maza, Pavol Hell, César Hernández Cruz, Seyyed Aliasghar Hosseini, Payam Valadkhan

    Abstract: Arboricity is a graph parameter akin to chromatic number, in that it seeks to partition the vertices into the smallest number of sparse subgraphs. Where for the chromatic number we are partitioning the vertices into independent sets, for the arboricity we want to partition the vertices into cycle-free subsets (i.e., forests). Arboricity is NP-hard in general, and our focus is on the arboricity of… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 14 pages, 1 figure

    MSC Class: 05C70; 05C75

  32. Evaluating Language Model Finetuning Techniques for Low-resource Languages

    Authors: Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Unlike mainstream languages (such as English and French), low-resource languages often suffer from a lack of expert-annotated corpora and benchmark resources that make it hard to apply state-of-the-art techniques directly. In this paper, we alleviate this scarcity problem for the low-resourced Filipino language in two ways. First, we introduce a new benchmark language modeling dataset in Filipino… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

    Comments: Pretrained models and datasets available at https://github.com/jcblaisecruz02/Tagalog-BERT

  33. Nonlocality-Reinforced Convolutional Neural Networks for Image Denoising

    Authors: Cristóvão Cruz, Alessandro Foi, Vladimir Katkovnik, Karen Egiazarian

    Abstract: We introduce a paradigm for nonlocal sparsity reinforced deep convolutional neural network denoising. It is a combination of a local multiscale denoising by a convolutional neural network (CNN) based denoiser and a nonlocal denoising based on a nonlocal filter (NLF) exploiting the mutual similarities between groups of patches. CNN models are leveraged with noise levels that progressively decrease… ▽ More

    Submitted 21 June, 2018; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: Accepted for publication in IEEE SPL

  34. Single Image Super-Resolution based on Wiener Filter in Similarity Domain

    Authors: Cristóvão Cruz, Rakesh Mehta, Vladimir Katkovnik, Karen Egiazarian

    Abstract: Single image super resolution (SISR) is an ill-posed problem aiming at estimating a plausible high resolution (HR) image from a single low resolution (LR) image. Current state-of-the-art SISR methods are patch-based. They use either external data or internal self-similarity to learn a prior for a HR image. External data based methods utilize large number of patches from the training data, while se… ▽ More

    Submitted 29 November, 2017; v1 submitted 13 April, 2017; originally announced April 2017.

    Comments: Paper accepted for publication on IEEE Transactions on Image Processing

  35. arXiv:1412.0854  [pdf, other

    cs.AI

    Semantic HMC for Big Data Analysis

    Authors: Thomas Hassan, Rafael Peixoto, Christophe Cruz, Aurlie Bertaux, Nuno Silva

    Abstract: Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.

    Submitted 2 December, 2014; originally announced December 2014.

  36. arXiv:1301.5349  [pdf

    cs.CG cs.AI

    Toward the Automatic Generation of a Semantic VRML Model from Unorganized 3D Point Clouds

    Authors: Helmi Ben Hmida, Christophe Cruz, Christophe Nicolle, Frank Boochs

    Abstract: This paper presents our experience regarding the creation of 3D semantic facility model out of unorganized 3D point clouds. Thus, a knowledge-based detection approach of objects using the OWL ontology language is presented. This knowledge is used to define SWRL detection rules. In addition, the combination of 3D processing built-ins and topological Built-Ins in SWRL rules aims at combining geometr… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1301.4991, arXiv:1301.4783

    Journal ref: The Fifth International Conference on Advances in Semantic Processing, Lisbon : Portugal (2011)

  37. arXiv:1301.4992  [pdf

    cs.AI

    From 9-IM Topological Operators to Qualitative Spatial Relations using 3D Selective Nef Complexes and Logic Rules for bodies

    Authors: Helmi Ben Hmida, Christophe Cruz, Frank Boochs, Christophe Nicolle

    Abstract: This paper presents a method to compute automatically topological relations using SWRL rules. The calculation of these rules is based on the definition of a Selective Nef Complexes Nef Polyhedra structure generated from standard Polyhedron. The Selective Nef Complexes is a data model providing a set of binary Boolean operators such as Union, Difference, Intersection and Symmetric difference, and u… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1301.4780

    Journal ref: International Conference on Knowledge Engineering and Ontology Development, Barcelone : Spain (2012)

  38. arXiv:1301.4991  [pdf

    cs.AI

    Knowledge Base Approach for 3D Objects Detection in Point Clouds Using 3D Processing and Specialists Knowledge

    Authors: Helmi Ben Hmida, Christophe Cruz, Frank Boochs, Christophe Nicolle

    Abstract: This paper presents a knowledge-based detection of objects approach using the OWL ontology language, the Semantic Web Rule Language, and 3D processing built-ins aiming at combining geometrical analysis of 3D point clouds and specialist's knowledge. Here, we share our experience regarding the creation of 3D semantic facility model out of unorganized 3D point clouds. Thus, a knowledge-based detectio… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Comments: ISSN: 1942-2679. arXiv admin note: text overlap with arXiv:1301.4783

    Journal ref: International Journal On Advances in Intelligent Systems 5, 1 et 2 (2012) 1-14

  39. Integration of knowledge to support automatic object reconstruction from images and 3D data

    Authors: Frank Boochs, Andreas Marbs, Hung Truong, Helmi Ben Hmida, Ashish Karmacharya, Christophe Cruz, Adlane Habed, Yvon Voisin, Christophe Nicolle

    Abstract: Object reconstruction is an important task in many fields of application as it allows to generate digital representations of our physical world used as base for analysis, planning, construction, visualization or other aims. A reconstruction itself normally is based on reliable data (images, 3D point clouds for example) expressing the object in his complete extent. This data then has to be compiled… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Journal ref: Systems, Signals and Devices (SSD), 2011 8th International Multi-Conference on, Chemnitz : Germany (2011)

  40. arXiv:1301.4783  [pdf

    cs.CG cs.AI

    From 3D Point Clouds To Semantic Objects An Ontology-Based Detection Approach

    Authors: Helmi Ben Hmida, Christophe Cruz, Frank Boochs, Christophe Nicolle

    Abstract: This paper presents a knowledge-based detection of objects approach using the OWL ontology language, the Semantic Web Rule Language, and 3D processing built-ins aiming at combining geometrical analysis of 3D point clouds and specialist's knowledge. This combination allows the detection and the annotation of objects contained in point clouds. The context of the study is the detection of railway obj… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Journal ref: International Conference on Knowledge Engineering and Ontology Development, Paris : France (2011)

  41. arXiv:1301.4781  [pdf

    cs.IR cs.DL

    Ontology-based Recommender System of Economic Articles

    Authors: David Werner, Christophe Cruz, Christophe Nicolle

    Abstract: Decision makers need economical information to drive their decisions. The Company Actualis SARL is specialized in the production and distribution of a press review about French regional economic actors. This economic review represents for a client a prospecting tool on partners and competitors. To reduce the overload of useless information, the company is moving towards a customized review for eac… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Journal ref: 8th International Conference on Web Information Systems and Technologies, Porto : Portugal (2013)

  42. From Quantitative Spatial Operator to Qualitative Spatial Relation Using Constructive Solid Geometry, Logic Rules and Optimized 9-IM Model, A Semantic Based Approach

    Authors: Helmi Ben Hmida, Christophe Cruz, Frank Boochs, Christophe Nicolle

    Abstract: The Constructive Solid Geometry (CSG) is a data model providing a set of binary Boolean operators such as Union, Difference and Intersection. In this work, these operators are used to compute topological relations between objects defined by the constraints of the nine Intersection Model (9-IM) from Egenhofer. With the help of these constraints, we define a procedure to compute the topological rela… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Journal ref: IEEE International Conference on Computer Science and Automation Engineering (CSAE),, Zhangjiajie : China (2012)

  43. arXiv:1208.1750  [pdf

    cs.SE cs.AI

    Guidelines for a Dynamic Ontology - Integrating Tools of Evolution and Versioning in Ontology

    Authors: Perrine Pittet, Christophe Nicolle, Christophe Cruz

    Abstract: Ontologies are built on systems that conceptually evolve over time. In addition, techniques and languages for building ontologies evolve too. This has led to numerous studies in the field of ontology versioning and ontology evolution. This paper presents a new way to manage the lifecycle of an ontology incorporating both versioning tools and evolution process. This solution, called VersionGraph, i… ▽ More

    Submitted 8 August, 2012; originally announced August 2012.

    Journal ref: KMIS 2011 - International Conference on Knowledge Management and Information Sharing is part of 3rd International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management., Paris : France (2011)