Skip to main content

Showing 1–14 of 14 results for author: Ramírez, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14864  [pdf, other

    cs.LG stat.AP stat.ML

    A review of feature selection strategies utilizing graph data structures and knowledge graphs

    Authors: Sisi Shao, Pedro Henrique Ribeiro, Christina Ramirez, Jason H. Moore

    Abstract: Feature selection in Knowledge Graphs (KGs) are increasingly utilized in diverse domains, including biomedical research, Natural Language Processing (NLP), and personalized recommendation systems. This paper delves into the methodologies for feature selection within KGs, emphasizing their roles in enhancing machine learning (ML) model efficacy, hypothesis generation, and interpretability. Through… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.20452  [pdf, other

    cs.LG cs.IT stat.ML

    Understanding Encoder-Decoder Structures in Machine Learning Using Information Measures

    Authors: Jorge F. Silva, Victor Faraggi, Camilo Ramirez, Alvaro Egana, Eduardo Pavez

    Abstract: We present new results to model and understand the role of encoder-decoder design in machine learning (ML) from an information-theoretic angle. We use two main information concepts, information sufficiency (IS) and mutual information loss (MIL), to represent predictive structures in machine learning. Our first main result provides a functional expression that characterizes the class of probabilist… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.04657  [pdf, other

    cs.LG cs.AI q-bio.BM

    ACEGEN: Reinforcement learning of generative chemical agents for drug discovery

    Authors: Albert Bou, Morgan Thomas, Sebastian Dittert, Carles Navarro Ramírez, Maciej Majewski, Ye Wang, Shivam Patel, Gary Tresadern, Mazen Ahmad, Vincent Moens, Woody Sherman, Simone Sciabola, Gianni De Fabritiis

    Abstract: In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties. However, striking a balance between capabilities, flexibility, reliability, and efficiency remains challenging due to the complexity of advanced RL algorithms and the significant reliance on specialized code. In this work, we… ▽ More

    Submitted 3 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  4. arXiv:2405.03667  [pdf, other

    eess.SP cs.IT cs.LG

    Fault Detection and Monitoring using an Information-Driven Strategy: Method, Theory, and Application

    Authors: Camilo Ramírez, Jorge F. Silva, Ferhat Tamssaouet, Tomás Rojas, Marcos E. Orchard

    Abstract: The ability to detect when a system undergoes an incipient fault is of paramount importance in preventing a critical failure. In this work, we propose an information-driven fault detection method based on a novel concept drift detector. The method is tailored to identifying drifts in input-output relationships of additive noise models (i.e., model drifts) and is based on a distribution-free mutual… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 28 pages, 11 figures

  5. arXiv:2404.07814  [pdf, ps, other

    cs.CL

    MultiLS-SP/CA: Lexical Complexity Prediction and Lexical Simplification Resources for Catalan and Spanish

    Authors: Stefan Bott, Horacio Saggion, Nelson Peréz Rojas, Martin Solis Salazar, Saul Calderon Ramirez

    Abstract: Automatic lexical simplification is a task to substitute lexical items that may be unfamiliar and difficult to understand with easier and more common words. This paper presents MultiLS-SP/CA, a novel dataset for lexical simplification in Spanish and Catalan. This dataset represents the first of its kind in Catalan and a substantial addition to the sparse data on automatic lexical simplification wh… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Submitted to the 40th edition of the SEPLN Conference. Under Revision

  6. Performance Analysis of Matrix Multiplication for Deep Learning on the Edge

    Authors: Cristian Ramírez, Adrián Castelló, Héctor Martínez, Enrique S. Quintana-Ortí

    Abstract: The devices designed for the Internet-of-Things encompass a large variety of distinct processor architectures, forming a highly heterogeneous zoo. In order to tackle this, we employ a simulator to estimate the performance of the matrix-matrix multiplication (GEMM) kernel on processors designed to operate at the edge. Our simulator adheres to the modern implementations of GEMM, advocated by GotoBLA… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 12 pages, 2 Tables, 6 Figures

    Journal ref: High Performance Computing. ISC High Performance 2022 International Workshops. ISC High Performance 2022. Lecture Notes in Computer Science, vol 13387. Springer, Cham

  7. arXiv:2402.09021  [pdf, other

    cs.LO

    Unified Opinion Dynamic Modeling as Concurrent Set Relations in Rewriting Logic

    Authors: Carlos Olarte, Carlos Ramírez, Camilo Rocha, Frank Valencia

    Abstract: Social media platforms have played a key role in weaponizing the polarization of social, political, and democratic processes. This is, mainly, because they are a medium for opinion formation. Opinion dynamic models are a tool for understanding the role of specific social factors on the acceptance/rejection of opinions because they can be used to analyze certain assumptions on human behaviors. This… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  8. arXiv:2401.17434  [pdf

    cs.CY cs.AI cs.HC

    Integrating Generative AI in Hackathons: Opportunities, Challenges, and Educational Implications

    Authors: Ramteja Sajja, Carlos Erazo Ramirez, Zhouyayan Li, Bekir Z. Demiray, Yusuf Sermet, Ibrahim Demir

    Abstract: Hackathons and software competitions, increasingly pivotal in the software industry, serve as vital catalysts for innovation and skill development for both organizations and students. These platforms enable companies to prototype ideas swiftly, while students gain enriched learning experiences, enhancing their practical skills. Over the years, hackathons have transitioned from mere competitive eve… ▽ More

    Submitted 1 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 8491 words, 23 pages, 12 figures

  9. arXiv:2212.14177  [pdf, other

    cs.AI cs.CY eess.IV

    Current State of Community-Driven Radiological AI Deployment in Medical Imaging

    Authors: Vikash Gupta, Barbaros Selnur Erdal, Carolina Ramirez, Ralf Floca, Laurence Jackson, Brad Genereaux, Sidney Bryson, Christopher P Bridge, Jens Kleesiek, Felix Nensa, Rickmer Braren, Khaled Younis, Tobias Penzkofer, Andreas Michael Bucher, Ming Melvin Qin, Gigon Bae, Hyeonhoon Lee, M. Jorge Cardoso, Sebastien Ourselin, Eric Kerfoot, Rahul Choudhury, Richard D. White, Tessa Cook, David Bericat, Matthew Lungren , et al. (2 additional authors not shown)

    Abstract: Artificial Intelligence (AI) has become commonplace to solve routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. We project that the gap between the number of imaging exams and the number of expert radiologist readers required to cover this increase will continue to expand, consequently introd… ▽ More

    Submitted 8 May, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: 21 pages; 5 figures

    MSC Class: eess.IV

  10. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  11. arXiv:2103.15710  [pdf, other

    cs.LO

    Representation of a vehicular traffic model using hybrid systems

    Authors: Miguel Andres Velasquez, Carlos Ernesto Ramirez

    Abstract: There is a great diversity of formal models to understand the dynamics of transport and vehicular flow on a road. Many of these models are inspired by the dynamics of flows governed by partial differential equations. However, it is possible to simplify these models to ordinary equations by considering constant variations in some of the input variables in this type of models. However, given that th… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

  12. arXiv:2006.09693  [pdf, other

    stat.ML cs.LG

    FREEtree: A Tree-based Approach for High Dimensional Longitudinal Data With Correlated Features

    Authors: Yuancheng Xu, Athanasse Zafirov, R. Michael Alvarez, Dan Kojis, Min Tan, Christina M. Ramirez

    Abstract: This paper proposes FREEtree, a tree-based method for high dimensional longitudinal data with correlated features. Popular machine learning approaches, like Random Forests, commonly used for variable selection do not perform well when there are correlated features and do not account for data observed over time. FREEtree deals with longitudinal data by using a piecewise random effects model. It als… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  13. arXiv:1303.1232  [pdf

    cs.CL cs.AI

    Japanese-Spanish Thesaurus Construction Using English as a Pivot

    Authors: Jessica Ramírez, Masayuki Asahara, Yuji Matsumoto

    Abstract: We present the results of research with the goal of automatically creating a multilingual thesaurus based on the freely available resources of Wikipedia and WordNet. Our goal is to increase resources for natural language processing tasks such as machine translation targeting the Japanese-Spanish language pair. Given the scarcity of resources, we use existing English resources as a pivot for creati… ▽ More

    Submitted 5 March, 2013; originally announced March 2013.

    Journal ref: In Proceeding of The Third International Joint Conference on Natural Language Processing (IJCNLP-08), Hyderabad, India. pages 473-480, 2008

  14. arXiv:1211.4488  [pdf

    cs.CL cs.AI

    A Rule-Based Approach For Aligning Japanese-Spanish Sentences From A Comparable Corpora

    Authors: Jessica C. Ramírez, Yuji Matsumoto

    Abstract: The performance of a Statistical Machine Translation System (SMT) system is proportionally directed to the quality and length of the parallel corpus it uses. However for some pair of languages there is a considerable lack of them. The long term goal is to construct a Japanese-Spanish parallel corpus to be used for SMT, whereas, there are a lack of useful Japanese-Spanish parallel Corpus. To addres… ▽ More

    Submitted 19 November, 2012; originally announced November 2012.

    Comments: International Journal on Natural Language Computing (IJNLC) Vol.1, No.3, October 2012