Skip to main content

Showing 1–37 of 37 results for author: Gonzalez, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10890  [pdf

    cs.AI cs.HC cs.IR

    Exploring Augmentation and Cognitive Strategies for AI based Synthetic Personae

    Authors: Rafael Arias Gonzalez, Steve DiPaola

    Abstract: Large language models (LLMs) hold potential for innovative HCI research, including the creation of synthetic personae. However, their black-box nature and propensity for hallucinations pose challenges. To address these limitations, this position paper advocates for using LLMs as data augmentation systems rather than zero-shot generators. We further propose the development of robust cognitive and m… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: This paper was accepted for publication: Proceedings of ACM Conf on Human Factors in Computing Systems (CHI 24), Rafael Arias Gonzalez, Steve DiPaola. Exploring Augmentation and Cognitive Strategies for Synthetic Personae. ACM SigCHI, in Challenges and Opportunities of LLM-Based Synthetic Personae and Data in HCI Workshop, 2024

    ACM Class: I.2.7

  2. arXiv:2404.08523  [pdf, other

    cs.LG cs.AI

    Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement

    Authors: Lucas Murray, Tatiana Castillo, Jaime Carrasco, Andrés Weintraub, Richard Weber, Isaac Martín de Diego, José Ramón González, Jordi García-Gonzalo

    Abstract: Over the past decades, the increase in both frequency and intensity of large-scale wildfires due to climate change has emerged as a significant natural threat. The pressing need to design resilient landscapes capable of withstanding such disasters has become paramount, requiring the development of advanced decision-support tools. Existing methodologies, including Mixed Integer Programming, Stochas… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 20 pages, 15 figures

  3. arXiv:2403.15604  [pdf

    cs.HC cs.AI

    Investigating Use Cases of AI-Powered Scene Description Applications for Blind and Low Vision People

    Authors: Ricardo Gonzalez, Jazmin Collins, Shiri Azenkot, Cynthia Bennett

    Abstract: "Scene description" applications that describe visual content in a photo are useful daily tools for blind and low vision (BLV) people. Researchers have studied their use, but they have only explored those that leverage remote sighted assistants; little is known about applications that use AI to generate their descriptions. Thus, to investigate their use cases, we conducted a two-week diary study w… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 21 pages, 18 figures, 5 tables, to appear CHI2024

  4. arXiv:2401.06649  [pdf, other

    cs.NE

    Data-Efficient Interactive Multi-Objective Optimization Using ParEGO

    Authors: Arash Heidari, Sebastian Rojas Gonzalez, Tom Dhaene, Ivo Couckuyt

    Abstract: Multi-objective optimization is a widely studied problem in diverse fields, such as engineering and finance, that seeks to identify a set of non-dominated solutions that provide optimal trade-offs among competing objectives. However, the computation of the entire Pareto front can become prohibitively expensive, both in terms of computational resources and time, particularly when dealing with a lar… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted at ECML PKDD 2023 workshop: Neuro-Explicit AI and Expert-informed Machine Learning for Engineering and Physical Sciences

  5. Performance of externally validated machine learning models based on histopathology images for the diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer: A systematic review

    Authors: Ricardo Gonzalez, Peyman Nejat, Ashirbani Saha, Clinton J. V. Campbell, Andrew P. Norgan, Cynthia Lokker

    Abstract: Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome predi… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Journal ref: Journal of Pathology Informatics. 2023;15:100348

  6. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities

    Authors: Ricardo Gonzalez, Ashirbani Saha, Clinton J. V. Campbell, Peyman Nejat, Cynthia Lokker, Andrew P. Norgan

    Abstract: This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and t… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Journal ref: Journal of Pathology Informatics 15 (2024) 100347

  7. arXiv:2308.11162  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    A Preliminary Investigation into Search and Matching for Tumour Discrimination in WHO Breast Taxonomy Using Deep Networks

    Authors: Abubakr Shafique, Ricardo Gonzalez, Liron Pantanowitz, Puay Hoon Tan, Alberto Machado, Ian A Cree, Hamid R. Tizhoosh

    Abstract: Breast cancer is one of the most common cancers affecting women worldwide. They include a group of malignant neoplasms with a variety of biological, clinical, and histopathological characteristics. There are more than 35 different histological forms of breast lesions that can be classified and diagnosed histologically according to cell morphology, growth, and architecture patterns. Recently, deep… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  8. arXiv:2307.10214  [pdf, other

    cs.CR cs.LG

    Time for aCTIon: Automated Analysis of Cyber Threat Intelligence in the Wild

    Authors: Giuseppe Siracusano, Davide Sanvito, Roberto Gonzalez, Manikantan Srinivasan, Sivakaman Kamatchi, Wataru Takahashi, Masaru Kawakita, Takahiro Kakumaru, Roberto Bifulco

    Abstract: Cyber Threat Intelligence (CTI) plays a crucial role in assessing risks and enhancing security for organizations. However, the process of extracting relevant information from unstructured text sources can be expensive and time-consuming. Our empirical experience shows that existing tools for automated structured CTI extraction have performance limitations. Furthermore, the community lacks a common… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  9. Immunohistochemistry Biomarkers-Guided Image Search for Histopathology

    Authors: Abubakr Shafique, Morteza Babaie, Ricardo Gonzalez, H. R. Tizhoosh

    Abstract: Medical practitioners use a number of diagnostic tests to make a reliable diagnosis. Traditionally, Haematoxylin and Eosin (H&E) stained glass slides have been used for cancer diagnosis and tumor detection. However, recently a variety of immunohistochemistry (IHC) stained slides can be requested by pathologists to examine and confirm diagnoses for determining the subtype of a tumor when this is di… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  10. Composite Biomarker Image for Advanced Visualization in Histopathology

    Authors: Abubakr Shafique, Morteza Babaie, Ricardo Gonzalez, Adrian Batten, Soma Sikdar, H. R. Tizhoosh

    Abstract: Immunohistochemistry (IHC) biomarkers are essential tools for reliable cancer diagnosis and subty**. It requires cross-staining comparison among Whole Slide Images (WSIs) of IHCs and hematoxylin and eosin (H&E) slides. Currently, pathologists examine the visually co-localized areas across IHC and H&E glass slides for a final diagnosis, which is a tedious and challenging task. Moreover, visually… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  11. arXiv:2302.01310  [pdf, other

    stat.ML cs.LG math.OC

    Bayesian Optimization of Multiple Objectives with Different Latencies

    Authors: Jack M. Buckingham, Sebastian Rojas Gonzalez, Juergen Branke

    Abstract: Multi-objective Bayesian optimization aims to find the Pareto front of optimal trade-offs between a set of expensive objectives while collecting as few samples as possible. In some cases, it is possible to evaluate the objectives separately, and a different latency or evaluation cost can be associated with each objective. This presents an opportunity to learn the Pareto front faster by evaluating… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 25 pages

  12. arXiv:2209.03919  [pdf, other

    stat.ML cs.LG

    Bi-objective Ranking and Selection Using Stochastic Kriging

    Authors: Sebastian Rojas Gonzalez, Juergen Branke, Inneke van Nieuwenhuyse

    Abstract: We consider bi-objective ranking and selection problems, where the goal is to correctly identify the Pareto optimal solutions among a finite set of candidates for which the two objective outcomes have been observed with uncertainty (e.g., after running a multiobjective stochastic simulation optimization procedure). When identifying these solutions, the noise perturbing the observed performance may… ▽ More

    Submitted 28 March, 2024; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: 33 pages, 14 figures

  13. arXiv:2208.05910  [pdf, other

    physics.soc-ph cs.LG

    Machine learning in front of statistical methods for prediction spread SARS-CoV-2 in Colombia

    Authors: A. Estupiñán, J. Acuña, A. Rodriguez, A. Ayala, C. Estupiñán, Ramon E. R. Gonzalez, D. A. Triana-Camacho, K. L. Cristiano-Rodríguez, Carlos Andrés Collazos Morales

    Abstract: An analytical study of the disease COVID-19 in Colombia was carried out using mathematical models such as Susceptible-Exposed-Infectious-Removed (SEIR), Logistic Regression (LR), and a machine learning method called Polynomial Regression Method. Previous analysis has been performed on the daily number of cases, deaths, infected people, and people who were exposed to the virus, all of them in a tim… ▽ More

    Submitted 27 September, 2022; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: 15 pages, 15 figures

  14. arXiv:2203.15324  [pdf, other

    cs.LG cs.DC cs.OS

    syslrn: Learning What to Monitor for Efficient Anomaly Detection

    Authors: Davide Sanvito, Giuseppe Siracusano, Sharan Santhanam, Roberto Gonzalez, Roberto Bifulco

    Abstract: While monitoring system behavior to detect anomalies and failures is important, existing methods based on log-analysis can only be as good as the information contained in the logs, and other approaches that look at the OS-level software state introduce high overheads. We tackle the problem with syslrn, a system that first builds an understanding of a target system offline, and then tailors the onl… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  15. arXiv:2112.08760  [pdf, other

    cs.NE cs.LG

    Constrained multi-objective optimization of process design parameters in settings with scarce data: an application to adhesive bonding

    Authors: Alejandro Morales-Hernández, Sebastian Rojas Gonzalez, Inneke Van Nieuwenhuyse, Ivo Couckuyt, Jeroen Jordens, Maarten Witters, Bart Van Doninck

    Abstract: Adhesive joints are increasingly used in industry for a wide variety of applications because of their favorable characteristics such as high strength-to-weight ratio, design flexibility, limited stress concentrations, planar force transfer, good damage tolerance, and fatigue resistance. Finding the optimal process parameters for an adhesive bonding process is challenging: the optimization is inher… ▽ More

    Submitted 10 April, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

  16. arXiv:2112.06769  [pdf, other

    cs.LG

    Multi-objective simulation optimization of the adhesive bonding process of materials

    Authors: Alejandro Morales-Hernández, Inneke Van Nieuwenhuyse, Sebastian Rojas Gonzalez, Jeroen Jordens, Maarten Witters, Bart Van Doninck

    Abstract: Automotive companies are increasingly looking for ways to make their products lighter, using novel materials and novel bonding processes to join these materials together. Finding the optimal process parameters for such adhesive bonding process is challenging. In this research, we successfully applied Bayesian optimization using Gaussian Process Regression and Logistic Regression, to efficiently (i… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted on Winter Simulation Conference (WSC21)

  17. arXiv:2111.13755  [pdf, other

    cs.LG cs.AI math.OC

    A survey on multi-objective hyperparameter optimization algorithms for Machine Learning

    Authors: Alejandro Morales-Hernández, Inneke Van Nieuwenhuyse, Sebastian Rojas Gonzalez

    Abstract: Hyperparameter optimization (HPO) is a necessary step to ensure the best possible performance of Machine Learning (ML) algorithms. Several methods have been developed to perform HPO; most of these are focused on optimizing one performance measure (usually an error-based measure), and the literature on such single-objective HPO problems is vast. Recently, though, algorithms have appeared that focus… ▽ More

    Submitted 15 November, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

  18. Simulating Crowds and Autonomous Vehicles

    Authors: John Charlton, Luis Rene Montana Gonzalez, Steve Maddock, Paul Richmond

    Abstract: Understanding how people view and interact with autonomous vehicles is important to guide future directions of research. One such way of aiding understanding is through simulations of virtual environments involving people and autonomous vehicles. We present a simulation model that incorporates people and autonomous vehicles in a shared urban space. The model is able to simulate many thousands of p… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: 15 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:1908.10107

    Journal ref: Transactions on Computational Science XXXVII, 2020, 129-143

  19. arXiv:2005.11098  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning Based Detection and Localization of Intracranial Aneurysms in Computed Tomography Angiography

    Authors: Dufan Wu, Daniel Montes, Ziheng Duan, Yangsibo Huang, Javier M. Romero, Ramon Gilberto Gonzalez, Quanzheng Li

    Abstract: Purpose: To develop CADIA, a supervised deep learning model based on a region proposal network coupled with a false-positive reduction module for the detection and localization of intracranial aneurysms (IA) from computed tomography angiography (CTA), and to assess our model's performance to a similar detection network. Methods: In this retrospective study, we evaluated 1,216 patients from two sep… ▽ More

    Submitted 14 December, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

  20. arXiv:2002.07722  [pdf, other

    cs.CR

    Image encryption based on flexible computing of chaotic systems

    Authors: R. C. Gonzalez, E. G. Nepomuceno

    Abstract: The increase in data traffic on the internet has significantly increased the relevance of data and image encryption. Among the techniques most used in cryptography, chaotic systems have received great attention due to their easy implementation. However, it has recently been observed that these systems can lose their chaotic properties due to the finite precision of computers. In this work, we inte… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: DINCON 2019 - Conferencia Brasileira de Dinamica, Controle e Aplicacoes. Sao Carlos (SP). Brazil. 7 pages. In Portuguese

  21. arXiv:1912.12362  [pdf, other

    cs.MM cs.CL cs.SD eess.AS

    Structural characterization of musical harmonies

    Authors: Maria Rojo González, Simone Santini

    Abstract: Understanding the structural characteristics of harmony is essential for an effective use of music as a communication medium. Of the three expressive axes of music (melody, rhythm, harmony), harmony is the foundation on which the emotional content is built, and its understanding is important in areas such as multimedia and affective computing. The common tool for studying this kind of structure in… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

  22. arXiv:1912.08103  [pdf, ps, other

    stat.ML cs.LG eess.SP math.ST

    A Finite-Sample Deviation Bound for Stable Autoregressive Processes

    Authors: Rodrigo A. González, Cristian R. Rojas

    Abstract: In this paper, we study non-asymptotic deviation bounds of the least squares estimator in Gaussian AR($n$) processes. By relying on martingale concentration inequalities and a tail-bound for $χ^2$ distributed variables, we provide a concentration bound for the sample covariance matrix of the process output. With this, we present a problem-dependent finite-time bound on the deviation probability of… ▽ More

    Submitted 25 May, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: 15 pages

  23. Fast Simulation of Crowd Collision Avoidance

    Authors: John Charlton, Luis Rene Montana Gonzalez, Steve Maddock, Paul Richmond

    Abstract: Real-time large-scale crowd simulations with realistic behavior, are important for many application areas. On CPUs, the ORCA pedestrian steering model is often used for agent-based pedestrian simulations. This paper introduces a technique for running the ORCA pedestrian steering model on the GPU. Performance improvements of up to 30 times greater than a multi-core CPU model are demonstrated. This… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: 12 pages, 6 figures, 36th Computer Graphics International Conference (CGI 2019)

    Journal ref: CGI 2019: Advances in Computer Graphics, 36, pp 266-277

  24. arXiv:1902.05062  [pdf, ps, other

    cs.LG

    Machine Learning of Time Series Using Time-delay Embedding and Precision Annealing

    Authors: Alexander J. A. Ty, Zheng Fang, Rivver A. Gonzalez, Paul J. Rozdeba, Henry D. I. Abarbanel

    Abstract: Tasking machine learning to predict segments of a time series requires estimating the parameters of a ML model with input/output pairs from the time series. Using the equivalence between statistical data assimilation and supervised machine learning, we revisit this task. The training method for the machine utilizes a precision annealing approach to identifying the global minimum of the action (-lo… ▽ More

    Submitted 14 June, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

  25. arXiv:1807.10215  [pdf, other

    cs.CV cs.LG

    DeepSPINE: Automated Lumbar Vertebral Segmentation, Disc-level Designation, and Spinal Stenosis Grading Using Deep Learning

    Authors: Jen-Tang Lu, Stefano Pedemonte, Bernardo Bizzo, Sean Doyle, Katherine P. Andriole, Mark H. Michalski, R. Gilberto Gonzalez, Stuart R. Pomerantz

    Abstract: The high prevalence of spinal stenosis results in a large volume of MRI imaging, yet interpretation can be time-consuming with high inter-reader variability even among the most specialized radiologists. In this paper, we develop an efficient methodology to leverage the subject-matter-expertise stored in large-scale archival reporting and image data for a deep-learning approach to fully-automated l… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: Accepted as spotlight talk at Machine Learning for Healthcare (MLHC) 2018. Supplementary Video: https://bit.ly/DeepSPINE

  26. arXiv:1806.07379  [pdf, other

    cs.CV cs.RO

    DeepTerramechanics: Terrain Classification and Slip Estimation for Ground Robots via Deep Learning

    Authors: Ramon Gonzalez, Karl Iagnemma

    Abstract: Terramechanics plays a critical role in the areas of ground vehicles and ground mobile robots since understanding and estimating the variables influencing the vehicle-terrain interaction may mean the success or the failure of an entire mission. This research applies state-of-the-art algorithms in deep learning to two key problems: estimating wheel slip and classifying the terrain being traversed b… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 22 pages, 23 figures

  27. arXiv:1801.10095  [pdf, other

    cs.IR cs.CL

    TransRev: Modeling Reviews as Translations from Users to Items

    Authors: Alberto Garcia-Duran, Roberto Gonzalez, Daniel Onoro-Rubio, Mathias Niepert, Hui Li

    Abstract: The text of a review expresses the sentiment a customer has towards a particular product. This is exploited in sentiment analysis where machine learning models are used to predict the review score from the text of the review. Furthermore, the products costumers have purchased in the past are indicative of the products they will purchase in the future. This is what recommender systems exploit by le… ▽ More

    Submitted 18 April, 2018; v1 submitted 30 January, 2018; originally announced January 2018.

  28. arXiv:1710.05604  [pdf, other

    cs.HC cs.CY

    Collaboration Spheres: a Visual Metaphor to Share and Reuse Research Objects

    Authors: Mariano Rico, José Manuel Gómez-Pérez, Rafael Gonzalez, Aleix Garrido, Oscar Corcho

    Abstract: Research Objects (ROs) are semantically enhanced aggregations of resources associated to scientific experiments, such as data, provenance of these data, the scientific workflow used to run the experiment, intermediate results, logs and the interpretation of the results. As the number of ROs increases, it is becoming difficult to find ROs to be used, reused or re-purposed. New search and retrieval… ▽ More

    Submitted 16 October, 2017; originally announced October 2017.

    Comments: The URL to the web app does not work

  29. arXiv:1709.02314  [pdf, other

    cs.LG cs.AI

    Answering Visual-Relational Queries in Web-Extracted Knowledge Graphs

    Authors: Daniel Oñoro-Rubio, Mathias Niepert, Alberto García-Durán, Roberto González, Roberto J. López-Sastre

    Abstract: A visual-relational knowledge graph (KG) is a multi-relational graph whose entities are associated with images. We explore novel machine learning approaches for answering visual-relational queries in web-extracted knowledge graphs. To this end, we have created ImageGraph, a KG with 1,330 relation types, 14,870 entities, and 829,931 images crawled from the web. With visual-relational KGs such as Im… ▽ More

    Submitted 3 May, 2019; v1 submitted 7 September, 2017; originally announced September 2017.

    Journal ref: AKBC2019

  30. arXiv:1705.03881  [pdf, other

    cs.NI cs.LG

    Net2Vec: Deep Learning for the Network

    Authors: Roberto Gonzalez, Filipe Manco, Alberto Garcia-Duran, Jose Mendes, Felipe Huici, Saverio Niccolini, Mathias Niepert

    Abstract: We present Net2Vec, a flexible high-performance platform that allows the execution of deep learning algorithms in the communication network. Net2Vec is able to capture data from the network at more than 60Gbps, transform it into meaningful tuples and apply predictions over the tuples in real time. This platform can be used for different purposes ranging from traffic classification to network perfo… ▽ More

    Submitted 10 May, 2017; originally announced May 2017.

  31. arXiv:1705.00548  [pdf, other

    cs.NI cs.MM

    Understanding the evolution of multimedia content in the Internet through BitTorrent glasses

    Authors: Reza Farahbakhsh, Angel Cuevas, Ruben Cuevas, Roberto Gonzalez, Noel Crespi

    Abstract: Today's Internet traffic is mostly dominated by multimedia content and the prediction is that this trend will intensify in the future. Therefore, main Internet players, such as ISPs, content delivery platforms (e.g. Youtube, Bitorrent, Netflix, etc) or CDN operators, need to understand the evolution of multimedia content availability and popularity in order to adapt their infrastructures and resou… ▽ More

    Submitted 1 May, 2017; originally announced May 2017.

    Comments: Farahbakhsh, Reza, et al. "Understanding the evolution of multimedia content in the internet through bittorrent glasses." IEEE Network 27.6 (2013): 80-88

    Journal ref: IEEE Network 27.6 (2013): 80-88

  32. Understanding the Detection of View Fraud in Video Content Portals

    Authors: Miriam Marciel, Ruben Cuevas, Albert Banchs, Roberto Gonzalez, Stefano Traverso, Mohamed Ahmed, Arturo Azcorra

    Abstract: While substantial effort has been devoted to understand fraudulent activity in traditional online advertising (search and banner), more recent forms such as video ads have received little attention. The understanding and identification of fraudulent activity (i.e., fake views) in video ads for advertisers, is complicated as they rely exclusively on the detection mechanisms deployed by video hostin… ▽ More

    Submitted 5 February, 2016; v1 submitted 31 July, 2015; originally announced July 2015.

    Comments: To appear in WWW 2016, Montréal, Québec, Canada. Please cite the conference version of this paper

  33. arXiv:1309.1416  [pdf, other

    cs.CR

    Automated Password Extraction Attack on Modern Password Managers

    Authors: Raul Gonzalez, Eric Y. Chen, Collin Jackson

    Abstract: To encourage users to use stronger and more secure passwords, modern web browsers offer users password management services, allowing users to save previously entered passwords locally onto their hard drives. We present Lupin, a tool that automatically extracts these saved passwords without the user's knowledge. Lupin allows a network adversary to obtain passwords as long as the login form appears… ▽ More

    Submitted 5 September, 2013; originally announced September 2013.

    Comments: 7 pages

  34. arXiv:1205.5662  [pdf, ps, other

    cs.SI cs.NI

    Google+ or Google-?: Dissecting the Evolution of the New OSN in its First Year

    Authors: Roberto Gonzalez, Ruben Cuevas, Reza Motamedi, Reza Rejaie, Angel Cuevas

    Abstract: In the era when Facebook and Twitter dominate the market for social media, Google has introduced Google+ (G+) and reported a significant growth in its size while others called it a ghost town. This begs the question that "whether G+ can really attract a significant number of connected and active users despite the dominance of Facebook and Twitter?". This paper tackles the above question by prese… ▽ More

    Submitted 26 March, 2013; v1 submitted 25 May, 2012; originally announced May 2012.

    Comments: WWW 2013

  35. arXiv:1105.3682  [pdf, ps, other

    cs.SI physics.soc-ph

    Where are my followers? Understanding the Locality Effect in Twitter

    Authors: Roberto Gonzalez, Ruben Cuevas, Angel Cuevas, Carmen Guerrero

    Abstract: Twitter is one of the most used applications in the current Internet with more than 200M accounts created so far. As other large-scale systems Twitter can obtain enefit by exploiting the Locality effect existing among its users. In this paper we perform the first comprehensive study of the Locality effect of Twitter. For this purpose we have collected the geographical location of around 1M Twitter… ▽ More

    Submitted 18 May, 2011; originally announced May 2011.

  36. arXiv:1105.3671  [pdf, ps, other

    cs.CR cs.NI

    TorrentGuard: stop** scam and malware distribution in the BitTorrent ecosystem

    Authors: Michal Kryczka, Ruben Cuevas, Roberto Gonzalez, Angel Cuevas, Arturo Azcorra

    Abstract: In this paper we conduct a large scale measurement study in order to analyse the fake content publishing phenomenon in the BitTorrent Ecosystem. Our results reveal that fake content represents an important portion (35%) of those files shared in BitTorrent and just a few tens of users are responsible for 90% of this content. Furthermore, more than 99% of the analysed fake files are linked to either… ▽ More

    Submitted 19 April, 2012; v1 submitted 18 May, 2011; originally announced May 2011.

  37. arXiv:0712.3360  [pdf, ps, other

    cs.DS

    Compressed Text Indexes:From Theory to Practice!

    Authors: Paolo Ferragina, Rodrigo Gonzalez, Gonzalo Navarro, Rossano Venturini

    Abstract: A compressed full-text self-index represents a text in a compressed form and still answers queries efficiently. This technology represents a breakthrough over the text indexing techniques of the previous decade, whose indexes required several times the size of the text. Although it is relatively new, this technology has matured up to a point where theoretical research is giving way to practical… ▽ More

    Submitted 20 December, 2007; originally announced December 2007.

    ACM Class: F.2.2; H.2.1; H.3.2; H.3.3