Search | arXiv e-print repository

Targeted Image Data Augmentation Increases Basic Skills Captioning Robustness

Authors: Valentin Barriere, Felipe del Rio, Andres Carvallo De Ferari, Carlos Aspillaga, Eugenio Herrera-Berg, Cristian Buc Calderon

Abstract: Artificial neural networks typically struggle in generalizing to out-of-context examples. One reason for this limitation is caused by having datasets that incorporate only partial information regarding the potential correlational structure of the world. In this work, we propose TIDA (Targeted Image-editing Data Augmentation), a targeted data augmentation method focused on improving models' human-l… ▽ More Artificial neural networks typically struggle in generalizing to out-of-context examples. One reason for this limitation is caused by having datasets that incorporate only partial information regarding the potential correlational structure of the world. In this work, we propose TIDA (Targeted Image-editing Data Augmentation), a targeted data augmentation method focused on improving models' human-like abilities (e.g., gender recognition) by filling the correlational structure gap using a text-to-image generative model. More specifically, TIDA identifies specific skills in captions describing images (e.g., the presence of a specific gender in the image), changes the caption (e.g., "woman" to "man"), and then uses a text-to-image model to edit the image in order to match the novel caption (e.g., uniquely changing a woman to a man while maintaining the context identical). Based on the Flickr30K benchmark, we show that, compared with the original data set, a TIDA-enhanced dataset related to gender, color, and counting abilities induces better performance in several image captioning metrics. Furthermore, on top of relying on the classical BLEU metric, we conduct a fine-grained analysis of the improvements of our models against the baseline in different ways. We compared text-to-image generative models and found different behaviors of the image captioning models in terms of encoding visual encoding and textual decoding. △ Less

Submitted 17 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

arXiv:2306.09890 [pdf, other]

Studying Generalization on Memory-Based Methods in Continual Learning

Authors: Felipe del Rio, Julio Hurtado, Cristian Buc, Alvaro Soto, Vincenzo Lomonaco

Abstract: One of the objectives of Continual Learning is to learn new concepts continually over a stream of experiences and at the same time avoid catastrophic forgetting. To mitigate complete knowledge overwriting, memory-based methods store a percentage of previous data distributions to be used during training. Although these methods produce good results, few studies have tested their out-of-distribution… ▽ More One of the objectives of Continual Learning is to learn new concepts continually over a stream of experiences and at the same time avoid catastrophic forgetting. To mitigate complete knowledge overwriting, memory-based methods store a percentage of previous data distributions to be used during training. Although these methods produce good results, few studies have tested their out-of-distribution generalization properties, as well as whether these methods overfit the replay memory. In this work, we show that although these methods can help in traditional in-distribution generalization, they can strongly impair out-of-distribution generalization by learning spurious features and correlations. Using a controlled environment, the Synbol benchmark generator (Lacoste et al., 2020), we demonstrate that this lack of out-of-distribution generalization mainly occurs in the linear classifier. △ Less

Submitted 20 June, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

arXiv:2303.02075 [pdf, ps, other]

Adaptive Interventions for Global Health: A Case Study of Malaria

Authors: África Periáñez, Andrew Trister, Madhav Nekkar, Ana Fernández del Río, Pedro L. Alonso

Abstract: Malaria can be prevented, diagnosed, and treated; however, every year, there are more than 200 million cases and 200.000 preventable deaths. Malaria remains a pressing public health concern in low- and middle-income countries, especially in sub-Saharan Africa. We describe how by means of mobile health applications, machine-learning-based adaptive interventions can strengthen malaria surveillance a… ▽ More Malaria can be prevented, diagnosed, and treated; however, every year, there are more than 200 million cases and 200.000 preventable deaths. Malaria remains a pressing public health concern in low- and middle-income countries, especially in sub-Saharan Africa. We describe how by means of mobile health applications, machine-learning-based adaptive interventions can strengthen malaria surveillance and treatment adherence, increase testing, measure provider skills and quality of care, improve public health by supporting front-line workers and patients (e.g., by capacity building and encouraging behavioral changes, like using bed nets), reduce test stockouts in pharmacies and clinics and informing public health for policy intervention. △ Less

Submitted 17 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

Comments: Accepted for ICLR 2023 Workshop on Machine Learning and Global Health

arXiv:2303.01954 [pdf, other]

Synthetic Data Generator for Adaptive Interventions in Global Health

Authors: Aditya Rastogi, Juan Francisco Garamendi, Ana Fernández del Río, Anna Guitart, Moiz Hassan Khan, Dexian Tang, África Periáñez

Abstract: Artificial Intelligence and digital health have the potential to transform global health. However, having access to representative data to test and validate algorithms in realistic production environments is essential. We introduce HealthSyn, an open-source synthetic data generator of user behavior for testing reinforcement learning algorithms in the context of mobile health interventions. The gen… ▽ More Artificial Intelligence and digital health have the potential to transform global health. However, having access to representative data to test and validate algorithms in realistic production environments is essential. We introduce HealthSyn, an open-source synthetic data generator of user behavior for testing reinforcement learning algorithms in the context of mobile health interventions. The generator utilizes Markov processes to generate diverse user actions, with individual user behavioral patterns that can change in reaction to personalized interventions (i.e., reminders, recommendations, and incentives). These actions are translated into actual logs using an ML-purposed data schema specific to the mobile health application functionality included with HealthKit, and open-source SDK. The logs can be fed to pipelines to obtain user metrics. The generated data, which is based on real-world behaviors and simulation techniques, can be used to develop, test, and evaluate, both ML algorithms in research and end-to-end operational RL-based intervention delivery frameworks. △ Less

Submitted 27 April, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

arXiv:2211.06525 [pdf, other]

Actionable Recourse via GANs for Mobile Health

Authors: Jennifer Chien, Anna Guitart, Ana Fernandez del Rio, Africa Perianez, Lauren Bellhouse

Abstract: Mobile health apps provide a unique means of collecting data that can be used to deliver adaptive interventions.The predicted outcomes considerably influence the selection of such interventions. Recourse via counterfactuals provides tangible mechanisms to modify user predictions. By identifying plausible actions that increase the likelihood of a desired prediction, stakeholders are afforded agency… ▽ More Mobile health apps provide a unique means of collecting data that can be used to deliver adaptive interventions.The predicted outcomes considerably influence the selection of such interventions. Recourse via counterfactuals provides tangible mechanisms to modify user predictions. By identifying plausible actions that increase the likelihood of a desired prediction, stakeholders are afforded agency over their predictions. Furthermore, recourse mechanisms enable counterfactual reasoning that can help provide insights into candidates for causal interventional features. We demonstrate the feasibility of GAN-generated recourse for mobile health applications on ensemble-survival-analysis-based prediction of medium-term engagement in the Safe Delivery App, a digital training tool for skilled birth attendants. △ Less

Submitted 11 November, 2022; originally announced November 2022.

Comments: 16 pages, formatted for extended abstract requirements

arXiv:2206.08178 [pdf, other]

doi 10.1145/3534678.3542681

User Engagement in Mobile Health Applications

Authors: Babaniyi Yusuf Olaniyi, Ana Fernández del Río, África Periáñez, Lauren Bellhouse

Abstract: Mobile health apps are revolutionizing the healthcare ecosystem by improving communication, efficiency, and quality of service. In low- and middle-income countries, they also play a unique role as a source of information about health outcomes and behaviors of patients and healthcare workers, while providing a suitable channel to deliver both personalized and collective policy interventions. We pro… ▽ More Mobile health apps are revolutionizing the healthcare ecosystem by improving communication, efficiency, and quality of service. In low- and middle-income countries, they also play a unique role as a source of information about health outcomes and behaviors of patients and healthcare workers, while providing a suitable channel to deliver both personalized and collective policy interventions. We propose a framework to study user engagement with mobile health, focusing on healthcare workers and digital health apps designed to support them in resource-poor settings. The behavioral logs produced by these apps can be transformed into daily time series characterizing each user's activity. We use probabilistic and survival analysis to build multiple personalized measures of meaningful engagement, which could serve to tailor content and digital interventions suiting each health worker's specific needs. Special attention is given to the problem of detecting churn, understood as a marker of complete disengagement. We discuss the application of our methods to the Indian and Ethiopian users of the Safe Delivery App, a capacity-building tool for skilled birth attendants. This work represents an important step towards a full characterization of user engagement in mobile health applications, which can significantly enhance the abilities of health workers and, ultimately, save lives. △ Less

Submitted 23 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: Accepted at KDD 2022 Health Day, will be appear in the KDD2022 proceedings as a full paper

arXiv:2111.01786 [pdf, other]

A Recommendation System to Enhance Midwives' Capacities in Low-Income Countries

Authors: Anna Guitart, Afsaneh Heydari, Eniola Olaleye, Jelena Ljubicic, Ana Fernández del Río, África Periáñez, Lauren Bellhouse

Abstract: Maternal and child mortality is a public health problem that disproportionately affects low- and middle-income countries. Every day, 800 women and 6,700 newborns die from complications related to pregnancy or childbirth. And for every maternal death, about 20 women suffer serious birth injuries. However, nearly all of these deaths and negative health outcomes are preventable. Midwives are key to r… ▽ More Maternal and child mortality is a public health problem that disproportionately affects low- and middle-income countries. Every day, 800 women and 6,700 newborns die from complications related to pregnancy or childbirth. And for every maternal death, about 20 women suffer serious birth injuries. However, nearly all of these deaths and negative health outcomes are preventable. Midwives are key to revert this situation, and thus it is essential to strengthen their capacities and the quality of their education. This is the aim of the Safe Delivery App, a digital job aid and learning tool to enhance the knowledge, confidence and skills of health practitioners. Here, we use the behavioral logs of the App to implement a recommendation system that presents each midwife with suitable contents to continue gaining expertise. We focus on predicting the click-through rate, the probability that a given user will click on a recommended content. We evaluate four deep learning models and show that all of them produce highly accurate predictions. △ Less

Submitted 4 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

Journal ref: NeurIPS Machine Learning in Public Health workshop (MLPH 2021), 2021

arXiv:2109.11745 [pdf, other]

DACT-BERT: Differentiable Adaptive Computation Time for an Efficient BERT Inference

Authors: Cristóbal Eyzaguirre, Felipe del Río, Vladimir Araujo, Álvaro Soto

Abstract: Large-scale pre-trained language models have shown remarkable results in diverse NLP applications. Unfortunately, these performance gains have been accompanied by a significant increase in computation time and model size, stressing the need to develop new or complementary strategies to increase the efficiency of these models. In this paper we propose DACT-BERT, a differentiable adaptive computatio… ▽ More Large-scale pre-trained language models have shown remarkable results in diverse NLP applications. Unfortunately, these performance gains have been accompanied by a significant increase in computation time and model size, stressing the need to develop new or complementary strategies to increase the efficiency of these models. In this paper we propose DACT-BERT, a differentiable adaptive computation time strategy for BERT-like models. DACT-BERT adds an adaptive computational mechanism to BERT's regular processing pipeline, which controls the number of Transformer blocks that need to be executed at inference time. By doing this, the model learns to combine the most appropriate intermediate representations for the task at hand. Our experiments demonstrate that our approach, when compared to the baselines, excels on a reduced computational regime and is competitive in other less restrictive ones. △ Less

Submitted 24 September, 2021; originally announced September 2021.

arXiv:2107.02480 [pdf, other]

doi 10.1016/j.artmed.2023.102511

Midwifery Learning and Forecasting: Predicting Content Demand with User-Generated Logs

Authors: Anna Guitart, Ana Fernández del Río, África Periáñez, Lauren Bellhouse

Abstract: Every day, 800 women and 6,700 newborns die from complications related to pregnancy or childbirth. A well-trained midwife can prevent most of these maternal and newborn deaths. Data science models together with logs generated by users of online learning applications for midwives can help to improve their learning competencies. The goal is to use these rich behavioral data to push digital learning… ▽ More Every day, 800 women and 6,700 newborns die from complications related to pregnancy or childbirth. A well-trained midwife can prevent most of these maternal and newborn deaths. Data science models together with logs generated by users of online learning applications for midwives can help to improve their learning competencies. The goal is to use these rich behavioral data to push digital learning towards personalized content and to provide an adaptive learning journey. In this work, we evaluate various forecasting methods to determine the interest of future users on the different kind of contents available in the app, broken down by profession and region. △ Less

Submitted 17 August, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

Journal ref: Artificial Intelligence in Medicine Volume 138, April 2023, 102511

arXiv:2010.05630 [pdf]

The Wireless Train Communication Network: Roll2Rail vision

Authors: Juan Moreno García-Loygorri, Javier Goikoetxea, Eneko Echeverría, Aitor Arriola, Iñaki Val, Stephan Sand, Paul Unterhuber, Francisco del Río

Abstract: This paper explains the main results obtained from the research carried out in the work package 2 (WP2) of the Roll2Rail (R2R) project. This project aims to develop key technologies and to remove already identified blocking points for radical innovation in the field of railway vehicles, to increase their operational reliability and to reduce life-cycle costs. This project started in May 2015 and h… ▽ More This paper explains the main results obtained from the research carried out in the work package 2 (WP2) of the Roll2Rail (R2R) project. This project aims to develop key technologies and to remove already identified blocking points for radical innovation in the field of railway vehicles, to increase their operational reliability and to reduce life-cycle costs. This project started in May 2015 and has been funded by the Horizon 2020 program of the European Commission. The goal for WP2 is to research on both technologies and architectures to develop a new wireless Train Communication Network (TCN) within IEC61375 standard series. This TCN is today entirely wired and is used for Train Control and Monitoring System (TCMS) functions (some of them safetyrelated), operator-oriented services and customer-oriented services. This paradigm shift from wired to wireless means a removal of wirings implies, among other benefits, a significant reduction of life cycle costs due to the removal of cables, and the simplification of the train coupling procedure, among others. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Journal ref: IEEE Vehicular Technology Magazine, 2018

arXiv:2009.04426 [pdf, other]

CuratorNet: Visually-aware Recommendation of Art Images

Authors: Pablo Messina, Manuel Cartagena, Patricio Cerda-Mardini, Felipe del Rio, Denis Parra

Abstract: Although there are several visually-aware recommendation models in domains like fashion or even movies, the art domain lacks thesame level of research attention, despite the recent growth of the online artwork market. To reduce this gap, in this article we introduceCuratorNet, a neural network architecture for visually-aware recommendation of art images. CuratorNet is designed at the core withthe… ▽ More Although there are several visually-aware recommendation models in domains like fashion or even movies, the art domain lacks thesame level of research attention, despite the recent growth of the online artwork market. To reduce this gap, in this article we introduceCuratorNet, a neural network architecture for visually-aware recommendation of art images. CuratorNet is designed at the core withthe goal of maximizing generalization: the network has a fixed set of parameters that only need to be trained once, and thereafter themodel is able to generalize to new users or items never seen before, without further training. This is achieved by leveraging visualcontent: items are mapped to item vectors through visual embeddings, and users are mapped to user vectors by aggregating the visualcontent of items they have consumed. Besides the model architecture, we also introduce novel triplet sampling strategies to build atraining set for rank learning in the art domain, resulting in more effective learning than naive random sampling. With an evaluationover a real-world dataset of physical paintings, we show that CuratorNet achieves the best performance among several baselines,including the state-of-the-art model VBPR. CuratorNet is motivated and evaluated in the art domain, but its architecture and trainingscheme could be adapted to recommend images in other areas △ Less

Submitted 30 September, 2020; v1 submitted 9 September, 2020; originally announced September 2020.

arXiv:2006.10256 [pdf, other]

doi 10.1038/s41586-020-2649-2

Array Programming with NumPy

Authors: Charles R. Harris, K. Jarrod Millman, Stéfan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, Julian Taylor, Sebastian Berg, Nathaniel J. Smith, Robert Kern, Matti Picus, Stephan Hoyer, Marten H. van Kerkwijk, Matthew Brett, Allan Haldane, Jaime Fernández del Río, Mark Wiebe, Pearu Peterson, Pierre Gérard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke , et al. (1 additional authors not shown)

Abstract: Array programming provides a powerful, compact, expressive syntax for accessing, manipulating, and operating on data in vectors, matrices, and higher-dimensional arrays. NumPy is the primary array programming library for the Python language. It plays an essential role in research analysis pipelines in fields as diverse as physics, chemistry, astronomy, geoscience, biology, psychology, material sci… ▽ More Array programming provides a powerful, compact, expressive syntax for accessing, manipulating, and operating on data in vectors, matrices, and higher-dimensional arrays. NumPy is the primary array programming library for the Python language. It plays an essential role in research analysis pipelines in fields as diverse as physics, chemistry, astronomy, geoscience, biology, psychology, material science, engineering, finance, and economics. For example, in astronomy, NumPy was an important part of the software stack used in the discovery of gravitational waves and the first imaging of a black hole. Here we show how a few fundamental array concepts lead to a simple and powerful programming paradigm for organizing, exploring, and analyzing scientific data. NumPy is the foundation upon which the entire scientific Python universe is constructed. It is so pervasive that several projects, targeting audiences with specialized needs, have developed their own NumPy-like interfaces and array objects. Because of its central position in the ecosystem, NumPy increasingly plays the role of an interoperability layer between these new array computation libraries. △ Less

Submitted 17 June, 2020; originally announced June 2020.

Journal ref: Nature 585, 357 (2020)

arXiv:2003.10287 [pdf, ps, other]

doi 10.3233/IDA-194940

A Time Series Approach To Player Churn and Conversion in Videogames

Authors: Ana Fernández del Río, Anna Guitart, África Periáñez

Abstract: Players of a free-to-play game are divided into three main groups: non-paying active users, paying active users and inactive users. A State Space time series approach is then used to model the daily conversion rates between the different groups, i.e., the probability of transitioning from one group to another. This allows, not only for predictions on how these rates are to evolve, but also for a d… ▽ More Players of a free-to-play game are divided into three main groups: non-paying active users, paying active users and inactive users. A State Space time series approach is then used to model the daily conversion rates between the different groups, i.e., the probability of transitioning from one group to another. This allows, not only for predictions on how these rates are to evolve, but also for a deeper understanding of the impact that in-game planning and calendar effects have. It is also used in this work for the detection of marketing and promotion campaigns about which no information is available. In particular, two different State Space formulations are considered and compared: an Autoregressive Integrated Moving Average process and an Unobserved Components approach, in both cases with a linear regression to explanatory variables. Both yield very close estimations for covariate parameters, producing forecasts with similar performances for most transition rates. While the Unobserved Components approach is more robust and needs less human intervention in regards to model definition, it produces significantly worse forecasts for non-paying user abandonment probability. More critically, it also fails to detect a plausible marketing and promotion campaign scenario. △ Less

Submitted 13 March, 2020; originally announced March 2020.

Comments: Accepted for publication in IOS Press Intelligent Data Analysis

Journal ref: Intelligent Data Analysis, vol. 25, no. 1, pp. 177-203, 2021

arXiv:1907.03947 [pdf, other]

Understanding Player Engagement and In-Game Purchasing Behavior with Ensemble Learning

Authors: Anna Guitart, Ana Fernández del Río, África Periáñez

Abstract: As video games attract more and more players, the major challenge for game studios is to retain them. We present a deep behavioral analysis of churn (game abandonment) and what we called "purchase churn" (the transition from paying to non-paying user). A series of churning behavior profiles are identified, which allows a classification of churners in terms of whether they eventually return to the… ▽ More As video games attract more and more players, the major challenge for game studios is to retain them. We present a deep behavioral analysis of churn (game abandonment) and what we called "purchase churn" (the transition from paying to non-paying user). A series of churning behavior profiles are identified, which allows a classification of churners in terms of whether they eventually return to the game (false churners)--or start purchasing again (false purchase churners)--and their subsequent behavior. The impact of excluding some or all of these churners from the training sample is then explored in several churn and purchase churn prediction models. Our results suggest that discarding certain combinations of "zombies" (players whose activity is extremely sporadic) and false churners has a significant positive impact in all models considered. △ Less

Submitted 8 July, 2019; originally announced July 2019.

Comments: Churn Prediction, Ensemble Methods, Survival Analysis, On- line Games, User Behavior

Journal ref: Proceedings of GAME-ON'2019 AI and Simulation in Games, September 2019, breda, the Netherlands. ISBN 978-94-92859-08-2

arXiv:1907.03870 [pdf, other]

doi 10.1109/CIG.2019.8848074

Profiling Players with Engagement Predictions

Authors: Ana Fernández del Río, Pei Pei Chen, África Periáñez

Abstract: The possibility of using player engagement predictions to profile high spending video game users is explored. In particular, individual-player survival curves in terms of days after first login, game level reached and accumulated playtime are used to classify players into different groups. Lifetime value predictions for each player---generated using a deep learning method based on long short-term… ▽ More The possibility of using player engagement predictions to profile high spending video game users is explored. In particular, individual-player survival curves in terms of days after first login, game level reached and accumulated playtime are used to classify players into different groups. Lifetime value predictions for each player---generated using a deep learning method based on long short-term memory---are also included in the analysis, and the relations between all these variables are thoroughly investigated. Our results suggest this constitutes a promising approach to user profiling. △ Less

Submitted 8 July, 2019; originally announced July 2019.

Comments: Accepted for IEEE Conference on Games (CoG) 2019

Journal ref: 2019 IEEE Conference in Games (CoG)

arXiv:1906.10320 [pdf, other]

doi 10.1145/3337722.3341855

From Non-Paying to Premium: Predicting User Conversion in Video Games with Ensemble Learning

Authors: Anna Guitart, Shi Hui Tan, Ana Fernández del Río, Pei Pei Chen, África Periáñez

Abstract: Retaining premium players is key to the success of free-to-play games, but most of them do not start purchasing right after joining the game. By exploiting the exceptionally rich datasets recorded by modern video games--which provide information on the individual behavior of each and every player--survival analysis techniques can be used to predict what players are more likely to become paying (or… ▽ More Retaining premium players is key to the success of free-to-play games, but most of them do not start purchasing right after joining the game. By exploiting the exceptionally rich datasets recorded by modern video games--which provide information on the individual behavior of each and every player--survival analysis techniques can be used to predict what players are more likely to become paying (or even premium) users and when, both in terms of time and game level, the conversion will take place. Here we show that a traditional semi-parametric model (Cox regression), a random survival forest (RSF) technique and a method based on conditional inference survival ensembles all yield very promising results. However, the last approach has the advantage of being able to correct the inherent bias in RSF models by dividing the procedure into two steps: first selecting the best predictor to perform the splitting and then the best split point for that covariate. The proposed conditional inference survival ensembles method could be readily used in operational environments for early identification of premium players and the parts of the game that may prompt them to become paying users. Such knowledge would allow developers to induce their conversion and, more generally, to better understand the needs of their players and provide them with a personalized experience, thereby increasing their engagement and paving the way to higher monetization. △ Less

Submitted 30 June, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

Comments: social games, conversion prediction, ensemble methods, survival analysis, online games, user behavior

Journal ref: ACM Foundations of Digital Games (FDG'2019), 97, 9, 2019

arXiv:1811.12799 [pdf, other]

doi 10.1109/BigData.2018.8622151

Customer Lifetime Value in Video Games Using Deep Learning and Parametric Models

Authors: Pei Pei Chen, Anna Guitart, Ana Fernández del Río, África Periáñez

Abstract: Nowadays, video game developers record every virtual action performed by their players. As each player can remain in the game for years, this results in an exceptionally rich dataset that can be used to understand and predict player behavior. In particular, this information may serve to identify the most valuable players and foresee the amount of money they will spend in in-app purchases during th… ▽ More Nowadays, video game developers record every virtual action performed by their players. As each player can remain in the game for years, this results in an exceptionally rich dataset that can be used to understand and predict player behavior. In particular, this information may serve to identify the most valuable players and foresee the amount of money they will spend in in-app purchases during their lifetime. This is crucial in free-to-play games, where up to 50% of the revenue is generated by just around 2% of the players, the so-called whales. To address this challenge, we explore how deep neural networks can be used to predict customer lifetime value in video games, and compare their performance to parametric models such as Pareto/NBD. Our results suggest that convolutional neural network structures are the most efficient in predicting the economic value of individual players. They not only perform better in terms of accuracy, but also scale to big data and significantly reduce computational time, as they can work directly with raw sequential data and thus do not require any feature engineering process. This becomes important when datasets are very large, as is often the case with video game logs. Moreover, convolutional neural networks are particularly well suited to identify potential whales. Such an early identification is of paramount importance for business purposes, as it would allow developers to implement in-game actions aimed at retaining big spenders and maximizing their lifetime, which would ultimately translate into increased revenue. △ Less

Submitted 28 November, 2018; originally announced November 2018.

Journal ref: IEEE International Conference on Big Data (Big Data), p. 2134-2140 , 2018

arXiv:1807.09870 [pdf, other]

Do Better ImageNet Models Transfer Better... for Image Recommendation?

Authors: Felipe del Rio, Pablo Messina, Vicente Dominguez, Denis Parra

Abstract: Visual embeddings from Convolutional Neural Networks (CNN) trained on the ImageNet dataset for the ILSVRC challenge have shown consistently good performance for transfer learning and are widely used in several tasks, including image recommendation. However, some important questions have not yet been answered in order to use these embeddings for a larger scope of recommendation domains: a) Do CNNs… ▽ More Visual embeddings from Convolutional Neural Networks (CNN) trained on the ImageNet dataset for the ILSVRC challenge have shown consistently good performance for transfer learning and are widely used in several tasks, including image recommendation. However, some important questions have not yet been answered in order to use these embeddings for a larger scope of recommendation domains: a) Do CNNs that perform better in ImageNet are also better for transfer learning in content-based image recommendation?, b) Does fine-tuning help to improve performance? and c) Which is the best way to perform the fine-tuning? In this paper we compare several CNN models pre-trained with ImageNet to evaluate their transfer learning performance to an artwork image recommendation task. Our results indicate that models with better performance in the ImageNet challenge do not always imply better transfer learning for recommendation tasks (e.g. NASNet vs. ResNet). Our results also show that fine-tuning can be helpful even with a small dataset, but not every fine-tuning works. Our results can inform other researchers and practitioners on how to train their CNNs for better transfer learning towards image recommendation systems. △ Less

Submitted 25 September, 2018; v1 submitted 25 July, 2018; originally announced July 2018.

Comments: Submitted to KTL Workshop co-located at RecSys

arXiv:1204.2240 [pdf, ps, other]

doi 10.1002/cplx.21397

Interdependent binary choices under social influence: phase diagram for homogeneous unbiased populations

Authors: Ana Fernández del Río, Elka Korutcheva, Javier de la Rubia

Abstract: Coupled Ising models are studied in a discrete choice theory framework, where they can be understood to represent interdependent choice making processes for homogeneous populations under social influence. Two different coupling schemes are considered. The nonlocal or group interdependence model is used to study two interrelated groups making the same binary choice. The local or individual interdep… ▽ More Coupled Ising models are studied in a discrete choice theory framework, where they can be understood to represent interdependent choice making processes for homogeneous populations under social influence. Two different coupling schemes are considered. The nonlocal or group interdependence model is used to study two interrelated groups making the same binary choice. The local or individual interdependence model represents a single group where agents make two binary choices which depend on each other. For both models, phase diagrams, and their implications in socioeconomic contexts, are described and compared in the absence of private deterministic utilities (zero opinion fields). △ Less

Submitted 10 April, 2012; originally announced April 2012.

Comments: 17 pages, 3 figures. This is the pre-peer reviewed version of the following article: Ana Fernández del Río, Elka Korutcheva and Javier de la Rubia, Interdependent binary choices under social influence, Wiley's Complexity, 2012; which has been published in final form at http://onlinelibrary.wiley.com/doi/10.1002/cplx.21397/abstract

Journal ref: Complexity, Wiley, 2012

arXiv:1104.4887 [pdf, ps, other]

Coupled Ising models and interdependent discrete choices under social influence in homogeneous populations

Authors: Ana Fernández del Río

Abstract: The use of statistical physics to study problems of social sciences is motivated and its current state of the art briefly reviewed, in particular for the case of discrete choice making. The coupling of two binary choices is studied in some detail, using an Ising model for each of the decision variables (the opinion or choice moments or spins, socioeconomic equivalents to the magnetic moments or sp… ▽ More The use of statistical physics to study problems of social sciences is motivated and its current state of the art briefly reviewed, in particular for the case of discrete choice making. The coupling of two binary choices is studied in some detail, using an Ising model for each of the decision variables (the opinion or choice moments or spins, socioeconomic equivalents to the magnetic moments or spins). Toy models for two different types of coupling are studied analytically and numerically in the mean field (infinite range) approximation. This is equivalent to considering a social influence effect proportional to the fraction of adopters or average magnetisation. In the nonlocal case, the two spin variables are coupled through a Weiss mean field type term. In a socioeconomic context, this can be useful when studying individuals of two different groups, making the same decision under social influence of their own group, when their outcome is affected by the fraction of adopters of the other group. In the local case, the two spin variables are coupled only through each individual. This accounts to considering individuals of a single group each making two different choices which affect each other. In both cases, only constant (intra- and inter-) couplings and external fields are considered, i.e., only completely homogeneous populations. Most of the results presented are for the zero field case, i.e. no externalities or private utilities. Phase diagrams and their interpretation in a socioeconomic context are discussed and compared to the uncoupled case. The two systems share many common features including the existence of both first and second order phase transitions, metastability and hysteresis. To conclude, some general remarks, pointing out the limitations of these models and suggesting further improvements are given. △ Less

Submitted 26 April, 2011; originally announced April 2011.

Comments: Master thesis, 100 pages, 39 figures

Showing 1–20 of 20 results for author: del Rio, F