Skip to main content

Showing 1–19 of 19 results for author: Daniel, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.13896  [pdf, other

    cs.LG q-fin.ST

    Comparative Evaluation of Anomaly Detection Methods for Fraud Detection in Online Credit Card Payments

    Authors: Hugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Liên Doan, Fabrice Daniel

    Abstract: This study explores the application of anomaly detection (AD) methods in imbalanced learning tasks, focusing on fraud detection using real online credit card payment data. We assess the performance of several recent AD methods and compare their effectiveness against standard supervised learning methods. Offering evidence of distribution shift within our dataset, we analyze its impact on the tested… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted at ICICT 2024

  2. Benchmarking Robustness of Deep Reinforcement Learning approaches to Online Portfolio Management

    Authors: Marc Velay, Bich-Liên Doan, Arpad Rimmel, Fabrice Popineau, Fabrice Daniel

    Abstract: Deep Reinforcement Learning approaches to Online Portfolio Selection have grown in popularity in recent years. The sensitive nature of training Reinforcement Learning agents implies a need for extensive efforts in market representation, behavior objectives, and training processes, which have often been lacking in previous works. We propose a training and evaluation process to assess the performanc… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Submitted to INISTA 2023

  3. arXiv:2206.13152  [pdf, ps, other

    cs.LG

    Evaluating resampling methods on a real-life highly imbalanced online credit card payments dataset

    Authors: François de la Bourdonnaye, Fabrice Daniel

    Abstract: Various problems of any credit card fraud detection based on machine learning come from the imbalanced aspect of transaction datasets. Indeed, the number of frauds compared to the number of regular transactions is tiny and has been shown to damage learning performances, e.g., at worst, the algorithm can learn to classify all the transactions as regular. Resampling methods and cost-sensitive approa… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  4. TracInAD: Measuring Influence for Anomaly Detection

    Authors: Hugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Liên Doan, Fabrice Daniel

    Abstract: As with many other tasks, neural networks prove very effective for anomaly detection purposes. However, very few deep-learning models are suited for detecting anomalies on tabular datasets. This paper proposes a novel methodology to flag anomalies based on TracIn, an influence measure initially introduced for explicability purposes. The proposed methods can serve to augment any unsupervised deep a… ▽ More

    Submitted 30 January, 2024; v1 submitted 3 May, 2022; originally announced May 2022.

    Journal ref: 2022 International Joint Conference on Neural Networks (IJCNN)

  5. arXiv:2112.12024  [pdf, other

    cs.LG

    Evaluating categorical encoding methods on a real credit card fraud detection database

    Authors: François de la Bourdonnaye, Fabrice Daniel

    Abstract: Correctly dealing with categorical data in a supervised learning context is still a major issue. Furthermore, though some machine learning methods embody builtin methods to deal with categorical features, it is unclear whether they bring some improvements and how do they compare with usual categorical encoding methods. In this paper, we describe several well-known categorical encoding methods that… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  6. arXiv:2104.07440  [pdf, ps, other

    cs.LG

    Bayesian and Dempster-Shafer models for combining multiple sources of evidence in a fraud detection system

    Authors: Fabrice Daniel

    Abstract: Combining evidence from different sources can be achieved with Bayesian or Dempster-Shafer methods. The first requires an estimate of the priors and likelihoods while the second only needs an estimate of the posterior probabilities and enables reasoning with uncertain information due to imprecision of the sources and with the degree of conflict between them. This paper describes the two methods an… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  7. arXiv:2012.03743  [pdf, other

    cs.CY cs.AI cs.HC cs.SE

    Bringing Cognitive Augmentation to Web Browsing Accessibility

    Authors: Alessandro Pina, Marcos Baez, Florian Daniel

    Abstract: In this paper we explore the opportunities brought by cognitive augmentation to provide a more natural and accessible web browsing experience. We explore these opportunities through \textit{conversational web browsing}, an emerging interaction paradigm for the Web that enables blind and visually impaired users (BVIP), as well as regular users, to access the contents and features of websites throug… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  8. Chatbot integration in few patterns

    Authors: Marcos Baez, Florian Daniel, Fabio Casati, Boualem Benatallah

    Abstract: Chatbots are software agents that are able to interact with humans in natural language. Their intuitive interaction paradigm is expected to significantly reshape the software landscape of tomorrow, while already today chatbots are invading a multitude of scenarios and contexts. This article takes a developer's perspective, identifies a set of architectural patterns that capture different chatbot i… ▽ More

    Submitted 18 September, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: prior version was an incomplete early draft; current version includes changes in references, appropriate acknowledgement; and minor revisions

  9. arXiv:2008.12097  [pdf, other

    cs.CY cs.AI

    Automatic Generation of Chatbots for Conversational Web Browsing

    Authors: Pietro Chittò, Marcos Baez, Florian Daniel, Boualem Benatallah

    Abstract: In this paper, we describe the foundations for generating a chatbot out of a website equipped with simple, bot-specific HTML annotations. The approach is part of what we call conversational web browsing, i.e., a dialog-based, natural language interaction with websites. The goal is to enable users to use content and functionality accessible through rendered UIs by "talking to websites" instead of b… ▽ More

    Submitted 21 October, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: typos corrected, metadata fixed

  10. arXiv:1907.03010  [pdf, other

    q-fin.ST cs.LG stat.ML

    Financial Time Series Data Processing for Machine Learning

    Authors: Fabrice Daniel

    Abstract: This article studies the financial time series data processing for machine learning. It introduces the most frequent scaling methods, then compares the resulting stationarity and preservation of useful information for trend forecasting. It proposes an empirical test based on the capability to learn simple data relationship with simple models. It also speaks about the data split method specific to… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

  11. arXiv:1808.00423  [pdf, other

    cs.LG cs.CL stat.ML

    Seq2Seq and Multi-Task Learning for joint intent and content extraction for domain specific interpreters

    Authors: Marc Velay, Fabrice Daniel

    Abstract: This study evaluates the performances of an LSTM network for detecting and extracting the intent and content of com- mands for a financial chatbot. It presents two techniques, sequence to sequence learning and Multi-Task Learning, which might improve on the previous task.

    Submitted 1 August, 2018; originally announced August 2018.

  12. arXiv:1808.00418  [pdf, other

    cs.LG stat.ML

    Stock Chart Pattern recognition with Deep Learning

    Authors: Marc Velay, Fabrice Daniel

    Abstract: This study evaluates the performances of CNN and LSTM for recognizing common charts patterns in a stock historical data. It presents two common patterns, the method used to build the training set, the neural networks architectures and the accuracies obtained.

    Submitted 1 August, 2018; originally announced August 2018.

  13. arXiv:1806.09533  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Using NLP on news headlines to predict index trends

    Authors: Marc Velay, Fabrice Daniel

    Abstract: This paper attempts to provide a state of the art in trend prediction using news headlines. We present the research done on predicting DJIA trends using Natural Language Processing. We will explain the different algorithms we have used as well as the various embedding techniques attempted. We rely on statistical and deep learning models in order to extract information from the corpuses.

    Submitted 22 June, 2018; originally announced June 2018.

  14. arXiv:1805.12346  [pdf, other

    cs.HC cs.AI

    Crowdsourcing for Reminiscence Chatbot Design

    Authors: Svetlana Nikitina, Florian Daniel, Marcos Baez, Fabio Casati

    Abstract: In this work-in-progress paper we discuss the challenges in identifying effective and scalable crowd-based strategies for designing content, conversation logic, and meaningful metrics for a reminiscence chatbot targeted at older adults. We formalize the problem and outline the main research questions that drive the research agenda in chatbot design for reminiscence and for relational agents for ol… ▽ More

    Submitted 31 May, 2018; originally announced May 2018.

  15. Quality Control in Crowdsourcing: A Survey of Quality Attributes, Assessment Techniques and Assurance Actions

    Authors: Florian Daniel, Pavel Kucherbaev, Cinzia Cappiello, Boualem Benatallah, Mohammad Allahbakhsh

    Abstract: Crowdsourcing enables one to leverage on the intelligence and wisdom of potentially large groups of individuals toward solving problems. Common problems approached with crowdsourcing are labeling images, translating or transcribing text, providing opinions or ideas, and similar - all tasks that computers are not good at or where they may even fail altogether. The introduction of humans into comput… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

    Comments: 40 pages main paper, 5 pages appendix

    Journal ref: ACM Comput. Surv. 51, 1, Article 7 (January 2018)

  16. arXiv:1709.03541  [pdf, other

    astro-ph.IM cs.IT

    Robust period estimation using mutual information for multi-band light curves in the synoptic survey era

    Authors: Pablo Huijse, Pablo A. Estevez, Francisco Forster, Scott F. Daniel, Andrew J. Connolly, Pavlos Protopapas, Rodrigo Carrasco, Jose C. Principe

    Abstract: The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a part… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

    Comments: Accepted for publication ApJ Supplement Series: Special Issue on Solar/Stellar Astronomy Big Data

  17. arXiv:1704.03610  [pdf, other

    cs.SE

    Blockchains for Business Process Management - Challenges and Opportunities

    Authors: Jan Mendling, Ingo Weber, Wil van der Aalst, Jan vom Brocke, Cristina Cabanillas, Florian Daniel, Soren Debois, Claudio Di Ciccio, Marlon Dumas, Schahram Dustdar, Avigdor Gal, Luciano Garcia-Banuelos, Guido Governatori, Richard Hull, Marcello La Rosa, Henrik Leopold, Frank Leymann, Jan Recker, Manfred Reichert, Hajo A. Reijers, Stefanie Rinderle-Ma, Andreas Rogge-Solti, Michael Rosemann, Stefan Schulte, Munindar P. Singh , et al. (7 additional authors not shown)

    Abstract: Blockchain technology promises a sizable potential for executing inter-organizational business processes without requiring a central party serving as a single point of trust (and failure). This paper analyzes its impact on business process management (BPM). We structure the discussion using two BPM frameworks, namely the six BPM core capabilities and the BPM lifecycle. This paper provides research… ▽ More

    Submitted 31 January, 2018; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: Preprint for ACM TMIS

  18. arXiv:1609.01070  [pdf, ps, other

    cs.HC

    Toward Crowdsourced User Studies for Software Evaluation

    Authors: Florian Daniel, Pavel Kucherbaev

    Abstract: This work-in-progress paper describes a vision, i.e., that of fast and reliable software user experience studies conducted with the help from the crowd. Commonly, user studies are controlled in-lab activities that require the instruction, monitoring, interviewing and compensation of a number of participants that are typically hard to recruit. The goal of this work is to study which user study meth… ▽ More

    Submitted 5 September, 2016; originally announced September 2016.

    Comments: Works-in-Progress paper of HCOMP 2016, Austin, Texas

  19. arXiv:1607.01752  [pdf, other

    cs.HC

    CrowdCafe - Mobile Crowdsourcing Platform

    Authors: Pavel Kucherbaev, Azad Abad, Stefano Tranquillini, Florian Daniel, Maurizio Marchese, Fabio Casati

    Abstract: In this paper we present a mobile crowdsourcing platform CrowdCafe, where people can perform microtasks using their smartphones while they ride a bus, travel by train, stand in a queue or wait for an appointment. These microtasks are executed in exchange for rewards provided by local stores, such as coffee, desserts and bus tickets. We present the concept, the implementation and the evaluation by… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

    Comments: Was published before as a part of the phd thesis by Pavel Kucherbaev http://eprints-phd.biblio.unitn.it/1716/