Skip to main content

Showing 1–48 of 48 results for author: Rosa, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18173  [pdf, ps, other

    cs.HC cs.IR

    LLMs in HCI Data Work: Bridging the Gap Between Information Retrieval and Responsible Research Practices

    Authors: Neda Taghizadeh Serajeh, Iman Mohammadi, Vittorio Fuccella, Mattia De Rosa

    Abstract: Efficient and accurate information extraction from scientific papers is significant in the rapidly develo** human-computer interaction research in the literature review process. Our paper introduces and analyses a new information retrieval system using state-of-the-art Large Language Models (LLMs) in combination with structured text analysis techniques to extract experimental data from HCI liter… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 5 pages, CHI2024 Workshop on LLMs as Research Tools: Applications and Evaluations in HCI Data Work

  2. The Hitchhiker's Guide to Malicious Third-Party Dependencies

    Authors: Piergiorgio Ladisa, Merve Sahin, Serena Elisa Ponta, Marco Rosa, Matias Martinez, Olivier Barais

    Abstract: The increasing popularity of certain programming languages has spurred the creation of ecosystem-specific package repositories and package managers. Such repositories (e.g., npm, PyPI) serve as public databases that users can query to retrieve packages for various functionalities, whereas package managers automatically handle dependency resolution and package installation on the client side. These… ▽ More

    Submitted 6 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Proceedings of the 2023 Workshop on Software Supply Chain Offensive Research and Ecosystem Defenses (SCORED '23), November 30, 2023, Copenhagen, Denmark

  3. arXiv:2303.03572  [pdf, other

    cs.LG cs.AI stat.ME

    Learning When to Treat Business Processes: Prescriptive Process Monitoring with Causal Inference and Reinforcement Learning

    Authors: Zahra Dasht Bozorgi, Marlon Dumas, Marcello La Rosa, Artem Polyvyanyy, Mahmoud Shoush, Irene Teinemaa

    Abstract: Increasing the success rate of a process, i.e. the percentage of cases that end in a positive outcome, is a recurrent process improvement goal. At runtime, there are often certain actions (a.k.a. treatments) that workers may execute to lift the probability that a case ends in a positive outcome. For example, in a loan origination process, a possible treatment is to issue multiple loan offers to in… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  4. Web-based Database Courses E-Learning Application

    Authors: Aaron Paul M. Dela Rosa, Luigi Miguel M. Villanueva, John Mardy R. San Miguel, John Emmanuel B. Quinto

    Abstract: This study was focused on the development of a web e-learning application for the database courses taken by Information Technology (IT) students at the College of Information and Communications Technology (CICT) of Bulacan State University (BulSU). The research methodology used in this project was the cross-sectional developmental approach. The Agile Software Development methodology was followed p… ▽ More

    Submitted 23 November, 2022; originally announced December 2022.

    Journal ref: International Journal of Computing Sciences Research (ISSN print: 2546-0552; ISSN online: 2546-115X) Vol. 6, August 20, 2022

  5. Web-based Management Information System of Cases Filed with the National Labor Relations Commission

    Authors: Aaron Paul M. Dela Rosa

    Abstract: This study was developed to describe the daily operations and encountered problems of the National Labor Relations Commission Regional Arbitration Branch No. IV (NLRC RAB IV) through conducted observations and interviews. These problems were addressed and analyzed to be the features of the developed web-based management information system (MIS) for cases. The research methodology utilized in this… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Journal ref: International Journal of Computing Sciences Research (ISSN print: 2546-0552; ISSN online: 2546-115X), Vol. 6, August 13, 2022

  6. Effectiveness of an Online Course in Programming in a State University in the Philippines

    Authors: Aaron Paul M. Dela Rosa

    Abstract: Online courses, as a pedagogical approach to teaching, boomed during this Coronavirus Disease 2019 pandemic era. Universities shifted from traditional face to face classes to online distance learning due to the cause of the pandemic. This study aimed to determine how effective an online course is in learning a programming course. The study utilized mixed method research applied through a validated… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Journal ref: International Journal of Computing Sciences Research (ISSN print: 2546-0552; ISSN online: 2546-115X) Vol. 6, November 23, 2022

  7. Monitoring Fog Computing: a Review, Taxonomy and Open Challenges

    Authors: Breno Costa, Joao Bachiega Jr, Leonardo Reboucas de Carvalho, Michel Rosa, Aleteia Araujo

    Abstract: Fog computing is a distributed paradigm that provides computational resources in the users' vicinity. Fog orchestration is a set of functionalities that coordinate the dynamic infrastructure and manage the services to guarantee the Service Level Agreements. Monitoring is an orchestration functionality of prime importance. It is the basis for resource management actions, collecting status of resour… ▽ More

    Submitted 16 June, 2022; v1 submitted 13 May, 2022; originally announced June 2022.

  8. arXiv:2206.02873  [pdf, other

    cs.IR cs.CL cs.PF

    No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval

    Authors: Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Recent work has shown that small distilled language models are strong competitors to models that are orders of magnitude larger and slower in a wide range of information retrieval tasks. This has made distilled and dense models, due to latency constraints, the go-to choice for deployment in real-world retrieval applications. In this work, we question this practice by showing that the number of par… ▽ More

    Submitted 12 December, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

  9. arXiv:2205.15172  [pdf, ps, other

    cs.CL

    Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

    Authors: Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Recent work has shown that language models scaled to billions of parameters, such as GPT-3, perform remarkably well in zero-shot and few-shot scenarios. In this work, we experiment with zero-shot models in the legal case entailment task of the COLIEE 2022 competition. Our experiments show that scaling the number of parameters in a language model improves the F1 score of our previous zero-shot resu… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  10. arXiv:2203.14079  [pdf, other

    cs.AI

    Generalization in Automated Process Discovery: A Framework based on Event Log Patterns

    Authors: Daniel Reißner, Abel Armas-Cervantes, Marcello La Rosa

    Abstract: The importance of quality measures in process mining has increased. One of the key quality aspects, generalization, is concerned with measuring the degree of overfitting of a process model w.r.t. an event log, since the recorded behavior is just an example of the true behavior of the underlying business process. Existing generalization measures exhibit several shortcomings that severely hinder the… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

  11. To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment

    Authors: Guilherme Moraes Rosa, Ruan Chaves Rodrigues, Roberto de Alencar Lotufo, Rodrigo Nogueira

    Abstract: There has been mounting evidence that pretrained language models fine-tuned on large and diverse supervised datasets can transfer well to a variety of out-of-domain tasks. In this work, we investigate this transfer ability to the legal domain. For that, we participated in the legal case entailment task of COLIEE 2021, in which we use such models with no adaptations to the target domain. Our submis… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  12. arXiv:2201.12855  [pdf, ps, other

    cs.AI cs.SE

    AI-Augmented Business Process Management Systems: A Research Manifesto

    Authors: Marlon Dumas, Fabiana Fournier, Lior Limonad, Andrea Marrella, Marco Montali, Jana-Rebecca Rehse, Rafael Accorsi, Diego Calvanese, Giuseppe De Giacomo, Dirk Fahland, Avigdor Gal, Marcello La Rosa, Hagen Völzer, Ingo Weber

    Abstract: AI-Augmented Business Process Management Systems (ABPMSs) are an emerging class of process-aware information systems, empowered by trustworthy AI technology. An ABPMS enhances the execution of business processes with the aim of making these processes more adaptable, proactive, explainable, and context-sensitive. This manifesto presents a vision for ABPMSs and discusses research challenges that nee… ▽ More

    Submitted 4 November, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: 19 pages, 1 figure

    Journal ref: ACM Transactions on Management Information Systems, 31 January 2023 Volume 14, Issue 1, Article No.: 11, pp 1-19

  13. Packaging research artefacts with RO-Crate

    Authors: Stian Soiland-Reyes, Peter Sefton, Mercè Crosas, Leyla Jael Castro, Frederik Coppens, José M. Fernández, Daniel Garijo, Björn Grüning, Marco La Rosa, Simone Leo, Eoghan Ó Carragáin, Marc Portier, Ana Trisovic, RO-Crate Community, Paul Groth, Carole Goble

    Abstract: An increasing number of researchers support reproducibility by including pointers to and descriptions of datasets, software and methods in their publications. However, scientific articles may be ambiguous, incomplete and difficult to process by automated systems. In this paper we introduce RO-Crate, an open, community-driven, and lightweight approach to packaging research artefacts along with thei… ▽ More

    Submitted 6 December, 2021; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: 44 pages. Accepted for Data Science

    ACM Class: H.1.1; H.3.2

    Journal ref: Data Science 2022

  14. arXiv:2106.15398  [pdf, other

    cs.AI cs.FL

    Automated Repair of Process Models with Non-Local Constraints Using State-Based Region Theory

    Authors: Anna Kalenkova, Josep Carmona, Artem Polyvyanyy, Marcello La Rosa

    Abstract: State-of-the-art process discovery methods construct free-choice process models from event logs. Consequently, the constructed models do not take into account indirect dependencies between events. Whenever the input behaviour is not free-choice, these methods fail to provide a precise model. In this paper, we propose a novel approach for enhancing free-choice process models by adding non-free-choi… ▽ More

    Submitted 13 December, 2021; v1 submitted 26 June, 2021; originally announced June 2021.

    Journal ref: Fundamenta Informaticae, Volume 183, Issues 3-4: Petri Nets 2020 (December 23, 2021) fi:7634

  15. arXiv:2106.13446  [pdf, other

    cs.SE

    Discovering executable routine specifications from user interaction logs

    Authors: Volodymyr Leno, Adriano Augusto, Marlon Dumas, Marcello La Rosa, Fabrizio Maria Maggi, Artem Polyvyanyy

    Abstract: Robotic Process Automation (RPA) is a technology to automate routine work such as copying data across applications or filling in document templates using data from multiple applications. RPA tools allow organizations to automate a wide range of routines. However, identifying and sco** routines that can be automated using RPA tools is time consuming. Manual identification of candidate routines vi… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 41 pages, 6 figures, 10 tables. arXiv admin note: text overlap with arXiv:2008.05782

  16. arXiv:2105.07111  [pdf, other

    cs.LG cs.AI

    Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction

    Authors: Zahra Dasht Bozorgi, Irene Teinemaa, Marlon Dumas, Marcello La Rosa, Artem Polyvyanyy

    Abstract: Reducing cycle time is a recurrent concern in the field of business process management. Depending on the process, various interventions may be triggered to reduce the cycle time of a case, for example, using a faster ship** service in an order-to-delivery process or giving a phone call to a customer to obtain missing information rather than waiting passively. Each of these interventions comes wi… ▽ More

    Submitted 14 September, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  17. arXiv:2105.06813  [pdf, other

    cs.CL cs.IR cs.LG

    A cost-benefit analysis of cross-lingual transfer methods

    Authors: Guilherme Moraes Rosa, Luiz Henrique Bonifacio, Leandro Rodrigues de Souza, Roberto Lotufo, Rodrigo Nogueira

    Abstract: An effective method for cross-lingual transfer is to fine-tune a bilingual or multilingual model on a supervised dataset in one language and evaluating it on another language in a zero-shot manner. Translating examples at training time or inference time are also viable alternatives. However, there are costs associated with these methods that are rarely addressed in the literature. In this work, we… ▽ More

    Submitted 14 December, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  18. arXiv:2105.06016  [pdf, other

    cs.SE

    Automated Discovery of Process Models with True Concurrency and Inclusive Choices

    Authors: Adriano Augusto, Marlon Dumas, Marcello La Rosa

    Abstract: Enterprise information systems allow companies to maintain detailed records of their business process executions. These records can be extracted in the form of event logs, which capture the execution of activities across multiple instances of a business process. Event logs may be used to analyze business processes at a fine level of detail using process mining techniques. Among other things, proce… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  19. arXiv:2105.05686  [pdf, ps, other

    cs.IR cs.CL cs.LG

    Yes, BM25 is a Strong Baseline for Legal Case Retrieval

    Authors: Guilherme Moraes Rosa, Ruan Chaves Rodrigues, Roberto Lotufo, Rodrigo Nogueira

    Abstract: We describe our single submission to task 1 of COLIEE 2021. Our vanilla BM25 got second place, well above the median of submissions. Code is available at https://github.com/neuralmind-ai/coliee.

    Submitted 25 October, 2021; v1 submitted 26 April, 2021; originally announced May 2021.

  20. arXiv:2104.03404  [pdf, other

    cs.AI cs.MA cs.NE

    Bootstrap** of memetic from genetic evolution via inter-agent selection pressures

    Authors: Nicholas Guttenberg, Marek Rosa

    Abstract: We create an artificial system of agents (attention-based neural networks) which selectively exchange messages with each-other in order to study the emergence of memetic evolution and how memetic evolutionary pressures interact with genetic evolution of the network weights. We observe that the ability of agents to exert selection pressures on each-other is essential for memetic evolution to bootst… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: 9 pages, 3 figures, submitted to ALife 2021

  21. Experimental Body-input Three-stage DC offset Calibration Scheme for Memristive Crossbar

    Authors: Charanraj Mohan, L. A. Camuñas-Mesa, Elisa Vianello, Carlo Reita, José M. de la Rosa, Teresa Serrano-Gotarredona, Bernabé Linares-Barranco

    Abstract: Reading several ReRAMs simultaneously in a neuromorphic circuit increases power consumption and limits scalability. Applying small inference read pulses is a vain attempt when offset voltages of the read-out circuit are decisively more. This paper presents an experimental validation of a three-stage calibration scheme to calibrate the DC offset voltage across the rows of the memristive crossbar. T… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 5 pages, 9 figures, conference paper published in ISCAS20

    ACM Class: B.7

  22. Implementation of binary stochastic STDP learning using chalcogenide-based memristive devices

    Authors: C. Mohan, L. A. Camuñas-Mesa, J. M. de la Rosa, T. Serrano-Gotarredona, B. Linares-Barranco

    Abstract: The emergence of nano-scale memristive devices encouraged many different research areas to exploit their use in multiple applications. One of the proposed applications was to implement synaptic connections in bio-inspired neuromorphic systems. Large-scale neuromorphic hardware platforms are being developed with increasing number of neurons and synapses, having a critical bottleneck in the online l… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Journal ref: 2021 IEEE International Symposium on Circuits and Systems (ISCAS), 2021, pp. 1-5

  23. arXiv:2102.07298  [pdf, other

    cs.LG cs.AI

    A Deep Adversarial Model for Suffix and Remaining Time Prediction of Event Sequences

    Authors: Farbod Taymouri, Marcello La Rosa, Sarah M. Erfani

    Abstract: Event suffix and remaining time prediction are sequence to sequence learning tasks. They have wide applications in different areas such as economics, digital health, business process management and IT infrastructure monitoring. Timestamped event sequences contain ordered events which carry at least two attributes: the event's label and its timestamp. Suffix and remaining time prediction are about… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

  24. arXiv:2009.01561  [pdf, other

    cs.LG stat.ML

    Process Mining Meets Causal Machine Learning: Discovering Causal Rules from Event Logs

    Authors: Zahra Dasht Bozorgi, Irene Teinemaa, Marlon Dumas, Marcello La Rosa, Artem Polyvyanyy

    Abstract: This paper proposes an approach to analyze an event log of a business process in order to generate case-level recommendations of treatments that maximize the probability of a given outcome. Users classify the attributes in the event log into controllable and non-controllable, where the former correspond to attributes that can be altered during an execution of the process (the possible treatments).… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: 8 pages, 4 figures, conference

  25. arXiv:2008.05782  [pdf, other

    cs.SE

    Identifying candidate routines for Robotic Process Automation from unsegmented UI logs

    Authors: V. Leno, A. Augusto, M. Dumas, M. La Rosa, F. Maggi, A. Polyvyanyy

    Abstract: Robotic Process Automation (RPA) is a technology to develop software bots that automate repetitive sequences of interactions between users and software applications (a.k.a. routines). To take full advantage of this technology, organizations need to identify and to scope their routines. This is a challenging endeavor in large organizations, as routines are usually not concentrated in a handful of p… ▽ More

    Submitted 26 August, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: International Conference on Process Mining 2020

  26. arXiv:2007.16030  [pdf, other

    cs.LG stat.ML

    Encoder-Decoder Generative Adversarial Nets for Suffix Generation and Remaining Time Prediction of Business Process Models

    Authors: Farbod Taymouri, Marcello La Rosa

    Abstract: This paper proposes an encoder-decoder architecture grounded on Generative Adversarial Networks (GANs), that generates a sequence of activities and their timestamps in an end-to-end way. GANs work well with differentiable data such as images. However, a suffix is a sequence of categorical items. To this end, we use the Gumbel-Softmax distribution to get a differentiable continuous approximation. T… ▽ More

    Submitted 19 October, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2003.11268

  27. Detecting sudden and gradual drifts in business processes from execution traces

    Authors: Abderrahmane Maaradji, Marlon Dumas, Marcello La Rosa, Alireza Ostovar

    Abstract: Business processes are prone to unexpected changes, as process workers may suddenly or gradually start executing a process differently in order to adjust to changes in workload, season, or other external factors. Early detection of business process changes enables managers to identify and act upon changes that may otherwise affect process performance. Business process drift detection refers to a f… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering 29, no. 10 (2017)

  28. arXiv:2004.01781  [pdf, other

    cs.SE cs.AI

    Efficient Conformance Checking using Approximate Alignment Computation with Tandem Repeats

    Authors: Daniel Reißner, Abel Armas-Cervantes, Marcello La Rosa

    Abstract: Conformance checking encompasses a body of process mining techniques which aim to find and describe the differences between a process model capturing the expected process behavior and a corresponding event log recording the observed behavior. Alignments are an established technique to compute the distance between a trace in the event log and the closest execution trace of a corresponding process m… ▽ More

    Submitted 26 March, 2022; v1 submitted 1 April, 2020; originally announced April 2020.

  29. arXiv:2003.11268  [pdf, other

    cs.LG stat.ML

    Predictive Business Process Monitoring via Generative Adversarial Nets: The Case of Next Event Prediction

    Authors: Farbod Taymouri, Marcello La Rosa, Sarah Erfani, Zahra Dasht Bozorgi, Ilya Verenich

    Abstract: Predictive process monitoring aims to predict future characteristics of an ongoing process case, such as case outcome or remaining timestamp. Recently, several predictive process monitoring methods based on deep learning such as Long Short-Term Memory or Convolutional Neural Network have been proposed to address the problem of next event prediction. However, due to insufficient training data or su… ▽ More

    Submitted 1 April, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

  30. arXiv:2001.01007  [pdf, other

    cs.AI

    Automated Discovery of Data Transformations for Robotic Process Automation

    Authors: Volodymyr Leno, Marlon Dumas, Marcello La Rosa, Fabrizio Maria Maggi, Artem Polyvyanyy

    Abstract: Robotic Process Automation (RPA) is a technology for automating repetitive routines consisting of sequences of user interactions with one or more applications. In order to fully exploit the opportunities opened by RPA, companies need to discover which specific routines may be automated, and how. In this setting, this paper addresses the problem of analyzing User Interaction (UI) logs in order to d… ▽ More

    Submitted 3 January, 2020; originally announced January 2020.

    Comments: 8 pages, 5 figures. To be published in proceedings of AAAI-20 workshop on Intelligent Process Automation

  31. arXiv:1912.10598  [pdf, other

    cs.LG stat.ML

    Business Process Variant Analysis based on Mutual Fingerprints of Event Logs

    Authors: Farbod Taymouri, Marcello La Rosa, Josep Carmona

    Abstract: Comparing business process variants using event logs is a common use case in process mining. Existing techniques for process variant analysis detect statistically-significant differences between variants at the level of individual entities (such as process activities) and their relationships (e.g. directly-follows relations between activities). This may lead to a proliferation of differences due t… ▽ More

    Submitted 1 April, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

  32. arXiv:1912.01513  [pdf

    cs.AI cs.LG cs.MA

    BADGER: Learning to (Learn [Learning Algorithms] through Multi-Agent Communication)

    Authors: Marek Rosa, Olga Afanasjeva, Simon Andersson, Joseph Davidson, Nicholas Guttenberg, Petr Hlubuček, Martin Poliak, Jaroslav Vítku, Jan Feyereisl

    Abstract: In this work, we propose a novel memory-based multi-agent meta-learning architecture and learning procedure that allows for learning of a shared communication policy that enables the emergence of rapid adaptation to new and unseen environments by learning to learn learning algorithms through communication. Behavior, adaptation and learning to adapt emerges from the interactions of homogeneous expe… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

  33. arXiv:1911.07582  [pdf, other

    cs.OH

    Business Process Variant Analysis: Survey and Classification

    Authors: Farbod Taymouri, Marcello La Rosa, Marlon Dumas, Fabrizio Maria Maggi

    Abstract: Process variant analysis aims at identifying and addressing the differences existing in a set of process executions enacted by the same process model. A process model can be executed differently in different situations for various reasons, e.g., the process could run in different locations or seasons, which gives rise to different behaviors. Having intuitions about the discrepancies in process beh… ▽ More

    Submitted 22 December, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

  34. arXiv:1910.09767  [pdf, other

    cs.SE

    Scalable Alignment of Process Models and Event Logs: An Approach Based on Automata and S-Components

    Authors: Daniel Reißner, Abel Armas-Cervantes, Raffaele Conforti, Marlon Dumas, Dirk Fahland, Marcello La Rosa

    Abstract: Given a model of the expected behavior of a business process and an event log recording its observed behavior, the problem of business process conformance checking is that of identifying and describing the differences between the model and the log. A desirable feature of a conformance checking technique is to identify a minimal yet complete set of differences. Existing conformance checking techniq… ▽ More

    Submitted 4 March, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

  35. arXiv:1909.09543  [pdf, other

    cs.PL cs.SE

    Process Query Language: Design, Implementation, and Evaluation

    Authors: Artem Polyvyanyy, Arthur H. M. ter Hofstede, Marcello La Rosa, Chun Ouyang, Anastasiia Pika

    Abstract: Organizations can benefit from the use of practices, techniques, and tools from the area of business process management. Through the focus on processes, they create process models that require management, including support for versioning, refactoring and querying. Querying thus far has primarily focused on structural properties of models rather than on exploiting behavioral properties capturing as… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: 83 pages

  36. ToyArchitecture: Unsupervised Learning of Interpretable Models of the World

    Authors: Jaroslav Vítků, Petr Dluhoš, Joseph Davidson, Matěj Nikl, Simon Andersson, Přemysl Paška, Jan Šinkora, Petr Hlubuček, Martin Stránský, Martin Hyben, Martin Poliak, Jan Feyereisl, Marek Rosa

    Abstract: Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improvements in narrow AI domains, or on universal theoretical frameworks which are usually uncomputable, incompatible with theories of biological intelligence, or lack practical implementations. The goal of this work is to combine the main advantages of the two: to follow a big picture view, while providi… ▽ More

    Submitted 9 September, 2020; v1 submitted 20 March, 2019; originally announced March 2019.

    Comments: Revision: changed the pdftitle

  37. arXiv:1805.02896  [pdf, other

    cs.AI cs.LG

    Survey and cross-benchmark comparison of remaining time prediction methods in business process monitoring

    Authors: Ilya Verenich, Marlon Dumas, Marcello La Rosa, Fabrizio Maggi, Irene Teinemaa

    Abstract: Predictive business process monitoring methods exploit historical process execution logs to generate predictions about running instances (called cases) of a business process, such as the prediction of the outcome, next activity or remaining cycle time of a given process case. These insights could be used to support operational managers in taking remedial actions as business processes unfold, e.g.… ▽ More

    Submitted 10 May, 2018; v1 submitted 8 May, 2018; originally announced May 2018.

  38. arXiv:1804.02704  [pdf, other

    cs.LG stat.ML

    Discovering Process Maps from Event Streams

    Authors: Volodymyr Leno, Abel Armas-Cervantes, Marlon Dumas, Marcello La Rosa, Fabrizio M. Maggi

    Abstract: Automated process discovery is a class of process mining methods that allow analysts to extract business process models from event logs. Traditional process discovery methods extract process models from a snapshot of an event log stored in its entirety. In some scenarios, however, events keep coming with a high arrival rate to the extent that it is impractical to store the entire event log and to… ▽ More

    Submitted 8 April, 2018; originally announced April 2018.

  39. arXiv:1707.06766  [pdf, other

    cs.AI

    Outcome-Oriented Predictive Process Monitoring: Review and Benchmark

    Authors: Irene Teinemaa, Marlon Dumas, Marcello La Rosa, Fabrizio Maria Maggi

    Abstract: Predictive business process monitoring refers to the act of making predictions about the future state of ongoing cases of a business process, based on their incomplete execution traces and logs of historical (completed) traces. Motivated by the increasingly pervasive availability of fine-grained event data about business process executions, the problem of predictive process monitoring has received… ▽ More

    Submitted 23 October, 2018; v1 submitted 21 July, 2017; originally announced July 2017.

  40. arXiv:1705.02288  [pdf, other

    cs.SE

    Automated Discovery of Process Models from Event Logs: Review and Benchmark

    Authors: Adriano Augusto, Raffaele Conforti, Marlon Dumas, Marcello La Rosa, Fabrizio Maria Maggi, Andrea Marrella, Massimo Mecella, Allar Soo

    Abstract: Process mining allows analysts to exploit logs of historical executions of business processes to extract insights regarding the actual performance of these processes. One of the most widely studied process mining operations is automated process discovery. An automated process discovery method takes as input an event log, and produces as output a business process model that captures the control-flo… ▽ More

    Submitted 29 January, 2018; v1 submitted 5 May, 2017; originally announced May 2017.

  41. arXiv:1704.03610  [pdf, other

    cs.SE

    Blockchains for Business Process Management - Challenges and Opportunities

    Authors: Jan Mendling, Ingo Weber, Wil van der Aalst, Jan vom Brocke, Cristina Cabanillas, Florian Daniel, Soren Debois, Claudio Di Ciccio, Marlon Dumas, Schahram Dustdar, Avigdor Gal, Luciano Garcia-Banuelos, Guido Governatori, Richard Hull, Marcello La Rosa, Henrik Leopold, Frank Leymann, Jan Recker, Manfred Reichert, Hajo A. Reijers, Stefanie Rinderle-Ma, Andreas Rogge-Solti, Michael Rosemann, Stefan Schulte, Munindar P. Singh , et al. (7 additional authors not shown)

    Abstract: Blockchain technology promises a sizable potential for executing inter-organizational business processes without requiring a central party serving as a single point of trust (and failure). This paper analyzes its impact on business process management (BPM). We structure the discussion using two BPM frameworks, namely the six BPM core capabilities and the BPM lifecycle. This paper provides research… ▽ More

    Submitted 31 January, 2018; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: Preprint for ACM TMIS

  42. arXiv:1612.02130  [pdf, other

    stat.AP cs.DB cs.LG cs.NE stat.ML

    Predictive Business Process Monitoring with LSTM Neural Networks

    Authors: Niek Tax, Ilya Verenich, Marcello La Rosa, Marlon Dumas

    Abstract: Predictive business process monitoring methods exploit logs of completed cases of a process in order to make predictions about running cases thereof. Existing methods in this space are tailor-made for specific prediction tasks. Moreover, their relative accuracy is highly sensitive to the dataset at hand, thus requiring users to engage in trial-and-error and tuning when applying them in a specific… ▽ More

    Submitted 16 May, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: Accepted at the International Conference on Advanced Information Systems Engineering (CAiSE) 2017

    Journal ref: Lecture Notes in Computer Science, 10253 (2017) 477-492

  43. arXiv:1611.00685  [pdf, other

    cs.AI

    A Framework for Searching for General Artificial Intelligence

    Authors: Marek Rosa, Jan Feyereisl, The GoodAI Collective

    Abstract: There is a significant lack of unified approaches to building generally intelligent machines. The majority of current artificial intelligence research operates within a very narrow field of focus, frequently without considering the importance of the 'big picture'. In this document, we seek to describe and unify principles that guide the basis of our development of general artificial intelligence.… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

  44. arXiv:1608.08252  [pdf, ps, other

    cs.AI cs.DB

    Business Process Deviance Mining: Review and Evaluation

    Authors: Hoang Nguyen, Marlon Dumas, Marcello La Rosa, Fabrizio Maria Maggi, Suriadi Suriadi

    Abstract: Business process deviance refers to the phenomenon whereby a subset of the executions of a business process deviate, in a negative or positive way, with respect to its expected or desirable outcomes. Deviant executions of a business process include those that violate compliance rules, or executions that undershoot or exceed performance targets. Deviance mining is concerned with uncovering the reas… ▽ More

    Submitted 29 August, 2016; originally announced August 2016.

  45. arXiv:1111.4570  [pdf, other

    cs.SI physics.soc-ph

    Four Degrees of Separation

    Authors: Lars Backstrom, Paolo Boldi, Marco Rosa, Johan Ugander, Sebastiano Vigna

    Abstract: Frigyes Karinthy, in his 1929 short story "Láancszemek" ("Chains") suggested that any two persons are distanced by at most six friendship links. (The exact wording of the story is slightly ambiguous: "He bet us that, using no more than five individuals, one of whom is a personal acquaintance, he could contact the selected individual [...]". It is not completely clear whether the selected individua… ▽ More

    Submitted 5 January, 2012; v1 submitted 19 November, 2011; originally announced November 2011.

  46. arXiv:1110.4474  [pdf, other

    cs.SI physics.soc-ph

    Robustness of Social Networks: Comparative Results Based on Distance Distributions

    Authors: Paolo Boldi, Marco Rosa, Sebastiano Vigna

    Abstract: Given a social network, which of its nodes have a stronger impact in determining its structure? More formally: which node-removal order has the greatest impact on the network structure? We approach this well-known problem for the first time in a setting that combines both web graphs and social networks, using datasets that are orders of magnitude larger than those appearing in the previous literat… ▽ More

    Submitted 20 October, 2011; originally announced October 2011.

  47. arXiv:1011.5599  [pdf, other

    cs.DS cs.SI physics.soc-ph

    HyperANF: Approximating the Neighbourhood Function of Very Large Graphs on a Budget

    Authors: Paolo Boldi, Marco Rosa, Sebastiano Vigna

    Abstract: The neighbourhood function N(t) of a graph G gives, for each t, the number of pairs of nodes <x, y> such that y is reachable from x in less that t hops. The neighbourhood function provides a wealth of information about the graph (e.g., it easily allows one to compute its diameter), but it is very expensive to compute it exactly. Recently, the ANF algorithm (approximate neighbourhood function) has… ▽ More

    Submitted 26 January, 2011; v1 submitted 25 November, 2010; originally announced November 2010.

  48. arXiv:1011.5425  [pdf, other

    cs.DS cs.SI physics.soc-ph

    Layered Label Propagation: A MultiResolution Coordinate-Free Ordering for Compressing Social Networks

    Authors: Paolo Boldi, Marco Rosa, Massimo Santini, Sebastiano Vigna

    Abstract: We continue the line of research on graph compression started with WebGraph, but we move our focus to the compression of social networks in a proper sense (e.g., LiveJournal): the approaches that have been used for a long time to compress web graphs rely on a specific ordering of the nodes (lexicographical URL ordering) whose extension to general social networks is not trivial. In this paper, we p… ▽ More

    Submitted 14 October, 2011; v1 submitted 24 November, 2010; originally announced November 2010.