Skip to main content

Showing 1–20 of 20 results for author: Fournier-Viger, P

.
  1. arXiv:2311.09667  [pdf, other

    cs.DB

    Repetitive nonoverlap** sequential pattern mining

    Authors: Meng Geng, Youxi Wu, Yan Li, **g Liu, Philippe Fournier-Viger, Xingquan Zhu, Xindong Wu

    Abstract: Sequential pattern mining (SPM) is an important branch of knowledge discovery that aims to mine frequent sub-sequences (patterns) in a sequential database. Various SPM methods have been investigated, and most of them are classical SPM methods, since these methods only consider whether or not a given pattern occurs within a sequence. Classical SPM can only find the common features of sequences, but… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  2. arXiv:2310.02612  [pdf, other

    cs.DB

    Top-k contrast order-preserving pattern mining

    Authors: Youxi Wu, Yufei Meng, Yan Li, Lei Guo, Xingquan Zhu, Philippe Fournier-Viger, Xindong Wu

    Abstract: Recently, order-preserving pattern (OPP) mining, a new sequential pattern mining method, has been proposed to mine frequent relative orders in a time series. Although frequent relative orders can be used as features to classify a time series, the mined patterns do not reflect the differences between two classes of time series well. To effectively discover the differences between time series, this… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  3. arXiv:2308.05463  [pdf, other

    cs.LG

    $\mathcal{G}^2Pxy$: Generative Open-Set Node Classification on Graphs with Proxy Unknowns

    Authors: Qin Zhang, Zelin Shi, Xiaolin Zhang, Xiaojun Chen, Philippe Fournier-Viger, Shirui Pan

    Abstract: Node classification is the task of predicting the labels of unlabeled nodes in a graph. State-of-the-art methods based on graph neural networks achieve excellent performance when all labels are available during training. But in real-life, models are often applied on data with new classes, which can lead to massive misclassification and thus significantly degrade performance. Hence, develo** open… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 8 pages, 1 figure

  4. arXiv:2304.10254  [pdf, other

    cs.CV

    Image-text Retrieval via Preserving Main Semantics of Vision

    Authors: Xu Zhang, Xinzheng Niu, Philippe Fournier-Viger, Xudong Dai

    Abstract: Image-text retrieval is one of the major tasks of cross-modal retrieval. Several approaches for this task map images and texts into a common space to create correspondences between the two modalities. However, due to the content (semantics) richness of an image, redundant secondary information in an image may cause false matches. To address this issue, this paper presents a semantic optimization a… ▽ More

    Submitted 28 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 6 pages, 3 figures, accepted by ICME2023

  5. arXiv:2303.14510  [pdf, other

    cs.DB

    Targeted Mining of Top-k High Utility Itemsets

    Authors: Shan Huang, Wensheng Gan, **bao Miao, Xuming Han, Philippe Fournier-Viger

    Abstract: Finding high-importance patterns in data is an emerging data mining task known as High-utility itemset mining (HUIM). Given a minimum utility threshold, a HUIM algorithm extracts all the high-utility itemsets (HUIs) whose utility values are not less than the threshold. This can reveal a wealth of useful information, but the precise needs of users are not well taken into account. In particular, use… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Preprint. 5 figures, 5 tables

  6. arXiv:2302.11426  [pdf, other

    cs.DB cs.IR

    Mining compact high utility sequential patterns

    Authors: Tai Dinh, Philippe Fournier-Viger, Huynh Van Hong

    Abstract: High utility sequential pattern mining (HUSPM) aims to mine all patterns that yield a high utility (profit) in a sequence dataset. HUSPM is useful for several applications such as market basket analysis, marketing, and website clickstream analysis. In these applications, users may also consider high utility patterns frequently appearing in the dataset to obtain more fruitful information. However,… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Nippon (Japan) Applied Informatics Society Journal

  7. arXiv:2209.08932  [pdf, other

    cs.DB

    OPR-Miner: Order-preserving rule mining for time series

    Authors: Youxi Wu, Xiaoqian Zhao, Yan Li, Lei Guo, Xingquan Zhu, Philippe Fournier-Viger, Xindong Wu

    Abstract: Discovering frequent trends in time series is a critical task in data mining. Recently, order-preserving matching was proposed to find all occurrences of a pattern in a time series, where the pattern is a relative order (regarded as a trend) and an occurrence is a sub-time series whose relative order coincides with the pattern. Inspired by the order-preserving matching, the existing order-preservi… ▽ More

    Submitted 4 December, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

  8. arXiv:2206.06157  [pdf, other

    cs.DB cs.AI

    Towards Target High-Utility Itemsets

    Authors: **bao Miao, Wensheng Gan, Shicheng Wan, Yongdong Wu, Philippe Fournier-Viger

    Abstract: For applied intelligence, utility-driven pattern discovery algorithms can identify insightful and useful patterns in databases. However, in these techniques for pattern discovery, the number of patterns can be huge, and the user is often only interested in a few of those patterns. Hence, targeted high-utility itemset mining has emerged as a key research topic, where the aim is to find a subset of… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Preprint. 6 figures, 5 tables

  9. arXiv:2204.12704  [pdf, other

    cs.AI

    Discovering Representative Attribute-stars via Minimum Description Length

    Authors: Jiahong Liu, Min Zhou, Philippe Fournier-Viger, Menglin Yang, Lujia Pan, Mourad Nouioua

    Abstract: Graphs are a popular data type found in many domains. Numerous techniques have been proposed to find interesting patterns in graphs to help understand the data and support decision-making. However, there are generally two limitations that hinder their practical use: (1) they have multiple parameters that are hard to set but greatly influence results, (2) and they generally focus on identifying com… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: 14pages.Accepted by ICDE 2022

  10. arXiv:2202.13041  [pdf, other

    cs.AI

    Towards Revenue Maximization with Popular and Profitable Products

    Authors: Wensheng Gan, Guoting Chen, Hongzhi Yin, Philippe Fournier-Viger, Chien-Ming Chen, Philip S. Yu

    Abstract: Economic-wise, a common goal for companies conducting marketing is to maximize the return revenue/profit by utilizing the various effective marketing strategies. Consumer behavior is crucially important in economy and targeted marketing, in which behavioral economics can provide valuable insights to identify the biases and profit from customers. Finding credible and reliable information on product… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: ACM/IMS Transactions on Data Science. 4 figures, 5 tables

  11. arXiv:2201.08554  [pdf, other

    cs.LG

    Enhancing Hyperbolic Graph Embeddings via Contrastive Learning

    Authors: Jiahong Liu, Menglin Yang, Min Zhou, Shanshan Feng, Philippe Fournier-Viger

    Abstract: Recently, hyperbolic space has risen as a promising alternative for semi-supervised graph representation learning. Many efforts have been made to design hyperbolic versions of neural network operations. However, the inspiring geometric properties of this unique geometry have not been fully explored yet. The potency of graph models powered by the hyperbolic space is still largely underestimated. Be… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: Accepted by NeurIPS'21@2nd Workshop on Self-Supervised Learning

  12. Utility-Driven Mining of Trend Information for Intelligent System

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Han-Chieh Chao, Philippe Fournier-Viger, Xuan Wang, Philip S. Yu

    Abstract: Useful knowledge, embedded in a database, is likely to change over time. Identifying recent changes in temporal databases can provide valuable up-to-date information to decision-makers. Nevertheless, techniques for mining high-utility patterns (HUPs) seldom consider recency as a criterion to discover patterns. Thus, the traditional utility mining framework is inadequate for obtaining up-to-date in… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

    Comments: Accepted by ACM Trans. on Management Information Systems, 26 pages, 8 figures

    Journal ref: ACM Transactions on Management Information Systems, 2020

  13. arXiv:1904.12248  [pdf, other

    cs.DB cs.DS

    Fast Utility Mining on Complex Sequences

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, Philippe Fournier-Viger, Han-Chieh Chao, Philip S. Yu

    Abstract: High-utility sequential pattern mining is an emerging topic in the field of Knowledge Discovery in Databases. It consists of discovering subsequences having a high utility (importance) in sequences, referred to as high-utility sequential patterns (HUSPs). HUSPs can be applied to many real-life applications, such as market basket analysis, E-commerce recommendation, click-stream analysis and scenic… ▽ More

    Submitted 27 April, 2019; originally announced April 2019.

    Comments: Under review in IEEE TKDE, 15 pages

  14. Beyond Frequency: Utility Mining with Varied Item-Specific Minimum Utility

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Philip S Yu

    Abstract: Utility-oriented mining which integrates utility theory and data mining is a useful tool for understanding economic consumer behavior. Traditional algorithms for mining high-utility patterns (HUPs) applies a single/uniform minimum high-utility threshold (minutil) to obtain the set of HUPs, but in some real-life circumstances, some specific products may bring lower utilities compared with others, b… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: Under review in ACM Trans. on Data Science, 31 pages

    Journal ref: ACM Transactions on Internet Technology, 2021

  15. arXiv:1902.09582  [pdf, other

    cs.DB

    Utility Mining Across Multi-Dimensional Sequences

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, Hongzhi Yin, Philippe Fournier-Viger, Han-Chieh Chao, Philip S. Yu

    Abstract: Knowledge extraction from database is the fundamental task in database and data mining community, which has been applied to a wide range of real-world applications and situations. Different from the support-based mining models, the utility-oriented mining framework integrates the utility theory to provide more informative and useful patterns. Time-dependent sequence data is commonly seen in real l… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: Under review in IEEE TKDE, 14 pages

  16. HUOPM: High Utility Occupancy Pattern Mining

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Philip S. Yu

    Abstract: Mining useful patterns from varied types of databases is an important research topic, which has many real-life applications. Most studies have considered the frequency as sole interestingness measure for identifying high quality patterns. However, each object is different in nature. The relative importance of objects is not equal, in terms of criteria such as the utility, risk, or interest. Beside… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

    Comments: Accepted by IEEE Transactions on Cybernetics, 14 pages

    Journal ref: IEEE Transactions on Cybernetics, 2019

  17. A Survey of Parallel Sequential Pattern Mining

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Philip S. Yu

    Abstract: With the growing popularity of shared resources, large volumes of complex data of different types are collected automatically. Traditional data mining algorithms generally have problems and challenges including huge memory cost, low processing speed, and inadequate hard disk space. As a fundamental task of data mining, sequential pattern mining (SPM) is used in a wide variety of real-life applicat… ▽ More

    Submitted 4 April, 2019; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: Accepted by ACM Trans. on Knowl. Discov. Data, 33 pages

    Journal ref: ACM Transactions on Knowledge Discovery from Data, 2019

  18. A Survey of Utility-Oriented Pattern Mining

    Authors: Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, Vincent S. Tseng, Philip S. Yu

    Abstract: The main purpose of data mining and analytics is to find novel, potentially useful patterns that can be utilized in real-world applications to derive beneficial knowledge. For identifying and evaluating the usefulness of different kinds of patterns, many techniques and constraints have been proposed, such as support, confidence, sequence order, and utility parameters (e.g., weight, price, profit,… ▽ More

    Submitted 16 September, 2019; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: Survey paper, accepted by IEEE TKDE, 20 pages

    Journal ref: IEEE Transactions on Knowledge and Data Engineering, 2021

  19. arXiv:0901.4963  [pdf

    cs.AI

    How Emotional Mechanism Helps Episodic Learning in a Cognitive Agent

    Authors: Usef Faghihi, Philippe Fournier-Viger, Roger Nkambou, Pierre Poirier, Andre Mayers

    Abstract: In this paper we propose the CTS (Concious Tutoring System) technology, a biologically plausible cognitive agent based on human brain functions.This agent is capable of learning and remembering events and any related information such as corresponding procedures, stimuli and their emotional valences. Our proposed episodic memory and episodic learning mechanism are closer to the current multiple-t… ▽ More

    Submitted 30 January, 2009; originally announced January 2009.

  20. A Knowledge Discovery Framework for Learning Task Models from User Interactions in Intelligent Tutoring Systems

    Authors: P. Fournier-Viger, R. Nkambou, E. Mephu Nguifo

    Abstract: Domain experts should provide relevant domain knowledge to an Intelligent Tutoring System (ITS) so that it can guide a learner during problemsolving learning activities. However, for many ill-defined domains, the domain knowledge is hard to define explicitly. In previous works, we showed how sequential pattern mining can be used to extract a partial problem space from logged user interactions, a… ▽ More

    Submitted 29 January, 2009; originally announced January 2009.

    Comments: Proceedings of the 7th Mexican International Conference on Artificial Intelligence (MICAI 2008), Springer, pp. 765-778