Search | arXiv e-print repository

Adaptable Logical Control for Large Language Models

Authors: Honghua Zhang, Po-Nien Kung, Masahiro Yoshida, Guy Van den Broeck, Nanyun Peng

Abstract: Despite the success of Large Language Models (LLMs) on various tasks following human instructions, controlling model generation at inference time poses a persistent challenge. In this paper, we introduce Ctrl-G, an adaptable framework that facilitates tractable and flexible control of LLM generation to reliably follow logical constraints. Ctrl-G combines any production-ready LLM with a Hidden Mark… ▽ More Despite the success of Large Language Models (LLMs) on various tasks following human instructions, controlling model generation at inference time poses a persistent challenge. In this paper, we introduce Ctrl-G, an adaptable framework that facilitates tractable and flexible control of LLM generation to reliably follow logical constraints. Ctrl-G combines any production-ready LLM with a Hidden Markov Model, enabling LLM outputs to adhere to logical constraints represented as deterministic finite automata. We show that Ctrl-G, when applied to a TULU2-7B model, outperforms GPT3.5 and GPT4 on the task of interactive text editing: specifically, for the task of generating text insertions/continuations following logical constraints, Ctrl-G achieves over 30% higher satisfaction rate in human evaluation compared to GPT4. When applied to medium-size language models (e.g., GPT2-large), Ctrl-G also beats its counterparts for constrained generation by large margins on standard benchmarks. Additionally, as a proof-of-concept study, we experiment Ctrl-G on the Grade School Math benchmark to assist LLM reasoning, foreshadowing the application of Ctrl-G, as well as other constrained generation approaches, beyond traditional language generation tasks. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2312.08235 [pdf, other]

Analysis of Psychographic Indicators via LIWC and Their Correlation with CTR for Instagram Ads

Authors: Kenjiro Inoue, Mitsuo Yoshida

Abstract: The online advertising industry continues to grow and accounts for over 40% of global advertising spending. Online display advertising consists of images and text, and advertisers maximize sales revenue by contacting consumers through advertisements and encouraging them to make purchases. In today's society, where products are becoming more homogenized and needs are diversifying, appealing to cons… ▽ More The online advertising industry continues to grow and accounts for over 40% of global advertising spending. Online display advertising consists of images and text, and advertisers maximize sales revenue by contacting consumers through advertisements and encouraging them to make purchases. In today's society, where products are becoming more homogenized and needs are diversifying, appealing to consumer psychology through advertisements is becoming increasingly important. However, it is not sufficiently clear what kind of appeal influences consumer psychology. In this study, we quantified the appeal of the text in advertisements for health products and cosmetics, which were actually delivered in Instagram advertisements (one of display advertisements), by applying linguistic inquiry and word count (LIWC). The correlation between click-through rate (CTR) and the text was analyzed. The results showed that negative appeals that arouse consumer anxiety and a sense of crisis were related to CTR. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: WI-IAT 2023 Workshop: The 8th International Workshop on Application of Big Data for Computational Social Science (ABCSS 2023)

arXiv:2305.06141 [pdf, ps, other]

Active Semantic Localization with Graph Neural Embedding

Authors: Mitsuki Yoshida, Kanji Tanaka, Ryogo Yamamoto, Daiki Iwata

Abstract: Semantic localization, i.e., robot self-localization with semantic image modality, is critical in recently emerging embodied AI applications (e.g., point-goal navigation, object-goal navigation, vision language navigation) and topological map** applications (e.g., graph neural SLAM, ego-centric topological map). However, most existing works on semantic localization focus on passive vision tasks… ▽ More Semantic localization, i.e., robot self-localization with semantic image modality, is critical in recently emerging embodied AI applications (e.g., point-goal navigation, object-goal navigation, vision language navigation) and topological map** applications (e.g., graph neural SLAM, ego-centric topological map). However, most existing works on semantic localization focus on passive vision tasks without viewpoint planning, or rely on additional rich modalities (e.g., depth measurements). Thus, the problem is largely unsolved. In this work, we explore a lightweight, entirely CPU-based, domain-adaptive semantic localization framework, called graph neural localizer. Our approach is inspired by two recently emerging technologies: (1) Scene graph, which combines the viewpoint- and appearance- invariance of local and global features; (2) Graph neural network, which enables direct learning/recognition of graph data (i.e., non-vector data). Specifically, a graph convolutional neural network is first trained as a scene graph classifier for passive vision, and then its knowledge is transferred to a reinforcement-learning planner for active vision. Experiments on two scenarios, self-supervised learning and unsupervised domain adaptation, using a photo-realistic Habitat simulator validate the effectiveness of the proposed method. △ Less

Submitted 26 December, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

Comments: ACPR2023 (extended version)

Journal ref: Pattern Recognition. ACPR 2023. Lecture Notes in Computer Science, vol 14406. Springer, Cham

arXiv:2211.04024 [pdf, other]

Comparing Two Counting Methods for Estimating the Probabilities of Strings

Authors: Ayaka Takamoto, Mitsuo Yoshida, Kyoji Umemura

Abstract: There are two methods for counting the number of occurrences of a string in another large string. One is to count the number of places where the string is found. The other is to determine how many pieces of string can be extracted without overlap**. The difference between the two becomes apparent when the string is part of a periodic pattern. This research reports that the difference is signific… ▽ More There are two methods for counting the number of occurrences of a string in another large string. One is to count the number of places where the string is found. The other is to determine how many pieces of string can be extracted without overlap**. The difference between the two becomes apparent when the string is part of a periodic pattern. This research reports that the difference is significant in estimating the occurrence probability of a pattern. In this study, the strings used in the experiments are approximated from time-series data. The task involves classifying strings by estimating the probability or computing the information quantity. First, the frequencies of all substrings of a string are computed. Each counting method may sometimes produce different frequencies for an identical string. Second, the probability of the most probable segmentation is selected. The probability of the string is the product of all probabilities of substrings in the selected segmentation. The classification results demonstrate that the difference in counting methods is statistically significant, and that the method without overlap** is better. △ Less

Submitted 8 November, 2022; originally announced November 2022.

arXiv:2210.13874 [pdf, other]

Follower--Followee Ratio Category and User Vector for Analyzing Following Behavior

Authors: Hayato Oshimo, Shiori Hironaka, Mitsuo Yoshida, Kyoji Umemura

Abstract: Analyzing following behavior is important in many applications. Following behavior may depend on the main intention of the follower. Users may either follow their friends or they may follow celebrities to know more about them. It is difficult to estimate users' intention from their following relationships. In this paper, we propose an approach to analyze following relationships. First, we investig… ▽ More Analyzing following behavior is important in many applications. Following behavior may depend on the main intention of the follower. Users may either follow their friends or they may follow celebrities to know more about them. It is difficult to estimate users' intention from their following relationships. In this paper, we propose an approach to analyze following relationships. First, we investigated the similarity between users. Similar followers and followees are likely to be friends. However, when the follower and followee are not similar, it is likely that follower seeks to obtain more information on the followee. Second, we categorized users by the network structure. We then proposed analysis of following behavior based on similarity and category of users estimated from tweets and user data. We confirmed the feasibility of the proposed method through experiments. Finally, we examined users in different categories and analyzed their following behavior. △ Less

Submitted 25 October, 2022; originally announced October 2022.

Comments: 2022 9th International Conference on Advanced Informatics: Concepts, Theory and Applications

arXiv:2210.10221 [pdf, other]

doi 10.1109/ICIP46576.2022.9898014

Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets

Authors: Yuki Tanaka, Shuhei M. Yoshida, Makoto Terao

Abstract: We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain th… ▽ More We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain the best result, however, thresholds for prediction confidence must be adjusted. This process typically involves iterative search and repeated training of student models and is time-consuming. Therefore, we develop a method to optimize the thresholds without iterative optimization by maximizing the $F_β$-score on a validation dataset, which measures the quality of pseudo labels and can be measured without training a student model. We experimentally demonstrate that our proposed method achieves an mAP comparable to that of grid search on the COCO and VOC datasets. △ Less

Submitted 18 October, 2022; originally announced October 2022.

Comments: ICIP2022

Journal ref: 2022 IEEE International Conference on Image Processing (ICIP), 2022, pp. 1676-1680

arXiv:2204.12089 [pdf, other]

Acquiring a Dynamic Light Field through a Single-Shot Coded Image

Authors: Ryoya Mizuno, Keita Takahashi, Michitaka Yoshida, Chihiro Tsutake, Toshiaki Fujii, Hajime Nagahara

Abstract: We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D measurement). We designed an imaging model that synchronously applies aperture coding and pixel-wise exposure coding within a single exposure time. This coding scheme enables us to effectively embed the original information into a single observed image. The observed image i… ▽ More We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D measurement). We designed an imaging model that synchronously applies aperture coding and pixel-wise exposure coding within a single exposure time. This coding scheme enables us to effectively embed the original information into a single observed image. The observed image is then fed to a convolutional neural network (CNN) for light-field reconstruction, which is jointly trained with the camera-side coding patterns. We also developed a hardware prototype to capture a real 3-D scene moving over time. We succeeded in acquiring a dynamic light field with 5x5 viewpoints over 4 temporal sub-frames (100 views in total) from a single observed image. Repeating capture and reconstruction processes over time, we can acquire a dynamic light field at 4x the frame rate of the camera. To our knowledge, our method is the first to achieve a finer temporal resolution than the camera itself in compressive light-field acquisition. Our software is available from our project webpage △ Less

Submitted 26 April, 2022; originally announced April 2022.

arXiv:2204.10497 [pdf, ps, other]

Active Domain-Invariant Self-Localization Using Ego-Centric and World-Centric Maps

Authors: Kanya Kurauchi, Kanji Tanaka, Ryogo Yamamoto, Mitsuki Yoshida

Abstract: The training of a next-best-view (NBV) planner for visual place recognition (VPR) is a fundamentally important task in autonomous robot navigation, for which a typical approach is the use of visual experiences that are collected in the target domain as training data. However, the collection of a wide variety of visual experiences in everyday navigation is costly and prohibitive for real-time robot… ▽ More The training of a next-best-view (NBV) planner for visual place recognition (VPR) is a fundamentally important task in autonomous robot navigation, for which a typical approach is the use of visual experiences that are collected in the target domain as training data. However, the collection of a wide variety of visual experiences in everyday navigation is costly and prohibitive for real-time robotic applications. We address this issue by employing a novel {\it domain-invariant} NBV planner. A standard VPR subsystem based on a convolutional neural network (CNN) is assumed to be available, and its domain-invariant state recognition ability is proposed to be transferred to train the domain-invariant NBV planner. Specifically, we divide the visual cues that are available from the CNN model into two types: the output layer cue (OLC) and intermediate layer cue (ILC). The OLC is available at the output layer of the CNN model and aims to estimate the state of the robot (e.g., the robot viewpoint) with respect to the world-centric view coordinate system. The ILC is available within the middle layers of the CNN model as a high-level description of the visual content (e.g., a saliency image) with respect to the ego-centric view. In our framework, the ILC and OLC are mapped to a state vector and subsequently used to train a multiview NBV planner via deep reinforcement learning. Experiments using the public NCLT dataset validate the effectiveness of the proposed method. △ Less

Submitted 28 July, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

Comments: 13 pages, 4 figures, draft version of a manuscript submitted to CVMI2022

arXiv:2112.08073 [pdf, other]

doi 10.1145/3486622.3493947

Analysis of Leading Communities Contributing to arXiv Information Distribution on Twitter

Authors: Kyosuke Shimada, Kazuhiro Kazama, Mitsuo Yoshida, Ikki Ohmukai, Sho Sato

Abstract: To analyze the impact that arXiv is having on the world, in this paper we propose an arXiv information distribution model on Twitter, which has a three-layer structure: arXiv papers, information spreaders, and information collectors. First, we use the HITS algorithm to analyze the arXiv information diffusion network with users as nodes, which is created from three types of behavior on Twitter rega… ▽ More To analyze the impact that arXiv is having on the world, in this paper we propose an arXiv information distribution model on Twitter, which has a three-layer structure: arXiv papers, information spreaders, and information collectors. First, we use the HITS algorithm to analyze the arXiv information diffusion network with users as nodes, which is created from three types of behavior on Twitter regarding arXiv papers: tweeting, retweeting, and liking. Next, we extract communities from the network of information spreaders with positive authority and hub degrees using the Louvain method, and analyze the relationship and roles of information spreaders in communities using research field, linguistic, and temporal characteristics. From our analysis using the tweet and arXiv datasets, we found that information about arXiv papers circulates on Twitter from information spreaders to information collectors, and that multiple communities of information spreaders are formed according to their research fields. It was also found that different communities were formed in the same research field, depending on the research or cultural background of the information spreaders. We were able to identify two types of key persons: information spreaders who lead the relevant field in the international community and information spreaders who bridge the regional and international communities using English and their native language. In addition, we found that it takes some time to gain trust as an information spreader. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Comments: The 20th IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT '21)

arXiv:2112.07230 [pdf, other]

doi 10.1145/3486622.3493982

Do you trust experts on Twitter?: Successful correction of COVID-19-related misinformation

Authors: Dongwoo Lim, Fujio Toriumi, Mitsuo Yoshida

Abstract: This study focuses on how scientifically-correct information is disseminated through social media, and how misinformation can be corrected. We have identified examples on Twitter where scientific terms that have been misused have been rectified and replaced by scientifically-correct terms through the interaction of users. The results show that the percentage of correct terms ("variant" or "COVID-1… ▽ More This study focuses on how scientifically-correct information is disseminated through social media, and how misinformation can be corrected. We have identified examples on Twitter where scientific terms that have been misused have been rectified and replaced by scientifically-correct terms through the interaction of users. The results show that the percentage of correct terms ("variant" or "COVID-19 variant") being used instead of the incorrect terms ("strain") on Twitter has already increased since the end of December 2020. This was about a month before the release of an official statement by the Japanese Association for Infectious Diseases regarding the correct terminology, and the use of terms on social media was faster than it was in television. Some Twitter users who quickly started using the correct term were more likely to retweet messages sent by leading influencers on Twitter, rather than messages sent by traditional media or portal sites. However, a few Twitter users continued to use wrong terms even after March 2021, even though the use of the correct terms was widespread. Further analysis of their tweets revealed that they were quoting sources that differed from that of other users. This study empirically verified that self-correction occurs even on Twitter, which is often known as a "hotbed for spreading rumors." The results of this study also suggest that influencers with expertise can influence the direction of public opinion on social media and that the media that users usually cite can also affect the possibility of behavioral changes. △ Less

Submitted 14 December, 2021; originally announced December 2021.

Comments: The 20th IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT '21)

arXiv:2111.03350 [pdf, other]

doi 10.1109/ICAICTA53211.2021.9640293

Feature Selective Likelihood Ratio Estimator for Low- and Zero-frequency N-grams

Authors: Masato Kikuchi, Mitsuo Yoshida, Kyoji Umemura, Tadachika Ozono

Abstract: In natural language processing (NLP), the likelihood ratios (LRs) of N-grams are often estimated from the frequency information. However, a corpus contains only a fraction of the possible N-grams, and most of them occur infrequently. Hence, we desire an LR estimator for low- and zero-frequency N-grams. One way to achieve this is to decompose the N-grams into discrete values, such as letters and wo… ▽ More In natural language processing (NLP), the likelihood ratios (LRs) of N-grams are often estimated from the frequency information. However, a corpus contains only a fraction of the possible N-grams, and most of them occur infrequently. Hence, we desire an LR estimator for low- and zero-frequency N-grams. One way to achieve this is to decompose the N-grams into discrete values, such as letters and words, and take the product of the LRs for the values. However, because this method deals with a large number of discrete values, the running time and memory usage for estimation are problematic. Moreover, use of unnecessary discrete values causes deterioration of the estimation accuracy. Therefore, this paper proposes combining the aforementioned method with the feature selection method used in document classification, and shows that our estimator provides effective and efficient estimation results for low- and zero-frequency N-grams. △ Less

Submitted 5 November, 2021; originally announced November 2021.

Comments: The 2021 International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA 2021)

arXiv:2110.13410 [pdf, other]

doi 10.1109/ICAICTA53211.2021.9640257

Comparison of Indicators of Location Homophily Using Twitter Follow Graph

Authors: Shiori Hironaka, Mitsuo Yoshida, Kyoji Umemura

Abstract: Location homophily is a tendency of Twitter users whose followers tend to be in the same or nearby areas. Intuitively, although users with a higher number of follower relationships might have negative homophily indicators, it is worth consulting actual Twitter data. Moreover, there may be certain functions regarding the numbers of friends and followers that are more directly correlated to the homo… ▽ More Location homophily is a tendency of Twitter users whose followers tend to be in the same or nearby areas. Intuitively, although users with a higher number of follower relationships might have negative homophily indicators, it is worth consulting actual Twitter data. Moreover, there may be certain functions regarding the numbers of friends and followers that are more directly correlated to the homophily. In this study, the ratio of the number of friends to the number of followers is shown to be a more effective negative indicator of homophily, and the results for 10 different countries are verified. △ Less

Submitted 26 October, 2021; originally announced October 2021.

Comments: The 2021 International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA 2021)

arXiv:2110.00946 [pdf, other]

doi 10.1587/transfun.2020EAP1088

Unified Likelihood Ratio Estimation for High- to Zero-frequency N-grams

Authors: Masato Kikuchi, Kento Kawakami, Kazuho Watanabe, Mitsuo Yoshida, Kyoji Umemura

Abstract: Likelihood ratios (LRs), which are commonly used for probabilistic data processing, are often estimated based on the frequency counts of individual elements obtained from samples. In natural language processing, an element can be a continuous sequence of $N$ items, called an $N$-gram, in which each item is a word, letter, etc. In this paper, we attempt to estimate LRs based on $N$-gram frequency i… ▽ More Likelihood ratios (LRs), which are commonly used for probabilistic data processing, are often estimated based on the frequency counts of individual elements obtained from samples. In natural language processing, an element can be a continuous sequence of $N$ items, called an $N$-gram, in which each item is a word, letter, etc. In this paper, we attempt to estimate LRs based on $N$-gram frequency information. A naive estimation approach that uses only $N$-gram frequencies is sensitive to low-frequency (rare) $N$-grams and not applicable to zero-frequency (unobserved) $N$-grams; these are known as the low- and zero-frequency problems, respectively. To address these problems, we propose a method for decomposing $N$-grams into item units and then applying their frequencies along with the original $N$-gram frequencies. Our method can obtain the estimates of unobserved $N$-grams by using the unit frequencies. Although using only unit frequencies ignores dependencies between items, our method takes advantage of the fact that certain items often co-occur in practice and therefore maintains their dependencies by using the relevant $N$-gram frequencies. We also introduce a regularization to achieve robust estimation for rare $N$-grams. Our experimental results demonstrate that our method is effective at solving both problems and can effectively control dependencies. △ Less

Submitted 3 October, 2021; originally announced October 2021.

Comments: 17 pages, 8 figures

Journal ref: IEICE Trans. Fundamentals, vol.E104-A, no.8, pp.1059-1074, Aug. 2021

arXiv:2109.04569 [pdf, ps, other]

Highly Compressive Visual Self-localization Using Sequential Semantic Scene Graph and Graph Convolutional Neural Network

Authors: Mitsuki Yoshida, Ryogo Yamamoto, Kanji Tanaka

Abstract: In this paper, we address the problem of image sequence-based self-localization from a new highly compressive scene representation called sequential semantic scene graph (S3G). Highly-compressive scene representation is essential for robots to perform long-term and huge-numbers of VPR tasks in virtual-training and real-deploy environments. Recent developments in deep graph convolutional neural net… ▽ More In this paper, we address the problem of image sequence-based self-localization from a new highly compressive scene representation called sequential semantic scene graph (S3G). Highly-compressive scene representation is essential for robots to perform long-term and huge-numbers of VPR tasks in virtual-training and real-deploy environments. Recent developments in deep graph convolutional neural networks (GCNs) have enabled a highly compressive visual place classifier (VPC) that can use a scene graph as the input modality. However, in such a highly compressive application, the amount of information lost in the image-to-graph map** is significant and can damage the classification performance. To address this issue, we propose a pair of similarity-preserving map**s, image-to-nodes and image-to-edges, such that the nodes and edges act as absolute and relative features, respectively, that complement each other. Moreover, the proposed GCN-VPC is applied to a new task of viewpoint planning of the query image sequence, which contributes to further improvement in the VPC performance. Experiments using the public NCLT dataset validated the effectiveness of the proposed method. △ Less

Submitted 29 October, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

Comments: 6 pages, 5 figures, Draft version of a paper presented at the 13th IROS Workshop on Planning, Perception, Navigation for Intelligent Vehicle (PPNIV2022)

arXiv:2103.02893 [pdf, other]

Lower-Bounded Proper Losses for Weakly Supervised Classification

Authors: Shuhei M. Yoshida, Takashi Takenouchi, Masashi Sugiyama

Abstract: This paper discusses the problem of weakly supervised classification, in which instances are given weak labels that are produced by some label-corruption process. The goal is to derive conditions under which loss functions for weak-label learning are proper and lower-bounded -- two essential requirements for the losses used in class-probability estimation. To this end, we derive a representation t… ▽ More This paper discusses the problem of weakly supervised classification, in which instances are given weak labels that are produced by some label-corruption process. The goal is to derive conditions under which loss functions for weak-label learning are proper and lower-bounded -- two essential requirements for the losses used in class-probability estimation. To this end, we derive a representation theorem for proper losses in supervised learning, which dualizes the Savage representation. We use this theorem to characterize proper weak-label losses and find a condition for them to be lower-bounded. From these theoretical findings, we derive a novel regularization scheme called generalized logit squeezing, which makes any proper weak-label loss bounded from below, without losing properness. Furthermore, we experimentally demonstrate the effectiveness of our proposed approach, as compared to improper or unbounded losses. The results highlight the importance of properness and lower-boundedness. △ Less

Submitted 11 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

Comments: ICML2021 camera ready, code available at https://github.com/yoshum/lower-bounded-proper-losses

arXiv:2101.09665 [pdf, other]

Corrective Information Does Not Necessarily Curb Social Disruption

Authors: Ryusuke Iizuka, Fujio Toriumi, Mao Nishiguchi, Masanori Takano, Mitsuo Yoshida

Abstract: The spread of misinformation can cause social confusion. The authenticity of information on a social networking service (SNS) is unknown, and false information can be easily spread. Consequently, many studies have been conducted on methods to control the spread of misinformation on social networking sites. However, few studies have examined the impact of the spread of misinformation and its correc… ▽ More The spread of misinformation can cause social confusion. The authenticity of information on a social networking service (SNS) is unknown, and false information can be easily spread. Consequently, many studies have been conducted on methods to control the spread of misinformation on social networking sites. However, few studies have examined the impact of the spread of misinformation and its corrections on society. This study models the impact of the reduction of misinformation and the diffusion of corrective information on social disruption, and it identifies the features of this impact. In this study, we analyzed misinformation regarding the shortage of toilet paper during the 2020 COVID-19 epidemic, its corrections, and the excessive purchasing caused by this information. First, we analyze the amount of misinformation and corrective information spread on SNS, and we create a regression model to estimate the real-world impact of misinformation and its correction. This model is used to analyze the change in real-world impact corresponding to the change in the diffusion of misinformation and corrective information. Our analysis shows that the corrective information was spread to a much greater extent than the misinformation. In addition, our model reveals that the corrective information was what caused the excessive purchasing behavior. As a result of our further analysis, we found that the amount of diffusion of corrective information required to minimize the impact on the real world depends on the amount of the diffusion of misinformation. △ Less

Submitted 24 January, 2021; originally announced January 2021.

arXiv:2012.13992 [pdf, other]

doi 10.1109/WIIAT50758.2020.00033

Analysis of Short Dwell Time in Relation to User Interest in a News Application

Authors: Ryosuke Homma, Yoshifumi Seki, Mitsuo Yoshida, Kyoji Umemura

Abstract: Dwell time has been widely used in various fields to evaluate content quality and user engagement. Although many studies shown that content with long dwell time is good quality, contents with short dwell time have not been discussed in detail. We hypothesize that content with short dwell time is not always low quality and does not always have low user engagement, but is instead related to user int… ▽ More Dwell time has been widely used in various fields to evaluate content quality and user engagement. Although many studies shown that content with long dwell time is good quality, contents with short dwell time have not been discussed in detail. We hypothesize that content with short dwell time is not always low quality and does not always have low user engagement, but is instead related to user interest. The purpose of this study is to clarify the meanings of short dwell time browsing in mobile news application. First, we analyze the relation of short dwell time to user interest using large scale user behavior logs from a mobile news application. This analysis was conducted on a vector space based on users click histories and then users and articles were mapped in the same space. The users with short dwell time are concentrated on a specific position in this space; thus, the length of dwell time is related to their interest. Moreover, we also analyze the characteristics of short dwell time browsing by excluding these browses from their click histories. Surprisingly, excluding short dwell time click history, it was found that short dwell time click history included some aspect of user interest in 30.87% of instances where the cluster of users changed. These findings demonstrate that short dwell time does not always indicate a low level of user engagement, but also level of user interest. △ Less

Submitted 27 December, 2020; originally announced December 2020.

Comments: The 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT '20), Best in Practice Paper Award

arXiv:2012.13990 [pdf, other]

doi 10.1109/WIIAT50758.2020.00084

The metrics of keywords to understand the difference between Retweet and Like in each category

Authors: Kenshin Sekimoto, Yoshifumi Seki, Mitsuo Yoshida, Kyoji Umemura

Abstract: The purpose of this study is to clarify what kind of news is easily retweeted and what kind of news is easily Liked. We believe these actions, retweeting and Liking, have different meanings for users. Understanding this difference is important for understanding people's interest in Twitter. To analyze the difference between retweets (RT) and Likes on Twitter in detail, we focus on word appearances… ▽ More The purpose of this study is to clarify what kind of news is easily retweeted and what kind of news is easily Liked. We believe these actions, retweeting and Liking, have different meanings for users. Understanding this difference is important for understanding people's interest in Twitter. To analyze the difference between retweets (RT) and Likes on Twitter in detail, we focus on word appearances in news titles. First, we calculate basic statistics and confirm that tweets containing news URLs have different RT and Like tendencies compared to other tweets. Next, we compared RTs and Likes for each category and confirmed that the tendency of categories is different. Therefore, we propose metrics for clarifying the differences in each action for each category used in the $χ$-square test in order to perform an analysis focusing on the topic. The proposed metrics are more useful than simple counts and TF-IDF for extracting meaningful words to understand the difference between RTs and Likes. We analyzed each category using the proposed metrics and quantitatively confirmed that the difference in the role of retweeting and Liking appeared in the content depending on the category. Moreover, by aggregating tweets chronologically, the results showed the trend of RT and Like as a list of words and clarified how the characteristic words of each week were related to current events for retweeting and Liking. △ Less

Submitted 27 December, 2020; originally announced December 2020.

Comments: The 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT '20)

arXiv:2008.09830 [pdf]

Brushing Feature Values in Immersive Graph Visualization Environment

Authors: Hinako Sassa, Maxime Cordeil, Mitsuo Yoshida, Takayuki Itoh

Abstract: There are a variety of graphs where multidimensional feature values are assigned to the nodes. Visualization of such datasets is not an easy task since they are complex and often huge. Immersive Analytics is a powerful approach to support the interactive exploration of such large and complex data. Many recent studies on graph visualization have applied immersive analytics frameworks. However, ther… ▽ More There are a variety of graphs where multidimensional feature values are assigned to the nodes. Visualization of such datasets is not an easy task since they are complex and often huge. Immersive Analytics is a powerful approach to support the interactive exploration of such large and complex data. Many recent studies on graph visualization have applied immersive analytics frameworks. However, there have been few studies on immersive analytics for visualization of multidimensional attributes associated with the input graphs. This paper presents a new immersive analytics system that supports the interactive exploration of multidimensional feature values assigned to the nodes of input graphs. The presented system displays label-axes corresponding to the dimensions of feature values, and label-edges that connect label-axes and corresponding to the nodes. The system supports brushing operations which controls the display of edges that connect a label-axis and nodes of the graph. This paper introduces visualization examples with a graph dataset of Twitter users and reviews by experts on graph data analysis. △ Less

Submitted 22 August, 2020; originally announced August 2020.

Comments: 5 pages, 7 figures

arXiv:2008.03711 [pdf]

Agricultural Knowledge Management Using Smart Voice Messaging Systems: Combination of Physical and Human Sensors

Authors: Naoshi Uchihira, Masami Yoshida

Abstract: The use of the Internet of Things (IoT) in agricultural knowledge management systems is one of the most promising approaches to increasing the efficiency of agriculture. However, the existing physical sensors in agriculture are limited for monitoring various changes in the characteristics of crops and may be expensive for the average farmer. We propose a combination of physical and human sensors (… ▽ More The use of the Internet of Things (IoT) in agricultural knowledge management systems is one of the most promising approaches to increasing the efficiency of agriculture. However, the existing physical sensors in agriculture are limited for monitoring various changes in the characteristics of crops and may be expensive for the average farmer. We propose a combination of physical and human sensors (the five human senses). By using their own eyes, ears, noses, tongues, and fingers, farmers could check the various changes in the characteristics and conditions (colors of leaves, diseases, pests, faulty or malfunctioning equipment) of their crops and equipment, verbally describe their observations, and capture the descriptions with audio recording devices, such as smartphones. The voice recordings could be transcribed into text by web servers. The data captured by the physical and human sensors (voice messages) are analyzed by data and text mining to create and improve agricultural knowledge. An agricultural knowledge management system using physical and human sensors encourages to share and transfer knowledge among farmers for the purpose of improving the efficiency and productivity of agriculture. We applied one such agricultural knowledge management system (smart voice messaging system) to a greenhouse vegetable farm in Hokkaido. A qualitative analysis of accumulated voice messages and an interview with the farmer demonstrated the effectiveness of this system. The contributions of this study include a new and practical approach to an "agricultural Internet of Everything (IoE)" and evidence of its effectiveness as a result of our trial experiment at a real vegetable farm. △ Less

Submitted 9 August, 2020; originally announced August 2020.

arXiv:1910.13195 [pdf, other]

doi 10.1145/3358695.3360930

User's Centrality Analysis for Home Location Estimation

Authors: Shiori Hironaka, Mitsuo Yoshida, Kyoji Umemura

Abstract: User attributes, such as home location, are useful for many applications. Many researchers have been tackling how to estimate users' home locations using relationships among users. It is known that the home locations of certain users, such as celebrities, are hard to estimate using relationships. However, because estimating the home locations of all celebrities is not actually hard, it is importan… ▽ More User attributes, such as home location, are useful for many applications. Many researchers have been tackling how to estimate users' home locations using relationships among users. It is known that the home locations of certain users, such as celebrities, are hard to estimate using relationships. However, because estimating the home locations of all celebrities is not actually hard, it is important to clarify the characteristics of users whose home locations are hard to estimate. We analyze whether centralities, which represent users' characteristics, and the tendency to have the same home locations as friends are related. The results indicate that PageRank and HITS scores are related to whether users have the same home location as friends, and that users with higher HITS scores have the same home location as their friends less often. This result indicates that there are two types of users whose home locations are difficult to estimate: hub users who follow many celebrities and authority users who are celebrities. △ Less

Submitted 29 October, 2019; originally announced October 2019.

Comments: published at ABCSS 2019 on WI 2019

arXiv:1909.11341 [pdf, other]

doi 10.1109/ICAICTA.2019.8904173

Usefulness of Instructor Annotations on Flipped Learning Preparation Video System

Authors: Shintaro Uchiyama, Hayato Okumoto, Mitsuo Yoshida, Yuko Ichikawa, Kyoji Umemura

Abstract: Flipped learning is a method that flips in/out class activities to make lectures learner-centered. In flipped learning, comments from learners on preparation material are useful information for instructors to consider before deciding in-class topics. Thus, we arrive at the notion that receiving comments from instructors will be effective for learners watching the video. By including annotations fr… ▽ More Flipped learning is a method that flips in/out class activities to make lectures learner-centered. In flipped learning, comments from learners on preparation material are useful information for instructors to consider before deciding in-class topics. Thus, we arrive at the notion that receiving comments from instructors will be effective for learners watching the video. By including annotations from instructors, we propose to improve the quality of content for learners and thus enhance learners' motivation and study satisfaction. To achieve this, we introduced "Steering Mark," a tool that enables learners to easily grasp the overall structure of a video, to the video learning system. We examined the effectiveness and influence of Steering Mark through an experiment with 34 undergraduate learners. As a result, Steering Mark was found to be useful in improving the quality of video content for learners. △ Less

Submitted 25 September, 2019; originally announced September 2019.

Comments: The 2019 International Conference on Advanced Informatics: Concepts, Theory and Applications

arXiv:1909.00554 [pdf, other]

doi 10.1109/bigdata.2018.8622482

Analysis of Bias in Gathering Information Between User Attributes in News Application

Authors: Yoshifumi Seki, Mitsuo Yoshida

Abstract: In the process of information gathering on the web, confirmation bias is known to exist, exemplified in phenomena such as echo chambers and filter bubbles. Our purpose is to reveal how people consume news and discuss these phenomena. In web services, we are able to use action logs of a service to investigate these phenomena. However, many existing studies about these phenomena are conducted via qu… ▽ More In the process of information gathering on the web, confirmation bias is known to exist, exemplified in phenomena such as echo chambers and filter bubbles. Our purpose is to reveal how people consume news and discuss these phenomena. In web services, we are able to use action logs of a service to investigate these phenomena. However, many existing studies about these phenomena are conducted via questionnaires, and there are few studies using action logs. In this paper, we attempt to discover biases of information gathering due to differences in user demographic attributes, such as age and gender, from the behavior log of the news distribution service. First, we summarized the actions in the service for each user attribute and showed the difference of user behavior depending on the attributes. Next, the degree of correlation between the attributes was measured using the correlation coefficient, and a strong correlation was found to exist in the browsing tendency of the news articles between the attributes. Then, the bias of keywords between attributes was discovered, keywords with bias in behavior among the attributes were found using parameters of regression analysis. Since these discovered keywords are almost explainable by big news, our proposed method is effective in detecting biased keywords. △ Less

Submitted 2 September, 2019; originally announced September 2019.

Comments: 8 pages, 13 figure, IEEE BigData 2018 Workshop : The 3rd International Workshop on Application of Big Data for Computational Social Science (ABCSS2018)

arXiv:1908.08690 [pdf, other]

doi 10.1109/WI.2018.000-3

Analysis of User Dwell Time by Category in News Application

Authors: Yoshifumi Seki, Mitsuo Yoshida

Abstract: Dwell time indicates how long a user looked at a page, and this is used especially in fields where ratings from users such as search engines, recommender systems, and advertisements are important. Despite the importance of this index, however, its characteristics are not well known. In this paper, we analyze the dwell time of news pages according to category in smartphone application. Our aim is t… ▽ More Dwell time indicates how long a user looked at a page, and this is used especially in fields where ratings from users such as search engines, recommender systems, and advertisements are important. Despite the importance of this index, however, its characteristics are not well known. In this paper, we analyze the dwell time of news pages according to category in smartphone application. Our aim is to clarify the characteristics of dwell time and the relation between length of news page and dwell time, for each category. The results indicated different dwell time trends for each category. For example, the social category had fewer news pages with shorter dwell time than peaks, compared to other categories, and there were a few news pages with remarkably short dwell time. We also found a large difference by category in the correlation value between dwell time and length of news page. Specifically, political news had the highest correlation value and technology news had the lowest. In addition, we found that a user tends to get sufficient information about the news content from the news title in short dwell times. △ Less

Submitted 23 August, 2019; originally announced August 2019.

Comments: 4 pages, 3 figures, WI 2018 Workshop : The International Workshop on Web Personalization, Recommender Systems, and Social Media (WPRSM2018)

arXiv:1906.04655 [pdf, ps, other]

doi 10.23919/APSIPA.2018.8659765

Journal Name Extraction from Japanese Scientific News Articles

Authors: Masato Kikuchi, Mitsuo Yoshida, Kyoji Umemura

Abstract: In Japanese scientific news articles, although the research results are described clearly, the article's sources tend to be uncited. This makes it difficult for readers to know the details of the research. In this paper, we address the task of extracting journal names from Japanese scientific news articles. We hypothesize that a journal name is likely to occur in a specific context. To support the… ▽ More In Japanese scientific news articles, although the research results are described clearly, the article's sources tend to be uncited. This makes it difficult for readers to know the details of the research. In this paper, we address the task of extracting journal names from Japanese scientific news articles. We hypothesize that a journal name is likely to occur in a specific context. To support the hypothesis, we construct a character-based method and extract journal names using this method. This method only uses the left and right context features of journal names. The results of the journal name extractions suggest that the distribution hypothesis plays an important role in identifying the journal names. △ Less

Submitted 11 June, 2019; originally announced June 2019.

Comments: The Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018)

arXiv:1903.01704 [pdf, other]

doi 10.1109/BigData.2018.8622239

Analysis of the Influence of Internet TV Station on Wikipedia Page Views

Authors: Hiroshi Hayano, Masanori Takano, Soichiro Morishita, Mitsuo Yoshida, Kyoji Umemura

Abstract: We aim to investigate the influence of television on the web; if the influence is strong, a viral effect may be expected. In this paper, we focus on the Internet TV station and on Wikipedia use as exploratory behavior on the web. We analyzed the influence of Internet TV station on Wikipedia page views. Our aim is to clarify the characteristics of page views as related to Internet TV station in ord… ▽ More We aim to investigate the influence of television on the web; if the influence is strong, a viral effect may be expected. In this paper, we focus on the Internet TV station and on Wikipedia use as exploratory behavior on the web. We analyzed the influence of Internet TV station on Wikipedia page views. Our aim is to clarify the characteristics of page views as related to Internet TV station in order to index outward impact and develop a prediction model. The results indicate that there is a correlation between TV viewership and page views. Moreover we find that the time lag between TV and web gradually reduce as broadcasts begin after 9:00; after 23:00, page views tend to be maximized during the broadcast itself. We also differentiate between page views on PC and on mobile and find that PC pages tend to be accessed more during the daytime. In addition, we consider the number of broadcasts per program, and observe that viewership tends to stabilize as the number of broadcasts increases but that page views tend to decrease. △ Less

Submitted 5 March, 2019; originally announced March 2019.

Comments: The 3rd International Workshop on Application of Big Data for Computational Social Science (ABCSS2018)

arXiv:1903.00213 [pdf, other]

doi 10.1109/BigData.2018.8621950

Analysis of User Dwell Time on Non-News Pages

Authors: Ryosuke Homma, Keiichi Soejima, Mitsuo Yoshida, Kyoji Umemura

Abstract: There is dwell time as one of the indicators of user's behavior, and this indicates how long a user looked at a page. Dwell time is especially useful in fields where user ratings are important, such as search engines, recommender systems, and advertisements are important. Despite the importance of this index, however, its characteristics are not well known. In this paper, we analyze the dwell time… ▽ More There is dwell time as one of the indicators of user's behavior, and this indicates how long a user looked at a page. Dwell time is especially useful in fields where user ratings are important, such as search engines, recommender systems, and advertisements are important. Despite the importance of this index, however, its characteristics are not well known. In this paper, we analyze the dwell times of various websites by desktop and mobile devices using data of one year. Our aim is to clarify the characteristics of dwell time on non-news websites in order to discover which features are effective for predicting the dwell time. In this analysis, we focus on device types, access times, behavior on the website, and scroll depth. The results indicated that the number of sessions decreased as the dwell time increased, for both desktop and mobile devices. We also found that hour and month greatly affected the dwell time, but day of the week had little effect. Moreover, we discovered that inside and click users tended to have longer dwell times than outside and non-click users. However, we can not find a relationship between dwell time and scroll depth. This is because even if a user browsed the bottom of the page, the user might not necessarily have read the entire page. △ Less

Submitted 1 March, 2019; originally announced March 2019.

Comments: IEEE BigData 2018 Workshop : The 3rd International Workshop on Application of Big Data for Computational Social Science (ABCSS2018). 2018

arXiv:1812.06698 [pdf, other]

doi 10.1109/WI.2018.000-2

Analysis of Political Party Twitter Accounts' Retweeters During Japan's 2017 Election

Authors: Mitsuo Yoshida, Fujio Toriumi

Abstract: In modern election campaigns, political parties utilize social media to advertise their policies and candidates and to communicate to the electorate. In Japan's latest general election in 2017, the 48th general election for the Lower House, social media, especially Twitter, was actively used. In this paper, we analyze the users who retweeted tweets of political parties on Twitter during the electi… ▽ More In modern election campaigns, political parties utilize social media to advertise their policies and candidates and to communicate to the electorate. In Japan's latest general election in 2017, the 48th general election for the Lower House, social media, especially Twitter, was actively used. In this paper, we analyze the users who retweeted tweets of political parties on Twitter during the election. Our aim is to clarify what kinds of users are diffusing (retweeting) tweets of political parties. The results indicate that the characteristics of retweeters of the largest ruling party (Liberal Democratic Party of Japan) and the largest opposition party (The Constitutional Democratic Party of Japan) were similar, even though the retweeters did not overlap each other. We also found that a particular opposition party (Japanese Communist Party) had quite different characteristics from other political parties. △ Less

Submitted 17 December, 2018; originally announced December 2018.

Comments: WI 2018 Workshop : The International Workshop on Web Personalization, Recommender Systems, and Social Media (WPRSM2018)

arXiv:1809.09514 [pdf, other]

doi 10.1007/978-3-030-01159-8_32

Information Diffusion Power of Political Party Twitter Accounts During Japan's 2017 Election

Authors: Mitsuo Yoshida, Fujio Toriumi

Abstract: In modern election campaigns, political parties utilize social media to advertise their policies and candidates and to communicate to electorates. In Japan's latest general election in 2017, the 48th general election for the Lower House, social media, especially Twitter, was actively used. In this paper, we perform a detailed analysis of social graphs and users who retweeted tweets of political pa… ▽ More In modern election campaigns, political parties utilize social media to advertise their policies and candidates and to communicate to electorates. In Japan's latest general election in 2017, the 48th general election for the Lower House, social media, especially Twitter, was actively used. In this paper, we perform a detailed analysis of social graphs and users who retweeted tweets of political parties during the election. Our aim is to obtain accurate information regarding the diffusion power for each party rather than just the number of followers. The results indicate that a user following a user who follows a political party account tended to also follow the account. This means that it does not increase diversity because users who follow each other tend to share similar values. We also find that followers of a specific party frequently retweeted the tweets. However, since users following the user who follow a political party account are not diverse, political parties delivered the information only to a few political detachment users. △ Less

Submitted 25 September, 2018; originally announced September 2018.

Comments: The 10th International Conference on Social Informatics (SocInfo 2018)

arXiv:1808.07227 [pdf, other]

doi 10.1109/ICAICTA.2018.8541338

Response Collector: A Video Learning System for Flipped Classrooms

Authors: Hayato Okumoto, Mitsuo Yoshida, Kyoji Umemura, Yuko Ichikawa

Abstract: The flipped classroom has become famous as an effective educational method that flips the purpose of classroom study and homework. In this paper, we propose a video learning system for flipped classrooms, called Response Collector, which enables students to record their responses to preparation videos. Our system provides response visualization for teachers and students to understand what they hav… ▽ More The flipped classroom has become famous as an effective educational method that flips the purpose of classroom study and homework. In this paper, we propose a video learning system for flipped classrooms, called Response Collector, which enables students to record their responses to preparation videos. Our system provides response visualization for teachers and students to understand what they have acquired and questioned. We performed a practical user study of our system in a flipped classroom setup. The results show that students preferred to use the proposed method as the inputting method, rather than naive methods. Moreover, sharing responses among students was helpful for resolving individual students' questions, and students were satisfied with the use of our system. △ Less

Submitted 22 August, 2018; originally announced August 2018.

Comments: The 2018 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2018)

arXiv:1806.10173 [pdf, other]

Do Political Detachment Users Receive Various Political Information on Social Media?

Authors: Mitsuo Yoshida, Fujio Toriumi

Abstract: In the election, political parties communicate political information to people through social media. The followers receive the information, but can users who are not followers, political detachment users, receive the information? We focus on political detachment users who do not follow any political parties, and tackle the following research question: do political detachment users receive various… ▽ More In the election, political parties communicate political information to people through social media. The followers receive the information, but can users who are not followers, political detachment users, receive the information? We focus on political detachment users who do not follow any political parties, and tackle the following research question: do political detachment users receive various political information during the election period? The results indicate that the answer is No. We determined that the political detachment users only receive the information of a few political parties. △ Less

Submitted 26 June, 2018; originally announced June 2018.

Comments: AAAI ICWSM 2018 Workshop : The 3rd International Workshop on Event Analytics using Social Media Data (EASM 2018)

arXiv:1804.05486 [pdf, other]

doi 10.1109/ICAICTA.2017.8090990

Computing Information Quantity as Similarity Measure for Music Classification Task

Authors: Ayaka Takamoto, Mitsuo Yoshida, Kyoji Umemura, Yuko Ichikawa

Abstract: This paper proposes a novel method that can replace compression-based dissimilarity measure (CDM) in composer estimation task. The main features of the proposed method are clarity and scalability. First, since the proposed method is formalized by the information quantity, reproduction of the result is easier compared with the CDM method, where the result depends on a particular compression program… ▽ More This paper proposes a novel method that can replace compression-based dissimilarity measure (CDM) in composer estimation task. The main features of the proposed method are clarity and scalability. First, since the proposed method is formalized by the information quantity, reproduction of the result is easier compared with the CDM method, where the result depends on a particular compression program. Second, the proposed method has a lower computational complexity in terms of the number of learning data compared with the CDM method. The number of correct results was compared with that of the CDM for the composer estimation task of five composers of 75 piano musical scores. The proposed method performed better than the CDM method that uses the file size compressed by a particular program. △ Less

Submitted 15 April, 2018; originally announced April 2018.

Comments: The 2017 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2017)

arXiv:1711.09251 [pdf, other]

doi 10.1109/BigData.2017.8258287

When Do Users Change Their Profile Information on Twitter?

Authors: **sei Shima, Mitsuo Yoshida, Kyoji Umemura

Abstract: We can see profile information such as name, description and location in order to know the user on social media. However, this profile information is not always fixed. If there is a change in the user's life, the profile information will be changed. In this study, we focus on user's profile information changes and analyze the timing and reasons for these changes on Twitter. The results indicate th… ▽ More We can see profile information such as name, description and location in order to know the user on social media. However, this profile information is not always fixed. If there is a change in the user's life, the profile information will be changed. In this study, we focus on user's profile information changes and analyze the timing and reasons for these changes on Twitter. The results indicate that the peak of profile information change occurs in April among Japanese users, but there was no such trend observed for English users throughout the year. Our analysis also shows that English users most frequently change their names on their birthdays, while Japanese users change their names as their Twitter engagement and activities decrease over time. △ Less

Submitted 25 November, 2017; originally announced November 2017.

Comments: IEEE BigData 2017 Workshop : The 2nd International Workshop on Application of Big Data for Computational Social Science (accepted)

arXiv:1710.01446 [pdf, other]

Improving Compression Based Dissimilarity Measure for Music Score Analysis

Authors: Ayaka Takamoto, Mayu Umemura, Mitsuo Yoshida, Kyoji Umemura

Abstract: In this paper, we propose a way to improve the compression based dissimilarity measure, CDM. We propose to use a modified value of the file size, where the original CDM uses an unmodified file size. Our application is a music score analysis. We have chosen piano pieces from five different composers. We have selected 75 famous pieces (15 pieces for each composer). We computed the distances among al… ▽ More In this paper, we propose a way to improve the compression based dissimilarity measure, CDM. We propose to use a modified value of the file size, where the original CDM uses an unmodified file size. Our application is a music score analysis. We have chosen piano pieces from five different composers. We have selected 75 famous pieces (15 pieces for each composer). We computed the distances among all pieces by using the modified CDM. We use the K-nearest neighbor method when we estimate the composer of each piece of music. The modified CDM shows improved accuracy. The difference is statistically significant. △ Less

Submitted 3 October, 2017; originally announced October 2017.

Comments: The 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2016)

arXiv:1709.08858 [pdf, other]

doi 10.1109/KST.2017.7886073

Polysemy Detection in Distributed Representation of Word Sense

Authors: Kana Oomoto, Haruka Oikawa, Eiko Yamamoto, Mitsuo Yoshida, Masayuki Okabe, Kyoji Umemura

Abstract: In this paper, we propose a statistical test to determine whether a given word is used as a polysemic word or not. The statistic of the word in this test roughly corresponds to the fluctuation in the senses of the neighboring words a nd the word itself. Even though the sense of a word corresponds to a single vector, we discuss how polysemy of the words affects the position of vectors. Finally, we… ▽ More In this paper, we propose a statistical test to determine whether a given word is used as a polysemic word or not. The statistic of the word in this test roughly corresponds to the fluctuation in the senses of the neighboring words a nd the word itself. Even though the sense of a word corresponds to a single vector, we discuss how polysemy of the words affects the position of vectors. Finally, we also explain the method to detect this effect. △ Less

Submitted 26 September, 2017; originally announced September 2017.

Comments: The 9th International Conference on Knowledge and Smart Technology (KST-2017)

arXiv:1709.08340 [pdf, other]

doi 10.1109/ICAICTA.2016.7803120

Realizing Half-Diminished Reality from Video Stream of Manipulating Objects

Authors: Hayato Okumoto, Mitsuo Yoshida, Kyoji Umemura

Abstract: When we watch a video, in which human hands manipulate objects, these hands may obscure some parts of those objects. We are willing to make clear how the objects are manipulated by making the image of hands semi-transparent, and showing the complete images of the hands and the object. By carefully choosing a Half-Diminished Reality method, this paper proposes a method that can process the video in… ▽ More When we watch a video, in which human hands manipulate objects, these hands may obscure some parts of those objects. We are willing to make clear how the objects are manipulated by making the image of hands semi-transparent, and showing the complete images of the hands and the object. By carefully choosing a Half-Diminished Reality method, this paper proposes a method that can process the video in real time and verifies that the proposed method works well. △ Less

Submitted 25 September, 2017; originally announced September 2017.

Comments: The 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2016)

arXiv:1709.08314 [pdf]

doi 10.1109/ICAICTA.2015.7335387

Confidence Interval of Probability Estimator of Laplace Smoothing

Authors: Masato Kikuchi, Mitsuo Yoshida, Masayuki Okabe, Kyoji Umemura

Abstract: Sometimes, we do not use a maximum likelihood estimator of a probability but it's a smoothed estimator in order to cope with the zero frequency problem. This is often the case when we use the Naive Bayes classifier. Laplace smoothing is a popular choice with the value of Laplace smoothing estimator being the expected value of posterior distribution of the probability where we assume that the prior… ▽ More Sometimes, we do not use a maximum likelihood estimator of a probability but it's a smoothed estimator in order to cope with the zero frequency problem. This is often the case when we use the Naive Bayes classifier. Laplace smoothing is a popular choice with the value of Laplace smoothing estimator being the expected value of posterior distribution of the probability where we assume that the prior is uniform distribution. In this paper, we investigate the confidence intervals of the estimator of Laplace smoothing. We show that the likelihood function for this confidence interval is the same as the likelihood of a maximum likelihood estimated value of a probability of Bernoulli trials. Although the confidence interval of the maximum likelihood estimator of the Bernoulli trial probability has been studied well, and although the approximate formulas for the confidence interval are well known, we cannot use the interval of maximum likelihood estimator since the interval contains the value 0, which is not suitable for the Naive Bayes classifier. We are also interested in the accuracy of existing approximation methods since these approximation methods are frequently used but their accuracy is not well discussed. Thus, we obtain the confidence interval by numerically integrating the likelihood function. In this paper, we report the difference between the confidence interval that we computed and the confidence interval by approximate formulas. Finally, we include a URL, where all of the intervals that we computed are available. △ Less

Submitted 25 September, 2017; originally announced September 2017.

Comments: The 2015 International Conference on Advanced Informatics: Concepts, Theory and Application (ICAICTA2015)

arXiv:1709.08309 [pdf, other]

doi 10.1109/ICAICTA.2016.7803098

Using Conservative Estimation for Conditional Probability instead of Ignoring Infrequent Case

Authors: Masato Kikuchi, Eiko Yamamoto, Mitsuo Yoshida, Masayuki Okabe, Kyoji Umemura

Abstract: There are several estimators of conditional probability from observed frequencies of features. In this paper, we propose using the lower limit of confidence interval on posterior distribution determined by the observed frequencies to ascertain conditional probability. In our experiments, this method outperformed other popular estimators. There are several estimators of conditional probability from observed frequencies of features. In this paper, we propose using the lower limit of confidence interval on posterior distribution determined by the observed frequencies to ascertain conditional probability. In our experiments, this method outperformed other popular estimators. △ Less

Submitted 25 September, 2017; originally announced September 2017.

Comments: The 2016 International Conference on Advanced Informatics: Concepts, Theory and Application (ICAICTA2016)

arXiv:1709.00714 [pdf, other]

doi 10.1109/ICAICTA.2017.8090972

Home Location Estimation Using Weather Observation Data

Authors: Yuki Kondo, Masatsugu Hangyo, Mitsuo Yoshida, Kyoji Umemura

Abstract: We can extract useful information from social media data by adding the user's home location. However, since the user's home location is generally not publicly available, many researchers have been attempting to develop a more accurate home location estimation. In this study, we propose a method to estimate a Twitter user's home location by using weather observation data from AMeDAS. In our method,… ▽ More We can extract useful information from social media data by adding the user's home location. However, since the user's home location is generally not publicly available, many researchers have been attempting to develop a more accurate home location estimation. In this study, we propose a method to estimate a Twitter user's home location by using weather observation data from AMeDAS. In our method, we first estimate the weather of the area posted by an estimation target user by using the tweet, Next, we check out the estimated weather against weather observation data, and narrow down the area posted by the user. Finally, the user's home location is estimated as which areas the user frequently posts from. In our experiments, the results indicate that our method functions effectively and also demonstrate that accuracy improves under certain conditions. △ Less

Submitted 3 September, 2017; originally announced September 2017.

Comments: The 2017 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2017)

arXiv:1608.08331 [pdf, other]

doi 10.1109/ICAICTA.2016.7803100

Analysis of Home Location Estimation with Iteration on Twitter Following Relationship

Authors: Shiori Hironaka, Mitsuo Yoshida, Kyoji Umemura

Abstract: User's home locations are used by numerous social media applications, such as social media analysis. However, since the user's home location is not generally open to the public, many researchers have been attempting to develop a more accurate home location estimation. A social network that expresses relationships between users is used to estimate the users' home locations. The network-based home l… ▽ More User's home locations are used by numerous social media applications, such as social media analysis. However, since the user's home location is not generally open to the public, many researchers have been attempting to develop a more accurate home location estimation. A social network that expresses relationships between users is used to estimate the users' home locations. The network-based home location estimation method with iteration, which propagates the estimated locations, is used to estimate more users' home locations. In this study, we analyze the function of network-based home location estimation with iteration while using the social network based on following relationships on Twitter. The results indicate that the function that selects the most frequent location among the friends' location has the best accuracy. Our analysis also shows that the 88% of users, who are in the social network based on following relationships, has at least one correct home location within one-hop (friends and friends of friends). According to this characteristic of the social network, we indicate that twice is sufficient for iteration. △ Less

Submitted 30 August, 2016; originally announced August 2016.

Comments: The 2016 International Conference on Advanced Informatics: Concepts, Theory and Application (ICAICTA2016)

arXiv:1509.02218 [pdf]

doi 10.1145/2786451.2786495

Wikipedia Page View Reflects Web Search Trend

Authors: Mitsuo Yoshida, Yuki Arase, Takaaki Tsunoda, Mikio Yamamoto

Abstract: The frequency of a web search keyword generally reflects the degree of public interest in a particular subject matter. Search logs are therefore useful resources for trend analysis. However, access to search logs is typically restricted to search engine providers. In this paper, we investigate whether search frequency can be estimated from a different resource such as Wikipedia page views of open… ▽ More The frequency of a web search keyword generally reflects the degree of public interest in a particular subject matter. Search logs are therefore useful resources for trend analysis. However, access to search logs is typically restricted to search engine providers. In this paper, we investigate whether search frequency can be estimated from a different resource such as Wikipedia page views of open data. We found frequently searched keywords to have remarkably high correlations with Wikipedia page views. This suggests that Wikipedia page views can be an effective tool for determining popular global web search trends. △ Less

Submitted 7 September, 2015; originally announced September 2015.

Comments: 2 pages, 4 figures, The 2015 ACM Web Science Conference (WebSci15)

ACM Class: H.3.3; H.3.5

Showing 1–41 of 41 results for author: Yoshida, M