Search | arXiv e-print repository

Token-based Decision Criteria Are Suboptimal in In-context Learning

Authors: Hakaze Cho, Yoshihiro Sakai, Mariko Kato, Kenshiro Tanaka, Akira Ishii, Naoya Inoue

Abstract: In-Context Learning (ICL) typically utilizes classification criteria from probabilities of manually selected label tokens. However, we argue that such token-based classification criteria lead to suboptimal decision boundaries, despite delicate calibrations through translation and constrained rotation. To address this problem, we propose Hidden Calibration, which renounces token probabilities and u… ▽ More In-Context Learning (ICL) typically utilizes classification criteria from probabilities of manually selected label tokens. However, we argue that such token-based classification criteria lead to suboptimal decision boundaries, despite delicate calibrations through translation and constrained rotation. To address this problem, we propose Hidden Calibration, which renounces token probabilities and uses the nearest centroid classifier on the LM's last hidden states. In detail, we use the nearest centroid classification on the hidden states, assigning the category of the nearest centroid previously observed from a few-shot calibration set to the test sample as the predicted label. Our experiments on 3 models and 10 classification datasets indicate that Hidden Calibration consistently outperforms current token-based calibrations by about 20%. Our further analysis demonstrates that Hidden Calibration finds better classification criteria with less inter-categories overlap, and LMs provide linearly separable intra-category clusters with the help of demonstrations, which supports Hidden Calibration and gives new insights into the conventional ICL. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 21 pages, 14 figures, 8 tables

arXiv:2305.01828 [pdf, other]

ns-3 Implementation of Sub-Terahertz and Millimeter Wave Drop-based NYU Channel Model (NYUSIM)

Authors: Hitesh Poddar, Tomoki Yoshimura, Matteo Pagin, Theodore S Rappaport, Art Ishii, Michele Zorzi

Abstract: The next generation of wireless networks will use sub-THz frequencies alongside mmWave frequencies to enable multi-Gbps and low-latency applications. To enable different verticals and use cases, engineers must take a holistic approach to build, analyze, and study different parts of the network and the interplay among the lower and higher layers of the protocol stack. It is of paramount importance… ▽ More The next generation of wireless networks will use sub-THz frequencies alongside mmWave frequencies to enable multi-Gbps and low-latency applications. To enable different verticals and use cases, engineers must take a holistic approach to build, analyze, and study different parts of the network and the interplay among the lower and higher layers of the protocol stack. It is of paramount importance to accurately characterize the radio propagation in diverse scenarios such as urban microcell (UMi), urban macrocell (UMa), rural macrocell (RMa), indoor hotspot (InH), and indoor factory (InF) for a wide range of frequencies. The 3GPP statistical channel model (SCM) is oversimplified and restricted to the frequency range of 0.5-100 GHz. Thus, to overcome these limitations, this paper presents a detailed implementation of the drop-based NYU channel model (NYUSIM) for the frequency range of 0.5-150 GHz for the UMi, UMa, RMa, InH, and InF scenarios. NYUSIM allows researchers to design and evaluate new algorithms and protocols for future sub-THz wireless networks in ns-3. △ Less

Submitted 2 May, 2023; originally announced May 2023.

arXiv:2302.12385 [pdf, ps, other]

Full-Stack End-To-End mmWave Simulations Using 3GPP and NYUSIM Channel Model in ns-3

Authors: H. Poddar, T. Yoshimura, M. Pagin, T. S. Rappaport, A. Ishii, M. Zorzi

Abstract: Accurate channel modeling and simulation tools are vital for studying sub-THz and millimeter (mmWave) wideband communication system performance. To accurately design future high data rate, low latency wireless modems, the entire protocol stack must be appropriately modeled to understand how the physical layer impacts the end-to-end performance experienced by the end user. This paper presents a ful… ▽ More Accurate channel modeling and simulation tools are vital for studying sub-THz and millimeter (mmWave) wideband communication system performance. To accurately design future high data rate, low latency wireless modems, the entire protocol stack must be appropriately modeled to understand how the physical layer impacts the end-to-end performance experienced by the end user. This paper presents a full stack end-to-end performance analysis in ns-3 using drop-based NYU channel model (NYUSIM) and 3GPP statistical channel model (SCM) in scenarios, namely urban microcell (UMi), urban macrocell (UMa), rural macrocell (RMa), and indoor hotspot (InH) at 28 GHz with 100 MHz bandwidth. Video data is transmitted at 50 Mbps using User Datagram Protocol (UDP), and we observe that the RMa channel is benign in non-line of sight (NLOS) for NYUSIM and 3GPP SCM as it exhibits no packet drops and yields maximum throughput (48.1 Mbps) and latency of $\sim$ 20 ms. In NLOS, for NYUSIM, the UMa and RMa channels are similar in terms of throughput and packet drops, and the latency in UMi and InH scenarios is 10 times and 25 times higher respectively compared to UMa. Our results indicate that mmWave bands can support data rates of 50 Mbps with negligible packet drops and latency below 150 ms in all scenarios using NYUSIM. △ Less

Submitted 5 March, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

Comments: ICC 2023 - 2023 IEEE International Conference on Communications

arXiv:2009.14573 [pdf]

Rain-Code Fusion : Code-to-code ConvLSTM Forecasting Spatiotemporal Precipitation

Authors: Takato Yasuno, Akira Ishii, Masazumi Amakata

Abstract: Recently, flood damage has become a social problem owing to unexperienced weather conditions arising from climate change. An immediate response to heavy rain is important for the mitigation of economic losses and also for rapid recovery. Spatiotemporal precipitation forecasts may enhance the accuracy of dam inflow prediction, more than 6 hours forward for flood damage mitigation. However, the ordi… ▽ More Recently, flood damage has become a social problem owing to unexperienced weather conditions arising from climate change. An immediate response to heavy rain is important for the mitigation of economic losses and also for rapid recovery. Spatiotemporal precipitation forecasts may enhance the accuracy of dam inflow prediction, more than 6 hours forward for flood damage mitigation. However, the ordinary ConvLSTM has the limitation of predictable range more than 3-timesteps in real-world precipitation forecasting owing to the irreducible bias between target prediction and ground-truth value. This paper proposes a rain-code approach for spatiotemporal precipitation code-to-code forecasting. We propose a novel rainy feature that represents a temporal rainy process using multi-frame fusion for the timestep reduction. We perform rain-code studies with various term ranges based on the standard ConvLSTM. We applied to a dam region within the Japanese rainy term hourly precipitation data, under 2006 to 2019 approximately 127 thousands hours, every year from May to October. We apply the radar analysis hourly data on the central broader region with an area of 136 x 148 km2 . Finally we have provided sensitivity studies between the rain-code size and hourly accuracy within the several forecasting range. △ Less

Submitted 1 March, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

Comments: 15 pages, 12 figures

arXiv:2006.15257 [pdf]

Generative Damage Learning for Concrete Aging Detection using Auto-flight Images

Authors: Takato Yasuno, Akira Ishii, Junichiro Fujii, Masazumi Amakata, Yuta Takahashi

Abstract: In order to monitor the state of large-scale infrastructures, image acquisition by autonomous flight drones is efficient for stable angle and high-quality images. Supervised learning requires a large data set consisting of images and annotation labels. It takes a long time to accumulate images, including identifying the damaged regions of interest (ROIs). In recent years, unsupervised deep learnin… ▽ More In order to monitor the state of large-scale infrastructures, image acquisition by autonomous flight drones is efficient for stable angle and high-quality images. Supervised learning requires a large data set consisting of images and annotation labels. It takes a long time to accumulate images, including identifying the damaged regions of interest (ROIs). In recent years, unsupervised deep learning approaches such as generative adversarial networks (GANs) for anomaly detection algorithms have progressed. When a damaged image is a generator input, it tends to reverse from the damaged state to the healthy state generated image. Using the distance of distribution between the real damaged image and the generated reverse aging healthy state fake image, it is possible to detect the concrete damage automatically from unsupervised learning. This paper proposes an anomaly detection method using unpaired image-to-image translation map** from damaged images to reverse aging fakes that approximates healthy conditions. We apply our method to field studies, and we examine the usefulness of our method for health monitoring of concrete damage. △ Less

Submitted 19 August, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

Comments: 8 pages, 15 figures

ACM Class: I.4.6; I.2.10; I.5.4

Journal ref: 37th International Symposium on Automation and Robotics in Construction (ISARC 2020)

arXiv:2003.01288 [pdf]

Trained Model Fusion for Object Detection using Gating Network

Authors: Tetsuo Inoshita, Yuichi Nakatani, Katsuhiko Takahashi, Asuka Ishii, Gaku Nakano

Abstract: The major approaches of transfer learning in computer vision have tried to adapt the source domain to the target domain one-to-one. However, this scenario is difficult to apply to real applications such as video surveillance systems. As those systems have many cameras installed at each location regarded as source domains, it is difficult to identify the proper source domain. In this paper, we intr… ▽ More The major approaches of transfer learning in computer vision have tried to adapt the source domain to the target domain one-to-one. However, this scenario is difficult to apply to real applications such as video surveillance systems. As those systems have many cameras installed at each location regarded as source domains, it is difficult to identify the proper source domain. In this paper, we introduce a new transfer learning scenario that has various source domains and one target domain, assuming video surveillance system integration. Also, we propose a novel method for automatically producing a high accuracy model by fusing models trained at various source domains. In particular, we show how to apply a gating network to fuse source domains for object detection tasks, which is a new approach. We demonstrate the effectiveness of our method through experiments on traffic surveillance datasets. △ Less

Submitted 2 March, 2020; originally announced March 2020.

Comments: Accepted to ACPR 2019

arXiv:1909.01078 [pdf]

A study of trends in the effects of TV ratings and social media (Twitter) -- Case study 1

Authors: Takuya Ueoka, Akira Ishii, Yasuko Kawahata

Abstract: The Japanese TV program 'Drama A' is a drama broadcast from October to December 2016. The audience rating was sluggish, but this drama marked a high audience rating in 2016. Since it was popular from the middle, and it was speculated that there was a part related to social media in the popularity, we considered existing research methods as a case study. In this paper, we used a mathematical model… ▽ More The Japanese TV program 'Drama A' is a drama broadcast from October to December 2016. The audience rating was sluggish, but this drama marked a high audience rating in 2016. Since it was popular from the middle, and it was speculated that there was a part related to social media in the popularity, we considered existing research methods as a case study. In this paper, we used a mathematical model of the hit phenomenon to examine the impact of audience assessment from social media from a sociophysical perspective. We got the same consideration as the audience rating per minute of video research. This paper is IEEE BIGDATA2018's Revised paper(Consideration on TV audience rating and influence of social media). △ Less

Submitted 11 August, 2019; originally announced September 2019.

Comments: Audience Rating, Twitter, Social Media, TV

Journal ref: IEEE International Conference on Big Data(2018)

arXiv:1907.07946 [pdf]

Consensus formation Online using Sociophysics method

Authors: Yasuko Kawahata, Akira Ishii

Abstract: Consensus formation and difference of opinion have long been the subject of research. However, relevant laws and systems within society are being updated to reflect the changes in information networks. Online environment has come to fulfill a major role as a real and concrete place of opposing opinions and consensus formation. In the future, quantitative findings on consensus formation, and findin… ▽ More Consensus formation and difference of opinion have long been the subject of research. However, relevant laws and systems within society are being updated to reflect the changes in information networks. Online environment has come to fulfill a major role as a real and concrete place of opposing opinions and consensus formation. In the future, quantitative findings on consensus formation, and findings on relevant trends, must be summarized, and quantitative research related to trends likely to give rise to social and economic risk is required. Thus, the potential for comparing research related to consensus formation using actual data and an approach using a mathematical model was first investigated. △ Less

Submitted 18 July, 2019; originally announced July 2019.

Journal ref: GDN2019(19th International Conference on Group Decision and Negotiation in 2019 a Joint GDN-EWG/BOR meeting) Proceedings

arXiv:1901.00076 [pdf, ps, other]

The Influence of Social Media Writing on Online Search Behavior for Seasonal Events: The Sociophysics Approach

Authors: Nozomi Okano, Masaru Higashi, Akira Ishii

Abstract: Using seasonal topics as the study subject, in this study, we focus on the timing gap between social media writing and online search behavior. To conduct our analysis, we used the mathematical model of search behavior, comprising the sociophysics approach. The seasonal topics selected were St.Valentine's Day, Halloween and New Year countdown. We also picked up the event like Christmas and Hallowee… ▽ More Using seasonal topics as the study subject, in this study, we focus on the timing gap between social media writing and online search behavior. To conduct our analysis, we used the mathematical model of search behavior, comprising the sociophysics approach. The seasonal topics selected were St.Valentine's Day, Halloween and New Year countdown. We also picked up the event like Christmas and Halloween. We analyzed the influence of blogs and Twitter on search behavior and found a deviation of interest in terms of timing. We also analyzed Japanese seasonal event of eating Eho-maki in February 3 and eels at the day of the ox in midsummer. △ Less

Submitted 31 December, 2018; originally announced January 2019.

Comments: 10 pages, 10 figures

MSC Class: 91F99

Journal ref: Proceedings of The 22nd Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES2018) 45-49

arXiv:1812.11845 [pdf, ps, other]

Opinion Dynamics Theory for Analysis of Consensus Formation and Division of Opinion on the Internet

Authors: Akira Ishii, Yasuko Kawahata

Abstract: The massive amount of text data on the web has facilitated research on the quantitative analysis of public opinion, which could not be visualized earlier. In this paper, we propose a new opinion dynamics theory. This theory that is intended to explain agreement formation and opinion breakup division in opinion exchanges on social media such as Twitter. With the popularization of the public network… ▽ More The massive amount of text data on the web has facilitated research on the quantitative analysis of public opinion, which could not be visualized earlier. In this paper, we propose a new opinion dynamics theory. This theory that is intended to explain agreement formation and opinion breakup division in opinion exchanges on social media such as Twitter. With the popularization of the public network, we have become able to communicate with instantaneity and interactivity beyond the temporal and spatial constraints.Research on quantitatively analyzing the distribution of opinion on public opinion that has not been visualized so far utilizing massive web text data is progressing.Our model is based on the Bounded Confidence Model, that expresses opinions in as continuous quantity values. However, in the Bounded Confidence Model, it was assumed that people with different opinions move not in disregard but ignoring opinions. Furthermore, in our theory, it modeled so that it can expresser model incorporates the influence from of the external pressure outside and the phenomenon depending on the surrounding situation. △ Less

Submitted 31 December, 2018; originally announced December 2018.

Comments: 11 pages, 9 figures

MSC Class: 91F99

Journal ref: Proceedings of The 22nd Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES2018) 71-76

arXiv:1807.05320 [pdf]

Analysis of social media content and search behavior related to seasonal topics using the sociophysics approach

Authors: Akira Ishii, Toshimichi Wakabayashi, Nozomi Okano, Yasuko Kawahata

Abstract: We studied the time interval between posting social media content and search action related to seasonal topics. The analysis was performed using a mathematical model of the search behavior as in the theory of sociophysics. As seasonal topics, the word cherry blossom was considered for spring, bikini for summer, autumn leaves for fall, and skiing for winter. We examined the influence of blogs and T… ▽ More We studied the time interval between posting social media content and search action related to seasonal topics. The analysis was performed using a mathematical model of the search behavior as in the theory of sociophysics. As seasonal topics, the word cherry blossom was considered for spring, bikini for summer, autumn leaves for fall, and skiing for winter. We examined the influence of blogs and Twitter posts given the search behavior and found a time deviation of interest on these topics. △ Less

Submitted 13 July, 2018; originally announced July 2018.

Comments: 4 pages, 12 figures, Proceedings of The 22nd World Multi-Conference on Systemics, Cybernetics and Informatics: WMSCI 2018 in Orlando, July 8 - 11, 2018

MSC Class: 91Cxx

arXiv:1706.04301 [pdf]

Measurement of human activity using velocity GPS data obtained from mobile phones

Authors: Yasuko Kawahata, Takayuki Mizuno, Akira Ishii

Abstract: Human movement is used as an indicator of human activity in modern society. The velocity of moving humans is calculated based on position information obtained from mobile phones. The level of human activity, as recorded by velocity, varies throughout the day. Therefore, velocity can be used to identify the intervals of highest and lowest activity. More specifically, we obtained mobile-phone GPS da… ▽ More Human movement is used as an indicator of human activity in modern society. The velocity of moving humans is calculated based on position information obtained from mobile phones. The level of human activity, as recorded by velocity, varies throughout the day. Therefore, velocity can be used to identify the intervals of highest and lowest activity. More specifically, we obtained mobile-phone GPS data from the people around Shibuya station in Tokyo, which has the highest population density in Japan. From these data, we observe that velocity tends to consistently increase with the changes in social activities. For example, during the earthquake in Kumamoto Prefecture in April 2016, the activity on that day was much lower than usual. In this research, we focus on natural disasters such as earthquakes owing to their significant effects on human activities in developed countries like Japan. In the event of a natural disaster in another developed country, considering the change in human behavior at the time of the disaster (e.g., the 2016 Kumamoto Great Earthquake) from the viewpoint of velocity allows us to improve our planning for mitigation measures. Thus, we analyze the changes in human activity through velocity calculations in Shibuya, Tokyo, and compare times of disasters with normal times. △ Less

Submitted 13 June, 2017; originally announced June 2017.

Comments: 9 pages, 8 figures, submitted to the 9th International Conference on Social Informatics (SocInfo 2017) to Oxford, UK, in September 2017

MSC Class: 91C99 ACM Class: J.4

arXiv:1706.00569 [pdf, ps, other]

Position-sensitive propagation of information on social media using social physics approach

Authors: Akira Ishii, Takayuki Mizuno, Yasuko Kawahata

Abstract: The excitement and convergence of tweets on specific topics are well studied. However, by utilizing the position information of Tweet, it is also possible to analyze the position-sensitive tweet. In this research, we focus on bomb terrorist attacks and propose a method for separately analyzing the number of tweets at the place where the incident occurred, nearby, and far. We made measurements of p… ▽ More The excitement and convergence of tweets on specific topics are well studied. However, by utilizing the position information of Tweet, it is also possible to analyze the position-sensitive tweet. In this research, we focus on bomb terrorist attacks and propose a method for separately analyzing the number of tweets at the place where the incident occurred, nearby, and far. We made measurements of position-sensitive tweets and suggested a theory to explain it. This theory is an extension of the mathematical model of the hit phenomenon. △ Less

Submitted 2 June, 2017; originally announced June 2017.

Comments: 11 pages, 6 figures, submitted to 9th International Conference on Social Informatics 2017 at Oxford

MSC Class: 91F99

arXiv:1501.00758 [pdf, other]

Mathematical model for hit phenomena and its application to analyze popularity of weekly tv drama

Authors: Akira Ishii, Akiko Kitao, Tsukasa Usui, Koki Uchiyama

Abstract: Mathematical model for hit phenomena presented by A Ishii et al in 2012 has been extended to analyze and predict a lot of hit subject using social network system. The equation for each individual consumers is assumed and the equation of social response to each hit subject is derived as stochastic process of statistical physics. The advertisement effect is included as external force and the communi… ▽ More Mathematical model for hit phenomena presented by A Ishii et al in 2012 has been extended to analyze and predict a lot of hit subject using social network system. The equation for each individual consumers is assumed and the equation of social response to each hit subject is derived as stochastic process of statistical physics. The advertisement effect is included as external force and the communication effects are included as two-body and three-body interaction. The applications of this model are demonstrated for analyzing population of weekly TV drama. Including both the realtime view data and the playback view data, we found that the indirect communication correlate strongly to the TV viewing rate data for recent Japanese 20 TV drama. △ Less

Submitted 4 January, 2015; originally announced January 2015.

Comments: 18 pages, 12 figures, submitted to International Journal of Modern Physics B: Special issue: Advances on Statistical Physics of Complex Systems as a conference paper of International Conference on Statisitical Physics, Rhodes, Greece, 7-11 July 2014

arXiv:1112.1143 [pdf]

Mathematical model for hit phenomena as stochastic process of interactions of human interactions

Authors: Akira Ishii, Hisashi Arakaki, Naoya Matsuda, Sanae Umemura, Tamiko Urushidani, Naoya Yamagata, Narihiko Yoshda

Abstract: Mathematical model for hit phenomena in entertainments in the society is presented as stochastic process of interactions of human dynamics. The model use only the time distribution of advertisement budget as input and the words of mouth (WOM) as posting in the social network system is used as the data to compare with the calculated results. The unit of time is daily. The WOM distribution in time i… ▽ More Mathematical model for hit phenomena in entertainments in the society is presented as stochastic process of interactions of human dynamics. The model use only the time distribution of advertisement budget as input and the words of mouth (WOM) as posting in the social network system is used as the data to compare with the calculated results. The unit of time is daily. The WOM distribution in time is found to be very close to the residue distribution in time. The calculations for Japanese motion picture market due to the mathematical model agree very well with the actual residue distribution in time. △ Less

Submitted 5 December, 2011; originally announced December 2011.

Comments: 20 pages, 16 figures, submitted to New Journal of Physics

arXiv:1112.0767 [pdf, ps, other]

Revenue Prediction of Local Event using Mathematical Model of Hit Phenomena

Authors: Akira Ishii, Takehiro Matsumoto, Shinji Miki

Abstract: Theoretical approach to investigate human-human interaction in society performed using a many-body theory including human-human interaction. The advertisement is treated as an external force. The word of mouth (WOM) effect is included as a two-body interaction between humans. The rumor effect is included as a three-body interaction between humans. The parameters to define the strength of human int… ▽ More Theoretical approach to investigate human-human interaction in society performed using a many-body theory including human-human interaction. The advertisement is treated as an external force. The word of mouth (WOM) effect is included as a two-body interaction between humans. The rumor effect is included as a three-body interaction between humans. The parameters to define the strength of human interactions are assumed to be constant values. The calculated result explained well the two local events "Mizuki-Shigeru Road in Sakaiminato" and "the sculpture festival at Tottori" in Japan. △ Less

Submitted 4 December, 2011; originally announced December 2011.

Comments: 8 pages, 3 Figures. submitted to Progress of Theoretical Physics

Showing 1–16 of 16 results for author: Ishii, A