Skip to main content

Showing 1–50 of 102 results for author: Kaneko, M

.
  1. arXiv:2407.03129  [pdf, other

    cs.CL

    Social Bias Evaluation for Large Language Models Requires Prompt Variations

    Authors: Rem Hida, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Warning: This paper contains examples of stereotypes and biases. Large Language Models (LLMs) exhibit considerable social biases, and various studies have tried to evaluate and mitigate these biases accurately. Previous studies use downstream tasks as prompts to examine the degree of social biases for evaluation and mitigation. While LLMs' output highly depends on prompts, previous studies evaluat… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2404.11262  [pdf, other

    cs.CL

    Sampling-based Pseudo-Likelihood for Membership Inference Attacks

    Authors: Masahiro Kaneko, Youmi Ma, Yuki Wata, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) are trained on large-scale web data, which makes it difficult to grasp the contribution of each text. This poses the risk of leaking inappropriate data such as benchmarks, personal information, and copyrighted texts in the training data. Membership Inference Attacks (MIA), which determine whether a given text is included in the model's training data, have been attracti… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  3. Millimeter-wave CO and SiO Observations toward the Broad-velocity-width Molecular Feature CO 16.134-0.553: a Smith cloud scenario?

    Authors: Hiroki Yokozuka, Tomoharu Oka, Shiho Tsujimoto, Yuto Watanabe, Miyuki Kaneko

    Abstract: We report the results of the CO $\textit J$=1-0 and SiO $\textit J$=2-1 map** observations towards the broad-velocity-width molecular feature CO 16.134-0.553 with the Nobeyama Radio Observatory 45 m telescope. The high quality CO map shows that the 5-pc size broad-velocity-width feature bridges two separate velocity components at $\textit V_{\rm{LSR}}$$\quad$$\simeq$ 40 km s$^{-1}$ and 65 km s… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 7 pages, 6 figures, 2 table, accepted for publication in ApJ

    Journal ref: 2024ApJ...964...52Y

  4. arXiv:2403.16139  [pdf, other

    cs.CL

    A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish

    Authors: Masahiro Kaneko, Timothy Baldwin

    Abstract: Large Language Models (LLMs) are trained on massive web-crawled corpora. This poses risks of leakage, including personal information, copyrighted texts, and benchmark datasets. Such leakage leads to undermining human trust in AI due to potential unauthorized generation of content or overestimation of performance. We establish the following three criteria concerning the leakage issues: (1) leakage… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  5. arXiv:2403.15971  [pdf, other

    eess.IV

    PSHop: A Lightweight Feed-Forward Method for 3D Prostate Gland Segmentation

    Authors: Yi**g Yang, Vasileios Magoulianitis, Jiaxin Yang, **tang Xue, Masatomo Kaneko, Giovanni Cacciamani, Andre Abreu, Vinay Duddalwar, C. -C. Jay Kuo, Inderbir S. Gill, Chrysostomos Nikias

    Abstract: Automatic prostate segmentation is an important step in computer-aided diagnosis of prostate cancer and treatment planning. Existing methods of prostate segmentation are based on deep learning models which have a large size and lack of transparency which is essential for physicians. In this paper, a new data-driven 3D prostate segmentation method on MRI is proposed, named PSHop. Different from dee… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 11 pages, 5 figures, 5 tables

  6. arXiv:2403.15969  [pdf, other

    eess.IV

    PCa-RadHop: A Transparent and Lightweight Feed-forward Method for Clinically Significant Prostate Cancer Segmentation

    Authors: Vasileios Magoulianitis, Jiaxin Yang, Yi**g Yang, **tang Xue, Masatomo Kaneko, Giovanni Cacciamani, Andre Abreu, Vinay Duddalwar, C. -C. Jay Kuo, Inderbir S. Gill, Chrysostomos Nikias

    Abstract: Prostate Cancer is one of the most frequently occurring cancers in men, with a low survival rate if not early diagnosed. PI-RADS reading has a high false positive rate, thus increasing the diagnostic incurred costs and patient discomfort. Deep learning (DL) models achieve a high segmentation performance, although require a large model size and complexity. Also, DL models lack of feature interpreta… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 13 pages, 7 figures, 5 tables

  7. arXiv:2403.14159  [pdf, other

    cs.RO math.OC

    Robust Locomotion via Zero-order Stochastic Nonlinear Model Predictive Control with Guard Saltation Matrix

    Authors: Sotaro Katayama, Noriaki Takasugi, Mitsuhisa Kaneko, Norio Nagatsuka, and Masaya Kinoshita

    Abstract: This paper presents a stochastic/robust nonlinear model predictive control (NMPC) to enhance the robustness of legged locomotion against contact uncertainties. We integrate the contact uncertainties into the covariance propagation of stochastic/robust NMPC framework by leveraging the guard saltation matrix and an extended Kalman filter-like covariance update. We achieve fast stochastic/robust NMPC… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

  8. arXiv:2402.15987  [pdf, other

    cs.CL cs.AI

    Likelihood-based Mitigation of Evaluation Bias in Large Language Models

    Authors: Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) are widely used to evaluate natural language generation tasks as automated metrics. However, the likelihood, a measure of LLM's plausibility for a sentence, can vary due to superficial differences in sentences, such as word order and sentence structure. It is therefore possible that there might be a likelihood bias if LLMs are used for evaluation: they might overrate s… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: 4 main pages

  9. arXiv:2402.14258  [pdf, other

    cs.CL

    Eagle: Ethical Dataset Given from Real Interactions

    Authors: Masahiro Kaneko, Danushka Bollegala, Timothy Baldwin

    Abstract: Recent studies have demonstrated that large language models (LLMs) have ethical-related problems such as social biases, lack of moral reasoning, and generation of offensive content. The existing evaluation metrics and methods to address these ethical challenges use datasets intentionally created by instructing humans to create instances including ethical problems. Therefore, the data does not refl… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  10. arXiv:2401.15585  [pdf, other

    cs.CL

    Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki, Timothy Baldwin

    Abstract: There exist both scalable tasks, like reading comprehension and fact-checking, where model performance improves with model size, and unscalable tasks, like arithmetic reasoning and symbolic reasoning, where model performance does not necessarily improve with model size. Large language models (LLMs) equipped with Chain-of-Thought (CoT) prompting are able to make accurate incremental predictions eve… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  11. arXiv:2401.09935  [pdf, ps, other

    math.NT

    On finite analogues of Euler's constant

    Authors: Masanobu Kaneko, Toshiki Matsusaka, Shin-ichiro Seki

    Abstract: We introduce and study finite analogues of Euler's constant in the same setting as finite multiple zeta values. We define a couple of candidate values from the perspectives of a ``regularized value of $ζ(1)$'' and of Mascheroni's and Kluyver--Nörlund's series expressions of Euler's constant using Gregory coefficients. Moreover, we reveal that the differences between them always lie in the… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 14 pages

    MSC Class: 11M32; 11B68; 11A07

  12. arXiv:2401.08511  [pdf, other

    cs.CL

    The Gaps between Pre-train and Downstream Settings in Bias Evaluation and Debiasing

    Authors: Masahiro Kaneko, Danushka Bollegala, Timothy Baldwin

    Abstract: The output tendencies of Pre-trained Language Models (PLM) vary markedly before and after Fine-Tuning (FT) due to the updates to the model parameters. These divergences in output tendencies result in a gap in the social biases of PLMs. For example, there exits a low correlation between intrinsic bias scores of a PLM and its extrinsic bias scores under FT-based debiasing methods. Additionally, appl… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  13. arXiv:2401.03213  [pdf, ps, other

    math.NT

    Two formulas for certain double and multiple polylogarithms in two variables

    Authors: Masanobu Kaneko, Hirofumi Tsumura

    Abstract: We give a weighted sum formula for the double polylogarithm in two variables, from which we can recover the classical weighted sum formulas for double zeta values, double $T$-values, and some double $L$-values. Also presented is a connection-type formula for a two-variable multiple polylogarithm, which specializes to previously known single-variable formulas. This identity can also be regarded as… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 11 pages

    MSC Class: 11M32; 11M99

  14. arXiv:2312.08668  [pdf, other

    cs.RO

    Versatile Telescopic-Wheeled-Legged Locomotion of Tachyon 3 via Full-Centroidal Nonlinear Model Predictive Control

    Authors: Sotaro Katayama, Noriaki Takasugi, Mitsuhisa Kaneko, Masaya Kinoshita

    Abstract: This paper presents a nonlinear model predictive control (NMPC) toward versatile motion generation for the telescopic-wheeled-legged robot Tachyon 3, the unique hardware structure of which poses challenges in control and motion planning. We apply the full-centroidal NMPC formulation with dedicated constraints that can capture the accurate kinematics and dynamics of Tachyon 3. We have developed a c… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 8 pages, 9 figures

  15. arXiv:2312.06678  [pdf, other

    nucl-ex nucl-th

    Constraining nucleon effective masses with flow and stop** observables from the S$π$RIT experiment

    Authors: C. Y. Tsang, M. Kurata-Nishimura, M. B. Tsang, W. G. Lynch, Y. X. Zhang, J. Barney, J. Estee, G. Jhang, R. Wang, M. Kaneko, J. W. Lee, T. Isobe, T. Murakami, D. S. Ahn, L. Atar, T. Aumann, H. Baba, K. Boretzky, J. Brzychczyk, G. Cerizza, N. Chiga, N. Fukuda, I. Gasparic, B. Hong, A. Horvat , et al. (30 additional authors not shown)

    Abstract: Properties of the nuclear equation of state (EoS) can be probed by measuring the dynamical properties of nucleus-nucleus collisions. In this study, we present the directed flow ($v_1$), elliptic flow ($v_2$) and stop** (VarXZ) measured in fixed target Sn + Sn collisions at 270 AMeV with the S$π$RIT Time Projection Chamber. We perform Bayesian analyses in which EoS parameters are varied simultane… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  16. arXiv:2311.08369  [pdf, other

    cs.CL

    How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection

    Authors: Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    Abstract: To combat the misuse of Large Language Models (LLMs), many recent studies have presented LLM-generated-text detectors with promising performance. When users instruct LLMs to generate texts, the instruction can include different constraints depending on the user's need. However, most recent studies do not cover such diverse instruction patterns when creating datasets for LLM detection. In this pape… ▽ More

    Submitted 12 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: under review

  17. arXiv:2311.08107  [pdf, other

    cs.CL

    SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks

    Authors: Mengsay Loem, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) can justify or critique their predictions through discussions with other models or humans, thereby enriching their intrinsic understanding of instances. While proactive discussions in the inference phase have been shown to boost performance, such interactions have not been extensively explored during the training phase. We hypothesize that incorporating interactive dis… ▽ More

    Submitted 29 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  18. arXiv:2309.11439  [pdf, other

    cs.CL

    Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction

    Authors: Masahiro Kaneko, Naoaki Okazaki

    Abstract: In Grammatical Error Correction (GEC), it is crucial to ensure the user's comprehension of a reason for correction. Existing studies present tokens, examples, and hints as to the basis for correction but do not directly explain the reasons for corrections. Although methods that use Large Language Models (LLMs) to provide direct explanations in natural language have been proposed for various tasks,… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Work in progress

  19. arXiv:2309.09697  [pdf, other

    cs.CL

    Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels

    Authors: Panatchakorn Anantaprayoon, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Discriminatory gender biases have been found in Pre-trained Language Models (PLMs) for multiple languages. In Natural Language Inference (NLI), existing bias evaluation methods have focused on the prediction results of one specific label out of three labels, such as neutral. However, such evaluation methods can be inaccurate since unique biased inferences are associated with unique prediction labe… ▽ More

    Submitted 18 May, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: LREC-COLING 2024

  20. arXiv:2309.09092  [pdf, other

    cs.CL

    The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: Pre-trained language models trained on large-scale data have learned serious levels of social biases. Consequently, various methods have been proposed to debias pre-trained models. Debiasing methods need to mitigate only discriminatory bias information from the pre-trained models, while retaining information that is useful for the downstream tasks. In previous research, whether useful information… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: IJCNLP-AACL 2023

  21. arXiv:2309.07251  [pdf, other

    cs.CL

    In-Contextual Gender Bias Suppression for Large Language Models

    Authors: Daisuke Oba, Masahiro Kaneko, Danushka Bollegala

    Abstract: Despite their impressive performance in a wide range of NLP tasks, Large Language Models (LLMs) have been reported to encode worrying-levels of gender biases. Prior work has proposed debiasing methods that require human labelled examples, data augmentation and fine-tuning of LLMs, which are computationally costly. Moreover, one might not even have access to the model parameters for performing debi… ▽ More

    Submitted 20 February, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: EACL 2024 Findings - Long Paper

  22. arXiv:2307.11729  [pdf, other

    cs.CL

    OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples

    Authors: Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) have achieved human-level fluency in text generation, making it difficult to distinguish between human-written and LLM-generated texts. This poses a growing risk of misuse of LLMs and demands the development of detectors to identify LLM-generated texts. However, existing detectors lack robustness against attacks: they degrade detection accuracy by simply paraphrasing L… ▽ More

    Submitted 18 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: AAAI 2024 camera ready. Code and dataset available at https://github.com/ryuryukke/OUTFOX

  23. arXiv:2305.18156  [pdf, other

    cs.CL cs.AI

    Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods

    Authors: Mengsay Loem, Masahiro Kaneko, Sho Takase, Naoaki Okazaki

    Abstract: Large-scale pre-trained language models such as GPT-3 have shown remarkable performance across various natural language processing tasks. However, applying prompt-based methods with GPT-3 for Grammatical Error Correction (GEC) tasks and their controllability remains underexplored. Controllability in GEC is crucial for real-world applications, particularly in educational settings, where the ability… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted in BEA 2023

  24. arXiv:2305.11862  [pdf, other

    cs.CL

    Reducing Sequence Length by Predicting Edit Operations with Large Language Models

    Authors: Masahiro Kaneko, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance in various tasks and gained significant attention. LLMs are also used for local sequence transduction tasks, including grammatical error correction (GEC) and formality style transfer, where most tokens in a source text are kept unchanged. However, the models that generate all target tokens in such tasks have a tendency to simply… ▽ More

    Submitted 20 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: EMNLP2023

  25. arXiv:2305.11789  [pdf, other

    cs.CL

    Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach

    Authors: Masahiro Kaneko, Graham Neubig, Naoaki Okazaki

    Abstract: Humans work together to solve common problems by having discussions, explaining, and agreeing or disagreeing with each other. Similarly, if a system can have discussions with humans when solving tasks, it can improve the system's performance and reliability. In previous research on explainability, it has only been possible for the system to make predictions and for humans to ask questions about th… ▽ More

    Submitted 30 January, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: EACL2024 Findings

  26. arXiv:2301.12074  [pdf, other

    cs.CL

    Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: Numerous types of social biases have been identified in pre-trained language models (PLMs), and various intrinsic bias evaluation measures have been proposed for quantifying those social biases. Prior works have relied on human annotated examples to compare existing intrinsic bias evaluation measures. However, this approach is not easily adaptable to different languages nor amenable to large scale… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: EACL 2023

  27. Discovery of the Tadpole Molecular Cloud near the Galactic Nucleus

    Authors: Miyuki Kaneko, Tomoharu Oka, Hiroki Yokozuka, Rei Enokiya, Shunya Takekawa, Yuhei Iwata, Shiho Tsujimoto

    Abstract: In this paper, we report the discovery of an isolated, peculiar compact cloud with a steep velocity gradient at $2\farcm 6$ northwest of Sgr A*. This ``Tadpole'' molecular cloud is unique owing to its characteristic head-tail structure in the position-velocity space. By tracing the CO {\it J}=3--2 intensity peak in each velocity channel, we noticed that the kinematics of the Tadpole can be well re… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: 10 pages, 9 figures, 2 tables, accepted for publication on the Astrophysical Journal

    Journal ref: The Astrophysical Journal, January 10, 2023, 942, 46

  28. arXiv:2212.12982  [pdf, ps, other

    physics.app-ph

    Design guidelines for the SPICE parameters of waveform-selective metasurfaces varying with the incident pulse width at a constant oscillation frequency

    Authors: Shiori Imai, Haruki Homma, Kairi Takimoto, Mizuki Tanikawa, ** Nakamura, Masaya Kaneko, Yuya Osaki, Kiichi Niitsu, Cheng Yongzhi, Ashif Aminulloh Fathnan, Hiroki Wakatsuchi

    Abstract: In this study, we numerically demonstrate how the response of recently reported circuit-based metasurfaces is characterized by their circuit parameters. These metasurfaces, which include a set of four diodes as a full wave rectifier, are capable of sensing different waves even at the same frequency in response to the incident waveform, or more specifically the pulse width. This study reveals the r… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Comments: 9 pages, 9 figures

  29. Isoscaling in central Sn+Sn collisions at 270 MeV/u

    Authors: J. W. Lee, M. B. Tsang, C. Y. Tsang, R. Wang, J. Barney, J. Estee, T. Isobe, M. Kaneko, M. Kurata-Nishimura, W. G. Lynch, T. Murakami, A. Ono, S. R. Souza, D. S. Ahn, L. Atar, T. Aumann, H. Baba, K. Boretzky, J. Brzychczyk, G. Cerizza, N. Chiga, N. Fukuda, I. Gasparic, B. Hong, A. Horvat , et al. (39 additional authors not shown)

    Abstract: Experimental information on fragment emissions is important in understanding the dynamics of nuclear collisions and in the development of transport model simulating heavy-ion collisions. The composition of complex fragments emitted in the heavy-ion collisions can be explained by statistical models, which assume that thermal equilibrium is achieved at collision energies below 100 MeV/u. Our new exp… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Journal ref: The European Physical Journal A volume 58, Article number: 201 (2022)

  30. arXiv:2210.02938  [pdf, other

    cs.CL

    Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: We study the relationship between task-agnostic intrinsic and task-specific extrinsic social bias evaluation measures for Masked Language Models (MLMs), and find that there exists only a weak correlation between these two types of evaluation measures. Moreover, we find that MLMs debiased using different methods still re-learn social biases during fine-tuning on downstream tasks. We identify the so… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: COLING 2022

  31. arXiv:2209.06770  [pdf, ps, other

    math.NT

    Parametric Apéry-type Series and Hurwitz-type Multiple Zeta Values

    Authors: Masanobu Kaneko, Wei** Wang, Ce Xu, Jianqiang Zhao

    Abstract: In this paper, we will establish many explicit relations between parametric Apéry-type series involving one or two parametric binomial coefficients and Hurwitz-type multiple zeta values (with $r$-variables) by using the method of iterated integral. Furthermore, we also establish some new identities of integrals involving multiple polylogarithm functions and Kaneko--Tsumura {\rm A}-functions in ter… ▽ More

    Submitted 18 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 33 pages

  32. arXiv:2208.05146  [pdf, ps, other

    math.NT

    Multiple $L$-values of level four, poly-Euler numbers, and related zeta functions

    Authors: Masanobu Kaneko, Hirofumi Tsumura

    Abstract: We present several formulas for some specific multiple $L$-values of conductor four. This grew out from the study of zeta functions of level four of Arakawa-Kaneko type. Closely related is a new version of multiple poly-Euler numbers and we briefly discuss this too.

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 28pages

    MSC Class: Primary 11M32; Secondary 11B68; 11M41

  33. arXiv:2207.13354  [pdf, other

    cs.CL

    Are Neighbors Enough? Multi-Head Neural n-gram can be Alternative to Self-attention

    Authors: Mengsay Loem, Sho Takase, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Impressive performance of Transformer has been attributed to self-attention, where dependencies between entire input in a sequence are considered at every position. In this work, we reform the neural $n$-gram model, which focuses on only several surrounding representations of each position, with the multi-head mechanism as in Vaswani et al.(2017). Through experiments on sequence-to-sequence tasks,… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  34. arXiv:2205.09867  [pdf, other

    cs.CL

    Gender Bias in Meta-Embeddings

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: Different methods have been proposed to develop meta-embeddings from a given set of source embeddings. However, the source embeddings can contain unfair gender-related biases, and how these influence the meta-embeddings has not been studied yet. We study the gender bias in meta-embeddings created under three different settings: (1) meta-embedding multiple sources without performing any debiasing (… ▽ More

    Submitted 6 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Findings of EMNLP 2022

  35. arXiv:2205.00551  [pdf, other

    cs.CL

    Gender Bias in Masked Language Models for Multiple Languages

    Authors: Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki

    Abstract: Masked Language Models (MLMs) pre-trained by predicting masked tokens on large corpora have been used successfully in natural language processing tasks for a variety of languages. Unfortunately, it was reported that MLMs also learn discriminative biases regarding attributes such as gender and race. Because most studies have focused on MLMs in English, the bias of MLMs in other languages has rarely… ▽ More

    Submitted 4 May, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  36. arXiv:2203.07523  [pdf, other

    cs.CL

    Sense Embeddings are also Biased--Evaluating Social Biases in Static and Contextualised Sense Embeddings

    Authors: Yi Zhou, Masahiro Kaneko, Danushka Bollegala

    Abstract: Sense embedding learning methods learn different embeddings for the different senses of an ambiguous word. One sense of an ambiguous word might be socially biased while its other senses remain unbiased. In comparison to the numerous prior work evaluating the social biases in pretrained word embeddings, the biases in sense embeddings have been relatively understudied. We create a benchmark dataset… ▽ More

    Submitted 16 March, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  37. arXiv:2203.07085  [pdf, other

    cs.CL

    Interpretability for Language Learners Using Example-Based Grammatical Error Correction

    Authors: Masahiro Kaneko, Sho Takase, Ayana Niwa, Naoaki Okazaki

    Abstract: Grammatical Error Correction (GEC) should not focus only on high accuracy of corrections but also on interpretability for language learning. However, existing neural-based GEC models mainly aim at improving accuracy, and their interpretability has not been explored. A promising approach for improving interpretability is an example-based method, which uses similar retrieved examples to generate cor… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  38. arXiv:2201.06199  [pdf, other

    cs.CL

    Proficiency Matters Quality Estimation in Grammatical Error Correction

    Authors: Yu** Takahashi, Masahiro Kaneko, Masato Mita, Mamoru Komachi

    Abstract: This study investigates how supervised quality estimation (QE) models of grammatical error correction (GEC) are affected by the learners' proficiency with the data. QE models for GEC evaluations in prior work have obtained a high correlation with manual evaluations. However, when functioning in a real-world context, the data used for the reported results have limitations because prior works were b… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    Comments: 6 pages (4 pages + references)

  39. arXiv:2201.05313  [pdf, other

    cs.CL

    ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization

    Authors: Mengsay Loem, Sho Takase, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Neural models trained with large amount of parallel data have achieved impressive performance in abstractive summarization tasks. However, large-scale parallel corpora are expensive and challenging to construct. In this work, we introduce a low-cost and effective strategy, ExtraPhrase, to augment training data for abstractive summarization tasks. ExtraPhrase constructs pseudo training data in two… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  40. arXiv:2109.12501  [pdf, ps, other

    math.NT

    On finite multiple zeta values of level two

    Authors: Masanobu Kaneko, Takuya Murakami, Amane Yoshihara

    Abstract: We introduce and study a ``level two'' analogue of finite multiple zeta values. We give conjectural bases of the space of finite Euler sums as well as that of usual finite multiple zeta values in terms of these newly defined elements. A kind of ``parity result'' and certain sum formulas are also presented.

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 10 pages

    MSC Class: 11M32; 11A07

  41. arXiv:2108.05827  [pdf, ps, other

    q-bio.NC

    Fluctuation in background synaptic activity controls synaptic plasticity

    Authors: Yuto Takeda, Katsuhiko Hata, Tokio Yamasaki, Masaki Kaneko, Osamu Yokoi, Chengta Tsai, Kazuo Umemura, Tetsuro Nikuni

    Abstract: Synaptic plasticity is vital for learning and memory in the brain. It consists of long-term potentiation (LTP) and long-term depression (LTD). Spike frequency is one of the major components of synaptic plasticity in the brain, a noisy environment. Recently, we mathematically analysed the frequency-dependent synaptic plasticity (FDP) in vivo and found that LTP is more likely to occur with an increa… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 9 pages, 4 figures

  42. arXiv:2107.13985  [pdf, other

    physics.ins-det nucl-ex

    Applying machine learning to determine impact parameter in nuclear physics experiments

    Authors: C. Y. Tsang, Yongjia Wang, M. B. Tsang, J. Estee, T. Isobe, M. Kaneko, M. Kurata-Nishimura, J. W. Lee, Fupeng Li, Qingfeng Li, W. G. Lynch, T. Murakami, R. Wang, Dan Cozma, Rohit Kumar, Akira Ono, Ying-Xun Zhang

    Abstract: Machine Learning (ML) algorithms have been demonstrated to be capable of predicting impact parameter in heavy-ion collisions from transport model simulation events with perfect detector response. We extend the scope of ML application to experimental data by incorporating realistic detector response of the S$π$RIT Time Projection Chamber into the heavy-ion simulation events generated from the UrQMD… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  43. arXiv:2106.09902  [pdf, other

    physics.app-ph

    Complementary junction field-effect transistor logic gate operational at 300$^\circ$C with 1.4 V supply voltage

    Authors: Mitsuaki Kaneko, Masashi Nakajima, Qimin **, Tsunenobu Kimoto

    Abstract: Integrated circuits (ICs) that can operate at high temperature have a wide variety of applications in the fields of automotive, aerospace, space exploration, and deep-well drilling. Conventional silicon-based complementary metal-oxide-semiconductor (CMOS) circuits cannot work at higher than 200 $^\circ$C, leading to the use of wide bandgap semiconductor, especially silicon carbide (SiC). However,… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 30 pages, 4 figures, 2 supplementary figures

  44. arXiv:2104.08478  [pdf, other

    cs.CL

    Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation

    Authors: Seiichiro Kondo, Kengo Hotate, Masahiro Kaneko, Mamoru Komachi

    Abstract: Neural machine translation (NMT) has recently gained widespread attention because of its high translation accuracy. However, it shows poor performance in the translation of long sentences, which is a major issue in low-resource languages. It is assumed that this issue is caused by insufficient number of long sentences in the training data. Therefore, this study proposes a simple data augmentation… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 7 pages; camera-ready for NAACL Student Research Workshop 2021

  45. arXiv:2104.07848  [pdf, other

    cs.CL

    Comparison of Grammatical Error Correction Using Back-Translation Models

    Authors: Aomi Koyama, Kengo Hotate, Masahiro Kaneko, Mamoru Komachi

    Abstract: Grammatical error correction (GEC) suffers from a lack of sufficient parallel data. Therefore, GEC studies have developed various methods to generate pseudo data, which comprise pairs of grammatical and artificially produced ungrammatical sentences. Currently, a mainstream approach to generate pseudo data is back-translation (BT). Most previous GEC studies using BT have employed the same architect… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: 10 pages; camera-ready for NAACL Student Research Workshop 2021

  46. arXiv:2104.07496  [pdf, other

    cs.CL

    Unmasking the Mask -- Evaluating Social Biases in Masked Language Models

    Authors: Masahiro Kaneko, Danushka Bollegala

    Abstract: Masked Language Models (MLMs) have shown superior performances in numerous downstream NLP tasks when used as text encoders. Unfortunately, MLMs also demonstrate significantly worrying levels of social biases. We show that the previously proposed evaluation metrics for quantifying the social biases in MLMs are problematic due to following reasons: (1) prediction accuracy of the masked tokens itself… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  47. arXiv:2104.07410  [pdf, other

    cs.CL

    Simultaneous Multi-Pivot Neural Machine Translation

    Authors: Raj Dabre, Aizhan Imankulova, Masahiro Kaneko, Abhisek Chakrabarty

    Abstract: Parallel corpora are indispensable for training neural machine translation (NMT) models, and parallel corpora for most language pairs do not exist or are scarce. In such cases, pivot language NMT can be helpful where a pivot language is used such that there exist parallel corpora between the source and pivot and pivot and target languages. Naturally, the quality of pivot language translation is mo… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: preliminary work. pardon the messy writing and mistakes. will be submitted to emnlp after major overhaul

  48. New Look at the Molecular Superbubble Candidate in the Galactic Center

    Authors: Shiho Tsujimoto, Tomoharu Oka, Shunya Takekawa, Yuhei Iwata, Asaka Uruno, Hiroki Yokozuka, Ryosuke Nakagawara, Yuto Watanabe, Akira Kawakami, Sonomi Nishiyama, Miyuki Kaneko, Shoko Kanno, Takuma Ogawa

    Abstract: The $l\!=\!+1.\!\!^\circ3$ region in the Galactic center is characterized by multiple shell-like structures and their extremely broad velocity widths. We revisit the molecular superbubble hypothesis for this region, based on high resolution maps of CO {\it J}=1--0, $^{13}$CO {\it J}=1--0, H$^{13}$CN {\it J}=1--0, H$^{13}$CO$^{+}$ {\it J}=1--0, SiO {\it J}=2--1, and CS {\it J}=2--1 lines obtained f… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  49. Probing the Symmetry Energy with the Spectral Pion Ratio

    Authors: J. Estee, W. G. Lynch, C. Y. Tsang, J. Barney, G. Jhang, M. B. Tsang, R. Wang, M. Kaneko, J. W. Lee, T. Isobe, M. Kurata-Nishimura, T. Murakami, D. S. Ahn, L. Atar, T. Aumann, H. Baba, K. Boretzky, J. Brzychczyk, G. Cerizza, N. Chiga, N. Fukuda, I. Gasparic, B. Hong, A. Horvat, K. Ieki , et al. (38 additional authors not shown)

    Abstract: Many neutron star (NS) properties, such as the proton fraction within a NS, reflect the symmetry energy contributions to the Equation of State that dominate when neutron and proton densities differ strongly. To constrain these contributions at supra-saturation densities, we measure the spectra of charged pions produced by colliding rare isotope tin (Sn) beams with isotopically enriched Sn targets.… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Journal ref: Phys. Rev. Lett. 126, 162701 (2021)

  50. arXiv:2101.09525  [pdf, other

    cs.CL

    Dictionary-based Debiasing of Pre-trained Word Embeddings

    Authors: Masahiro Kaneko, Danushka Bollegala

    Abstract: Word embeddings trained on large corpora have shown to encode high levels of unfair discriminatory gender, racial, religious and ethnic biases. In contrast, human-written dictionaries describe the meanings of words in a concise, objective and an unbiased manner. We propose a method for debiasing pre-trained word embeddings using dictionaries, without requiring access to the original training r… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: EACL 2021