Skip to main content

Showing 1–18 of 18 results for author: Takahashi, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06032  [pdf, other

    cs.CL

    The Curse of Popularity: Popular Entities have Catastrophic Side Effects when Deleting Knowledge from Language Models

    Authors: Ryosuke Takahashi, Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, Kentaro Inui

    Abstract: Language models (LMs) encode world knowledge in their internal parameters through training. However, LMs may learn personal and confidential information from the training data, leading to privacy concerns such as data leakage. Therefore, research on knowledge deletion from LMs is essential. This study focuses on the knowledge stored in LMs and analyzes the relationship between the side effects of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2304.03177  [pdf, other

    eess.SP cs.IT eess.SY math.ST

    Mutual Interference Mitigation for MIMO-FMCW Automotive Radar

    Authors: Sian **, Pu Perry Wang, Petros Boufounos, Philip V. Orlik, Ryuhei Takahashi, Sumit Roy

    Abstract: This paper considers mutual interference mitigation among automotive radars using frequency-modulated continuous wave (FMCW) signal and multiple-input multiple-output (MIMO) virtual arrays. For the first time, we derive a general interference signal model that fully accounts for not only the time-frequency incoherence, e.g., different FMCW configuration parameters and time offsets, but also the sl… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 15 pages, 10 figures;

  3. arXiv:2303.13465  [pdf, other

    cs.CL cs.AI

    Deep RL with Hierarchical Action Exploration for Dialogue Generation

    Authors: Itsugun Cho, Ryota Takahashi, Yusaku Yanase, Hiroaki Saito

    Abstract: Traditionally, approximate dynamic programming is employed in dialogue generation with greedy policy improvement through action sampling, as the natural language action space is vast. However, this practice is inefficient for reinforcement learning (RL) due to the sparsity of eligible responses with high action values, which leads to weak improvement sustained by random sampling. This paper presen… ▽ More

    Submitted 15 May, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  4. arXiv:2205.09295  [pdf, other

    cs.CL

    Are Prompt-based Models Clueless?

    Authors: Pride Kavumba, Ryo Takahashi, Yusuke Oda

    Abstract: Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

  5. arXiv:2204.07372  [pdf, other

    cs.CL cs.LG

    A Personalized Dialogue Generator with Implicit User Persona Detection

    Authors: Itsugun Cho, Dongyang Wang, Ryota Takahashi, Hiroaki Saito

    Abstract: Current works in the generation of personalized dialogue primarily contribute to the agent presenting a consistent personality and driving a more informative response. However, we found that the generated responses from most previous models tend to be self-centered, with little care for the user in the dialogue. Moreover, we consider that human-like conversation is essentially built based on infer… ▽ More

    Submitted 21 August, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: 9 pages, 7 figures, Accepted by Coling2022

  6. arXiv:2105.11919  [pdf, other

    cs.DS cs.CC

    Minmax-optimal list searching with $O(\log_2\log_2 n)$ average cost

    Authors: I. F. D. Oliveira, R. H. C. Takahashi

    Abstract: We find a searching method on ordered lists that surprisingly outperforms binary searching with respect to average query complexity while retaining minmax optimality. The method is shown to require $O(\log_2\log_2 n)$ queries on average while never exceeding $\lceil \log_2 n \rceil$ queries in the worst case, i.e. the minmax bound of binary searching. Our average results assume a uniform distribut… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: under consideration by the Journal of Computer and System Sciences

    MSC Class: 68P10; 68W40; 68Q25 ACM Class: F.2.2; F.2.3; H.3.3

  7. arXiv:2105.11845  [pdf, other

    math.OC cs.CC

    An incremental descent method for multi-objective optimization

    Authors: I. F. D. Oliveira, R. H. C. Takahashi

    Abstract: Current state-of-the-art multi-objective optimization solvers, by computing gradients of all $m$ objective functions per iteration, produce after $k$ iterations a measure of proximity to critical conditions that is upper-bounded by $O(1/\sqrt{k})$ when the objective functions are assumed to have $L-$Lipschitz continuous gradients; i.e. they require $O(m/ε^2)$ gradient and function computations to… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: paper pre-submission

    MSC Class: 90C26 90C29 ACM Class: G.1.6

  8. arXiv:2103.00535  [pdf, other

    cs.SI

    A multi-objective time series analysis of community mobility reduction comparing first and second COVID-19 waves

    Authors: Gabriela Cavalcante da Silva, Fernanda Monteiro de Almeida, Sabrina Oliveira, Leonardo C. T. Bezerra, Elizabeth F. Wanner, Ricardo H. C. Takahashi

    Abstract: With the logistic challenges faced by most countries for the production, distribution, and application of vaccines for the novel coronavirus disease~(COVID-19), social distancing~(SD) remains the most tangible approach to mitigate the spread of the virus. To assist SD monitoring, several tech companies have made publicly available anonymized mobility data. In this work, we conduct a multi-objectiv… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  9. arXiv:2102.06540  [pdf, other

    cs.CL cs.LG

    Two Training Strategies for Improving Relation Extraction over Universal Graph

    Authors: Qin Dai, Naoya Inoue, Ryo Takahashi, Kentaro Inui

    Abstract: This paper explores how the Distantly Supervised Relation Extraction (DS-RE) can benefit from the use of a Universal Graph (UG), the combination of a Knowledge Graph (KG) and a large-scale text collection. A straightforward extension of a current state-of-the-art neural model for DS-RE with a UG may lead to degradation in performance. We first report that this degradation is associated with the di… ▽ More

    Submitted 6 May, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  10. arXiv:2101.00133  [pdf, other

    cs.CL cs.AI

    NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

    Authors: Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini , et al. (28 additional authors not shown)

    Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage conte… ▽ More

    Submitted 19 September, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track

  11. arXiv:2011.01785  [pdf, other

    cs.CL

    Modeling Event Salience in Narratives via Barthes' Cardinal Functions

    Authors: Takaki Otake, Sho Yokoi, Naoya Inoue, Ryo Takahashi, Tatsuki Kuribayashi, Kentaro Inui

    Abstract: Events in a narrative differ in salience: some are more important to the story than others. Estimating event salience is useful for tasks such as story generation, and as a tool for text analysis in narratology and folkloristics. To compute event salience without any annotations, we adopt Barthes' definition of event salience and propose several unsupervised methods that require only a pre-trained… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: accepted to COLING 2020

  12. arXiv:2011.00948  [pdf, other

    cs.CL

    An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution

    Authors: Ryuto Konno, Yuichiroh Matsubayashi, Shun Kiyono, Hiroki Ouchi, Ryo Takahashi, Kentaro Inui

    Abstract: One critical issue of zero anaphora resolution (ZAR) is the scarcity of labeled data. This study explores how effectively this problem can be alleviated by data augmentation. We adopt a state-of-the-art data augmentation method, called the contextual data augmentation (CDA), that generates labeled training instances using a pretrained language model. The CDA has been reported to work well for seve… ▽ More

    Submitted 4 November, 2020; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: 13 pages, accepted by COLING 2020

  13. arXiv:2004.15003  [pdf, other

    cs.CL

    Word Rotator's Distance

    Authors: Sho Yokoi, Ryo Takahashi, Reina Akama, Jun Suzuki, Kentaro Inui

    Abstract: A key principle in assessing textual similarity is measuring the degree of semantic overlap between two texts by considering the word alignment. Such alignment-based approaches are intuitive and interpretable; however, they are empirically inferior to the simple cosine similarity between general-purpose sentence vectors. To address this issue, we focus on and demonstrate the fact that the norm of… ▽ More

    Submitted 16 November, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: 17 pages, accepted at EMNLP 2020

    Journal ref: EMNLP 2020

  14. arXiv:2001.07895  [pdf, other

    cs.CV

    Partially-Shared Variational Auto-encoders for Unsupervised Domain Adaptation with Target Shift

    Authors: Ryuhei Takahashi, Atsushi Hashimoto, Motoharu Sonogashira, Masaaki Iiyama

    Abstract: This paper proposes a novel approach for unsupervised domain adaptation (UDA) with target shift. Target shift is a problem of mismatch in label distribution between source and target domains. Typically it appears as class-imbalance in target domain. In practice, this is an important problem in UDA; as we do not know labels in target domain datasets, we do not know whether or not its distribution i… ▽ More

    Submitted 25 January, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

  15. Data Augmentation using Random Image Crop** and Patching for Deep CNNs

    Authors: Ryo Takahashi, Takashi Matsubara, Kuniaki Uehara

    Abstract: Deep convolutional neural networks (CNNs) have achieved remarkable results in image processing tasks. However, their high expression ability risks overfitting. Consequently, data augmentation techniques have been proposed to prevent overfitting while enriching datasets. Recent CNN architectures with more parameters are rendering traditional data augmentation techniques insufficient. In this study,… ▽ More

    Submitted 27 August, 2019; v1 submitted 22 November, 2018; originally announced November 2018.

    Comments: accepted version, 16 pages

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology, 2019

  16. arXiv:1805.09547  [pdf, other

    cs.LG stat.ML

    Interpretable and Compositional Relation Learning by Joint Training with an Autoencoder

    Authors: Ryo Takahashi, Ran Tian, Kentaro Inui

    Abstract: Embedding models for entities and relations are extremely useful for recovering missing facts in a knowledge base. Intuitively, a relation can be modeled by a matrix map** entity vectors. However, relations reside on low dimension sub-manifolds in the parameter space of arbitrary matrices---for one reason, composition of two relations $\boldsymbol{M}_1,\boldsymbol{M}_2$ may match a third… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: Equal contribution from first two authors. Accepted for publication in the ACL 2018

  17. A Novel Weight-Shared Multi-Stage CNN for Scale Robustness

    Authors: Ryo Takahashi, Takashi Matsubara, Kuniaki Uehara

    Abstract: Convolutional neural networks (CNNs) have demonstrated remarkable results in image classification for benchmark tasks and practical applications. The CNNs with deeper architectures have achieved even higher performance recently thanks to their robustness to the parallel shift of objects in images as well as their numerous parameters and the resulting high expression ability. However, CNNs have a l… ▽ More

    Submitted 11 April, 2019; v1 submitted 12 February, 2017; originally announced February 2017.

    Comments: accepted version, 13 pages

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 4, 2019, pp. 1090-1101

  18. arXiv:1503.02642  [pdf, ps, other

    cs.IT cs.NI nlin.CD

    Security of Power Packet Dispatching Using Differential Chaos Shift Keying

    Authors: Yanzi Zhou, Ryo Takahashi, Takashi Hikihara

    Abstract: This paper investigates and confirms one advantageous function of a power packet dispatching system, which has been proposed by authors' group with being apart from the conventional power distribution system. Here is focused on the function to establish the security of power packet dispatching for prohibiting not only information but also power of power packet from being stolen by attackers. For t… ▽ More

    Submitted 19 February, 2015; originally announced March 2015.

    Comments: 9 pages, 19 figures