Skip to main content

Showing 1–10 of 10 results for author: Sogawa, Y

.
  1. arXiv:2405.08037  [pdf, other

    cs.HC cs.AI

    Layout Generation Agents with Large Language Models

    Authors: Yuichi Sasazawa, Yasuhiro Sogawa

    Abstract: In recent years, there has been an increasing demand for customizable 3D virtual spaces. Due to the significant human effort required to create these virtual spaces, there is a need for efficiency in virtual space creation. While existing studies have proposed methods for automatically generating layouts such as floor plans and furniture arrangements, these methods only generate text indicating th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2311.07994  [pdf, other

    cs.IR

    Text Retrieval with Multi-Stage Re-Ranking Models

    Authors: Yuichi Sasazawa, Kenichi Yokote, Osamu Imaichi, Yasuhiro Sogawa

    Abstract: The text retrieval is the task of retrieving similar documents to a search query, and it is important to improve retrieval accuracy while maintaining a certain level of retrieval speed. Existing studies have reported accuracy improvements using language models, but many of these do not take into account the reduction in search speed that comes with increased performance. In this study, we propose… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  3. arXiv:2308.07336  [pdf, other

    cs.AI cs.CL cs.LG cs.LO

    Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic

    Authors: Terufumi Morishita, Gaku Morio, Atsuki Yamaguchi, Yasuhiro Sogawa

    Abstract: We study a synthetic corpus based approach for language models (LMs) to acquire logical deductive reasoning ability. The previous studies generated deduction examples using specific sets of deduction rules. However, these rules were limited or otherwise arbitrary, limiting the generalizability of acquired reasoning ability. We rethink this and adopt a well-grounded set of deduction rules based on… ▽ More

    Submitted 13 November, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:25254-25274, 2023

  4. LARCH: Large Language Model-based Automatic Readme Creation with Heuristics

    Authors: Yuta Koreeda, Terufumi Morishita, Osamu Imaichi, Yasuhiro Sogawa

    Abstract: Writing a readme is a crucial aspect of software development as it plays a vital role in managing and reusing program code. Though it is a pain point for many developers, automatically creating one remains a challenge even with the recent advancements in large language models (LLMs), because it requires generating an abstract description from thousands of lines of code. In this demo paper, we show… ▽ More

    Submitted 22 August, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: This is a pre-print of a paper accepted at CIKM'23 Demo. Refer to the DOI URL for the original publication

    Journal ref: In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, October 21-25, 2023, Birmingham, United Kingdom. ACM, New York, NY, USA, 5 pages

  5. arXiv:2306.09572  [pdf, other

    cs.CL cs.AI

    How do different tokenizers perform on downstream tasks in scriptio continua languages?: A case study in Japanese

    Authors: Takuro Fujii, Koki Shibata, Atsuki Yamaguchi, Terufumi Morishita, Yasuhiro Sogawa

    Abstract: This paper investigates the effect of tokenizers on the downstream performance of pretrained language models (PLMs) in scriptio continua languages where no explicit spaces exist between words, using Japanese as a case study. The tokenizer for such languages often consists of a morphological analyzer and a subword tokenizer, requiring us to conduct a comprehensive study of all possible pairs. Howev… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL SRW 2023

  6. arXiv:2305.10992  [pdf, other

    cs.CL cs.AI

    How does the task complexity of masked pretraining objectives affect downstream performance?

    Authors: Atsuki Yamaguchi, Hiroaki Ozaki, Terufumi Morishita, Gaku Morio, Yasuhiro Sogawa

    Abstract: Masked language modeling (MLM) is a widely used self-supervised pretraining objective, where a model needs to predict an original token that is replaced with a mask given contexts. Although simpler and computationally efficient pretraining objectives, e.g., predicting the first character of a masked token, have recently shown comparable results to MLM, no objectives with a masking scheme actually… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 Findings

  7. arXiv:2304.09516  [pdf, other

    cs.CL

    Controlling keywords and their positions in text generation

    Authors: Yuichi Sasazawa, Terufumi Morishita, Hiroaki Ozaki, Osamu Imaichi, Yasuhiro Sogawa

    Abstract: One of the challenges in text generation is to control text generation as intended by the user. Previous studies proposed specifying the keywords that should be included in the generated text. However, this approach is insufficient to generate text that reflect the user's intent. For example, placing an important keyword at the beginning of the text would help attract the reader's attention; howev… ▽ More

    Submitted 31 October, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Journal ref: Proceedings of the 16th International Natural Language Generation Conference, 2023, pages 407 to 413

  8. arXiv:2303.01794  [pdf, other

    cs.CL cs.AI

    Hitachi at SemEval-2023 Task 3: Exploring Cross-lingual Multi-task Strategies for Genre and Framing Detection in Online News

    Authors: Yuta Koreeda, Ken-ichi Yokote, Hiroaki Ozaki, Atsuki Yamaguchi, Masaya Tsunokake, Yasuhiro Sogawa

    Abstract: This paper explains the participation of team Hitachi to SemEval-2023 Task 3 "Detecting the genre, the framing, and the persuasion techniques in online news in a multi-lingual setup.'' Based on the multilingual, multi-task nature of the task and the low-resource setting, we investigated different cross-lingual and multi-task strategies for training the pretrained language models. Through extensive… ▽ More

    Submitted 25 April, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at SemEval-2023 Task 3

  9. arXiv:1707.02963  [pdf, other

    stat.ML

    An Interactive Greedy Approach to Group Sparsity in High Dimensions

    Authors: Wei Qian, Wending Li, Yasuhiro Sogawa, Ryohei Fujimaki, Xitong Yang, Ji Liu

    Abstract: Sparsity learning with known grou** structure has received considerable attention due to wide modern applications in high-dimensional data analysis. Although advantages of using group information have been well-studied by shrinkage-based approaches, benefits of group sparsity have not been well-documented for greedy-type methods, which much limits our understanding and use of this important clas… ▽ More

    Submitted 26 September, 2018; v1 submitted 10 July, 2017; originally announced July 2017.

  10. arXiv:1101.2489  [pdf, ps, other

    stat.ML

    DirectLiNGAM: A direct method for learning a linear non-Gaussian structural equation model

    Authors: Shohei Shimizu, Takanori Inazumi, Yasuhiro Sogawa, Aapo Hyvarinen, Yoshinobu Kawahara, Takashi Washio, Patrik O. Hoyer, Kenneth Bollen

    Abstract: Structural equation models and Bayesian networks have been widely used to analyze causal relations between continuous variables. In such frameworks, linear acyclic models are typically used to model the data-generating process of variables. Recently, it was shown that use of non-Gaussianity identifies the full structure of a linear acyclic model, i.e., a causal ordering of variables and their conn… ▽ More

    Submitted 7 April, 2011; v1 submitted 12 January, 2011; originally announced January 2011.

    Comments: A revised version of this was accepted in Journal of Machine Learning Research