Skip to main content

Showing 1–29 of 29 results for author: Ueda, N

.
  1. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2403.19259  [pdf, other

    cs.CL

    J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution

    Authors: Nobuhiro Ueda, Hideko Habe, Yoko Matsui, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi, Koichiro Yoshino

    Abstract: Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal referen… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  3. Temporal Logic Formalisation of ISO 34502 Critical Scenarios: Modular Construction with the RSS Safety Distance

    Authors: Jesse Reimann, Nico Mansion, James Haydon, Benjamin Bray, Agnishom Chattopadhyay, Sota Sato, Masaki Waga, Étienne André, Ichiro Hasuo, Naoki Ueda, Yosuke Yokoyama

    Abstract: As the development of autonomous vehicles progresses, efficient safety assurance methods become increasingly necessary. Safety assurance methods such as monitoring and scenario-based testing call for formalisation of driving scenarios. In this paper, we develop a temporal-logic formalisation of an important class of critical scenarios in the ISO standard 34502. We use signal temporal logic (STL) a… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 12 pages, 4 figures, 5 tables. Accepted to SAC 2024

  4. arXiv:2403.03690  [pdf

    cs.CL cs.AI

    Rapidly Develo** High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese

    Authors: Yikun Sun, Zhen Wan, Nobuhiro Ueda, Sakiko Yahata, Fei Cheng, Chenhui Chu, Sadao Kurohashi

    Abstract: The creation of instruction data and evaluation benchmarks for serving Large language models often involves enormous human annotation. This issue becomes particularly pronounced when rapidly develo** such resources for a non-English language like Japanese. Instead of following the popular practice of directly translating existing English resources into Japanese (e.g., Japanese-Alpaca), we propos… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: COLING 2024. Our code are available here: \href{https://github.com/hitoshizuku7/awesome-Ja-self-instruct}{self-instruct data} and \href{https://github.com/ku-nlp/ja-vicuna-qa-benchmark}{evaluation benchmark}

  5. arXiv:2402.09018  [pdf, other

    stat.ML cs.LG

    Neural Operators Meet Energy-based Theory: Operator Learning for Hamiltonian and Dissipative PDEs

    Authors: Yusuke Tanaka, Takaharu Yaguchi, Tomoharu Iwata, Naonori Ueda

    Abstract: The operator learning has received significant attention in recent years, with the aim of learning a map** between function spaces. Prior works have proposed deep neural networks (DNNs) for learning such a map**, enabling the learning of solution operators of partial differential equations (PDEs). However, these works still struggle to learn dynamics that obeys the laws of physics. This paper… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  6. arXiv:2310.13270  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-learning of Physics-informed Neural Networks for Efficiently Solving Newly Given PDEs

    Authors: Tomoharu Iwata, Yusuke Tanaka, Naonori Ueda

    Abstract: We propose a neural network-based meta-learning method to efficiently solve partial differential equation (PDE) problems. The proposed method is designed to meta-learn how to solve a wide variety of PDE problems, and uses the knowledge for solving newly given PDE problems. We encode a PDE problem into a problem representation using neural networks, where governing equations are represented by coef… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  7. arXiv:2206.12478  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    Deep-learning Real/Bogus classification for the Tomo-e Gozen transient survey

    Authors: Ichiro Takahashi, Ryo Hamasaki, Naonori Ueda, Masaomi Tanaka, Nozomu Tominaga, Shigeyuki Sako, Ryou Ohsawa, Naoki Yoshida

    Abstract: We present a deep neural network Real/Bogus classifier that improves classification performance in the Tomo-e Gozen transient survey by handling label errors in the training data. In the wide-field, high-frequency transient survey with Tomo-e Gozen, the performance of conventional convolutional neural network classifier is not sufficient as about $10^6$ bogus detections appear every night. In need… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 14 pages, 17 figures, 2 tables. Published in PASJ. The source code is available at https://github.com/ichiro-takahashi/tomoe-realbogus

  8. arXiv:2206.01606  [pdf, ps, other

    stat.ML cs.LG

    Excess risk analysis for epistemic uncertainty with application to variational inference

    Authors: Futoshi Futami, Tomoharu Iwata, Naonori Ueda, Issei Sato, Masashi Sugiyama

    Abstract: Bayesian deep learning plays an important role especially for its ability evaluating epistemic uncertainty (EU). Due to computational complexity issues, approximation methods such as variational inference (VI) have been used in practice to obtain posterior distributions and their generalization abilities have been analyzed extensively, for example, by PAC-Bayesian theory; however, little analysis… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  9. arXiv:2106.05010  [pdf, ps, other

    stat.ML cs.LG

    Loss function based second-order Jensen inequality and its application to particle variational inference

    Authors: Futoshi Futami, Tomoharu Iwata, Naonori Ueda, Issei Sato, Masashi Sugiyama

    Abstract: Bayesian model averaging, obtained as the expectation of a likelihood function by a posterior distribution, has been widely used for prediction, evaluation of uncertainty, and model selection. Various approaches have been developed to efficiently capture the information in the posterior distribution; one such approach is the optimization of a set of models simultaneously with interaction to ensure… ▽ More

    Submitted 9 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  10. arXiv:2008.06726  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    Photometric classification of HSC transients using machine learning

    Authors: Ichiro Takahashi, Nao Suzuki, Naoki Yasuda, Akisato Kimura, Naonori Ueda, Masaomi Tanaka, Nozomu Tominaga, Naoki Yoshida

    Abstract: The advancement of technology has resulted in a rapid increase in supernova (SN) discoveries. The Subaru/Hyper Suprime-Cam (HSC) transient survey, conducted from fall 2016 through spring 2017, yielded 1824 SN candidates. This gave rise to the need for fast type classification for spectroscopic follow-up and prompted us to develop a machine learning algorithm using a deep neural network (DNN) with… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: 23 pages, 18 figures, accepted for publication in PASJ

  11. arXiv:2008.01523  [pdf, other

    cs.CL

    A System for Worldwide COVID-19 Information Aggregation

    Authors: Akiko Aizawa, Frederic Bergeron, Junjie Chen, Fei Cheng, Katsuhiko Hayashi, Kentaro Inui, Hiroyoshi Ito, Daisuke Kawahara, Masaru Kitsuregawa, Hirokazu Kiyomaru, Masaki Kobayashi, Takashi Kodama, Sadao Kurohashi, Qianying Liu, Masaki Matsubara, Yusuke Miyao, Atsuyuki Morishima, Yugo Murawaki, Kazumasa Omura, Haiyue Song, Eiichiro Sumita, Shinji Suzuki, Ribeka Tanaka, Yu Tanaka, Masashi Toyoda , et al. (4 additional authors not shown)

    Abstract: The global pandemic of COVID-19 has made the public pay close attention to related news, covering various domains, such as sanitation, treatment, and effects on education. Meanwhile, the COVID-19 condition is very different among the countries (e.g., policies and development of the epidemic), and thus citizens would be interested in news in foreign countries. We build a system for worldwide COVID-… ▽ More

    Submitted 11 October, 2020; v1 submitted 27 July, 2020; originally announced August 2020.

    Comments: Accepted to EMNLP 2020 Workshop NLP-COVID

  12. arXiv:2007.10394  [pdf, other

    cs.LG cs.AI stat.ML

    Translation Between Waves, wave2wave

    Authors: Tsuyoshi Okita, Hirotaka Hachiya, Sozo Inoue, Naonori Ueda

    Abstract: The understanding of sensor data has been greatly improved by advanced deep learning methods with big data. However, available sensor data in the real world are still limited, which is called the opportunistic sensor problem. This paper proposes a new variant of neural machine translation seq2seq to deal with continuous signal waves by introducing the window-based (inverse-) representation to adap… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  13. arXiv:1911.08105  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Three-dimensional Generative Adversarial Nets for Unsupervised Metal Artifact Reduction

    Authors: Megumi Nakao, Keiho Imanishi, Nobuhiro Ueda, Yuichiro Imai, Tadaaki Kirita, Tetsuya Matsuda

    Abstract: The reduction of metal artifacts in computed tomography (CT) images, specifically for strong artifacts generated from multiple metal objects, is a challenging issue in medical imaging research. Although there have been some studies on supervised metal artifact reduction through the learning of synthesized artifacts, it is difficult for simulated artifacts to cover the complexity of the real physic… ▽ More

    Submitted 21 August, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Journal ref: IEEE Access, 8, 109453-109465 (2020)

  14. arXiv:1909.04807  [pdf, other

    stat.ML cs.LG

    Anomaly Detection with Inexact Labels

    Authors: Tomoharu Iwata, Machiko Toyoda, Shotaro Tora, Naonori Ueda

    Abstract: We propose a supervised anomaly detection method for data with inexact anomaly labels, where each label, which is assigned to a set of instances, indicates that at least one instance in the set is anomalous. Although many anomaly detection methods have been proposed, they cannot handle inexact anomaly labels. To measure the performance with inexact anomaly labels, we define the inexact AUC, which… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  15. Deep Mixture Point Processes: Spatio-temporal Event Prediction with Rich Contextual Information

    Authors: Maya Okawa, Tomoharu Iwata, Takeshi Kurashima, Yusuke Tanaka, Hiroyuki Toda, Naonori Ueda

    Abstract: Predicting when and where events will occur in cities, like taxi pick-ups, crimes, and vehicle collisions, is a challenging and important problem with many applications in fields such as urban planning, transportation optimization and location-based marketing. Though many point processes have been proposed to model events in a continuous spatio-temporal space, none of them allow for the considerat… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: KDD 19

  16. arXiv:1905.09690  [pdf, other

    cs.LG stat.ML

    Fully Neural Network based Model for General Temporal Point Processes

    Authors: Takahiro Omi, Naonori Ueda, Kazuyuki Aihara

    Abstract: A temporal point process is a mathematical model for a time series of discrete events, which covers various applications. Recently, recurrent neural network (RNN) based models have been developed for point processes and have been found effective. RNN based models usually assume a specific functional form for the time course of the intensity function of a point process (e.g., exponentially decreasi… ▽ More

    Submitted 10 January, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

    Journal ref: Neurips 2019

  17. arXiv:1904.09697  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    The Hyper Suprime-Cam SSP Transient Survey in COSMOS: Overview

    Authors: Naoki Yasuda, Masaomi Tanaka, Nozomu Tominaga, Ji-an Jiang, Takashi J. Moriya, Tomoki Morokuma, Nao Suzuki, Ichiro Takahashi, Masaki S. Yamaguchi, Keiichi Maeda, Masao Sako, Shiro Ikeda, Akisato Kimura, Mikio Morii, Naonori Ueda, Naoki Yoshida, Chien-Hsiu Lee, Sherry H. Suyu, Yutaka Komiyama, Nicolas Regnault, David Rubin

    Abstract: We present an overview of a deep transient survey of the COSMOS field with the Subaru Hyper Suprime-Cam (HSC). The survey was performed for the 1.77 deg$^2$ ultra-deep layer and 5.78 deg$^2$ deep layer in the Subaru Strategic Program over 6- and 4-month periods from 2016 to 2017, respectively. The ultra-deep layer shows a median depth per epoch of 26.4, 26.3, 26.0, 25.6, and 24.6 mag in $g$, $r$,… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

    Comments: 17 pages, 17 figures, accepted for publication in PASJ

  18. arXiv:1810.09712  [pdf, other

    physics.soc-ph cs.AI cs.LG stat.ML

    Finding Appropriate Traffic Regulations via Graph Convolutional Networks

    Authors: Tomoharu Iwata, Takuma Otsuka, Hitoshi Shimizu, Hiroshi Sawada, Futoshi Naya, Naonori Ueda

    Abstract: Appropriate traffic regulations, e.g. planned road closure, are important in congested events. Crowd simulators have been used to find appropriate regulations by simulating multiple scenarios with different regulations. However, this approach requires multiple simulation runs, which are time-consuming. In this paper, we propose a method to learn a function that outputs regulation effects given the… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  19. arXiv:1810.03770  [pdf, ps, other

    stat.ML cs.LG

    Unsupervised Object Matching for Relational Data

    Authors: Tomoharu Iwata, Naonori Ueda

    Abstract: We propose an unsupervised object matching method for relational data, which finds matchings between objects in different relational datasets without correspondence information. For example, the proposed method matches documents in different languages in multi-lingual document-word networks without dictionaries nor alignment information. The proposed method assumes that each object has latent vect… ▽ More

    Submitted 26 December, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

  20. arXiv:1806.04838  [pdf, ps, other

    stat.ML cs.LG

    Partial AUC Maximization via Nonlinear Scoring Functions

    Authors: Naonori Ueda, Akinori Fu**o

    Abstract: We propose a method for maximizing a partial area under a receiver operating characteristic (ROC) curve (pAUC) for binary classification tasks. In binary classification tasks, accuracy is the most commonly used as a measure of classifier performance. In some applications such as anomaly detection and diagnostic testing, accuracy is not an appropriate measure since prior probabilties are often grea… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 9pages, 2 tables

  21. arXiv:1802.03039  [pdf, other

    stat.ML cs.LG cs.NE

    Few-shot learning of neural networks from scratch by pseudo example optimization

    Authors: Akisato Kimura, Zoubin Ghahramani, Koh Takeuchi, Tomoharu Iwata, Naonori Ueda

    Abstract: In this paper, we propose a simple but effective method for training neural networks with a limited amount of training data. Our approach inherits the idea of knowledge distillation that transfers knowledge from a deep or wide reference model to a shallow or narrow target model. The proposed method employs this idea to mimic predictions of reference estimators that are more robust against overfitt… ▽ More

    Submitted 5 July, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

    Comments: 14 pages, 2 figures, will be presented at BMVC2018

  22. arXiv:1711.11526  [pdf, other

    astro-ph.IM cs.CV

    Single-epoch supernova classification with deep convolutional neural networks

    Authors: Akisato Kimura, Ichiro Takahashi, Masaomi Tanaka, Naoki Yasuda, Naonori Ueda, Naoki Yoshida

    Abstract: Supernovae Type-Ia (SNeIa) play a significant role in exploring the history of the expansion of the Universe, since they are the best-known standard candles with which we can accurately measure the distance to the objects. Finding large samples of SNeIa and investigating their detailed characteristics have become an important issue in cosmology and astronomy. Existing methods relied on a photometr… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

    Comments: 7 pages, published as a workshop paper in ICDCS2017, in June 2017

    Journal ref: Published in: 2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW)

  23. arXiv:1705.07603  [pdf, other

    stat.ML cs.LG

    Multi-output Polynomial Networks and Factorization Machines

    Authors: Mathieu Blondel, Vlad Niculae, Takuma Otsuka, Naonori Ueda

    Abstract: Factorization machines and polynomial networks are supervised polynomial models based on an efficient low-rank decomposition. We extend these models to the multi-output setting, i.e., for learning vector-valued functions, with application to multi-class or multi-task problems. We cast this as the problem of learning a 3-way tensor whose slices share a common basis and propose a convex formulation… ▽ More

    Submitted 4 November, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: Published at NIPS 2017. 17 pages, including appendix

  24. Machine-learning Selection of Optical Transients in Subaru/Hyper Suprime-Cam Survey

    Authors: Mikio Morii, Shiro Ikeda, Nozomu Tominaga, Masaomi Tanaka, Tomoki Morokuma, Katsuhiko Ishiguro, Junji Yamato, Naonori Ueda, Naotaka Suzuki, Naoki Yasuda, Naoki Yoshida

    Abstract: We present an application of machine-learning (ML) techniques to source selection in the optical transient survey data with Hyper Suprime-Cam (HSC) on the Subaru telescope. Our goal is to select real transient events accurately and in a timely manner out of a large number of false candidates, obtained with the standard difference-imaging method. We have developed the transient selector which is ba… ▽ More

    Submitted 11 September, 2016; originally announced September 2016.

    Comments: 9 pages, 6 figures. Accepted for publication in PASJ

  25. arXiv:1607.08810  [pdf, other

    stat.ML cs.LG

    Polynomial Networks and Factorization Machines: New Insights and Efficient Training Algorithms

    Authors: Mathieu Blondel, Masakazu Ishihata, Akinori Fu**o, Naonori Ueda

    Abstract: Polynomial networks and factorization machines are two recently-proposed models that can efficiently use feature interactions in classification and regression tasks. In this paper, we revisit both models from a unified perspective. Based on this new view, we study the properties of both models and propose new efficient training algorithms. Key to our approach is to cast parameter learning as a low… ▽ More

    Submitted 29 July, 2016; originally announced July 2016.

  26. arXiv:1607.07195  [pdf, other

    stat.ML cs.LG

    Higher-Order Factorization Machines

    Authors: Mathieu Blondel, Akinori Fu**o, Naonori Ueda, Masakazu Ishihata

    Abstract: Factorization machines (FMs) are a supervised learning approach that can use second-order feature combinations even when the data is very high-dimensional. Unfortunately, despite increasing interest in FMs, there exists to date no efficient training algorithm for higher-order FMs (HOFMs). In this paper, we present the first generic yet efficient algorithms for training arbitrary-order HOFMs. We al… ▽ More

    Submitted 14 October, 2016; v1 submitted 25 July, 2016; originally announced July 2016.

  27. arXiv:1409.4757  [pdf, other

    cs.LG stat.ML

    Collapsed Variational Bayes Inference of Infinite Relational Model

    Authors: Katsuhiko Ishiguro, Issei Sato, Naonori Ueda

    Abstract: The Infinite Relational Model (IRM) is a probabilistic model for relational data clustering that partitions objects into clusters based on observed relationships. This paper presents Averaged CVB (ACVB) solutions for IRM, convergence-guaranteed and practically useful fast Collapsed Variational Bayes (CVB) inferences. We first derive ordinary CVB and CVB0 for IRM based on the lower bound maximizati… ▽ More

    Submitted 16 September, 2014; originally announced September 2014.

  28. Flexible construction of hierarchical scale-free networks with general exponent

    Authors: J. C. Nacher, N. Ueda, M. Kanehisa, T. Akutsu

    Abstract: Extensive studies have been done to understand the principles behind architectures of real networks. Recently, evidences for hierarchical organization in many real networks have also been reported. Here, we present a new hierarchical model which reproduces the main experimental properties observed in real networks: scale-free of degree distribution $P(k)$ (frequency of the nodes that are connect… ▽ More

    Submitted 6 September, 2004; originally announced September 2004.

    Comments: RevTeX, 5 pages, 4 figures

    Journal ref: Phys. Rev. E 71 (2005) 036132

  29. Clustering under the line graph transformation: Application to reaction network

    Authors: J. C. Nacher, N. Ueda, T. Yamada, M. Kanehisa, T. Akutsu

    Abstract: Many real networks can be understood as two complementary networks with two kind of nodes. This is the case of metabolic networks where the first network has chemical compounds as nodes and the second one has nodes as reactions. The second network can be related to the first one by a technique called line graph transformation (i.e., edges in an initial network are transformed into nodes). Recent… ▽ More

    Submitted 18 August, 2004; v1 submitted 31 March, 2004; originally announced March 2004.

    Comments: 20 pages, 12 figures, REVTeX 4 style

    Journal ref: BMC Bioinformatics 5, 207 (2004)