Skip to main content

Showing 1–33 of 33 results for author: Shinoda, K

.
  1. arXiv:2406.05312  [pdf, other

    eess.IV

    Deep convolutional demosaicking network for multispectral polarization filter array

    Authors: Tomoharu Ishiuchi, Kazuma Shinoda

    Abstract: To address the demosaicking problem in multispectral polarization filter array (MSPFA) imaging, we propose a multispectral polarization demosaicking network (MSPDNet) that improves image reconstruction accuracy. Imaging with a multispectral polarization filter array acquires multispectral polarization information in a snapshot. The full-resolution multispectral polarization image must be reconstru… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2212.13145  [pdf, other

    econ.EM stat.ME stat.ML

    Orthogonal Series Estimation for the Ratio of Conditional Expectation Functions

    Authors: Kazuhiko Shinoda, Takahiro Hoshino

    Abstract: In various fields of data science, researchers are often interested in estimating the ratio of conditional expectation functions (CEFR). Specifically in causal inference problems, it is sometimes natural to consider ratio-based treatment effects, such as odds ratios and hazard ratios, and even difference-based treatment effects are identified as CEFR in some empirically relevant settings. This cha… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  3. arXiv:2211.16220  [pdf, other

    cs.CL

    Which Shortcut Solution Do Question Answering Models Prefer to Learn?

    Authors: Kazutoshi Shinoda, Saku Sugawara, Akiko Aizawa

    Abstract: Question answering (QA) models for reading comprehension tend to learn shortcut solutions rather than the solutions intended by QA datasets. QA models that have learned shortcut solutions can achieve human-level performance in shortcut examples where shortcuts are valid, but these same behaviors degrade generalization potential on anti-shortcut examples where shortcuts are invalid. Various methods… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023

  4. arXiv:2211.16093  [pdf, ps, other

    cs.CL

    Penalizing Confident Predictions on Largely Perturbed Inputs Does Not Improve Out-of-Distribution Generalization in Question Answering

    Authors: Kazutoshi Shinoda, Saku Sugawara, Akiko Aizawa

    Abstract: Question answering (QA) models are shown to be insensitive to large perturbations to inputs; that is, they make correct and confident predictions even when given largely perturbed inputs from which humans can not correctly derive answers. In addition, QA models fail to generalize to other domains and adversarial test sets, while humans maintain high accuracy. Based on these observations, we assume… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to the KnowledgeNLP workshop at AAAI 2023

  5. arXiv:2210.14541  [pdf, other

    cs.CL

    Look to the Right: Mitigating Relative Position Bias in Extractive Question Answering

    Authors: Kazutoshi Shinoda, Saku Sugawara, Akiko Aizawa

    Abstract: Extractive question answering (QA) models tend to exploit spurious correlations to make predictions when a training set has unintended biases. This tendency results in models not being generalizable to examples where the correlations do not hold. Determining the spurious correlations QA models can exploit is crucial in building generalizable QA models in real-world applications; moreover, a method… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to BlackboxNLP 2022

  6. arXiv:2207.13864  [pdf, ps, other

    astro-ph.IM astro-ph.SR physics.space-ph

    Development of Fast and Precise Scan Mirror Mechanism for an Airborne Solar Telescope

    Authors: Takayoshi Oba, Toshifumi Shimizu, Yukio Katsukawa, Masahito Kubo, Yusuke Kawabata, Hirohisa Hara, Fumihiro Uraguchi, Toshihiro Tsuzuki, Tomonori Tamura, Kazuya Shinoda, Kazuhide Kodeki, Kazuhiko Fukushima, José Miguel Morales Fernández, Antonio Sánchez Gómez, María Balaguer Jimenéz, David Hernández Expósito, Achim Gandorfer

    Abstract: We developed a scan mirror mechanism (SMM) that enable a slit-based spectrometer or spectropolarimeter to precisely and quickly map an astronomical object. The SMM, designed to be installed in the optical path preceding the entrance slit, tilts a folding mirror and then moves the reflected image laterally on the slit plane, thereby feeding a different one-dimensional image to be dispersed by the s… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 24 pages, 19 figures,accepted in Solar Physics

    Journal ref: Solar Physics 2022

  7. arXiv:2203.13694  [pdf, other

    cs.CV

    Implicit Neural Representations for Variable Length Human Motion Generation

    Authors: Pablo Cervantes, Yusuke Sekikawa, Ikuro Sato, Koichi Shinoda

    Abstract: We propose an action-conditional human motion generation method using variational implicit neural representations (INR). The variational formalism enables action-conditional distributions of INRs, from which one can easily sample representations to generate novel human motion sequences. Our method offers variable-length sequence generation by construction because a part of INR is optimized for a w… ▽ More

    Submitted 15 July, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted to ECCV 2022

  8. arXiv:2111.10202  [pdf, other

    eess.AS cs.CL cs.SD

    Multimodal Emotion Recognition with High-level Speech and Text Features

    Authors: Mariana Rodrigues Makiuchi, Kuniaki Uto, Koichi Shinoda

    Abstract: Automatic emotion recognition is one of the central concerns of the Human-Computer Interaction field as it can bridge the gap between humans and machines. Current works train deep learning models on low-level data representations to solve the emotion recognition task. Since emotion datasets often have a limited amount of data, these approaches may suffer from overfitting, and they may learn based… ▽ More

    Submitted 29 September, 2021; originally announced November 2021.

    Comments: Accepted at ASRU 2021. Code available at https://github.com/mmakiuchi/multimodal_emotion_recognition

  9. arXiv:2110.07031  [pdf, other

    cs.AI cs.CL cs.HC

    Improving the Robustness to Variations of Objects and Instructions with a Neuro-Symbolic Approach for Interactive Instruction Following

    Authors: Kazutoshi Shinoda, Yuki Takezawa, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: An interactive instruction following task has been proposed as a benchmark for learning to map natural language instructions and first-person vision into sequences of actions to interact with objects in 3D environments. We found that an existing end-to-end neural model for this task tends to fail to interact with objects of unseen attributes and follow various instructions. We assume that this pro… ▽ More

    Submitted 15 November, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted to the 29th International Conference on MultiMedia Modeling (MMM 2023)

  10. arXiv:2109.11256  [pdf, other

    cs.CL cs.AI

    Can Question Generation Debias Question Answering Models? A Case Study on Question-Context Lexical Overlap

    Authors: Kazutoshi Shinoda, Saku Sugawara, Akiko Aizawa

    Abstract: Question answering (QA) models for reading comprehension have been demonstrated to exploit unintended dataset biases such as question-context lexical overlap. This hinders QA models from generalizing to under-represented samples such as questions with low lexical overlap. Question generation (QG), a method for augmenting QA datasets, can be a solution for such performance degradation if QG can pro… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: MRQA workshop at EMNLP 2021

  11. arXiv:2109.05175  [pdf, other

    stat.ML cs.LG

    Estimation of Local Average Treatment Effect by Data Combination

    Authors: Kazuhiko Shinoda, Takahiro Hoshino

    Abstract: It is important to estimate the local average treatment effect (LATE) when compliance with a treatment assignment is incomplete. The previously proposed methods for LATE estimation required all relevant variables to be jointly observed in a single dataset; however, it is sometimes difficult or even impossible to collect such data in many real-world problems for technical or privacy reasons. We con… ▽ More

    Submitted 21 March, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

  12. arXiv:2009.09209  [pdf, other

    cs.CV cs.AI

    MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search

    Authors: Kengo Machida, Kuniaki Uto, Koichi Shinoda, Taiji Suzuki

    Abstract: In neural architecture search (NAS), differentiable architecture search (DARTS) has recently attracted much attention due to its high efficiency. It defines an over-parameterized network with mixed edges, each of which represents all operator candidates, and jointly optimizes the weights of the network and its architecture in an alternating manner. However, this method finds a model with the weigh… ▽ More

    Submitted 15 March, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

  13. arXiv:2004.07992  [pdf, other

    eess.AS cs.LG cs.SD q-bio.QM

    Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network

    Authors: Mariana Rodrigues Makiuchi, Tifani Warnita, Nakamasa Inoue, Koichi Shinoda, Michitaka Yoshimura, Momoko Kitazawa, Kei Funaki, Yoko Eguchi, Taishiro Kishimoto

    Abstract: We propose a non-invasive and cost-effective method to automatically detect dementia by utilizing solely speech audio data. We extract paralinguistic features for a short speech segment and use Gated Convolutional Neural Networks (GCNN) to classify it into dementia or healthy. We evaluate our method on the Pitt Corpus and on our own dataset, the PROMPT Database. Our method yields the accuracy of 7… ▽ More

    Submitted 6 October, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

  14. arXiv:2004.03238  [pdf, other

    cs.CL cs.AI cs.LG

    Improving the Robustness of QA Models to Challenge Sets with Variational Question-Answer Pair Generation

    Authors: Kazutoshi Shinoda, Saku Sugawara, Akiko Aizawa

    Abstract: Question answering (QA) models for reading comprehension have achieved human-level accuracy on in-distribution test sets. However, they have been demonstrated to lack robustness to challenge sets, whose distribution is different from that of training sets. Existing data augmentation methods mitigate this problem by simply augmenting training sets with synthetic examples sampled from the same distr… ▽ More

    Submitted 3 June, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: ACL-IJCNLP 2021 SRW

  15. arXiv:2001.10642  [pdf, other

    stat.ML cs.LG

    Binary Classification from Positive Data with Skewed Confidence

    Authors: Kazuhiko Shinoda, Hirotaka Kaji, Masashi Sugiyama

    Abstract: Positive-confidence (Pconf) classification [Ishida et al., 2018] is a promising weakly-supervised learning method which trains a binary classifier only from positive data equipped with confidence. However, in practice, the confidence may be skewed by bias arising in an annotation process. The Pconf classifier cannot be properly learned with skewed confidence, and consequently, the classification p… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  16. arXiv:1904.07386  [pdf, other

    eess.AS cs.CL cs.SD

    I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

    Authors: Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, **g Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda , et al. (21 additional authors not shown)

    Abstract: The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE). The latest edition of such joint submission was in SRE 2018, in which the I4U submission was among the best-performing systems. SRE'18 also marks the 10-year anniversary of I4U consortium into NIST SRE series of evaluation. The primary objective of the current paper is to summarize the res… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: 5 pages

  17. arXiv:1901.02262  [pdf, ps, other

    cs.CL

    Multi-style Generative Reading Comprehension

    Authors: Kyosuke Nishida, Itsumi Saito, Kosuke Nishida, Kazutoshi Shinoda, Atsushi Otsuka, Hisako Asano, Junji Tomita

    Abstract: This study tackles generative reading comprehension (RC), which consists of answering questions based on textual evidence and natural language generation (NLG). We propose a multi-style abstractive summarization model for question answering, called Masque. The proposed model has two key characteristics. First, unlike most studies on RC that have focused on extracting an answer span from the provid… ▽ More

    Submitted 27 May, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: Accepted as a long paper at ACL 2019

  18. arXiv:1811.04531  [pdf, other

    cs.CL

    Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition

    Authors: Raden Mu'az Mun'im, Nakamasa Inoue, Koichi Shinoda

    Abstract: We investigate the feasibility of sequence-level knowledge distillation of Sequence-to-Sequence (Seq2Seq) models for Large Vocabulary Continuous Speech Recognition (LVSCR). We first use a pre-trained larger teacher model to generate multiple hypotheses per utterance with beam search. With the same input, we then train the student model using these hypotheses generated from the teacher as pseudo la… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

  19. Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances

    Authors: Thao Minh Le, Nobuyuki Shimizu, Takashi Miyazaki, Koichi Shinoda

    Abstract: With the widespread use of intelligent systems, such as smart speakers, addressee recognition has become a concern in human-computer interaction, as more and more people expect such systems to understand complicated social scenes, including those outdoors, in cafeterias, and hospitals. Because previous studies typically focused only on pre-specified tasks with limited conversational situations suc… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence Main track. Pages 1546-1553

  20. arXiv:1808.09106  [pdf

    eess.IV

    Snapshot multispectral imaging using a filter array

    Authors: Kazuma Shinoda

    Abstract: A multispectral filter array (MSFA) is one solution for capturing a multispectral image (MSI) in a single shot at low cost. We introduce our optimization method of the spectral sensitivity of the MSFAs and demosaicking, and show a new prototype filter array for snapshot imaging based on a photonic crystal.

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: This paper has been submitted to International Workshop on Image Sensors and Imaging Systems (IWISS2018) (Invited talk)

    Journal ref: International Workshop on Image Sensors and Imaging Systems (IWISS2018)

  21. arXiv:1808.08021  [pdf, other

    eess.IV

    Deep demosaicking for multispectral filter arrays

    Authors: Kazuma Shinoda, Shoichiro Yoshiba, Madoka Hasegawa

    Abstract: We propose a novel demosaicking method for multispectral filter arrays based on a deep convolutional neural network. The proposed method first interpolates mosaicked multispectral images utilizing a bilinear approach, then applies a residual network to initial demosaicked images. The residual network consists of various three-dimensional convolutional layers and a rectified linear unit for describ… ▽ More

    Submitted 21 October, 2018; v1 submitted 24 August, 2018; originally announced August 2018.

  22. arXiv:1807.07203  [pdf, ps, other

    cs.MM cs.CV

    Few-Shot Adaptation for Multimedia Semantic Indexing

    Authors: Nakamasa Inoue, Koichi Shinoda

    Abstract: We propose a few-shot adaptation framework, which bridges zero-shot learning and supervised many-shot learning, for semantic indexing of image and video data. Few-shot adaptation provides robust parameter estimation with few training examples, by optimizing the parameters of zero-shot learning and supervised many-shot learning simultaneously. In this method, first we build a zero-shot detector, an… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

  23. arXiv:1807.01386  [pdf, other

    eess.IV

    Optimal Spectral Sensitivity of Multispectral Filter Array for Pathological Images

    Authors: Kazuma Shinoda, Maru Kawase, Madoka Hasegawa, Masahiro Ishikawa, Hideki Komagata, Naoki Kobayashi

    Abstract: A capturing system with multispectral filter array (MSFA) technology has been researched to shorten the capturing time and reduce the cost. In this system, the mosaicked image captured by the MSFA is demosaicked to reconstruct multispectral images (MSIs). We focus on the spectral sensitivity design of a MSFA in this paper and propose a pathology-specific MSFA. The proposed method optimizes the MSF… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Journal ref: Image Electronics and Visual Computing Workshop (IEVC), 1P-10, Mar. 2017

  24. arXiv:1807.01385  [pdf, other

    eess.IV

    Joint optimization of multispectral filter arrays and demosaicking for pathological images

    Authors: Kazuma Shinoda, Maru Kawase, Madoka Hasegawa, Masahiro Ishikawa, Hideki Komagata, Naoki Kobayashi

    Abstract: A capturing system with multispectral filter array (MSFA) technology is proposed for shortening the capture time and reducing costs. Therein, a mosaicked image captured using an MSFA is demosaicked to reconstruct multispectral images (MSIs). Joint optimization of the spectral sensitivity of the MSFAs and demosaicking is considered, and pathology-specific multispectral imaging is proposed. This opt… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Journal ref: IIEEJ Transactions on Image Electronics and Visual Computing, Vol. 6, No. 1, pp. 13-21, Jun. 2018

  25. arXiv:1805.11790  [pdf, other

    cs.CV

    A Fine-to-Coarse Convolutional Neural Network for 3D Human Action Recognition

    Authors: Thao Minh Le, Nakamasa Inoue, Koichi Shinoda

    Abstract: This paper presents a new framework for human action recognition from a 3D skeleton sequence. Previous studies do not fully utilize the temporal relationships between video segments in a human action. Some studies successfully used very deep Convolutional Neural Network (CNN) models but often suffer from the data insufficiency problem. In this study, we first segment a skeleton sequence into disti… ▽ More

    Submitted 18 August, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: Camera-ready manuscript for BMVC2018

  26. arXiv:1804.00290  [pdf, other

    eess.AS cs.LG cs.SD

    I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification

    Authors: Jiacen Zhang, Nakamasa Inoue, Koichi Shinoda

    Abstract: I-vector based text-independent speaker verification (SV) systems often have poor performance with short utterances, as the biased phonetic distribution in a short utterance makes the extracted i-vector unreliable. This paper proposes an i-vector compensation method using a generative adversarial network (GAN), where its generator network is trained to generate a compensated i-vector from a short-… ▽ More

    Submitted 1 April, 2018; originally announced April 2018.

  27. arXiv:1803.11344  [pdf, other

    eess.AS cs.SD

    Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data

    Authors: Tifani Warnita, Nakamasa Inoue, Koichi Shinoda

    Abstract: We propose an automatic detection method of Alzheimer's diseases using a gated convolutional neural network (GCNN) from speech data. This GCNN can be trained with a relatively small amount of data and can capture the temporal information in audio paralinguistic features. Since it does not utilize any linguistic features, it can be easily applied to any languages. We evaluated our method using Pitt… ▽ More

    Submitted 30 March, 2018; originally announced March 2018.

    Comments: 5 pages, 3 figures, submitted to INTERSPEECH 2018

  28. Attentive Statistics Pooling for Deep Speaker Embedding

    Authors: Koji Okabe, Takafumi Koshinaka, Koichi Shinoda

    Abstract: This paper proposes attentive statistics pooling for deep speaker embedding in text-independent speaker verification. In conventional speaker embedding, frame-level features are averaged over all the frames of a single utterance to form an utterance-level feature. Our method utilizes an attention mechanism to give different weights to different frames and generates not only weighted means but also… ▽ More

    Submitted 24 February, 2019; v1 submitted 29 March, 2018; originally announced March 2018.

    Comments: Proc. Interspeech 2018, pp2252--2256. arXiv admin note: text overlap with arXiv:1809.09311

  29. arXiv:1801.03577  [pdf, ps, other

    eess.IV

    Mosaicked multispectral image compression based on inter- and intra-band correlation

    Authors: Kazuma Shinoda, Madoka Hasegawa, Masahiro Yamaguchi, Antonio Ortega

    Abstract: Multispectral imaging has been utilized in many fields, but the cost of capturing and storing image data is still high. Single-sensor cameras with multispectral filter arrays can reduce the cost of capturing images at the expense of slightly lower image quality. When multispectral filter arrays are used, conventional multispectral image compression methods can be applied after interpolation, but t… ▽ More

    Submitted 10 January, 2018; originally announced January 2018.

  30. Chaotic Griffiths Phase with Anomalous Lyapunov Spectra in Coupled Map Networks

    Authors: Kenji Shinoda, Kunihiko Kaneko

    Abstract: Dynamics of coupled chaotic oscillators on a network are studied using coupled maps. Within a broad range of parameter values representing the coupling strength or the degree of elements, the system repeats formation and split of coherent clusters. The distribution of the cluster size follows a power law with the exponent $α$, which changes with the parameter values. The number of positive Lyapuno… ▽ More

    Submitted 14 May, 2016; v1 submitted 7 May, 2016; originally announced May 2016.

    Comments: 6 pages +2 pages Supplement (6 figues + 3 Supplement Figures)

    Journal ref: Phys. Rev. Lett. 117, 254101 (2016)

  31. Classical and Quantum Cosmology of Multigravity

    Authors: Teruki Hanada, Koichiro Kobayashi, Kazuhiko Shinoda, Kiyoshi Shiraishi

    Abstract: Recently, a multigraviton theory on a simple closed circuit graph corresponding to the discretization of $S^1$ compactification of the Kaluza-Klein (KK) theory has been considered. In the present paper, we extend this theory to that on a general graph and study what modes of particles are included. Furthermore, we generalize it in a possible nonlinear theory based on the vierbein formalism and stu… ▽ More

    Submitted 24 August, 2010; v1 submitted 29 April, 2010; originally announced April 2010.

    Comments: 17 pages, 15 figures, RevTeX4.1, revised version

    Journal ref: Class.Quant.Grav.27:225010,2010

  32. arXiv:0902.0103  [pdf, ps, other

    gr-qc hep-th

    Cosmology of multigravity

    Authors: Teruki Hanada, Kazuhiko Shinoda, Kiyoshi Shiraishi

    Abstract: We have constructed a nonlinear multi-graviton theory. An application of this theory to cosmology is discussed. We found that scale factors in a solution for this theory repeat acceleration and deceleration.

    Submitted 5 February, 2009; v1 submitted 31 January, 2009; originally announced February 2009.

    Comments: 4 pages, 10 figures. Prepared for the proceedings of JGRG18 (Hiroshima, Japan, 17--21 November 2008). corrected version, references added

  33. arXiv:0801.2641  [pdf, ps, other

    gr-qc hep-th

    Multi-graviton theory in vierbein formalism

    Authors: Teruki Hanada, Kazuhiko Shinoda, Kiyoshi Shiraishi

    Abstract: Recently, multi-graviton theory on a simple closed circuit graph corresponding to the $S^1$ compactification of the Kaluza-Klein (KK) theory has been considered. In the present paper, we extend this theory to that on a general graph and study what modes of particles are included. Furthermore, we generalize it in a possible non-linear theory based on the vierbein formalism and study cosmological… ▽ More

    Submitted 17 January, 2008; originally announced January 2008.

    Comments: 4 pages, no figure. A presentation given at JGRG17 (Nagoya, Japan), to appear in the proceedings