Search | arXiv e-print repository

doi 10.1109/ICIP46576.2022.9898014

Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets

Authors: Yuki Tanaka, Shuhei M. Yoshida, Makoto Terao

Abstract: We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain th… ▽ More We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain the best result, however, thresholds for prediction confidence must be adjusted. This process typically involves iterative search and repeated training of student models and is time-consuming. Therefore, we develop a method to optimize the thresholds without iterative optimization by maximizing the $F_β$-score on a validation dataset, which measures the quality of pseudo labels and can be measured without training a student model. We experimentally demonstrate that our proposed method achieves an mAP comparable to that of grid search on the COCO and VOC datasets. △ Less

Submitted 18 October, 2022; originally announced October 2022.

Comments: ICIP2022

Journal ref: 2022 IEEE International Conference on Image Processing (ICIP), 2022, pp. 1676-1680

arXiv:2204.12089 [pdf, other]

Acquiring a Dynamic Light Field through a Single-Shot Coded Image

Authors: Ryoya Mizuno, Keita Takahashi, Michitaka Yoshida, Chihiro Tsutake, Toshiaki Fujii, Hajime Nagahara

Abstract: We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D measurement). We designed an imaging model that synchronously applies aperture coding and pixel-wise exposure coding within a single exposure time. This coding scheme enables us to effectively embed the original information into a single observed image. The observed image i… ▽ More We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D measurement). We designed an imaging model that synchronously applies aperture coding and pixel-wise exposure coding within a single exposure time. This coding scheme enables us to effectively embed the original information into a single observed image. The observed image is then fed to a convolutional neural network (CNN) for light-field reconstruction, which is jointly trained with the camera-side coding patterns. We also developed a hardware prototype to capture a real 3-D scene moving over time. We succeeded in acquiring a dynamic light field with 5x5 viewpoints over 4 temporal sub-frames (100 views in total) from a single observed image. Repeating capture and reconstruction processes over time, we can acquire a dynamic light field at 4x the frame rate of the camera. To our knowledge, our method is the first to achieve a finer temporal resolution than the camera itself in compressive light-field acquisition. Our software is available from our project webpage △ Less

Submitted 26 April, 2022; originally announced April 2022.

arXiv:1804.05486 [pdf, other]

doi 10.1109/ICAICTA.2017.8090990

Computing Information Quantity as Similarity Measure for Music Classification Task

Authors: Ayaka Takamoto, Mitsuo Yoshida, Kyoji Umemura, Yuko Ichikawa

Abstract: This paper proposes a novel method that can replace compression-based dissimilarity measure (CDM) in composer estimation task. The main features of the proposed method are clarity and scalability. First, since the proposed method is formalized by the information quantity, reproduction of the result is easier compared with the CDM method, where the result depends on a particular compression program… ▽ More This paper proposes a novel method that can replace compression-based dissimilarity measure (CDM) in composer estimation task. The main features of the proposed method are clarity and scalability. First, since the proposed method is formalized by the information quantity, reproduction of the result is easier compared with the CDM method, where the result depends on a particular compression program. Second, the proposed method has a lower computational complexity in terms of the number of learning data compared with the CDM method. The number of correct results was compared with that of the CDM for the composer estimation task of five composers of 75 piano musical scores. The proposed method performed better than the CDM method that uses the file size compressed by a particular program. △ Less

Submitted 15 April, 2018; originally announced April 2018.

Comments: The 2017 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2017)

arXiv:1710.01446 [pdf, other]

Improving Compression Based Dissimilarity Measure for Music Score Analysis

Authors: Ayaka Takamoto, Mayu Umemura, Mitsuo Yoshida, Kyoji Umemura

Abstract: In this paper, we propose a way to improve the compression based dissimilarity measure, CDM. We propose to use a modified value of the file size, where the original CDM uses an unmodified file size. Our application is a music score analysis. We have chosen piano pieces from five different composers. We have selected 75 famous pieces (15 pieces for each composer). We computed the distances among al… ▽ More In this paper, we propose a way to improve the compression based dissimilarity measure, CDM. We propose to use a modified value of the file size, where the original CDM uses an unmodified file size. Our application is a music score analysis. We have chosen piano pieces from five different composers. We have selected 75 famous pieces (15 pieces for each composer). We computed the distances among all pieces by using the modified CDM. We use the K-nearest neighbor method when we estimate the composer of each piece of music. The modified CDM shows improved accuracy. The difference is statistically significant. △ Less

Submitted 3 October, 2017; originally announced October 2017.

Comments: The 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA2016)

Showing 1–4 of 4 results for author: Yoshida, M