-
Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets
Authors:
Yuki Tanaka,
Shuhei M. Yoshida,
Makoto Terao
Abstract:
We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain th…
▽ More
We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain the best result, however, thresholds for prediction confidence must be adjusted. This process typically involves iterative search and repeated training of student models and is time-consuming. Therefore, we develop a method to optimize the thresholds without iterative optimization by maximizing the $F_β$-score on a validation dataset, which measures the quality of pseudo labels and can be measured without training a student model. We experimentally demonstrate that our proposed method achieves an mAP comparable to that of grid search on the COCO and VOC datasets.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Acquiring a Dynamic Light Field through a Single-Shot Coded Image
Authors:
Ryoya Mizuno,
Keita Takahashi,
Michitaka Yoshida,
Chihiro Tsutake,
Toshiaki Fujii,
Hajime Nagahara
Abstract:
We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D measurement). We designed an imaging model that synchronously applies aperture coding and pixel-wise exposure coding within a single exposure time. This coding scheme enables us to effectively embed the original information into a single observed image. The observed image i…
▽ More
We propose a method for compressively acquiring a dynamic light field (a 5-D volume) through a single-shot coded image (a 2-D measurement). We designed an imaging model that synchronously applies aperture coding and pixel-wise exposure coding within a single exposure time. This coding scheme enables us to effectively embed the original information into a single observed image. The observed image is then fed to a convolutional neural network (CNN) for light-field reconstruction, which is jointly trained with the camera-side coding patterns. We also developed a hardware prototype to capture a real 3-D scene moving over time. We succeeded in acquiring a dynamic light field with 5x5 viewpoints over 4 temporal sub-frames (100 views in total) from a single observed image. Repeating capture and reconstruction processes over time, we can acquire a dynamic light field at 4x the frame rate of the camera. To our knowledge, our method is the first to achieve a finer temporal resolution than the camera itself in compressive light-field acquisition. Our software is available from our project webpage
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
Computing Information Quantity as Similarity Measure for Music Classification Task
Authors:
Ayaka Takamoto,
Mitsuo Yoshida,
Kyoji Umemura,
Yuko Ichikawa
Abstract:
This paper proposes a novel method that can replace compression-based dissimilarity measure (CDM) in composer estimation task. The main features of the proposed method are clarity and scalability. First, since the proposed method is formalized by the information quantity, reproduction of the result is easier compared with the CDM method, where the result depends on a particular compression program…
▽ More
This paper proposes a novel method that can replace compression-based dissimilarity measure (CDM) in composer estimation task. The main features of the proposed method are clarity and scalability. First, since the proposed method is formalized by the information quantity, reproduction of the result is easier compared with the CDM method, where the result depends on a particular compression program. Second, the proposed method has a lower computational complexity in terms of the number of learning data compared with the CDM method. The number of correct results was compared with that of the CDM for the composer estimation task of five composers of 75 piano musical scores. The proposed method performed better than the CDM method that uses the file size compressed by a particular program.
△ Less
Submitted 15 April, 2018;
originally announced April 2018.
-
Improving Compression Based Dissimilarity Measure for Music Score Analysis
Authors:
Ayaka Takamoto,
Mayu Umemura,
Mitsuo Yoshida,
Kyoji Umemura
Abstract:
In this paper, we propose a way to improve the compression based dissimilarity measure, CDM. We propose to use a modified value of the file size, where the original CDM uses an unmodified file size. Our application is a music score analysis. We have chosen piano pieces from five different composers. We have selected 75 famous pieces (15 pieces for each composer). We computed the distances among al…
▽ More
In this paper, we propose a way to improve the compression based dissimilarity measure, CDM. We propose to use a modified value of the file size, where the original CDM uses an unmodified file size. Our application is a music score analysis. We have chosen piano pieces from five different composers. We have selected 75 famous pieces (15 pieces for each composer). We computed the distances among all pieces by using the modified CDM. We use the K-nearest neighbor method when we estimate the composer of each piece of music. The modified CDM shows improved accuracy. The difference is statistically significant.
△ Less
Submitted 3 October, 2017;
originally announced October 2017.