Estimating Photometric Redshifts of Quasars via K-nearest Neighbor Approach Based on Large Survey Databases
Authors:
Zhang Yanxia,
Ma He,
Peng Nanbo,
Zhao Yongheng,
Wu Xue-bing
Abstract:
We apply one of lazy learning methods named k-nearest neighbor algorithm (kNN) to estimate the photometric redshifts of quasars, based on various datasets from the Sloan Digital Sky Survey (SDSS), UKIRT Infrared Deep Sky Survey (UKIDSS) and Wide-field Infrared Survey Explorer (WISE) (the SDSS sample, the SDSS-UKIDSS sample, the SDSS-WISE sample and the SDSS-UKIDSS-WISE sample). The influence of th…
▽ More
We apply one of lazy learning methods named k-nearest neighbor algorithm (kNN) to estimate the photometric redshifts of quasars, based on various datasets from the Sloan Digital Sky Survey (SDSS), UKIRT Infrared Deep Sky Survey (UKIDSS) and Wide-field Infrared Survey Explorer (WISE) (the SDSS sample, the SDSS-UKIDSS sample, the SDSS-WISE sample and the SDSS-UKIDSS-WISE sample). The influence of the k value and different input patterns on the performance of kNN is discussed. kNN arrives at the best performance when k is different with a special input pattern for a special dataset. The best result belongs to the SDSS-UKIDSS-WISE sample. The experimental results show that generally the more information from more bands, the better performance of photometric redshift estimation with kNN. The results also demonstrate that kNN using multiband data can effectively solve the catastrophic failure of photometric redshift estimation, which is met by many machine learning methods. By comparing the performance of various methods for photometric redshift estimation of quasars, kNN based on KD-Tree shows its superiority with the best accuracy for our case.
△ Less
Submitted 22 May, 2013;
originally announced May 2013.
Support Vector Machines and Kd-tree for Separating Quasars from Large Survey Databases
Authors:
Gao Dan,
Zhang Yanxia,
Zhao Yongheng
Abstract:
We compare the performance of two automated classification algorithms: k-dimensional tree (kd-tree) and support vector machines (SVMs), to separate quasars from stars in the databases of the Sloan Digital Sky Survey (SDSS) and the Two Micron All Sky Survey (2MASS) catalogs. The two algorithms are trained on subsets of SDSS and 2MASS objects whose nature is known via spectroscopy. We choose diffe…
▽ More
We compare the performance of two automated classification algorithms: k-dimensional tree (kd-tree) and support vector machines (SVMs), to separate quasars from stars in the databases of the Sloan Digital Sky Survey (SDSS) and the Two Micron All Sky Survey (2MASS) catalogs. The two algorithms are trained on subsets of SDSS and 2MASS objects whose nature is known via spectroscopy. We choose different attribute combination as input patterns to train the classifier using photometric data only and present the classification results obtained by these two methods. Performance metrics such as precision and recall, true positive rate and true negative rate, F-measure, G-mean and Weighted Accuracy are computed to evaluate the performance of the two algorithms. The study shows that both kd-tree and SVMs are effective automated algorithms to classify point sources. SVMs show slightly higher accuracy, but kd-tree requires less computation time. Given different input patterns based on various parameters(e.g. magnitudes, color information), we conclude that both kd-tree and SVMs show better performance with fewer features. What is more, our results also indicate that the accuracy using the four colors (u-g, g-r, r-i, i-z) and r magnitude based on SDSS model magnitudes adds up to the highest value. The classifiers trained by kd-tree and SVMs can be used to solve the automated classification problems faced by the virtual observatory (VO); moreover, they all can be applied for the photometric preselection of quasar candidates for large survey projects in order to optimize the efficiency of telescopes.
△ Less
Submitted 4 February, 2008;
originally announced February 2008.