Skip to main content

Showing 1–36 of 36 results for author: Davidson, I

.
  1. arXiv:2406.12150  [pdf, other

    cs.LG cs.AI

    ChaosMining: A Benchmark to Evaluate Post-Hoc Local Attribution Methods in Low SNR Environments

    Authors: Ge Shi, Ziwen Kan, Jason Smucny, Ian Davidson

    Abstract: In this study, we examine the efficacy of post-hoc local attribution methods in identifying features with predictive power from irrelevant ones in domains characterized by a low signal-to-noise ratio (SNR), a common scenario in real-world machine learning applications. We developed synthetic datasets encompassing symbolic functional, image, and audio data, incorporating a benchmark on the {\it (Mo… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 19 pages, 10 figures, submission to Neurips 2024

  2. arXiv:2403.18278  [pdf, other

    cs.AI

    Identification and Uses of Deep Learning Backbones via Pattern Mining

    Authors: Michael Livanos, Ian Davidson

    Abstract: Deep learning is extensively used in many areas of data mining as a black-box method with impressive results. However, understanding the core mechanism of how deep learning makes predictions is a relatively understudied problem. Here we explore the notion of identifying a backbone of deep learning for a given group of instances. A group here can be instances of the same class or even misclassified… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 9 pages, 6 figures, published SIAM SDM24

  3. arXiv:2402.05942  [pdf, other

    cs.LG cs.AI

    Cooperative Knowledge Distillation: A Learner Agnostic Approach

    Authors: Michael Livanos, Ian Davidson, Stephen Wong

    Abstract: Knowledge distillation is a simple but powerful way to transfer knowledge between a teacher model to a student model. Existing work suffers from at least one of the following key limitations in terms of direction and scope of transfer which restrict its use: all knowledge is transferred from teacher to student regardless of whether or not that knowledge is useful, the student is the only one learn… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures, AAAI24

  4. arXiv:2311.03214  [pdf

    physics.optics physics.app-ph physics.ins-det

    Double Clad Antiresonant Hollow Core Fiber and Its Comparison with other Fibres for Multiphoton Micro-Endoscopy

    Authors: Marzanna Szwaj, Ian A Davidson, Peter B Johnson, Greg Jasion, Yongmin Jung, Seyed Reza Sandoghchi, Krzysztof P Herdzik, Konstantinos N Bourdakos, Natalie V Wheeler, Hans Christian Mulvad, David J Richardson, Francesco Poletti, Sumeet Mahajan

    Abstract: In this work, we study a new hollow-core (air-filled) double-clad anti-resonant fiber (DC-ARF) as a potent candidate for multiphoton micro-endoscopy. We compare the fiber characteristics with a single-clad anti-resonant fiber (SC-ARF) and a solid core fiber (SCF). While the DC-ARF and the SC-ARF enable low-loss (<0.2 dBm-1), close to dispersion-free excitation pulse delivery (<10% pulse width incr… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 29 pages, 13 figures

  5. arXiv:2310.19687  [pdf

    cs.CY cs.CL

    Sentiment Analysis in Digital Spaces: An Overview of Reviews

    Authors: Laura E. M. Ayravainen, Joanne Hinds, Brittany I. Davidson

    Abstract: Sentiment analysis (SA) is commonly applied to digital textual data, revealing insight into opinions and feelings. Many systematic reviews have summarized existing work, but often overlook discussions of validity and scientific practices. Here, we present an overview of reviews, synthesizing 38 systematic reviews, containing 2,275 primary studies. We devise a bespoke quality assessment framework d… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 44 pages, 4 figures, 6 tables, 3 appendices

    ACM Class: I.2.7

  6. arXiv:2310.13875  [pdf

    physics.optics

    Mode attraction, rejection and control in nonlinear multimode optics

    Authors: Kunhao Ji, Ian Davidson, Jayantha Sahu, David. J. Richardson, Stefan Wabnitz, Massimiliano Guasoni

    Abstract: Novel fundamental notions hel** in the interpretation of the complex dynamics of nonlinear systems are essential to our understanding and ability to exploit them. In this work we predict and demonstrate experimentally a fundamental property of Kerr-nonlinear media, which we name mode rejection and takes place when two intense counter-propagating beams interact in a multimode waveguide. In stark… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  7. arXiv:2305.16911  [pdf, other

    physics.optics

    On-target delivery of intense ultrafast laser pulses through hollow-core anti-resonant fibers

    Authors: Athanasios Lekosiotis, Federico Belli, Christian Brahms, Mohammed Sabbah, Hesham Sakr, Ian A. Davidson, Francesco Poletti, John C. Travers

    Abstract: We report the flexible on-target delivery of 800 nm wavelength, 5 GW peak power, 40 fs duration laser pulses through an evacuated and tightly coiled 10 m long hollow-core nested anti-resonant fiber by positively chir** the input pulses to compensate for the anomalous dispersion of the fiber. Near-transform-limited output pulses with high beam quality and a guided peak intensity of 3 PW/cm2 were… ▽ More

    Submitted 7 September, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2305.02770  [pdf, other

    cs.CY cs.CL stat.AP

    The Politics of Language Choice: How the Russian-Ukrainian War Influences Ukrainians' Language Use on Twitter

    Authors: Daniel Racek, Brittany I. Davidson, Paul W. Thurner, Xiao Xiang Zhu, Göran Kauermann

    Abstract: The use of language is innately political and often a vehicle of cultural identity as well as the basis for nation building. Here, we examine language choice and tweeting activity of Ukrainian citizens based on more than 4 million geo-tagged tweets from over 62,000 users before and during the Russian-Ukrainian War, from January 2020 to October 2022. Using statistical models, we disentangle sample… ▽ More

    Submitted 6 June, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  9. arXiv:2210.17334  [pdf, other

    astro-ph.GA astro-ph.IM

    COSMOS2020: Identification of High-z Protocluster Candidates in COSMOS

    Authors: Malte Brinch, Thomas R. Greve, John R. Weaver, Gabriel Brammer, Olivier Ilbert, Marko Shuntov, Shuowen **, Daizhong Liu, Clara Giménez-Arteaga, Caitlin M. Casey, Iary Davidson, Seiji Fujimoto, Anton M. Koekemoer, Vasily Kokorev, Georgios Magdis, H. J. McCracken, Conor J. R. McPartland, Bahram Mobasher, David B. Sanders, Sune Toft, Francesco Valentino, Giovanni Zamorani, Jorge Zavala

    Abstract: We conduct a systematic search for protocluster candidates at $z \geq 6$ in the COSMOS field using the recently released COSMOS2020 source catalog. We select galaxies using a number of selection criteria to obtain a sample of galaxies that have a high probability of being inside a given redshift bin. We then apply overdensity analysis to the bins using two density estimators, a Weighted Adaptive K… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 52 pages, 32 figues, 18 tables, main text is 30 pages, appendix is 22 pages, to be published in ApJ

  10. arXiv:2210.16435  [pdf, other

    cs.LG stat.ML

    Scalable Spectral Clustering with Group Fairness Constraints

    Authors: Ji Wang, Ding Lu, Ian Davidson, Zhaojun Bai

    Abstract: There are synergies of research interests and industrial efforts in modeling fairness and correcting algorithmic bias in machine learning. In this paper, we present a scalable algorithm for spectral clustering (SC) with group fairness constraints. Group fairness is also known as statistical parity where in each cluster, each protected group is represented with the same proportion as in the entiret… ▽ More

    Submitted 14 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR 206:6613-6629, 2023

  11. arXiv:2209.11762  [pdf, other

    cs.AI cs.CY cs.LG

    Towards Auditing Unsupervised Learning Algorithms and Human Processes For Fairness

    Authors: Ian Davidson, S. S. Ravi

    Abstract: Existing work on fairness typically focuses on making known machine learning algorithms fairer. Fair variants of classification, clustering, outlier detection and other styles of algorithms exist. However, an understudied area is the topic of auditing an algorithm's output to determine fairness. Existing work has explored the two group classification problem for binary protected status variables u… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 22 pages, 3 figures

  12. arXiv:2209.09670  [pdf, other

    cs.AI cs.LG

    Explainable Clustering via Exemplars: Complexity and Efficient Approximation Algorithms

    Authors: Ian Davidson, Michael Livanos, Antoine Gourru, Peter Walker, Julien Velcin, S. S. Ravi

    Abstract: Explainable AI (XAI) is an important develo** area but remains relatively understudied for clustering. We propose an explainable-by-design clustering approach that not only finds clusters but also exemplars to explain each cluster. The use of exemplars for understanding is supported by the exemplar-based school of concept definition in psychology. We show that finding a small set of exemplars to… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 22 pages; 4 figures

  13. arXiv:2203.08557  [pdf, ps, other

    cs.CR

    How darknet market users learned to worry more and love PGP: Analysis of security advice on darknet marketplaces

    Authors: Andrew C. Dwyer, Joseph Hallett, Claudia Peersman, Matthew Edwards, Brittany I. Davidson, Awais Rashid

    Abstract: Darknet marketplaces, accessible through, Tor are where users can buy illicit goods, and learn to hide from law enforcement. We surveyed the advice on these markets and found valid security advice mixed up with paranoid threat models and a reliance on privacy tools dismissed as unusable by the mainstream.

    Submitted 16 March, 2022; originally announced March 2022.

  14. On the role of technology in human-dog relationships: a future of nightmares or dreams?

    Authors: Dirk van der Linden, Brittany I. Davidson, Orit Hirsch-Matsioulas, Anna Zamansky

    Abstract: Digital technologies that help people take care of their dogs are becoming more widespread. Yet, little research explores what the role of technology in the human-dog relationship should be. We conducted a qualitative study incorporating quantitative and thematic analysis of 155 UK dog owners reflecting on their daily routines and technology's role in it, disentangling the what-where-why of inters… ▽ More

    Submitted 24 September, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Journal ref: IEEE Transactions on Technology and Society (2022)

  15. arXiv:2105.14146  [pdf, other

    cs.LG cs.CY stat.ML

    Deep Fair Discriminative Clustering

    Authors: Hong**g Zhang, Ian Davidson

    Abstract: Deep clustering has the potential to learn a strong representation and hence better clustering performance compared to traditional clustering methods such as $k$-means and spectral clustering. However, this strong representation learning ability may make the clustering unfair by discovering surrogates for protected information which we empirically show in our experiments. In this work, we study a… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  16. arXiv:2105.11549  [pdf, other

    cs.LG cs.AI

    Deep Descriptive Clustering

    Authors: Hong**g Zhang, Ian Davidson

    Abstract: Recent work on explainable clustering allows describing clusters when the features are interpretable. However, much modern machine learning focuses on complex data such as images, text, and graphs where deep learning is used but the raw features of data are not interpretable. This paper explores a novel setting for performing clustering on complex data while simultaneously generating explanations… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: Paper accepted at IJCAI 2021

  17. arXiv:2101.02792  [pdf, other

    cs.LG

    A Framework for Deep Constrained Clustering

    Authors: Hong**g Zhang, Tianyang Zhan, Sugato Basu, Ian Davidson

    Abstract: The area of constrained clustering has been extensively explored by researchers and used by practitioners. Constrained clustering formulations exist for popular algorithms such as k-means, mixture models, and spectral clustering but have several limitations. A fundamental strength of deep learning is its flexibility, and here we explore a deep learning framework for constrained clustering and in p… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: Data Mining and Knowledge Discovery, 2021. arXiv admin note: substantial text overlap with arXiv:1901.10061

  18. arXiv:2012.14961  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Fair Deep Anomaly Detection

    Authors: Hong**g Zhang, Ian Davidson

    Abstract: Anomaly detection aims to find instances that are considered unusual and is a fundamental problem of data science. Recently, deep anomaly detection methods were shown to achieve superior results particularly in complex data such as images. Our work focuses on deep one-class classification for anomaly detection which learns a map** only from the normal samples. However, the non-linear transformat… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Comments: Accepted for publication at the ACM Conference on Fairness, Accountability, and Transparency 2021 (ACM FAccT'21)

  19. Block Model Guided Unsupervised Feature Selection

    Authors: Zilong Bai, Hoa Nguyen, Ian Davidson

    Abstract: Feature selection is a core area of data mining with a recent innovation of graph-driven unsupervised feature selection for linked data. In this setting we have a dataset $\mathbf{Y}$ consisting of $n$ instances each with $m$ features and a corresponding $n$ node graph (whose adjacency matrix is $\mathbf{A}$) with an edge indicating that the two instances are similar. Existing efforts for unsuperv… ▽ More

    Submitted 5 July, 2020; originally announced July 2020.

    Comments: Published at KDD2020

    Journal ref: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD2020)

  20. arXiv:2002.02487  [pdf, other

    cs.DS cs.AI cs.DM math.OC

    Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability

    Authors: Prathyush Sambaturu, Aparna Gupta, Ian Davidson, S. S. Ravi, Anil Vullikanti, Andrew Warren

    Abstract: Improving the explainability of the results from machine learning methods has become an important research goal. Here, we study the problem of making clusters more interpretable by extending a recent approach of [Davidson et al., NeurIPS 2018] for constructing succinct representations for clusters. Given a set of objects $S$, a partition $π$ of $S$ (into clusters), and a universe $T$ of tags such… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    MSC Class: 68W25; 68T01; 68R05 ACM Class: G.2; I.2; F.2

  21. arXiv:2001.11143  [pdf, other

    cs.LG stat.ML

    A Graph-Based Approach for Active Learning in Regression

    Authors: Hong**g Zhang, S. S. Ravi, Ian Davidson

    Abstract: Active learning aims to reduce labeling efforts by selectively asking humans to annotate the most important data points from an unlabeled pool and is an example of human-machine interaction. Though active learning has been extensively researched for classification and ranking problems, it is relatively understudied for regression problems. Most existing active learning for regression methods use t… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: SDM 2020 camera-ready. 9 pages, 4 figures, links to supplementary material available at https://sdm2020.s3-us-west-1.amazonaws.com/supplementary.pdf

  22. arXiv:1911.02617  [pdf, ps, other

    cs.LG cs.AI

    Coverage-based Outlier Explanation

    Authors: Yue Wu, Leman Akoglu, Ian Davidson

    Abstract: Outlier detection is a core task in data mining with a plethora of algorithms that have enjoyed wide scale usage. Existing algorithms are primarily focused on detection, that is the identification of outliers in a given dataset. In this paper we explore the relatively under-studied problem of the outlier explanation problem. Our goal is, given a dataset that is already divided into outliers and no… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

  23. arXiv:1901.10061  [pdf, other

    cs.LG stat.ML

    A Framework for Deep Constrained Clustering -- Algorithms and Advances

    Authors: Hong**g Zhang, Sugato Basu, Ian Davidson

    Abstract: The area of constrained clustering has been extensively explored by researchers and used by practitioners. Constrained clustering formulations exist for popular algorithms such as k-means, mixture models, and spectral clustering but have several limitations. A fundamental strength of deep learning is its flexibility, and here we explore a deep learning framework for constrained clustering and in p… ▽ More

    Submitted 19 December, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Comments: Updated for ECML/PKDD 2019

  24. arXiv:1901.10053  [pdf, other

    cs.LG stat.ML

    Towards Fair Deep Clustering With Multi-State Protected Variables

    Authors: Bokun Wang, Ian Davidson

    Abstract: Fair clustering under the disparate impact doctrine requires that population of each protected group should be approximately equal in every cluster. Previous work investigated a difficult-to-scale pre-processing step for $k$-center and $k$-median style algorithms for the special case of this problem when the number of protected groups is two. In this work, we consider a more general and practical… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Comments: under review as a conference paper at icml 2019

  25. arXiv:1810.05357  [pdf, other

    cs.DB cs.AI

    On The Equivalence of Tries and Dendrograms - Efficient Hierarchical Clustering of Traffic Data

    Authors: Chia-Tung Kuo, Ian Davidson

    Abstract: The widespread use of GPS-enabled devices generates voluminous and continuous amounts of traffic data but analyzing such data for interpretable and actionable insights poses challenges. A hierarchical clustering of the trips has many uses such as discovering shortest paths, common routes and often traversed areas. However, hierarchical clustering typically has time complexity of $O(n^2 \log n)$ wh… ▽ More

    Submitted 12 October, 2018; originally announced October 2018.

  26. arXiv:1804.01575  [pdf, ps, other

    cs.LG stat.ML

    Probabilistic Formulations of Regression with Mixed Guidance

    Authors: Aubrey Gress, Ian Davidson

    Abstract: Regression problems assume every instance is annotated (labeled) with a real value, a form of annotation we call \emph{strong guidance}. In order for these annotations to be accurate, they must be the result of a precise experiment or measurement. However, in some cases additional \emph{weak guidance} might be given by imprecise measurements, a domain expert or even crowd sourcing. Current formula… ▽ More

    Submitted 1 April, 2018; originally announced April 2018.

    Comments: Appeared in ICDM 2016

  27. arXiv:1712.08855  [pdf, other

    cs.LG

    Transfer Regression via Pairwise Similarity Regularization

    Authors: Aubrey Gress, Ian Davidson

    Abstract: Transfer learning methods address the situation where little labeled training data from the "target" problem exists, but much training data from a related "source" domain is available. However, the overwhelming majority of transfer learning methods are designed for simple settings where the source and target predictive functions are almost identical, limiting the applicability of transfer learning… ▽ More

    Submitted 23 December, 2017; originally announced December 2017.

  28. arXiv:1705.08881  [pdf, other

    cs.CV cs.LG cs.NE stat.ML

    Dense Transformer Networks

    Authors: Jun Li, Yongjun Chen, Lei Cai, Ian Davidson, Shuiwang Ji

    Abstract: The key idea of current deep learning methods for dense prediction is to apply a model on a regular patch centered on each pixel to make pixel-wise predictions. These methods are limited in the sense that the patches are determined by network architecture instead of learned from data. In this work, we propose the dense transformer networks, which can learn the shapes and sizes of patches from data… ▽ More

    Submitted 7 June, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

  29. The VIMOS Public Extragalactic Redshift Survey (VIPERS). The matter density and baryon fraction from the galaxy power spectrum at redshift $0.6<z<1.1$

    Authors: S. Rota, B. R. Granett, J. Bel, L. Guzzo, J. A. Peacock, M. J. Wilson, A. Pezzotta, S. de la Torre, B. Garilli, M. Bolzonella, M. Scodeggio, U. Abbas, C. Adami, D. Bottini, A. Cappi, O. Cucciati, I. Davidson, P. Franzetti, A. Fritz, A. Iovino, J. Krywult, V. Le Brun, O. Le Fèvre, D. Mascagni, K. Małek , et al. (15 additional authors not shown)

    Abstract: We use the final catalogue of the VIMOS Public Extragalactic Redshift Survey (VIPERS) to measure the power spectrum of the galaxy distribution at high redshift, presenting results that extend beyond $z=1$ for the first time. We apply an FFT technique to four independent sub-volumes comprising a total of $51,728$ galaxies at $0.6<z<1.1$ (out of the nearly $90,000$ included in the whole survey). We… ▽ More

    Submitted 15 June, 2017; v1 submitted 21 November, 2016; originally announced November 2016.

    Comments: 16 pages, 10 figures

    Journal ref: A&A 601, A144 (2017)

  30. arXiv:1610.09993  [pdf, ps, other

    math.OC

    Rank Restricted Semidefinite Matrices and Image Closedness

    Authors: Ian Davidson, Henry Wolkowicz

    Abstract: We study the closure of the projection of the (nonconvex) cone of rank restricted positive semidefinite matrices onto subsets of the matrix entries. This defines the feasible sets for semidefinite completion problems with restrictions on the ranks. Applications include conditions for low-rank completions using the nuclear norm heuristic.

    Submitted 31 October, 2016; originally announced October 2016.

    Comments: 13 pages

    MSC Class: 90C22; 90C46; 52A99

  31. arXiv:1609.02646  [pdf, other

    cs.AI

    Some Advances in Role Discovery in Graphs

    Authors: Sean Gilpin, Chia-Tung Kuo, Tina Eliassi-Rad, Ian Davidson

    Abstract: Role discovery in graphs is an emerging area that allows analysis of complex graphs in an intuitive way. In contrast to other graph prob- lems such as community discovery, which finds groups of highly connected nodes, the role discovery problem finds groups of nodes that share similar graph topological structure. However, existing work so far has two severe limitations that prevent its use in some… ▽ More

    Submitted 8 September, 2016; originally announced September 2016.

  32. arXiv:1407.8147  [pdf, other

    cs.LG cs.CE

    Stochastic Coordinate Coding and Its Application for Drosophila Gene Expression Pattern Annotation

    Authors: Binbin Lin, Qingyang Li, Qian Sun, Ming-Jun Lai, Ian Davidson, Wei Fan, Jie** Ye

    Abstract: \textit{Drosophila melanogaster} has been established as a model organism for investigating the fundamental principles of developmental gene interactions. The gene expression patterns of \textit{Drosophila melanogaster} can be documented as digital images, which are annotated with anatomical ontology terms to facilitate pattern discovery and comparison. The automated annotation of gene expression… ▽ More

    Submitted 9 December, 2014; v1 submitted 30 July, 2014; originally announced July 2014.

  33. arXiv:1301.3851  [pdf

    cs.LG stat.ML

    Minimum Message Length Clustering Using Gibbs Sampling

    Authors: Ian Davidson

    Abstract: The K-Mean and EM algorithms are popular in clustering and mixture modeling, due to their simplicity and ease of implementation. However, they have several significant limitations. Both coverage to a local optimum of their respective objective functions (ignoring the uncertainty in the model space), require the apriori specification of the number of classes/clsuters, and are inconsistent. In this… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-160-167

  34. arXiv:1202.0855  [pdf, ps, other

    cs.LG stat.ML

    A Reconstruction Error Formulation for Semi-Supervised Multi-task and Multi-view Learning

    Authors: Buyue Qian, Xiang Wang, Ian Davidson

    Abstract: A significant challenge to make learning techniques more suitable for general purpose use is to move beyond i) complete supervision, ii) low dimensional data, iii) a single task and single view per instance. Solving these challenges allows working with "Big Data" problems that are typically high dimensional with multiple (but possibly incomplete) labelings and views. While other work has addressed… ▽ More

    Submitted 3 February, 2012; originally announced February 2012.

  35. On Constrained Spectral Clustering and Its Applications

    Authors: Xiang Wang, Buyue Qian, Ian Davidson

    Abstract: Constrained clustering has been well-studied for algorithms such as $K$-means and hierarchical clustering. However, how to satisfy many constraints in these algorithmic settings has been shown to be intractable. One alternative to encode many constraints is to use spectral clustering, which remains a develo** area. In this paper, we propose a flexible framework for constrained spectral clusterin… ▽ More

    Submitted 21 September, 2012; v1 submitted 25 January, 2012; originally announced January 2012.

    Comments: Data Mining and Knowledge Discovery, 2012

    ACM Class: H.2.8

  36. The LSST Data Mining Research Agenda

    Authors: K. D. Borne, J. Becla, I. Davidson, A. Szalay, J. A. Tyson

    Abstract: We describe features of the LSST science database that are amenable to scientific data mining, object classification, outlier identification, anomaly detection, image quality assurance, and survey science validation. The data mining research agenda includes: scalability (at petabytes scales) of existing machine learning and data mining algorithms; development of grid-enabled parallel data mining… ▽ More

    Submitted 2 November, 2008; originally announced November 2008.

    Comments: 5 pages, Presented at the "Classification and Discovery in Large Astronomical Surveys" meeting, Ringberg Castle, 14-17 October, 2008