Seminar on Computational Learning and
Adaptation
Fractal Dimensions in Data Mining
Krishna Kumaraswamy
Center for Advanced Research
Price Waterhouse Coopers, LLP
The "fractal" dimension of
a data set is a number that is related to the 'degrees of freedom' of the
data, and the distribution of the data. In this talk, I will introduce
the idea of the intrinsic "fractal" dimension of a data set and
show how this can be used to aid in different data mining tasks. The main
interest is in answering questions about the performance of a method and
in comparing the performance of different methods quickly. In particular,
I will talk about two specific problems - dimensionality reduction and
vector quantization. In each of these methods, we show that the performance
of a method is related to the fractal dimension of the data set. Using
real and synthetic data sets, we show how we can use this for faster evaluation
and comparison of different methods.
Date: Wednesday, June 2
|
Time: 4:15-5:30PM
|
Place: Cordura 100
|
Return to the seminar schedule