Seminar on Computational Learning and Adaptation


  Fractal Dimensions in Data Mining

Krishna Kumaraswamy
Center for Advanced Research
Price Waterhouse Coopers, LLP

The "fractal" dimension of a data set is a number that is related to the 'degrees of freedom' of the data, and the distribution of the data. In this talk, I will introduce the idea of the intrinsic "fractal" dimension of a data set and show how this can be used to aid in different data mining tasks. The main interest is in answering questions about the performance of a method and in comparing the performance of different methods quickly. In particular, I will talk about two specific problems - dimensionality reduction and vector quantization. In each of these methods, we show that the performance of a method is related to the fractal dimension of the data set. Using real and synthetic data sets, we show how we can use this for faster evaluation and comparison of different methods.





Date: Wednesday, June 2

Time: 4:15-5:30PM

Place: Cordura 100


Return to the seminar schedule