Dimensionality Reduction in Data Mining: Copula Approach

Rima Houari 1 Ahcène Bounceur 2, 3 Tahar Kechadi 4 Abdelkamel Tari 5 Reinhardt Euler 6
2 Lab-STICC_UBO_CACS_MOCS
UBO - Université de Brest, Lab-STICC - Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance
6 Lab-STICC_UBS_CACS_MOCS
Lab-STICC - Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance
Abstract : Sampling-based dimensionality reduction technique•Eliminating linearly redundant combined dimensions•Providing a convenient way to generate correlated multivariate random variables•Maintaining the integrity of the original information•Reducing the dimension of data space without losing important information The recent trends in collecting huge and diverse datasets have created a great challenge in data analysis. One of the characteristics of these gigantic datasets is that they often have significant amounts of redundancies. The use of very large multi-dimensional data will result in more noise, redundant data, and the possibility of unconnected data entities. To efficiently manipulate data represented in a high-dimensional space and to address the impact of redundant dimensions on the final results, we propose a new technique for the dimensionality reduction using Copulas and the LU-decomposition (Forward Substitution) method. The proposed method is compared favorably with existing approaches on real-world datasets: Diabetes, Waveform, two versions of Human Activity Recognition based on Smartphone, and Thyroid Datasets taken from machine learning repository in terms of dimensionality reduction and efficiency of the method, which are performed on statistical and classification measures.
Type de document :
Article dans une revue
International Journal of Expert Systems With Applications, 2016, 〈10.1016/j.eswa.2016.07.041〉
Liste complète des métadonnées

http://hal.univ-brest.fr/hal-01350520
Contributeur : Ahcène Bounceur <>
Soumis le : samedi 30 juillet 2016 - 13:38:09
Dernière modification le : mardi 16 janvier 2018 - 15:54:24

Identifiants

Citation

Rima Houari, Ahcène Bounceur, Tahar Kechadi, Abdelkamel Tari, Reinhardt Euler. Dimensionality Reduction in Data Mining: Copula Approach. International Journal of Expert Systems With Applications, 2016, 〈10.1016/j.eswa.2016.07.041〉. 〈hal-01350520〉

Partager

Métriques

Consultations de la notice

190