Dimensionality Reduction in Data Mining: Copula Approach

Rima Houari; Ahcène Bounceur; Tahar Kechadi; Abdelkamel Tari; Reinhardt Euler

doi:10.1016/j.eswa.2016.07.041

Article Dans Une Revue Expert Systems with Applications Année : 2016

Dimensionality Reduction in Data Mining: Copula Approach

(1) , (2, 3) , (4) , (5) , (6)

1
2
3
4
5
6

Rima Houari

Fonction : Auteur

Laboratoire des Réseaux et Systèmes Distribués

Ahcène Bounceur

Fonction : Auteur
PersonId : 14866
IdHAL : ahcene-bounceur
ORCID : 0000-0002-0043-7742
IdRef : 117817007

Lab-STICC_UBO_CACS_MOCS

Université de Brest

Tahar Kechadi

Fonction : Auteur
PersonId : 835828

School of Computer Science and Informatics [Dublin]

Abdelkamel Tari

Fonction : Auteur

Université Abderrahmane Mira [Université de Béjaïa] = University Abderrahmane Mira [University of Béjaïa]

Reinhardt Euler

Fonction : Auteur
PersonId : 745514
IdHAL : reinhardt-euler
ORCID : 0000-0002-4294-286X
IdRef : 22361663X

Lab-STICC_UBS_CACS_MOCS

Résumé

Sampling-based dimensionality reduction technique•Eliminating linearly redundant combined dimensions•Providing a convenient way to generate correlated multivariate random variables•Maintaining the integrity of the original information•Reducing the dimension of data space without losing important information The recent trends in collecting huge and diverse datasets have created a great challenge in data analysis. One of the characteristics of these gigantic datasets is that they often have significant amounts of redundancies. The use of very large multi-dimensional data will result in more noise, redundant data, and the possibility of unconnected data entities. To efficiently manipulate data represented in a high-dimensional space and to address the impact of redundant dimensions on the final results, we propose a new technique for the dimensionality reduction using Copulas and the LU-decomposition (Forward Substitution) method. The proposed method is compared favorably with existing approaches on real-world datasets: Diabetes, Waveform, two versions of Human Activity Recognition based on Smartphone, and Thyroid Datasets taken from machine learning repository in terms of dimensionality reduction and efficiency of the method, which are performed on statistical and classification measures.

Mots clés

Data mining Data pre-processing Multi-dimensional Sampling Copulas Dimensionality reduction

Domaines

Informatique [cs]

Ahcène Bounceur : Connectez-vous pour contacter le contributeur

https://hal.univ-brest.fr/hal-01350520

Soumis le : samedi 30 juillet 2016-13:38:09

Dernière modification le : mardi 27 août 2024-12:47:06

Dates et versions

hal-01350520 , version 1 (30-07-2016)

Identifiants

HAL Id : hal-01350520 , version 1
DOI : 10.1016/j.eswa.2016.07.041

Citer

Rima Houari, Ahcène Bounceur, Tahar Kechadi, Abdelkamel Tari, Reinhardt Euler. Dimensionality Reduction in Data Mining: Copula Approach. Expert Systems with Applications, 2016, ⟨10.1016/j.eswa.2016.07.041⟩. ⟨hal-01350520⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST CNRS LAB-STICC_UBO UBS LAB-STICC_UBS ENIB LAB-STICC_ENIB LAB-STICC IBNM

216 Consultations

0 Téléchargements

Dimensionality Reduction in Data Mining: Copula Approach

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager