Ensemble feature selection for multi‐label text classification: An intelligent order statistics approach

Miri, M, Dowlatshahi, M B, Hashemi, A, Rafsanjani, M K, Gupta, B B and Alhalabi, W (2022) Ensemble feature selection for multi‐label text classification: An intelligent order statistics approach. International Journal of Intelligent Systems. ISSN 0884-8173

Full text not available from this repository. (Request a copy)

Abstract

Because of the overgrowth of data, especially in text format, the value and importance of multi-label text classification have increased. Aside from this, preprocessing and particularly intelligent feature selection (FS) are the most important step in classification. Each FS finds the best features based on its approach, but we try to use a multi-strategy approach to find more useful features. Evaluating and comparing features’ importance and relevance makes using multiple strategy and methods more suitable than conventional approaches because each feature is measured based on several perspectives. Nevertheless, the ensemble FS merges the final performance results of various methods to take advantage of different methods’ strengths and better classify. In this article, we have proposed an ensemble FS method for multi-label text data (MLTD) for the first time using the order statistics (EMFS) approach. We have utilized four multi-label FS (MLFS) algorithms with various particular performances to achieve a good result. In this method, as one of the most important statistics methods, Order Statistics was used to aggregate the ranks of different algorithms, which is robust against noise, redundant and inessential features. In the end, the performance of EMFS, executing six MLTDs, was evaluated according to six performance criteria (ranking-based and classification-based). Surprisingly, the proposed method was more accurate than others among all used MLTDs. The proposed method has improved by 1.5% compared to other methods. This value is based on the results obtained based on six evaluation criteria and all tested data sets.

Affiliation: Skyline University College
SUC Author(s): Gupta, B B
All Author(s): Miri, M, Dowlatshahi, M B, Hashemi, A, Rafsanjani, M K, Gupta, B B and Alhalabi, W
Item Type: Article
Uncontrolled Keywords: Ensemble feature, multi-label text classification, intelligent order statistics
Subjects: B Information Technology > BM Artificial Intelligence
Divisions: Skyline University College > School of IT
Depositing User: Mr Veeramani Rasu
Date Deposited: 08 Sep 2022 05:34
Last Modified: 08 Sep 2022 05:34
URI: https://research.skylineuniversity.ac.ae/id/eprint/570
Publisher URL: https://doi.org/10.1002/int.23044
Publisher OA policy: https://v2.sherpa.ac.uk/id/publication/14768
Related URLs:

Actions (login required)

View Item
View Item
Statistics for SkyRep ePrint 570 Statistics for this ePrint Item