تجاوز إلى المحتوى الرئيسي
User Image

نوره ابراهيم عمر الفنتوخ Norah Ibrahim Alfantoukh

Demonstrator

Teaching Assistant

علوم الحاسب والمعلومات
Building 6, 2nd floor, office# 13
المنشورات
مقال فى مجلة
2020

Automatic Determination of the Number of Clusters for Semi-Supervised Relational Fuzzy Clustering

Fantoukh, Norah Ibrahim . 2020

Semi-supervised clustering relies on both labeled and unlabeled data to steer the clustering process towards optimal categorization and escape from local minima. In this paper, we propose a novel fuzzy relational semi-supervised clustering algorithm based on an adaptive local distance measure (SSRF-CA). The proposed clustering algorithm utilizes side-information and formulates it as a set of constraints to supervise the learning task. These constraints are expressed using reward and penalty terms, which are integrated into a novel objective function. In particular, we formulate the clustering task as an optimization problem through the minimization of the proposed objective function. Solving this optimization problem provides the optimal values of different objective function parameters and yields the proposed semisupervised clustering algorithm. Along with its ability to perform data clustering and learn the underlying dissimilarity measure between the data instances, our algorithm determines the optimal number of clusters in an unsupervised manner. Moreover, the proposed SSRF-CA is designed to handle relational data. This makes it appropriate for applications where only pairwise similarity (or dissimilarity) information between data instances is available. In this paper, we proved the ability of the proposed algorithm to learn the appropriate local distance measures and the optimal number of clusters while partitioning the data using various synthetic and real-world benchmark datasets that contain varying numbers of clusters with diverse shapes. The experimental results revealed that the proposed SSRF-CA accomplished the best performance among other state-of-the-art algorithms and confirmed the outperformance of our clustering approach.

رقم المجلد
20
رقم الانشاء
2
مجلة/صحيفة
International Journal of Fuzzy Logic and Intelligent Systems
الصفحات
156 to 167
مزيد من المنشورات
publications

Semi-supervised clustering relies on both labeled and unlabeled data to steer the clustering process towards optimal categorization and escape from local minima. In this paper, we propose a novel…

بواسطة Norah Ibrahim Fantoukh, Mohamed Maher Ben Ismail, Ouiem Bchir
2020