The School of Computer Science Technical Workshop Series Presents: Data Clustering Workshop by: Ali Abbasi-Tadi

Wednesday, November 9, 2022 - 18:00 to 19:00

The School of Computer Science at the Univerity of Windsor is pleased to present...

Technical Workshops Presentation by: Ali Abbasi-Tadi

 

Data Clustering Workshop

Date: Wednesday, November 9th, 2022 

Time: 6:00 pm – 7:00 pm 

Location: 4th Floor (Workshop space) at 300 Ouellette Avenue (School of Computer Science Advanced Computing Hub)

 

Abstract: 

Clustering is a way of grouping data points into different clusters consisting of similar data points. The objects with possible similarities remain in a group that has less or no similarities with another group.  In this workshop, we explore various clustering approaches in centralized and decentralized environments and provide various metrics for finding the accuracy and performance of clustering. We also provide basic concepts on some well-known clustering approaches and their differences, followed by the performance discussion of each of the clustering methods. In an advanced clustering session, we explore various dimensionality reduction techniques and provide basic concepts and applications. All the clustering techniques will come with implementations in python using the scikit-learn library. This workshop will be in three parts: I- clustering in centralized environments, II- advanced topics in clustering, and III- parallel clustering in decentralized environments.

After this workshop, students will be able to: I- identify the difference between different machine learning tasks, II- work with some well-known clustering approaches and understand the concepts behind them, and III- identify the difference between centralized and parallel clustering.

 

Prerequisites:

Matrix algebra, statistics, basic python programming, basic C++ programming

 

Workshop Outline:

Introduction to clustering

K-means++ and concepts

Hierarchical clustering and concepts

Clustering metrics (Silhouette coefficient and Adjusted Rand Index)

Basic Implementation of K-means++ and hierarchical clustering in python

 

Biography: 

Ali is pursuing his Ph.D. in computer science at the University of Windsor. His main research interest is security/privacy in machine learning. He has some publications on private clustering and has participated in the iDash competition in 2022. He has got various scholarships from the University of Windsor, including RAs and GAs.