Fall 2025: Data Mining-1

From MKWiki
Revision as of 23:16, 21 September 2025 by Mkwiki (talk | contribs) (→‎Lab 2: ( week of 08th & 15th September 2025 ))
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Lab 0: Getting Started ( week of 04th, 11th & 18th August 2025 )

Task No. Task Assessment Period. Submission Deadline
1 https://www.cse.msu.edu/~ptan/dmbook/tutorials/tutorial1/tutorial1.html -- --
2 https://www.cse.msu.edu/~ptan/dmbook/tutorials/tutorial2/tutorial2.html -- --
3 https://www.cse.msu.edu/~ptan/dmbook/tutorials/tutorial3/tutorial3.html -- --

Lab 1: ( week of 25th August & 01st September 2025 )

Task No. Task Assessment Period. Submission Deadline
1 Apply data cleaning techniques on any dataset (e.g., Paper Reviews dataset in UCI repository). Techniques may include handling missing values, outliers and inconsistent values. A set of validation rules can be prepared based on the dataset and validations can be performed. 25/08/2025 - 01/09/2025 02/09/2025

Lab 2: ( week of 08th & 15th September 2025 )

Task No. Task Assessment Period. Submission Deadline
2 Apply data pre-processing techniques such as standardization/normalization, transformation, aggregation, discretization/binarization, sampling etc. on any dataset 08/09/2025 - 15/09/2025 22/09/2025

Lab 3: ( week of 22nd September 2025 )

Task No. Task Assessment Period. Submission Deadline
5 Apply simple K-means algorithm for clustering any dataset. Compare the performance of clusters by varying the algorithm parameters. For a given set of parameters, plot a line graph depicting MSE obtained after each iteration. 22/09/2025 - 06/10/2025 06/10/2025