Difference between revisions of "Fall 2025: Data Mining-1"
Jump to navigation
Jump to search
Line 25: | Line 25: | ||
! Submission Deadline | ! Submission Deadline | ||
|- | |- | ||
− | | style="width: 8%" | 1 | + | | style="width: 8%" style="text-align: centre; | 1 |
| style="width: 60%" | Apply data cleaning techniques on any dataset (e.g., Paper Reviews dataset in UCI repository). Techniques may include handling missing values, outliers and inconsistent values. A set of validation rules can be prepared based on the dataset and validations can be performed. | | style="width: 60%" | Apply data cleaning techniques on any dataset (e.g., Paper Reviews dataset in UCI repository). Techniques may include handling missing values, outliers and inconsistent values. A set of validation rules can be prepared based on the dataset and validations can be performed. | ||
| style="width: 15%" | 25/08/2025 - 01/09/2025 | | style="width: 15%" | 25/08/2025 - 01/09/2025 |
Revision as of 21:24, 1 September 2025
Lab 0: Getting Started ( week of 04th, 11th & 18th August 2025 )
Task No. | Task | Assessment Period. | Submission Deadline |
---|---|---|---|
1 | https://www.cse.msu.edu/~ptan/dmbook/tutorials/tutorial1/tutorial1.html | -- | -- |
2 | https://www.cse.msu.edu/~ptan/dmbook/tutorials/tutorial2/tutorial2.html | -- | -- |
3 | https://www.cse.msu.edu/~ptan/dmbook/tutorials/tutorial3/tutorial3.html | -- | -- |
Lab 1: ( week of 25th August & 01st September 2025 )
Task No. | Task | Assessment Period. | Submission Deadline |
---|---|---|---|
1 | Apply data cleaning techniques on any dataset (e.g., Paper Reviews dataset in UCI repository). Techniques may include handling missing values, outliers and inconsistent values. A set of validation rules can be prepared based on the dataset and validations can be performed. | 25/08/2025 - 01/09/2025 | 02/09/2025 |
Lab 2: ( week of 08th September 2025 )
Task No. | Task | Assessment Period. | Submission Deadline |
---|---|---|---|
2 | Apply data pre-processing techniques such as standardization/normalization, transformation, aggregation, discretization/binarization, sampling etc. on any dataset | 08/09/2025 - 15/09/2025 | 16/09/2025 |