Skip to content

Commit bb698c7

Browse files
committed
Updated Readme
1 parent cb21352 commit bb698c7

File tree

5 files changed

+15
-0
lines changed

5 files changed

+15
-0
lines changed

‎AUC_diff_choc.png‎

66.7 KB
Loading

‎AUC_diff_mush.png‎

68.2 KB
Loading

‎README.md‎

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,15 @@
44
2. Predict whether a person is a consumer of chocolate and magic mushroom.
55
3. Choose the best and worst classifiers for each dataset.
66
4. Explain AI models in a scientific manner which should be convincable to non-technical people.
7+
5. Implement models with Semi-Supervised Learning.
8+
## Preview
9+
Comparing a Pipeline of 6 classifiers on 2 datasets
10+
![alt text](AUC_diff_choc.png)
11+
![alt text](AUC_diff_mush.png)
12+
Explainable AI
13+
![alt text](XAI.png)
14+
Semi-Supervised Learning
15+
![alt text](semi-supervised-learning-comparison.png)
716
# Dataset: Drug Consumption Analysis Dataset
817
The dataset can be found at this link: https://archive.ics.uci.edu/dataset/373/drug+consumption+quantified.
918
## Description of the Dataset
@@ -16,6 +25,7 @@ The dataset can be found at this link: https://archive.ics.uci.edu/dataset/373/d
1625
- Provide results from Evaluation with some useful plots and metrics.
1726
- Summarize the analysis in a report.
1827
- Explain whether certain classifiers make trustable predictions, with the calculation of SHAP values and some visualization plots.
28+
- Prepared labelled and unlabelled data, Implemented and Compared different semi-supervised learning algorithms based on the gradient boosting classifier from assignment 1.
1929
# Project Structure
2030
- You should expect a report in `.pdf` format at the root level.
2131
- Please expand the folder at the root level to view codes.
@@ -24,6 +34,11 @@ The dataset can be found at this link: https://archive.ics.uci.edu/dataset/373/d
2434
2. Evaluation - please check the file `CSI5155 Assignment 1 Evaluation Part - Kelvin Mock 300453668.ipynb`
2535
3. Calculation of SHAP Values - please check the file `CSI5155 Assignment 2 - Kelvin Mock 300453668.ipynb`
2636
4. Visualizing the SHAP Values - please check the file `CSI5155 Assignment 2 Plots - Kelvin Mock 300453668.ipynb`
37+
5. Baseline Model (Gradient Boosting classifier) - `CSI5155 Project - baseline.ipynb`
38+
6. Self Training method applied on baseline model - `CSI5155 Project - Self Training.ipynb`
39+
7. Co-Training method applied on baseline model - `CSI5155 Project - Co Training.ipynb`
40+
8. Semi-Boost method applied on baseline model - `CSI5155 Project - Semi Boost.ipynb`
41+
9. Label Spreading method applied on baseline model - `CSI5155 Project - Label Spreading.ipynb`
2742
- Models are data dumped into several `.pkl` files from time-to-time in different phases to maintain the code's maintainability.
2843
- The training sets and test sets are also data dumped into several `.pkl` files.
2944
- `choc` directory shows data dumped files related to the Chocolate dataset (which is split from the original dataset).

‎XAI.png‎

161 KB
Loading
87.6 KB
Loading

0 commit comments

Comments
 (0)