Classification on Imbalanced Data

This project addresses the problem of class imbalance in classification tasks using Python-based machine learning techniques. Real-world examples include fraud detection, rare disease prediction, and more.

📌 Features

Data visualization and analysis
Handling imbalance using:
- Oversampling (Random, SMOTE, ADASYN)
- Undersampling
Model training using:
- Random Forest
- XGBoost
Evaluation with metrics suited for imbalance:
- F1-score, ROC-AUC, Confusion Matrix

🧰 Tech Stack

Python 3.8+
pandas, numpy, matplotlib, seaborn
scikit-learn, imbalanced-learn
xgboost

📁 Project Structure

notebooks/: Jupyter Notebook implementation
src/: Python scripts for data handling and modeling
outputs/: Generated charts and results
report/: Final PDF project report

🚀 Getting Started

git clone https://github.com/your-username/classification-imbalanced-data-ml.git
cd classification-imbalanced-data-ml
pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
notebooks		notebooks
report		report
src		src
README.md		README.md
Requirements.txt		Requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Classification on Imbalanced Data

📌 Features

🧰 Tech Stack

📁 Project Structure

🚀 Getting Started

About

Uh oh!

Releases

Packages

Languages

sandeep1707-debug/imbalanced-data-ml

Folders and files

Latest commit

History

Repository files navigation

Classification on Imbalanced Data

📌 Features

🧰 Tech Stack

📁 Project Structure

🚀 Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages