Skip to content

sandeep1707-debug/imbalanced-data-ml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Classification on Imbalanced Data

This project addresses the problem of class imbalance in classification tasks using Python-based machine learning techniques. Real-world examples include fraud detection, rare disease prediction, and more.

πŸ“Œ Features

  • Data visualization and analysis
  • Handling imbalance using:
    • Oversampling (Random, SMOTE, ADASYN)
    • Undersampling
  • Model training using:
    • Random Forest
    • XGBoost
  • Evaluation with metrics suited for imbalance:
    • F1-score, ROC-AUC, Confusion Matrix

🧰 Tech Stack

  • Python 3.8+
  • pandas, numpy, matplotlib, seaborn
  • scikit-learn, imbalanced-learn
  • xgboost

πŸ“ Project Structure

  • notebooks/: Jupyter Notebook implementation
  • src/: Python scripts for data handling and modeling
  • outputs/: Generated charts and results
  • report/: Final PDF project report

πŸš€ Getting Started

git clone https://github.com/your-username/classification-imbalanced-data-ml.git
cd classification-imbalanced-data-ml
pip install -r requirements.txt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published