๐ A Data Science Project by Vishnu Raj
Life expectancy is a key indicator of a nationโs overall health and development. This project aims to predict life expectancy using Multiple Linear Regression (MLR) based on socio-economic and health factors. Using the WHO Life Expectancy Dataset, we explore correlations, clean the data, visualize relationships, and build regression models for prediction.
โ Identify significant socio-economic and health predictors of life expectancy โ Build and evaluate a Multiple Linear Regression model โ Compare performance with Ridge and Lasso Regression โ Visualize patterns through EDA for clear interpretation
| Purpose | Libraries |
|---|---|
| Data Handling | pandas, numpy |
| Visualization | matplotlib, seaborn |
| Modeling | scikit-learn |
| Evaluation | r2_score, RMSE, MAE |
| Development | Jupyter Notebook |
๐ Dataset: Life Expectancy (WHO) โ Kaggle ๐ Records: 2,930 ๐งฉ Features: 22 (GDP, BMI, Schooling, Status, etc.) ๐ฏ Target Variable: Life Expectancy
โ Replaced missing values using median imputation
โ Encoded categorical variables (Status: Developed/Developing)
โ Handled outliers using IQR
โ Split data into 70% Train and 30% Test using train_test_split
Explored patterns between health and economic indicators:
- Correlation heatmaps ๐
- Pairplots for relationships ๐ฅ
- Schooling,Diphtheria Immunization as key drivers
| Model | Rยฒ | RMSE | MAE |
|---|---|---|---|
| Linear Regression | 0.8316 | 3.9104 | 2.8904 |
| Ridge Regression | 0.8316 | 3.9104 | 2.8902 |
| Lasso Regression | 0.8316 | 3.9104 | 2.8897 |
๐ก Higher education levels and income = higher life expectancy. ๐ก Schooling, Health Expenditure, and Diphtheria Immunization strongly correlate. ๐ก Regularization (Ridge/Lasso) provided stable, consistent results.
pandas
numpy
matplotlib
seaborn
scikit-learn
โ Achieved Rยฒ โ 0.83, showing strong predictive power โ Proved education, economy, and healthcare as vital for longer lives โ Demonstrated MLRโs simplicity and interpretability in social datasets
๐จโ๐ป Vishnu Raj ๐ Data Science Project ๐ผ GitHub | LinkedIn | ๐ง vishnuskillx@gmail.com
โญ If you found this project helpful, please give it a star! โญ


