Human Resources Data Dynamos Project

Overview

The Human Resources Data Dynamos project delivers an end-to-end analytics pipeline for HR data, transforming raw employee information into data-driven insights useful for decision-makers. This repository includes everything from SQL scripts and Python notebooks to BI dashboards and strategic reports.

Key Features

Modular Phases: Organized into 13 clear, sequential phases covering data collection through reporting.
Reproducibility: Environment defined in requirements.txt, with setup scripts and instructions.
Multi-Tool Stack: Utilizes SQL, Python (pandas, scikit-learn), Jupyter notebooks, Tableau, Power BI, and standard office docs.
Deliverables: Cleaned datasets, visualizations, predictive models, and formal presentations.

Repository Structure

├── assets                     # Branding: team logo, images
├── DataSet                    # Raw HR data (Excel, CSV, archives)
├── Instructions               # Guides, proposals, PDFs, member info
├── Project-Operations         # Core analysis pipeline
│   ├── 01.Data-Collection     # Raw CSV exports of source tables
│   ├── 02.Data-Wrangling      # Transformation specs and scripts
│   ├── 03.Data-Cleaning       # Cleaned data snapshots + SQL scripts
│   ├── 04.Data-Exploration&Transformation  # EDA outputs & business questions
│   ├── 05.Data-Modeling       # ER diagrams and logical models
│   ├── 06.Data-Analysis       # Jupyter notebooks and detailed PDF report
│   ├── 07.Data-Forecasting    # Forecasting notebooks and code
│   ├── 08.Data-Visualization  # Tableau (.twbx) & Power BI dashboards
│   ├── 09.Data-Mining         # Clustering, association, and mining outputs
│   ├── 10.Data-Driven-Decision-Making # Strategic frameworks (SWOT, PESTEL, etc.)
│   ├── 11.Reporting           # Annual and management report drafts & finals
│   ├── 12.Application         # Proposal templates and Excel macros
│   └── 13.Presentation        # Stakeholder slide decks
├── LICENSE
├── README.md                  # This document
└── requirements.txt           # Python package dependencies

Getting Started

Follow these steps to replicate our environment and explore the analysis:

Clone the repo

git clone https://github.com/0PeterAdel/Data-Dyanamos.git
cd Data-Dyanamos

Create a virtual environment

Windows:
```
python -m venv env
env\Scripts\activate
```

Linux/macOS:

python3 -m venv env
source env/bin/activate

Install dependencies
```
pip install -r requirements.txt
```
Set up database (optional)
- Load Project-Operations/01.Data-Collection/*.csv into your SQL engine.
- Execute SQL scripts in 03.Data-Cleaning to create cleaned tables.
Run Notebooks
- Launch Jupyter:
```
jupyter lab
```
- Navigate to Project-Operations/06.Data-Analysis and open analysis-part1.ipynb, analysis-part2.ipynb.
View Dashboards
- Tableau: Open Project-Operations/08.Data-Visualization/Tableau/Data Dynamos Project (Data Forecasting).twbx.
- Power BI: Open Project-Operations/08.Data-Visualization/Power-Bi/Data Dynamos Data Analysis.pbix.

Phase Details

Below are brief descriptions and key artifacts for each project phase:

01. Data Collection

Objective: Gather raw HR tables.
Files: Employee.csv, PerformanceRating.csv, etc.
Outcome: Baseline CSV exports for import.

02. Data Wrangling

Objective: Define transformations (e.g., normalizing codes).
Artifacts: PDF with mapping rules, Python scripts.

03. Data Cleaning

Objective: Remove duplicates, handle missing values, enforce types.
Scripts: Employee.sql, PerformanceRating.sql.
Snapshots: Cleaned CSVs and Excel files in Data-Cleaned/.

04. Exploration & Transformation

Objective: Perform EDA to surface patterns.
Deliverables: Business-Questions.pdf, KPI definitions.

05. Data Modeling

Objective: Design ER diagrams and logical data model.
Files: Data-Modeling.png variants, HR-Data.xlsx model sheet.

06. Data Analysis

Objective: Answer core HR questions via notebooks.
Notebooks:
- analysis-part1.ipynb: Demographics & satisfaction analysis.
- analysis-part2.ipynb: Turnover and performance insights.
Report: Analysis-Report.pdf summarizing major findings.

07. Data Forecasting

Objective: Build predictive models (e.g., satisfaction, turnover).
Code: main.ipynb, main.py using scikit-learn.
Report: Forecast-Report.pdf.

08. Data Visualization

Objective: Create interactive dashboards.
Tableau: .twbx and PDF export.
Power BI: .pbix, PDF, and PPTX templates.

09. Data Mining

Objective: Uncover latent clusters and associations.
Outputs: Excel dashboards, PDF & PPTX slides.

10. Data-Driven Decision Making

Objective: Apply strategy frameworks.
Frameworks: PESTEL, SWOT, SOAR, TOWS, VRIO.
Artifacts: Each has paired PDF and Excel workbook.

11. Reporting

Objective: Consolidate insights into reports.
Reports: Annual HR report, Management reports across functions.

12. Application

Objective: Provide templates for client proposals.
Files: Proposal docs (.docx/.pdf), Excel-based macros.

13. Presentation

Objective: Stakeholder slide decks summarizing project.
Formats: PPTX templates ready for customization.

Project Workflow

Collect & Clean: Import raw CSVs → run SQL cleaning → export cleaned tables.
Explore: Run EDA notebooks → define business questions.
Model & Forecast: Build data models → train predictive models.
Visualize: Develop dashboards in Tableau/Power BI.
Report & Present: Compile insights into formal reports and slide decks.

Contributing

Fork and create a branch: git checkout -b feature/XYZ
Commit: git commit -m "Add feature XYZ"
Push: git push origin feature/XYZ
Submit a Pull Request for review.

Please follow our coding standards and document any major changes in CHANGELOG.md (create one if needed).

License

This project is licensed under the Apache License 2.0. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Human Resources Data Dynamos Project

Table of Contents

Overview

Key Features

Repository Structure

Getting Started

Phase Details

01. Data Collection

02. Data Wrangling

03. Data Cleaning

04. Exploration & Transformation

05. Data Modeling

06. Data Analysis

07. Data Forecasting

08. Data Visualization

09. Data Mining

10. Data-Driven Decision Making

11. Reporting

12. Application

13. Presentation

Project Workflow

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
DataSet		DataSet
Instructions		Instructions
Project-Operations		Project-Operations
assets		assets
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

0PeterAdel/Data-Dyanamos

Folders and files

Latest commit

History

Repository files navigation

Human Resources Data Dynamos Project

Table of Contents

Overview

Key Features

Repository Structure

Getting Started

Phase Details

01. Data Collection

02. Data Wrangling

03. Data Cleaning

04. Exploration & Transformation

05. Data Modeling

06. Data Analysis

07. Data Forecasting

08. Data Visualization

09. Data Mining

10. Data-Driven Decision Making

11. Reporting

12. Application

13. Presentation

Project Workflow

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages