π Masterβs student in Information Systems at Northeastern University, Boston, MA
π
Expected graduation: December 2025
π Actively seeking full-time opportunities starting December 2025 in Data Engineering, Analytics, and AI-driven Systems.
Iβm a data-driven engineer passionate about building scalable pipelines, intelligent analytics, and AI-powered solutions. My experience spans data engineering, cloud platforms, and product development, with a strong focus on supply chain, healthcare, and EdTech applications. I enjoy solving complex problems by combining data engineering, machine learning, and visualization to drive actionable insights.
Languages & Programming
Python (Pandas, NumPy, PySpark, Scikit-Learn), SQL (T-SQL, Oracle, PostgreSQL, MySQL), R, Java, Scala
Data Engineering & Pipelines
Ab Initio, Spark, Databricks, Airflow, dbt, Talend, Alteryx, Hive, REST APIs, Data Modeling (Star/Snowflake schemas, SCDs)
Databases & Cloud Warehouses
Snowflake, SQL Server, Oracle DB, PostgreSQL, BigQuery, MongoDB
Cloud Platforms
Azure (Synapse, Data Factory, Purview, Key Vault, ADLS), AWS (S3, EC2, Redshift), GCP (BigQuery, Looker Studio)
Analytics & Visualization
Power BI (DAX, drilldowns, dynamic filters), Tableau, Excel (PivotTables, Power Query, Macros), Snowsight
Tools & Frameworks
Git, Streamlit, FastAPI, Docker, Terraform, Jira
πΉ SkillPath AI | AI-Powered Personalized Learning Path Generator
- Designed ELT pipelines in Snowflake + dbt to process 100K+ course records
- Integrated RAG + NLP to map resumes to learning paths, cutting course search time by 50%
- Built a Streamlit UI and LLM chatbot, improving upskilling relevance by 35%
πΉ Reinforcement Learning for Multi-Warehouse Inventory Optimization
- Built an agent-based RL framework (PPO & DQN) for inventory replenishment
- Achieved 98.8% service levels while reducing costs by 48% vs. baseline
- Deployed simulation dashboards in Streamlit to visualize policies and warehouse dynamics
πΉ Urban Collision Analytics
- Processed 2.4M+ traffic collision records with Talend + Python
- Built a dimensional model in ER Studio and delivered geospatial dashboards in Tableau/Power BI to analyze patterns across major US cities
πΉ MediAid AI
- Developed a medical assistant using RAG with Pinecone + LlamaIndex
- Enabled real-time search across 27K+ WHO/CDC docs with 94% retrieval accuracy
- Integrated risk prediction models (heart disease, diabetes) with 87%+ accuracy
π "Stock Market Prediction Using ML Techniques"
Published in IJSART, Volume 6, Issue 1 β Jan 2020
Iβm currently deepening expertise in:
- Snowflake & dbt performance optimization
- Reinforcement Learning for intelligent systems
- Cloud-native data engineering (Azure + AWS)
π§ mohanan.a@northeastern.edu
π LinkedIn
π GitHub
