Skip to content
View Soumitra1312's full-sized avatar
โ˜บ๏ธ
Focusing
โ˜บ๏ธ
Focusing

Highlights

  • Pro

Block or report Soumitra1312

Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Soumitra1312/README.md

๐Ÿ‘‹ Hi, I'm Soumitra Ghosh

Typing SVG

๐Ÿš€ Open to Software Engineering Internship Opportunities
๐Ÿ“ฌ Connect with me on LinkedIn

๐Ÿง  Core Expertise

  • LLM Systems (RAG, Vector Databases, Prompt Engineering)
  • Backend Engineering (FastAPI, REST APIs, Microservices)
  • Machine Learning (NLP, Computer Vision)
  • System Design & Scalable Architectures
  • Cloud & DevOps (Docker, AWS, GCP)

๐Ÿ‘จโ€๐Ÿ’ป About Me

Building scalable backend systems and production-ready AI applications that deliver real-world impact.

Iโ€™m a Computer Science undergraduate (2026) focused on backend engineering, system design, and applied AI/LLM systems. My work combines software engineering with machine learning to build reliable, scalable, and production-ready applications.

During internships at NIT Rourkela and Infosys Springboard, I developed and deployed real-world AI systems including:

  • Face recognition pipeline with 96% accuracy using PyTorch and StyleGAN-based augmentation
  • Real-time sign language recognition system achieving 95% accuracy
  • Optimized inference and evaluation pipelines for scalable deployment

My work also includes research in edge AI and secure computer vision systems:

  • ๐Ÿ“„ Published IEEE research on deepfake-aware edge AI authentication
  • Built a unified Raspberry Pi-based pipeline integrating YOLOv5 for face detection, FaceNet for recognition, and EfficientNet-B4 for deepfake detection
  • Evaluated on benchmarks including WIDER Face, LFW, CelebA, and FaceForensics++
  • Achieved real-time inference at ~15 FPS with 0.96 AUC for deepfake detection under challenging real-world conditions
  • Optimized for secure, low-latency identity verification in edge AI, IoT, and surveillance environments

I also build LLM-powered systems such as:

  • โš–๏ธ RAG-based legal document analysis platform processing 1K+ documents/day
  • Reduced manual review time by 60% using retrieval and transformer-based pipelines

Tech interests: Backend Systems โ€ข Distributed Systems โ€ข LLMs โ€ข RAG โ€ข System Design โ€ข Computer Vision โ€ข Cloud & DevOps

Currently seeking Software Engineering Internship opportunities (Backend & AI Focus) to build scalable systems and intelligent AI-powered products.


๐ŸŒ Social Media Handles:

LinkedIn email

๐Ÿ’ป Tech Stack:

๐Ÿค– Machine Learning & AI


โš™๏ธ Backend & Frameworks


๐Ÿง‘โ€๐Ÿ’ป Languages


๐ŸŽจ Frontend & UI


๐Ÿ—„๏ธ Databases


โ˜๏ธ Cloud & Hosting


๐Ÿš€ Deployment & DevOps


๐Ÿ› ๏ธ Tools & Platforms

๐ŸŽฏ Featured Projects

๐Ÿš€ DocuLix โ€” AI Legal Document Analysis Platform

๐Ÿ“Š Impact:

  • Processes 1K+ documents/day
  • Achieves 92% clause extraction accuracy
  • Reduces legal review time by 60%

๐Ÿ› ๏ธ Tech Stack

Next.js React Python Flask Node.js PyTorch Docker

โœจ Key Features

  • ๐Ÿ“„ Multi-format uploads (PDF, Word, Images)
  • ๐Ÿค– AI-powered legal Q&A using NLP models
  • ๐Ÿง  Clause paraphrasing with T5-based models
  • โš–๏ธ Risk & sentiment analysis of contract clauses
  • ๐Ÿ” Secure OTP-based authentication
  • โณ Configurable session timer with automatic logout (10-minute default) for secure access control
  • ๐Ÿ“œ Session history tracking to review previous queries and responses
  • ๐Ÿ“ฅ One-click download of complete session reports as PDF
  • ๐Ÿ›ก๏ธ Your document is 100% secure โ€” no user data or uploaded documents are stored
  • โšก Modern dashboard with Next.js & React
  • ๐Ÿณ Dockerized deployment using Docker Compose

๐Ÿ”— Repository: https://github.com/Soumitra1312/DocuLix

๐Ÿš€ Face Recognition Based Attendance System

๐Ÿ“Š Impact:

  • Achieves 92% recognition accuracy
  • Automates attendance, reducing manual effort and errors

๐Ÿ› ๏ธ Tech Stack

Python Flask OpenCV MongoDB NumPy

โœจ Key Features

  • ๐Ÿ“ธ Automated image capture โ€” captures 10 images per user via webcam
  • ๐Ÿง  Face recognition system using OpenCV, dlib, and face_recognition
  • ๐Ÿ—‚๏ธ MongoDB-backed storage for users, attendance, and schedules
  • ๐Ÿ‘จโ€๐Ÿซ Faculty dashboard with time slots, sections, and student lists
  • ๐Ÿ“Š Attendance analytics including history and per-class statistics
  • โšก End-to-end automation eliminating manual attendance processes

๐Ÿ”— Repository: https://github.com/Soumitra1312/Face-Recognition

๐ŸŽฎ Shadow Fire โ€” Third-Person Action Shooter

๐Ÿ“Š Impact:

  • Delivers smooth real-time gameplay with optimized rendering and AI behavior

๐Ÿ› ๏ธ Tech Stack

Unreal Engine Blueprints Game AI

โœจ Key Features

  • โš”๏ธ Enemy AI with behavior trees for chasing, flanking, and attacking players
  • ๐Ÿ”ซ Realistic shooting mechanics with aiming, firing, and hit feedback
  • ๐Ÿ’ฅ Gun recoil & audio effects for immersive combat experience
  • ๐Ÿƒ Fluid player movement including walking, sprinting, jumping, and dodging
  • ๐Ÿง  Responsive control system tuned for fast-paced third-person gameplay
  • ๐ŸŽฎ Action-focused combat loop rewarding movement and situational awareness

๐Ÿ”— Repository: https://github.com/Soumitra1312/Shadow-Fire

๐Ÿ“š Research & Publications

Publication Venue Research Area
Deepfake-Aware Face Authentication for Edge Devices Using a Unified Raspberry Pi Pipeline IEEE AIEI 2026 Edge AI, Computer Vision, Deepfake Detection

๐Ÿง  Deepfake-Aware Face Authentication for Edge Devices Using a Unified Raspberry Pi Pipeline

  • Published in IEEE AIEI 2026
  • Built a complete edge AI pipeline integrating YOLOv5, FaceNet, and EfficientNet-B4
  • Achieved real-time face detection at ~15 FPS on Raspberry Pi
  • Reached 0.96 AUC for deepfake detection using FaceForensics++ and real-world camera inputs
  • Optimized for secure and low-latency identity verification on resource-constrained edge devices
  • Evaluated across challenging real-world conditions including masks, sunglasses, occlusion, and varying lighting conditions

๐Ÿ“„ Read the Full Paper Here : Link

๐Ÿ“Š GitHub Stats

Snake dark


โœ๏ธ Dev Quote

โ€œPrograms must be written for people to read,
and only incidentally for machines to execute.โ€


โ€” Harold Abelson

Pinned Loading

  1. DocuLix DocuLix Public

    DocuLix is an AI-powered platform for uploading, analyzing, and simplifying legal documents. It features a modern web dashboard (Next.js/React), a Python Flask backend for document processing, and โ€ฆ

    TypeScript

  2. Face-Recognition Face-Recognition Public

    Jupyter Notebook

  3. Shadow-Fire Shadow-Fire Public

    A shooting game built on top of Unreal Engine.

  4. PromptForge PromptForge Public

    PromptForge is a distributed async prompt queue that processes LLM requests in parallel with MongoDB-backed durable execution. It enforces 300 req/min rate limits via a token bucket, uses semantic โ€ฆ

    Python