You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
AI-powered disaster response platform with offline-first architecture using Gemma 3n. Provides computer vision hazard detection, voice analysis with emergency keywords, PDF report generation, and multi-user coordination - all working without internet access.
Circuitry.ai is an open-source tool that combines computer vision and large language models to detect, analyze, and explain electronic circuit diagrams. It leverages YOLOv8 for component detection and LLaMA 3 for generating intelligent textual explanations of how the circuit works.
Multi Modal Agent using Deepgram and Groq LPU's and Sentence Transformers for Vector Embeddings, ChromaDB for persistent vector db storage and BM25 with ColBERT reranking for hybrid search for RAG context management
Medibot is a voice-enabled medical AI assistant using RAG for accurate healthcare conversations. Evolved from my text-based chatbot, it now understands spoken questions and responds with voice answers, making medical guidance more accessible through intuitive multimodal interaction.
Framework for training and evaluating "Single-Branch Network Architectures to Close the Modality Gap in Multimodal Recommendation" (TORS Journal Paper)
A Flask-based web app for managing multimodal datasets text and images with CRUD operations via SQLite, and seamless export as a structured Parquet dataset to Hugging Face Hub.