Built an AI backend that thinks before it responds. Not just a wrapper around an LLM — it actually classifies your intent first, then decides what to do. Send it "2 + 2" → math engine handles it Send it "explain recursion" → explanation pipeline kicks in Send it "how's your day" → conversational flow takes over All routing is handled by a LangGraph state machine. No if-else spaghetti. Stack: → FastAPI for the API layer → LangGraph for intent routing → Groq (LLaMA 3.3-70b) for LLM inference → PostgreSQL for storing chat history per user Hardest part? Getting the intent classifier to not hallucinate random words and break the router . Next: JWT auth + persistent memory across sessions. GitHub: https://lnkd.in/dFq_nkVR #FastAPI #LangGraph #Python #BackendDevelopment #AI #OpenToWork
More Relevant Posts
-
🎓 I built 𝗔𝗜-𝗽𝗼𝘄𝗲𝗿𝗲𝗱 𝗔𝘁𝘁𝗲𝗻𝗱𝗮𝗻𝗰𝗲 𝗦𝘆𝘀𝘁𝗲𝗺 that takes attendance in seconds, no paperwork. 𝗧𝗛𝗘 𝗣𝗥𝗢𝗕𝗟𝗘𝗠: Teachers waste 5-10 minutes every class just calling names. I decided to fix that. 𝗔𝘁𝘁𝗲𝗻𝗱𝗮𝗻𝗰𝗲 𝗦𝘆𝘀𝘁𝗲𝗺 uses Face Recognition + Voice Recognition to mark attendance automatically: 📸 Teacher uploads a classroom photo → AI detects every face → marks who's Present or Absent 🎤 Or record classroom audio → AI identifies each student by their voice 🔐 Students log in just by looking at the camera — no passwords needed 🛠️ Tech Stack: • 𝗦𝘁𝗿𝗲𝗮𝗺𝗹𝗶𝘁 — Full web app UI • 𝗦𝘂𝗽𝗮𝗯𝗮𝘀𝗲 (PostgreSQL) — Database • 𝗗𝗹𝗶𝗯 + 𝗳𝗮𝗰𝗲_𝗿𝗲𝗰𝗼𝗴𝗻𝗶𝘁𝗶𝗼𝗻 — Face detection & recognition • 𝗥𝗲𝘀𝗲𝗺𝗯𝗹𝘆𝘇𝗲𝗿 + 𝗟𝗶𝗯𝗿𝗼𝘀𝗮 — Voice recognition • 𝗦𝗰𝗶𝗸𝗶𝘁-𝗹𝗲𝗮𝗿𝗻 𝗦𝗩𝗖 — Face classification • 𝗯𝗰𝗿𝘆𝗽𝘁 — Secure password hashing • 𝗣𝘆𝘁𝗵𝗼𝗻 — Everything glued together 🔗 GitHub: https://lnkd.in/diTdB-dU Would love your feedback! Drop a comment #ArtificialIntelligence #DeepLearning #ComputerVision #NLP #VoiceAI #FaceRecognition #Python #Streamlit #OpenSource #BuildInPublic #MachineLearning #DataScience #StudentProject #TechInnovation #SoftwareDevelopment #WebApp #FullStackDeveloper #AIProjects #Python #AI #MachineLearning #ComputerVision #Streamlit #FaceRecognition #BuildInPublic
To view or add a comment, sign in
-
🚀 Excited to share my first hands-on Generative AI project using Google Gemini API! As part of my self-learning journey in GenAI, I built a simple AI-powered application in Python that can generate responses dynamically using Gemini models. What I implemented: ✅ Integrated Google Gemini API ✅ Loaded API keys securely using .env ✅ Explored available Gemini models programmatically ✅ Used gemini-2.5-flash for content generation ✅ Built a reusable Python function for AI prompting ✅ Generated AI responses directly from custom prompts Tech Stack: Python Google Generative AI SDK Gemini 2.5 Flash dotenv Jupyter Notebook Sample Functionality: Prompt-based AI response generation Dynamic model calling Model capability exploration This project helped me understand: 🔹 How LLM APIs work 🔹 Model integration in Python 🔹 Prompt-response workflow 🔹 Secure API handling 🔹 Basics of GenAI application development I’m excited to combine Analytics + GenAI to build smarter AI-driven solutions. Learning. Building. Improving. 🚀 #GenerativeAI #GenAI #Python #GeminiAI #GoogleAI #MachineLearning #AI #LLM #DataAnalytics #ArtificialIntelligence #TechLearning #Innovation
To view or add a comment, sign in
-
🧠 Chunking is one of the most important parts of RAG — and most beginners ignore it. In RAG, we don’t send entire PDFs or documents to the LLM. We split them into smaller chunks before creating embeddings. Why? 👇 ✅ Better semantic search ✅ Faster retrieval ✅ More accurate context ✅ Lower hallucinations But here’s the catch 🚨 Bad chunking = bad RAG performance. Choosing the right: 📌 Chunk size 📌 Chunk overlap 📌 Semantic boundaries can massively improve answer quality. RAG is not just “upload PDF and chat with AI” 😅 #RAG #AI #LLM #GenerativeAI #LangChain #VectorDatabase #SemanticSearch #MachineLearning #OpenAI #Developers #SoftwareEngineering #Tech #NodeJS #Python
To view or add a comment, sign in
-
-
Understanding modern RAG architecture is becoming one of the most important skills in AI engineering. This diagram perfectly explains how advanced AI systems combine: • Vector Databases • Graph Databases • Query Routing • Re-ranking • Semantic Search • Indexing Strategies • Self-RAG & Active Retrieval Building powerful AI applications is no longer only about prompting LLMs — the real power comes from intelligent retrieval pipelines and high-quality data flow. A well-designed RAG system can dramatically improve accuracy, relevance, and real-time knowledge retrieval in AI products. #AI #RAG #GenerativeAI #LLM #ArtificialIntelligence #MachineLearning #VectorDatabase #GraphDB #MLOps #DataEngineering #AIEngineering #LangChain #Python #BackendDevelopment
To view or add a comment, sign in
-
-
Just pushed a RAG deep-dive repository to GitHub — and I want to break down exactly what's inside. Retrieval-Augmented Generation is one of the most practical skills in GenAI right now. Instead of hoping an LLM "knows" the answer, RAG makes it retrieve the right context first — then respond. Less hallucination. More accuracy. So I built a modular repo that walks through every core component using LangChain: Document Loaders — ingest data from PDFs, CSVs, web pages, text files & directories Text Splitters — chunk documents intelligently with overlap for better context continuity Vector Stores — embed chunks and store them in FAISS / Chroma for similarity search Retrievers — query smarter using MultiQuery, ContextualCompression, ParentDocument & SelfQuery retrievers Each module is a standalone Jupyter notebook — pick up any stage and run it independently. Stack: LangChain · FAISS · Chroma · HuggingFace Embeddings · OpenAI · Python This is part of my ongoing GenAI learning roadmap — covering Transformers → Prompt Engineering → RAG → Fine-tuning. Repo link : https://lnkd.in/gJ4YvTiu #RAG #LangChain #GenerativeAI #LLM #MachineLearning #NLP #Python #OpenToWork #DataScience #AIEngineering LangChain
To view or add a comment, sign in
-
-
Built an AI-powered RAG assistant called AURA. Over the last few months, I’ve been working on systems focused on: Retrieval-Augmented Generation (RAG) semantic search document intelligence FastAPI backends vector databases LLM-powered workflows AURA was designed to improve document-based question answering using: • FAISS vector search • semantic retrieval • chunking pipelines • contextual grounding • reranking workflows One thing I learned while building this project: Good AI systems are not just about calling an LLM API. The retrieval pipeline matters just as much as the model itself. Tech Stack: Python • FastAPI • LangChain • FAISS • Sentence Transformers GitHub: [https://lnkd.in/dFDZzwCa] I’m currently exploring more advanced RAG and agentic AI systems. #AI #RAG #LLM #FastAPI #LangChain #MachineLearning
To view or add a comment, sign in
-
-
A small FastAPI pattern that made my AI APIs cleaner 👇 ```python from fastapi import FastAPI from pydantic import BaseModel app = FastAPI() class QueryRequest(BaseModel): question: str @app.post("/ask") async def ask_ai(data: QueryRequest): # Example retrieval step context = "Retrieved relevant context" # Prompt creation prompt = f""" Context: {context} Question: {data.question} """ # Mock AI response response = { "answer": "Generated response from LLM" } return response ``` Simple thing, but separating: ✔ Request validation ✔ Context retrieval ✔ Prompt creation ✔ Response generation made debugging much easier later. One thing I’m learning with AI APIs: If the flow is not structured early, the complexity grows very quickly. #FastAPI #Python #AI #Backend #GenerativeAI #RAG #SoftwareEngineering
To view or add a comment, sign in
-
38/60 🚀ABTalksOnAI – AI Engineering Challenge Day 38 was focused on understanding how vector databases work using FAISS and how modern AI systems retrieve semantically relevant information in milliseconds. Today’s task connected embeddings with retrieval workflows and helped me understand what actually powers semantic search systems behind RAG applications and AI assistants. 🧠 What I built today: • Generated embeddings for multiple documents using OpenAI • Stored vectors efficiently using FAISS • Built a similarity search pipeline • Retrieved semantically related documents from vector search • Explored how retrieval systems work internally in AI applications ⚙️ Tools & Tech Used: • Python • OpenAI Embeddings API • FAISS • NumPy • Vector Similarity Search Key Learning: One thing that became very clear today is that embeddings alone are not enough — efficient retrieval infrastructure is equally important. FAISS made it easier to understand how large-scale AI systems can search through thousands or millions of vectors extremely fast while still retrieving contextually relevant information. This task helped me connect the dots between embeddings, vector databases, semantic search, and RAG pipelines much more practically. If you also want to start building AI systems publicly and learn hands-on AI engineering, you can join the challenge at abtalks.in 🚀 #AIEngineering #FAISS #VectorDatabase #SemanticSearch #RAG #OpenAI #Embeddings #Python #ABTalksOnAI #LearningInPublic
To view or add a comment, sign in
-
-
Backend Engineer Learns AI Infrastructure — Day 3 Today I explored one of the core concepts behind semantic search and vector databases: Cosine Similarity. At first, cosine similarity looked like just another mathematical formula. But while learning embeddings, vector retrieval, and Qdrant internals, I realized it solves a very important problem: Measuring meaning instead of exact values. This topic helped me understand: • Why cosine similarity compares direction instead of magnitude • How embeddings with similar meaning point in similar directions • Why cosine similarity works well for semantic search • How vector databases like Qdrant use cosine similarity internally • Why retrieval quality heavily depends on similarity calculations One thing that became very clear: In AI systems, retrieval quality matters more than most people think. Better similarity search leads to: better context, better retrieval, and ultimately better LLM responses. I also started understanding why vector databases optimize heavily around: ANN search, cosine similarity, indexing strategies, and retrieval latency. The deeper I go into AI infrastructure, the more I realize: modern AI systems are a combination of backend engineering + search systems + intelligent retrieval. Current stack I’m exploring: FastAPI + Qdrant + PostgreSQL + Redis + LLMs #AIInfrastructure #FastAPI #Qdrant #RAG #BackendEngineering #LLM #VectorSearch #Python
To view or add a comment, sign in
-
-
How can AI actually understand a massive PDF the way a human expert does? Most RAG systems split documents into random chunks based on token limits. The problem? Context gets lost, sections break apart, and answers become less accurate. PageIndex by VectifyAI solves this in a smarter way. Instead of treating the document as plain text, it converts it into a structured JSON tree that LLMs can understand much better. The system: detects the table of contents understands the document structure using LLMs splits content based on real sections instead of random chunks generates summaries and metadata for each section builds a hierarchical tree for smarter retrieval The result is better document understanding for: RAG systems AI agents research assistants enterprise knowledge systems This allows AI to navigate documents intelligently instead of searching through disconnected text chunks. 🔗 PageIndex GitHub Repository PageIndex #AI #LLM #RAG #OpenSource #MachineLearning #ArtificialIntelligence #Python #SoftwareEngineering
To view or add a comment, sign in
-
Explore related topics
- Building AI Applications with Open Source LLM Models
- LLM Applications for Intermediate Programming Tasks
- Understanding LLM Self-Routing in Inference
- How Llms Process Language
- Guide to Meta Llama Large Language Models
- How to Build APIs With Graphql
- How LLMs Improve Human Language Analysis
- Using LLMs to Translate Human Intent Into Code
- How LLMs Model Human Language Abilities
- How LLMs Understand Human Thought Patterns