“Mihir, whom I have worked with for more than 4 years, continues to amaze me. Through intelligence, technological prowess, business acumen, and a work-ethic that is second to none, Mihir has consistently delivered measurable value to his peers, employer, and clients. I am certain that Mihir, a valued friend and a respected colleague, would quickly earn a spot on anyone’s A-Team.”
Sign in to view Mihir’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Greater Seattle Area
Sign in to view Mihir’s full profile
Mihir can introduce you to 10+ people at Microsoft
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
3K followers
500+ connections
Sign in to view Mihir’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Mihir
Mihir can introduce you to 10+ people at Microsoft
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Mihir
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Mihir’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
About
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
Experience & Education
-
Microsoft
********* ******* ******** ********* ******
-
******
********* ******* ******** ****** ****** ******* ********
-
********
******* * ******** ********
-
*** ********** ** ***** ** ******
****** ** ******** ************** ***** undefined undefined
-
-
****** ********* ** *********** ******
******* undefined
View Mihir’s full experience
See their title, tenure and more.
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
or
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Licenses & Certifications
Volunteer Experience
-
Volunteer
Association for India's Development
- 18 years 10 months
Economic Empowerment
AID is a non-profit dedicated to holistic developmental initiatives in India. I have been a volunteer for 15+ years, working as an individual contributor, and as a team leader on several initiatives in fund raising, project evaluation, and community involvement.
Courses
-
Marketing Management
-
-
Strategic Management
-
Recommendations received
3 people have recommended Mihir
Join now to viewView Mihir’s full profile
-
See who you know in common
-
Get introduced
-
Contact Mihir directly
Other similar profiles
-
Larry Heathcote
Larry Heathcote
The Unforgettable Agent is a brand of Mind Shift 4 You, LLC
30K followersWake Forest, NC
Explore more posts
-
Fernando Migone, PhD
Welocalize • 6K followers
My team is on a roll!! In our latest research, we dug into one of the hardest challenges in AI today: causal reasoning — and exposed major blind spots in how state-of-the-art models handle complex, real-world causation. 💡 Most evaluations force models to choose one answer. But real-life causes aren’t that simple. So we tested 8 frontier LLMs using multi-select (polytomous) tasks across 576 scenarios. The results? Both surprising and sobering. 🔍 Key insights: - Models adopt a conservative strategy when identifying causes but over-select confounders—revealing asymmetries in reasoning across task types. - Chain-of-thought prompting? 🚫 Not helping here. - How you score these evaluations can change which model looks best: Our results show that rankings shift significantly depending on the scoring metric used. - We uncovered behavioral signatures by model family: some are cautious and precise, others more liberal and prone to over-selection. ⚠️ The implications for AI in healthcare, law, and policy are huge. Missing causal factors isn’t just suboptimal — it’s dangerous. 🧠 This research doesn't just offer a new benchmark. It offers a new lens for understanding model behavior under uncertainty and instruction complexity. 🔗 Full methodology, scoring analysis, and model comparisons https://lnkd.in/gvTEDhej Major kudos to the team: David Harper, Konstantinos Karageorgos & Abigail Thornton, PhD How are you measuring causal reasoning in your AI systems? #AI #LLMs #CausalReasoning #AIEvaluation #ResponsibleAI #WeloData #MachineLearning #AIRisk #ModelSelection #AIResearch
34
1 Comment -
Annie Pearl
Microsoft AI • 9K followers
Today we announced that Maia 200 is now online in Azure. Maia 200 reflects a year-long focus on making AI inference more efficient and easier to deploy at scale. It delivers 30% better performance per dollar than our latest-generation fleet, with strong FP4/FP8 throughput optimized for large-scale AI workloads. In addition to performance and deployability, it also helps us make production-ready AI at hyperscale more sustainable by lowering energy per workload. Pretty exciting.
21
-
Carmella (Surdyk) Weatherill
3K followers
We're excited to announce the new AI-first Colab Enterprise experience in Vertex AI and BigQuery. This powerful platform streamlines complex data science workflows, allowing you to simply prompt an agent with a request like "train a model to predict income." The agent then autonomously generates and executes a complete plan—from data loading and cleaning to model training and evaluation. It's a game-changer for accelerating your path from data to insights.
-
Yuqing Gao
Cisco • 2K followers
The AI Research Team at Cisco Central AI Org keeps innovating how LLMs can be applied to help customers in managing large complex operational problems. This blog that the team just published is a great example of addressing the 'Large Context Window' bottleneck, delivering a scalable framework that slashes LLM latency and operational costs without compromising precision. https://lnkd.in/giFG45Cx
61
-
Dongsoo Lee
Presidential Council on… • 6K followers
I am delighted to share that our paper, “CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs,” has been accepted to NeurIPS 2025! It has long been a prevailing assumption that while codebook-based quantization (often referred to as vector quantization) can achieve higher accuracy than other formats such as uniform quantization, thanks to its more flexible representation for a given number of bits, the lack of regular structure makes it extremely difficult to accelerate GEMM operations. In this work, we challenge and overturn that long standing belief. By precomputing and storing inner product results in what we call a Psumbook, we demonstrate that codebook based quantization can indeed be made efficient for GEMM. We firmly believe that the future of cost-efficient AI services lies in combining compression algorithms with bold, innovative implementation strategies, and this paper represents one exciting step in that direction. ----------------------- Title: CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs Authors: Gunho Park (NAVER Cloud), Jeongin Bae (NAVER Cloud), Byeongwook Kim (NAVER Cloud), Baeseong park (NAVER Cloud), Jiwon Ryu (NAVER Cloud), Hoseung Kim (NAVER Cloud), Se Jung Kwon (NAVER Cloud), Dongsoo Lee (NAVER Cloud) Abstract: Codebook-based quantization delivers superior performance in extreme low-bit environments but is hindered by latency overhead in existing dequantization-based kernels, which require loading the entire codebook into programmable cache memory. We propose CodeGEMM, a matrix multiplication kernel optimized for computing codebook-based quantized models by precomputing and storing inner product results in a Psumbook, thereby eliminating the need for traditional dequantization. The kernel supports a wide range of hyperparameter configurations, enabling exploration of trade-offs between latency, memory usage, and accuracy to achieve optimal performance. CodeGEMM obtains a 2.27× speed-up compared to the state-of-the-art codebook-based quantization method with comparable accuracy in the 2-bit configuration.
175
2 Comments -
Meeta Y Vouk
Teradata • 4K followers
People see the results. They don't see the labor behind them — the relentless strategy work, the market deep-dives, the innovation bets that had to be right. It took deep focus, genuine team spirit, and nights where sleep wasn't an option. But we made it. And it was worth every hour. To the team I've had the privilege of leading — you are exceptional. Deep knowledge, lead with kindness, obsessed with customer outcomes, and quietly passionate about what we're building at Teradata. You don't talk about leading. You just lead. This is what vision plus execution actually looks like. Go Teradata. 🚀 Jefferson Uy, Pablo Escobar de la Oliva, Michael Riordan, Timothy Miller, Josefina Tamia van Geloven, Jessica Velez Perez, Danny R., Karthik Logasundaram, Doug Frazier, Faraz Ahmad, Pankaj Purandare, Mahesh Kharwadkar, Sukumar Burra, Kumar Sri Chandra Bhaskar Adabala, Subhadip (Sub) Bandyopadhyay
18
-
Aishwarya Srinivasan
634K followers
Attending Snowflake BUILD 2025 was such an incredible experience. It’s amazing to see how Snowflake is re-imagining the future of AI and data. Here were my top 5 takeaways from the conference: 1️⃣Building the agentic future: AI agents moving from concept to production. 2️⃣Cortex AISQL: natural language queries, no prompt engineering needed. 3️⃣ Cortex Agents: from queries to multi-step reasoning and automation. 4️⃣ Managed MCP Server: bridging AI tools through standardized connectivity. 5️⃣ Gen2 Warehouses: a data foundation built for real-time, enterprise-scale AI. Watch the full breakdown in my video 🎥 And if you’d like to learn more about Snowflake Intelligence, you can check out the link here: https://lnkd.in/gnRkJPv2 #SnowflakePartner #SnowflakeBUILD
303
31 Comments -
Thomas Been
16K followers
Tuning Spark, Ray, or Dask for ML isn’t one-size-fits-all. In a new Domino Blueprint, Principal Solution Architect Sameer Wadkar shows how API-driven scaling complements autoscaling to right-size clusters on demand—cutting idle spend and speeding bursts for GPU/I/O-heavy work. Practical steps, a lightweight Python client, and Helm install included. If you care about FinOps and MLOps, this is for you. Read the blueprint: https://gag.gl/WwWso7
1
-
Alex Shih
Q-CTRL • 4K followers
There are a lot of cool things about our new Q-CTRL AI Assistant, but one of my favorites is just how much EASIER it empowers people to work with quantum computers. We're starting with a deployment in Q-CTRL's docs, which has a wealth of technical information, enriched by our team's expertise, real-world examples , and client use cases. Now, anyone (literally) can truly experience the benefits of quantum computing. Not only do we provide the software abstraction layer, but we're also providing an AI-powered, expertise-driven configuration and interpretation layer for users and customers to get started with our products our dive deeper with more use case examples. It can not only generate code samples, but also the quantum circuits and inputs for the use cases you're interested in. Check out the video, and try it out by clicking "Ask AI" here: https://docs.q-ctrl.com!
17
-
Jan Beitner, PhD
Inflexion • 3K followers
#AI coding #agents are changing how I evaluate vendors. A strong argument for products is if they can be configured purely through code and #API / #MCP. Not because they have their own AI features - but because they let your AI agent do the work. Lightdash is a great example. It is not the most mature BI tool, but probably one of the most AI-ready - because they made all data and configuration accessible via API and gave AI agents the context to act on it, so Claude Code can end-to-end create dashboards by modelling data in the data warehouse and then build the visualizations. If your product is not API-first, AI cannot bring in outside context or act autonomously within it. That is becoming a real competitive disadvantage. https://lnkd.in/e6Gp3MGj
53
14 Comments -
Shubham Vora
Nutsovertech • 24K followers
I met one AI development agency founder, and they said our 50% RAG apps failed in production last year. Because teams don’t know how to handle: chunking + embeddings + vector DB + reranking + hallucinations + evaluation + monitoring. So I'm sharing a RAG Playbook that shows exactly how to build production-ready RAG systems, with practical examples. ✅ What you’ll learn from this playbook This playbook helps you go from: “RAG demo is working” → “RAG system is reliable in production” You’ll learn how to: - reduce hallucinations using prompting techniques - use advanced chunking methods (not random splits) - select the right embedding model for your use case - choose the best vector database (based on real needs) - add reranking to improve relevance - follow clear steps to build an enterprise-grade RAG system - run evaluation before production using key scenarios - monitor and optimize performance after deployment - track RAG quality using 4 powerful metrics If you’re building a RAG-based feature for your product, this playbook will save you weeks of trial and error. ✅ Save this post (you’ll need it later) 🔁 Repost for your team/network ➕ Follow Shubham Vora to learn more on AI agents.
139
77 Comments -
Paul Son
Concentrix • 4K followers
The combination of Netflix's event-driven architecture, domain-oriented design, and real-time processing capabilities creates a possible foundation for building an agentic AI mesh. While there may need to be some modifications—particularly in terms of semantic understanding and goal-oriented orchestration—the core infrastructure patterns for a decoupled architecture support distributed agent coordination at scale. I'm still not sure how context-sharing would occur. Netflix proves that millions of independent transactions can create emergent intelligence without explicit coordination. If Netflix's architecture can predict what you want to watch without agents explicitly sharing your viewing history, are we solving the wrong problem by focusing on context transfer instead of context inference? https://lnkd.in/eqVsz4px
11
-
Colin Judd
Accion Group • 5K followers
If you're giving LLMs more and more context and not getting the results you expect, check out my colleague Norman's new research paper about the maximum effective context window in LLMs. It's awesome to be building tech for the electric utility industry alongside such a thoughtful engineering leader. You can read his full paper here: https://lnkd.in/d4pMhXd9
4
-
Dr Haluk D.
Amazon • 6K followers
A 12-month professional, voluntary collaboration has finally borne fruit. This morning, I was honored to see the University of Washington News (https://lnkd.in/gRRpDqjA) and the UW Milgard School of Business (https://lnkd.in/gKJ_pwbm) highlight our Responsible GenAI Framework. Why do we care about responsible generative AI? Because GenAI systems are no longer experimental curiosities. They shape decisions, influence behavior, generate content at scale, and increasingly affect trust, fairness, privacy, and accountability in organizations and society. Without intentional design and governance, these systems can amplify bias, create opaque risks, and erode confidence rather than enable innovation. This framework is about making GenAI useful, ethical, and sustainable at the same time. It emphasizes responsibility not as a constraint, but as an enabler of long-term value, credibility, and impact. I am grateful to the collaborators, reviewers, and institutions who supported this work, and excited to see responsible GenAI move from aspiration to practice. To learn more: - Read the RGAF Framework (PDF): RGAF Framework https://lnkd.in/gTaxnndV #GenAI #ResponsibleGenAI International Society of Service Innovation Professionals (ISSIP) Milgard CBA
23
1 Comment -
Paul Ramsey
Google • 2K followers
Check out my latest demo for Week 3 of my 4-part series sharing the latest AlloyDB innovations from Next '26! What if you could run semantic vector searches on live database records and instantly join those results with your data warehouse or cold archives—without moving a single byte of data? In this demo, I’ll show how Lakehouse Federation acts as a unified access layer, connecting your live transactions, warehouse analytics, and historical archives under a single database lens, with: 🌐 Zero-Data-Movement Joins: Instantly join vector search results stored in AlloyDB with structured reference data in BigQuery or archived Parquet data in Apache Iceberg. No complex ETL pipelines required. 🚀 Automatic Query Pushdown: Keep performance fast and accurate by automatically pushing filters and aggregations directly down to BigQuery. 🔄 One-Click Reverse ETL: For queries that demand the absolute lowest latency, easily sync data from BigQuery directly into a local AlloyDB table with zero performance trade-offs. 🎯 Unified Data Platform: Query live transactional data and historical cold archives through a single lens—without changing your database connection string. Check out the full demo below to see Lakehouse Federation in action, and reach out to your Google Cloud team to learn more! #AlloyDB #GoogleCloud #Lakehouse #BigQuery #DataArchitecture #Next26 #Database cc: Sujatha Mandava, Bala Narasimhan, Sean Zinsmeister https://lnkd.in/dS6MiBjC
37
-
Josh Klahr
10K followers
A quick update on #OSI (Open Semantic Interchange), a spec focused on making semantic definitions portable across data, AI and BI tools: we’ve published the first public Apache 2.0 version of the spec, an important step toward more open, shared semantics! We’ve also added a number of new partners to the OSI ecosystem: AtScale, Coalesce.io, Collate, Credible, Databricks, JetBrains, Lightdash, and Qlik. It's great to see continued momentum and collaboration. In our latest working group session we identified several key working groups that will focus on the continued evolution of the specification, including things like: Metrics Expression Language, Composability, Catalog Integration, Ontology Representation, Model Converters, and Developer Tools More context in the blog below, and a link to the repo is in the comments 👇 https://lnkd.in/gEaQXSME
461
22 Comments
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content