People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://lnkd.in/gMMSKAaF
Thinking Machines Lab
Technology, Information and Internet
San Francisco, California 134,193 followers
About us
- Website
-
https://thinkingmachines.ai
External link for Thinking Machines Lab
- Industry
- Technology, Information and Internet
- Company size
- 11-50 employees
- Headquarters
- San Francisco, California
- Type
- Privately Held
Locations
-
Get directions
2300 Harrison St
San Francisco, California 94110, US
Employees at Thinking Machines Lab
Updates
-
We are partnering with NVIDIA to power our frontier model training and platforms delivering customizable AI. https://lnkd.in/gKVcwGtT
-
Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other approaches for a fraction of the cost. https://lnkd.in/gKt-QXJ4
-
-
Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models! https://lnkd.in/gvgbYu3i
-
LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA. https://lnkd.in/gh33SJBA
-
-
Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices. https://lnkd.in/gDyaXr-f We explore a fundamental understanding of the geometry of neural network optimization.
-
Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly. The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains. https://lnkd.in/gKHbbJ_y