Answer.AI - Practical AI R&D
Answer.AI is a new kind of AI R&D lab which creates practical end-user products based on foundational research breakthroughs. To learn more about the research agenda and background behind our work, see our launch post and an independent analysis of our lab. See below for a complete list of published articles.
/llms.txt—a proposal to provide information to help LLMs use websites
We propose that those interested in providing LLM-friendly content add a /llms.txt file to their site. This is a markdown file that provides brief background information and guidance, along with links to markdown files providing more detailed information.
Sep 3, 2024
Jeremy Howard
Small but Mighty: Introducing answerai-colbert-small
Say hello to answerai-colbert-small-v1, a tiny ColBERT model that punches well above its weight.
Aug 13, 2024
Benjamin Clavié
How HTMX is Revolutionizing Web Development — Interview with Carson Gross
We recently sat down with Carson Gross, the creator of HTMX, to discuss how it’s changing the web.
Aug 4, 2024
Elizabeth Creason
FastHTML: Modern web applications in pure Python
FastHTML is a new way to create modern interactive web apps. It scales down to a 6-line python file, and scales up to complex production apps.
Aug 3, 2024
Jeremy Howard
JaColBERTv2.5🇯🇵: Optimising Retrieval Training for Lower-Resources Languages
Introducing JaColBERTv2.5🇯🇵, the new best Japanese retrieval model. Through this release, we present a thorough analysis to better understand what helps in training a good multi-vector retrieval model.
Aug 2, 2024
Benjamin Clavié
Cold-Compress 1.0: A Hackable Toolkit for KV-Cache Compression
Blog
Answer.AI is proud to publicly release Cold Compress 1.0 , a toolkit for creating and benchmarking state-of-the-art KV cache compression methods. Built on top of GPT-Fast, it…** ****************__**
Aug 1, 2024
Griffin Adams, Faisal Ladhak
Faith and Fate: Transformers as fuzzy pattern matchers
Are GPT-like models thinking? Unclear. But the Faith and Fate paper (Dziri, 2023) points out they are often “just” pattern matching.
Jul 25, 2024
Alexis Gallagher
gpu.cpp: portable GPU compute for C++ with WebGPU
We’re thrilled to announce the release of gpu.cpp. gpu.cpp is a lightweight, open-source library that makes portable GPU compute with C++ simple.
Jul 13, 2024
Austin Huang
Politicians can’t change math
It is not possible to provide assurance that a released model can’t be used to cause harm, because a released model can be changed.
Jun 28, 2024
Jeremy Howard
A little pooling goes a long way for multi-vector representations
We’re releasing an early version of a simple token pooling trick for ColBERT. This allows for considerable memory&disk footprint reduction with very minimal retrieval…
Jun 27, 2024
Benjamin Clavié
A new kind of literate program – Claudette
Claudette is the first ever ‘literate nbdev’ project. Its source is a rendered Notebook with callout notes and tips, HTML tables and images, detailed explanations. It explains how and why the code is written the way it is, along with how to use the libraries it depends on.
Jun 23, 2024
Jeremy Howard
Introducing Claudette, a new friend that makes Claude 3.5 Sonnet even nicer
Today, Anthropic launched the most powerful language model available: Claude 3.5 Sonnet. And today, we are making it ever better, with the launch of Claudette.
Jun 21, 2024
Jeremy Howard
The definition of ‘Artificial Intelligence’ in SB 1047 was actually meant for systems, not models
SB 1047 defines “AI Model” using a definition designed for “AI Systems”, and as a result fails to cover anything.
Jun 19, 2024
Jeremy Howard
What policy makers need to know about AI (and what goes wrong if they don’t)
Policy makers need to understand how AI really works. Otherwise bills like SB 1047 totally fail to achieve their stated goals.
Jun 17, 2024
Jeremy Howard
More=Better?
In this post I want to share a few quick experiments to show something that is both obvious and also perhaps underappreciated: you can get a sizeable quality boost from a…
May 17, 2024
Jonathan Whitaker
SB-1047 will stifle open-source AI and decrease safety
By imposing the restrictions on open-source AI, SB-1047 could reduce AI safety, through reducing transparency, collaboration, diversity, and resilience.
Apr 29, 2024
Jeremy Howard
Efficient finetuning of Llama 3 with FSDP QDoRA
We’re releasing FSDP QDoRA, a scalable and memory-efficient method to close the gap between parameter efficient finetuning and full finetuning.
Apr 22, 2024
Kerem Turgutlu
A few tips for working on high-surface-area problems
What do you do when there are too many pieces to fit in your head?
A few tips for working on things that involve a lot of moving pieces, that have a high “surface area”.
Apr 12, 2024
Johno Whitaker
Enabling 70B Finetuning on Consumer GPUs
A Technical Deep Dive into FSDP+QLoRA
A detailed guide for adding FSDP and QLoRA support to quantization libraries and training frameworks.
Mar 14, 2024
Benjamin Warner, Johno Whitaker, Kerem Turgutlu
You can now train a 70b language model at home
We’re releasing an open source system, based on FSDP and QLoRA, that can train a 70b model on two 24GB GPUs.
Mar 6, 2024
Lessons from history’s greatest R&D labs
A historical analysis of what the earliest electrical and great applied R&D labs can teach Answer.AI, and potential pitfalls
Jan 26, 2024
Eric Gilliam
A new old kind of R&D lab
Answer.AI will figure out the fundamental research needed to tame AI, and the development path needed to make it useful in practice
Dec 12, 2023
Jeremy Howard
A new old kind of R&D lab
Answer.AI will figure out the fundamental research needed to tame AI, and the development path needed to make it useful in practice