The Memo - 1/Apr/2024
DBRX 132B, companies spend $18M avg on LLMs, OpenAI Voice Engine, and much more!
To: US Govt, major govts, Microsoft, Apple, NVIDIA, Alphabet, Amazon, Meta, Tesla, Citi, Tencent, IBM, & 10,000+ more recipients…
From: Dr Alan D. Thompson <LifeArchitect.ai>
Sent: 1/Apr/2024 (31/Mar/2024 in US; all analysis as serious as ever)
Subject: The Memo - AI that matters, as it happens, in plain English
AGI: 72%
I had a lot of fun at a datacenter next door to me (thanks to Stuart and Marie!) that features a world-leading patented innovation: tanks of non-conductive mineral oil with NVIDIA A100s (and other chips) completely submerged in the liquid for cooling purposes. This DC was designed and patented in Perth, Western Australia. It is really, really strange to see all electronics—including power connectors—completely submerged in liquid.
It was also just a little bit emotional to be in the same kind of ‘hospital’ as the one that gave birth to Leta AI and GPT-3 (V100s), ChatGPT, GPT-4, DALL-E, and much more. (Except this hospital specializes in water births!) You can read more about the patented design with specs here: https://dug.com/dug-cool/
The BIG Stuff
Andreessen Horowitz: 16 Changes to the Way Enterprises Are Building and Buying Generative AI (21/Mar/2024)
Andreessen Horowitz has interviewed many Fortune 500s in the ‘technology, telecom, CPG [Consumer Packaged Goods], banking, payments, healthcare, and energy’ fields about their use of large language models.
The findings are sensational. Here are my ‘top 3’ charts, beginning with the outrageous 2024 expected LLM spend of US$18,000,000 per company. It is interesting to see that 100% of these companies used OpenAI models (probably via Microsoft Azure OpenAI or Microsoft Copilot, rather than ChatGPT Enterprise).
Later in this edition we look at 200 use cases for post-2020 AI.
Read more: https://a16z.com/generative-ai-enterprise-2024/
Sidenote: Here is one of my newest (and simplest!) slides, added this month for a 2-hour workshop on Microsoft Copilot for Microsoft 365, delivered to a major utility provider:
Full subscribers can watch some of my keynote rehearsals and recorded presentations here:
Red Lines (10/Mar/2024)
On 10 March 2024, leading global AI scientists met in Beijing, China in collaboration with the Beijing Academy of AI (BAAI). They signed a ‘Red Lines’ document.
This definitely fits under The BIG Stuff heading, but it’s unusually long so I provide full commentary in the Policy section of this edition.
56 new models announced in Q1 2024 (Apr/2024)
That was a massive first quarter. I can’t believe it’s over and we’re in April already. That means we’re half way to my next ‘The sky is…’ AI report.
Claude 3 Opus helped format this list of 56 LLM highlights announced this quarter, using data from my Models Table (you can see my prompt and conversation here 26/Mar/2024):
January 2024
JPMorgan DocLLM 7B, SUTD/Independent TinyLlama 1.1B, Tencent LLaMA Pro 8.3B, DeepSeek-AI DeepSeek 67B, DeepSeek-AI DeepSeekMoE 16B, Zhipu AI (Tsinghua) GLM-4 200B, Adept Fuyu-Heavy 120B, Tencent FuseLLM 7B, DeepSeek-AI DeepSeek-Coder 33B, Cornell MambaByte 972M, LMU MaLA-500 10B, RWKV RWKV-v5 Eagle 7B, Meta AI CodeLlama-70B, Apple MGIE 7B, iFlyTek Xinghuo 3.5 (Spark) 200B, iFlyTek iFlytekSpark-13B, Mistral AI miqu 70b, AIWaves.cn Weaver 34B.
February 2024
Cerebras FLOR-6.3B, Allen AI OLMo 7B, Google TimesFM 200M, AI Singapore Sea-Lion 7.5B, ChatDB Natural-SQL-7B, BRAIN GOODY-2, Alibaba Qwen-1.5 72B, Google DeepMind Gemini 1.5 Pro MoE, Google DeepMind Gemma 7B, Reka AI Reka Flash 21B, Reka AI Reka Edge 7B, Apple Ask 20B, Reliance Hanooman 40B, Mistral AI Mistral Large 540B, Mistral AI Mistral Small 7B, ByteDance 175B, ByteDance 530B, HF/ServiceNow StarCoder 2 15B, HF Cosmo-1B, SambaNova Samba-1 1.4T CoE.
March 2024
Anthropic Claude 3 Opus 2T, SRIBD/CUHK Apollo 7B, Inflection AI Inflection-2.5 1.2T, Stability AI Stable Beluga 2.5 70B, Fudan University AnyGPT 7B, DeepSeek-AI DeepSeek-VL 7B, Cohere Command-R 35B, Covariant RFM-1 8B, Apple MM1, RWKV RWKV-v5 EagleX 7.52B, Independent Parakeet 378M, Rakuten Group RakutenAI-7B, Sakana AI EvoLLM-JP 10B, Stability AI Stable Code Instruct 3B, MosaicML DBRX 132B MoE, AI21 Jamba 52B MoE, xAI Grok-1.5 314B, Alibaba Qwen1.5-MoE-A2.7B 14.3B MoE.
The final five bolded models were all announced in about a 24-hour period just before the Easter weekend.
See them on the Models Table: https://lifearchitect.ai/models-table/
The Interesting Stuff
Paper: GPT-4 can help fly a plane (25/Mar/2024)
I don’t list a ‘paper of the week’ in these editions, but if I did, this would be my favorite paper this week. Absolutely outrageous, and an incredible case study by the research team.
[GPT-4V and GPT-4 was used] to interpret and generate human-like text from cockpit images and pilot inputs, thereby offering real-time support during flight operations. To the best of our knowledge, this is the first work to study the virtual co-pilot with pretrained LLMs for aviation...
The case study revealed that GPT-4, when provided with instrument images and pilot instructions, can effectively retrieve quick-access references for flight operations. The findings affirmed that the V-CoP can harness the capabilities of LLM to comprehend dynamic aviation scenarios and pilot instructions.
Remember that GPT-4 is coming up to its 2nd birthday (it was ready in the OpenAI lab back in August 2022), and we’re still discovering its capabilities…
Read the paper: https://arxiv.org/abs/2403.16645v1
See my list of GPT achievements.
GPT-6 (Mar/2024)
I like to keep on the ‘bleeding edge’ of AI, but this one came quicker than even I was prepared for. GPT-5 isn’t even ready yet, and here are updates about GPT-6’s setup.
This is another advisory-grade edition. Let’s look at a lot more AI, including major new datasets, my Red Lines analysis, new Sora details, the AI platform replacing TIkTok, 200 use cases for post-2020 AI, two companies commit a quarter trillion dollars to datacenters, and much more…