The Memo - 27/Dec/2024

NVIDIA GB300, OpenAI o3, Microsoft’s 485,000 Hopper chips, and much more!

Dec 26, 2024

∙ Paid

To:      US Govt, major govts, Microsoft, Apple, NVIDIA, Alphabet, Amazon, Meta, Tesla, Citi, Tencent, IBM, & 10,000+ more recipients…
From:    Dr Alan D. Thompson <LifeArchitect.ai>
Sent:    27/Dec/2024
Subject: The Memo - AI that matters, as it happens, in plain English
AGI:     84 ➜ 88%
ASI:     0/50 (no expected movement until post-AGI)

It was a nice surprise to see my GPT-5 dataset paper findings featured in the G7 AI document. G7 (wiki) is an intergovernmental organization of Canada, France, Germany, Italy, Japan, the UK, and the US. The document is available for download in the Policy section of this edition.

The Memo subscription fee for new subscribers will increase from 1/Jan/2025. If you’re a current subscriber, you’ll always be on your old/original rate while you’re subbed. Free readers can become a full subscriber at the old/current rate before New Year’s Day.

Contents

The BIG Stuff (AI report, o3 reasoning model, DeepSeek-V3, 485k Hopper chips…)
The Interesting Stuff (Veo 2 examples, Falcon 3, The Well, Harvard Books…)
Policy (G7 AI document, NIST Claude 3.5S new, NIST o1…)
Toys to Play With (new free OpenAI course…)
Flashback (My Aurora experiments from 2021…)
Next (Roundtable…)

The BIG Stuff

2024 AI retrospective: The sky is steadfast (21/Dec/2024)

Image generated in a few seconds, on 19 December 2024, text prompt by Alan D. Thompson, via Google Imagen 3-002: ‘Beautiful sky, Australian rural setting, in the style of Kazuo Oga and Atey Ghailan, rolling hills, sheep and wallabies and a small cottage, cinematic lighting, otherworldly colors.’

Full subscribers to The Memo received my end-of-year AI report last week. Headings include: The BIG Stuff, Large language models, Datasets, China, and Performance.

The final section, ‘How I'm preparing for AGI and ASI,’ lists six practical actions I’ve been personally taking for the last few years.

Read the report: https://lifearchitect.ai/the-sky-is-steadfast/

o3 reasoning model (19/Dec/2024)

Many people are freaking out about the superhuman performance of OpenAI’s latest reasoning model, o3. In my view, this is just a natural point on an exponential curve. This thing is smart, outperforming most people on the planet, and much closer to ASI (beyond the most gifted human) than just plain ol’ AGI (average human). This model increased my AGI countdown from 84% ➜ 88%.

My latest viz shows most of what you need to know:

Read more: https://lifearchitect.ai/o3/

DeepSeek-V3: Strong MoE language model (2024)

Hangzhou-based AI lab DeepSeek-AI (深度求索, Shēndù qiúsuǒ) released a Christmas gift with the open-source DeepSeek-V3 model. This thing is huge: 685B parameters MoE, of which 37B are activated per token, trained on 14.8T tokens (22:1).

GPQA=59.1, MMLU=87.1.

The model is available immediately on the free chat.deepseek.com playground, scoring 5/5 for the 2024H1 ALPrompt, but 0/5 for the 2024H2 ALPrompt.

Announce, paper, weights

See it on the Models Table: https://lifearchitect.ai/models-table/

NVIDIA GB300 'Blackwell Ultra' 288GB memory, 1,400W power (23/Dec/2024)

NVIDIA's upcoming GB300 AI server is set to redefine performance standards with its B300 GPU chip, boasting 288GB of HBM3E memory and a 1,400W TDP [Thermal Design Power].

SemiAnalysis commented (25/Dec/2024) on the move from H100 ➜ H200 ➜ B300:

Reasoning models [like o1 and o3] can be a poor user experience due to significant waiting time between requests and responses. If you can offer significantly faster reasoning time, this will increase the user’s propensity to use and pay for them.
A 3x difference in cost is massive. Hardware delivering 3x with a mid-generation memory upgrade is frankly insane, way faster than Moore’s law, Huang’s Law, or any other pace of hardware improvement we’ve seen.
We have observed that the most capable and differentiated models are able to charge a significant premium over even slightly less capable models. Gross margins on frontier models are north of 70%, but on trailing models with open source competition, margins are below 20%. Reasoning models don’t have to be 1 chain of thought. Search exists and can be scaled up to improve performance as it has in o1 Pro and o3. This enables smarter models that can solve more problems and generate significantly more revenue per GPU.

The Interesting Stuff

Happily ever after: A children’s story on post-ASI scenario (23/Dec/2024)

My latest piece is a children’s story about the upcoming post-scarcity world.

Some grown-ups were scared about their piggy banks and money. “If the humanoids do everything for free,” they said, “what will happen to all our savings? What about our jobs?”

Read it: https://lifearchitect.ai/happily-ever-after/

Watch it (link):

Nobel Minds 2024: Prof Geoffrey Hinton on post-AGI scenario (19/Dec/2024)

This was a beautiful setting for the 2024 Nobel prize winners to speak. AI godfather Geoffrey Hinton had an interesting and pessimistic point about a possible post-AGI scenario (to offset my ‘hyper-optimistic’ children’s story above!)…

Interviewer: Geoffrey Hinton, do you think that this increase in productivity, essentially, that will come with automation and so on and so forth, is a good thing for society?

Hinton: Well, it ought to be, right? It's crazy; we're talking about having a huge increase in productivity, so there's going to be more goods and services for everybody, so everybody ought to be better off…

But actually, it's going to be the other way around. And it's because we live in a capitalist society. And so what's going to happen is this huge increase in productivity is going to make much more money for the big companies and the rich, and it's going to increase the gap between the rich and the people who lose their jobs. And as soon as you increase that gap, you get fertile ground for fascism.

And so it's very scary that we may be at a point where we're just making things worse and worse. And it's crazy because we're doing something that should help everybody, and obviously will help in healthcare, and help in education. But if the profits just go to the rich, that's going to make society worse.

Watch (link):

South Korea considers creating 'KSMC' chipmaker to compete with TSMC, $13.9B investment (24/Dec/2024)

South Korea is considering establishing a government-funded contract chipmaker, the Korea Semiconductor Manufacturing Company (KSMC), to compete with TSMC. The initiative aims to address the country's semiconductor industry's structural weaknesses and reliance on Samsung, proposing a $13.9 billion investment that could yield $208.7 billion in economic benefits by 2045. The plan highlights the need for increased R&D, financial incentives, and reduced regulatory burdens to enhance Korea's competitive edge and support smaller semiconductor firms.

The Memo by LifeArchitect.ai

The Memo - 27/Dec/2024

NVIDIA GB300, OpenAI o3, Microsoft’s 485,000 Hopper chips, and much more!

The BIG Stuff

The Interesting Stuff

This post is for paid subscribers