The Memo - 2/Dec/2023

BASIS benchmark for ASI, $20B+ of H100s sold in 90 days, Amazon Q, and much more!

Dec 02, 2023

FOR IMMEDIATE RELEASE: 2/Dec/2023

Welcome back to The Memo.

December 2023 already?!

If we include my first few articles about post-2020 AI, then we’re about to enter our fifth year together talking about AI, language models, and this revolution of many names (AIGC or artificial intelligence for generalized computation, LLMs, Generative AI, GenAI, Transformers, Frontier models, Foundation models, Transformative AI). What’s that saying about time flying and having fun?

2024 is going to be a huge year for AI, bringing both formidable and comforting advances…

The Policy section in this edition might be my favourite so far, including a spectacular analogy by a major government leader, comparing AI and handwriting. We also explore a new legal ruling for AI-generated images.

In the Toys to play with section, we look at the latest and greatest way to run LLMs locally on your own computer, live voice interpretation that is eye-poppingly amazing, a new LLM orchestration pipeline, how AI will influence your love life, and much more.

The BIG Stuff

Introducing BASIS (Nov/2023)

We are rapidly approaching a position of being unable to test AI because ‘no one is smart enough’ (watch my Sep/2023 video of this keynote title, to Devoxx Ukraine).

Introducing BASIS, the Betts artificial superintelligence suite, aimed at measuring the smartest AI in the world against the smartest human-created test items in the world.

It was great to work with Mensan, founder of the World Genius Directory, and ultra-high-ceiling test designer Dr Jason Betts on this project, a benchmarking suite for artificial superintelligence. There were some really funky specs, like having all items designed in a ‘clean room’ and then sealed in a locked bag. We both felt a bit like spies!

The suite is designed for testing frontier models like Gemini and GPT-5, aiming for IQ=180.

The suite is being made available to AI research labs around the world, on request.

Watch the livestream: https://youtube.com/live/DGPMJN0sskQ

Two alternative benchmarks were also released in November 2023:

GAIA by Meta/HuggingFace, media level for IQ=100. (paper, dataset)
GPQA by NYU/Cohere/Anthropic, expert level for IQ=125. (paper, dataset)

Note that both GAIA and GPQA:

are not for the higher ceiling of artificial superintelligence
were not air-gapped
are already compromised, with datasets released and available online

OpenAI: Compute is king (20/Nov/2023)

The numbers for compute and $ being thrown around are getting crazy.

We’ve gone from OpenAI having access to 25,000 GPUs (Morgan Stanley, Feb/2023), a mix of A100s and H100s, to tripling that number and focusing exclusively on H100s (according to NVIDIA, the H100 is up to nine times faster for AI training). We’re in for a wild ride.

OpenAI, with plans for >$50B annual datacenter spend to race to AGI… one of OpenAI’s next training supercomputers in Arizona was going to have more than 75,000+ GPUs in a singular site by the middle of next year.
Our data also shows us that Microsoft is directly buying more than 400,000 GPUs next year [2024] for both training and copilot/API inference. Furthermore, Microsoft also has tens of thousands of GPUs coming in via cloud deals with CoreWeave, Lambda, and Oracle.

The Interesting Stuff

Exclusive: 17 new model highlights for November 2023 (Nov/2023)

While research continues at the rate of one new AI paper published every ~8 minutes (see The Memo edition 9/Jul/2023), we’re also seeing one new major model released every ~42 hours on average.

In 2023, November saw 17 major model announcements. As a comparison, October was 10 models, and September was 18 models. Some parameter counts are estimated. Here they are:

01-ai Yi-34B, xAI Grok-0 (33B), xAI Grok-1 (33B), Samsung Gauss (7B), NTU OtterHD-8B, Google DeepMind Mirasol3B (3B), Microsoft Florence-2 (0.771B), Microsoft phi-2 (2.7B), Microsoft Orca 2 (13B), Allen AI TÜLU 2 (70B), Anthropic Claude 2.1 (130B), Inflection AI Inflection-2 (1.2T), Berkeley Starling-7B, Microsoft Transformers-Arithmetic (0.1B), EPFL MEDITRON (70B), IEIT Yuan 2.0 (102B), Google DeepMind Q-Transformer.

I also have a new and publicly-available GPT-4 bot helping out with the comma separated formatting above: https://poe.com/TheMemoModelsBot

See more about each of these models on the Models Table: https://lifearchitect.ai/models-table/

ChatGPT's 1-year anniversary: how it changed the world (30/Nov/2023)

VentureBeat takes a retrospective look at how OpenAI's ChatGPT has influenced the world one year since its launch, noting its rapid adoption and the controversies it has sparked, leading to debates about the role of large language models in society.

Policy

World’s first AI minister likens risk of overregulation to calligraphers that kept the printing press out of the Middle East for nearly 200 years (28/Nov/2023)

(Omar Sultan Al Olama is Minister of State for Artificial Intelligence in the United Arab Emirates. He was appointed in October 2017 by the Vice President and Prime Minister of the UAE and Ruler of Dubai, Sheikh Mohammed bin Rashid Al Maktoum.)

Omar Al Olama, the world's first minister of AI, warned that overregulating artificial intelligence could have serious consequences, drawing parallels to the Ottoman Empire's refusal to adopt the printing press. He suggests that concerns today about AI, such as job losses and misinformation, mirror those faced during the introduction of the printing press, resulting in its ban in the Middle East for 200 years.

…the issues policymakers are now facing with regard to AI—such as its impact on job losses, misinformation, and fear of social upheaval—are very similar to the problems faced by the empire’s then leader, Sultan Selim I.
“We overregulated a technology, which was the printing press,” said Al Olama. “It was adopted everywhere on Earth. The Middle East banned it for 200 years.
“The calligraphers came to the sultan and said: ‘We’re going to lose our jobs, do something to protect us’—so, job loss protection, very similar to AI,” the UAE minister explained. “The religious scholars said people are going to print fake versions of the Quran and corrupt society—misinformation, second reason.”
Lastly, Al Olama said, it was fear of the unknown that led to this fateful decision.
“The top advisors of the sultan said: ‘We actually do not know what this technology is going to do; let us ban it, see what happens to other societies, and then reconsider,’” he explained.

Toys to Play With

Mozilla: Introducing llamafile (29/Nov/2023)

This looks to be the newest and best way to run LLMs locally.

Today we’re announcing the first release of llamafile and inviting the open source community to participate in this new project.
llamafile lets you turn large language model (LLM) weights into executables.
Say you have a set of LLM weights in the form of a 4GB file (in the commonly-used GGUF format). With llamafile you can transform that 4GB file into a binary that runs on six OSes without needing to be installed.

Step through it with Simon: https://simonwillison.net/2023/Nov/29/llamafile/

Try it: https://github.com/Mozilla-Ocho/llamafile

Live voice-to-voice with Meta’s new SeamlessExpressive model (30/Nov/2023)

SeamlessExpressive is an AI model that aims to maintain expressive speech style elements in the translation, across nearly 100 languages.

Read the paper: https://ai.meta.com/research/publications/seamless-multilingual-expressive-and-streaming-speech-translation/

Try it: https://seamless.metademolab.com/expressive/

ElevenLabs Voice Changer (23/Nov/2023)

AI Speech to Speech Converter
Transform your voice into another character and control its emotion and delivery. Easily create custom voices for games, videos, podcasts, and more with a single click.

Read the announce: https://twitter.com/elevenlabsio/status/1727460218345242979

Try it: https://elevenlabs.io/voice-changer

txtai (Nov/2023)

txtai is an all-in-one embeddings database for semantic search, LLM orchestration and language model workflows.

Pipelines powered by language models that run LLM prompts, question-answering, labeling, transcription, translation, summarization and more.

Take a look: https://github.com/neuml/txtai

The best sex of your life will be with AI (30/Nov/2023)

This is a fun article, and it is even more fun putting it into the ‘toys to play with’ section of The Memo.

The author posits the idea that the future of sexual experiences will be dominated by AI-powered experiences, possibly delivered through virtual reality, augmented reality, or brain interfaces. These AI entities will be able to cater to individual desires and preferences, providing a level of satisfaction that surpasses any human capability.

Flashback

A colleague put me into a time machine all the way back to 2017, when I was writing about Elon Musk’s gifted school in California. The article, originally published in the Journal of Australian Mensa, was released at the same time as the discovery of Google’s Transformer.

Was there something in the air?

The future of gifted education assumes integrated artificial intelligence, so is more focused on the human side of life. It includes abstract reasoning, strategy, ethics, decision-making, and cooperation.

Read it online: https://lifearchitect.ai/ad-astra/

Or download the article as published (Mensa – PDF).

I’m often asked about hard stats for how LLMs have impacted our lives. Here’s a viz, and I expect it to also appear in the end of year report.

See more: https://lifearchitect.ai/use-cases/

The next roundtable will be:

Life Architect - The Memo - Roundtable #5
Follows the Chatham House Rule (no recording, no outside discussion)
Saturday 9/Dec/2023 at 4PM Los Angeles
Saturday 9/Dec/2023 at 7PM New York
Sunday 10/Dec/2023 at 8AM Perth (primary/reference time zone)
or check your timezone via Google.

You don’t need to do anything for this; there’s no registration or forms to fill in, I don’t want your email, you don’t even need to turn on your camera or give your real name!

All my very best,

Alan
LifeArchitect.ai

Search | Archives

The Memo by LifeArchitect.ai

1 Comment