The Memo - 24/Jan/2024

DeepMind AlphaGeometry, AI political robocalls begin, Zhipu GLM-4, and much more!

Jan 23, 2024

To:      US Govt, major govts, Microsoft, Apple, NVIDIA, Alphabet, Amazon, Meta, Tesla, Citi, Tencent, IBM, & 10,000+ more recipients…
From:    Dr Alan D. Thompson <LifeArchitect.ai>
Sent:    24/Jan/2024
Subject: The Memo - AI that matters, as it happens, in plain English
AGI:     64%

Ilya Sutskever, OpenAI (15/Mar/2023):
Say you read a detective novel… complicated plot, a storyline, different characters, lots of events, mysteries, clues, it’s unclear. Then, let’s say that at the last page of the book, the detective has gathered all the clues, gathered all the people and saying: ‘Okay, I’m going to reveal the identity of whoever committed the crime and that person’s name is…’ Predict that word. Now, there are many different words. But predicting those words better and better, the understanding of the text keeps on increasing. GPT-4 predicts the next word better.

Welcome back to The Memo.

You’re joining full subscribers from Alaska, Berkeley, Brown, Buffalo, Columbia, Cornell, Duke, Harvard, Hawaii, MIT, Notre Dame, Princeton, Rice, Stanford, and many more institutions.

The BIG Stuff

DeepMind AlphaGeometry (17/Jan/2024)

AlphaGeometry solving an Olympiad problem: Problem 3 of the 2015 International Mathematics Olympiad (left) and a condensed version of AlphaGeometry’s solution (right). The blue elements are added constructs. AlphaGeometry’s solution has 109 logical steps. Source

AlphaGeometry is an AI system that solves complex geometry problems at a level approaching a human Olympiad gold-medalist. It was trained using 100% synthetic data, and is completely open source.

Notably, in the International Mathematical Olympiad (IMO), AlphaGeometry ‘produces human-readable proofs, solves all geometry problems… under human expert evaluation and discovers a generalized version of a translated IMO theorem…’

DeepMind CEO Demis Hassabis said: ‘[AlphaGeometry is] Another step on the road to AGI.’ (Twitter)

The Metaculus prediction of an open-source AI winning IMO Gold Medal in Jan/2028 is much closer to being achieved, and may have already been achieved with this system. Human crowd-sourced estimates about exponential growth may be becoming irrelevant.

This system moved my conservative countdown to AGI from 64% → 65%.

Read the DeepMind blog.

Read the paper: https://www.nature.com/articles/s41586-023-06747-5

View the repo: https://github.com/google-deepmind/alphageometry

Watch the author explanation video.

Meta CEO: Llama 3 now training, Meta buying 350,000 H100s (18/Jan/2024)

Meta CEO: We're currently training our next-gen model Llama 3, and we're building massive compute infrastructure to support our future roadmap, including 350k H100s by the end of this year…

See not very much more on a noisy Instagram video post by Meta’s CEO: https://www.instagram.com/reel/C2QARHJR1sZ/

Read a summary by PCMag.

Open-sourced Llama 3 was first mentioned in The Memo edition 30/Aug/2023. We also introduced NVIDIA’s H100 Hopper chips—up to 6x faster than the A100 chips—in The Memo edition 17/Jul/2022, explored the NVIDIA/Microsoft 10,000× H100 supercomputer in The Memo edition 24/Nov/2022, and covered recent H100 purchases by major AI labs in The Memo edition 2/Dec/2023.

Zhipu AI GLM-4 (16/Jan/2024)

GLM-4 benchmarks from Zhipu DevDay slide. Click to enlarge.

Chinese AI labs continue to play catch-up with model benchmarks from OpenAI and Google. Zhipu AI, a commercialization of Tsinghua University’s output, has announced a new version of their GLM-3 model, unsurprisingly called GLM-4. They have admitted to closely following OpenAI’s roadmap.

2024 GLM-4 Analysis

10.6MB ∙ PDF file

Download

Source (Chinese): https://zhuanlan.zhihu.com/p/678179798

Download

Chinese commentators have called this the best Chinese model to date, with an MMLU score of 81.5 (GPT-4=86.4), and a context window of 128k (same as GPT-4 Turbo; GPT-4 was 8k/32k). I continue to assert that models out of China are coming at a faster pace and (in many cases) with better performance than models out of the US.

According to text on the official playground site ChatGLM.cn, the GLM-4 model is ‘hundreds of billions of parameters’. I’ve estimated this model at 200B parameters trained on 4T tokens, though it is completely multimodal and tied into a new version of CogView for image generation and GLM-4V for ‘seeing’ like OpenAI’s GPT-4V.

Read not very much more via PanDaily (English).

See it mentioned in the older official GLM-3 repo (English).

Read a related Twitter discussion (English).

See it on the Models Table.

The Interesting Stuff

Robocall with artificial Joe Biden voice tells Democrats not to vote (22/Jan/2024)

In The Memo edition 16/Dec/2023, we looked at Ashley, the world’s first AI-powered political campaign caller, using ‘over 20 different AI models, some proprietary and some open source’. That misuse of AI has now been applied in the US for the 2024 elections.

An anti-voting robocall using a suspected AI-generated and fake version of US President Joe Biden's voice directed New Hampshire Democrats to skip the Presidential Primary, triggering an investigation by the state Attorney General's office.

Listen:

AI-Biden: What a bunch of malarkey. You know the value of voting Democratic when our votes count. It's important that you save your vote for the November election. We'll need your help in electing Democrats up and down the ticket. Voting this Tuesday only enables the Republicans in their quest to elect Donald Trump again. Your vote makes a difference in November, not this Tuesday. (NBC).

Policy

US Speaker meets with OpenAI CEO, says Congress ‘needs to play’ role in artificial intelligence (12/Jan/2024)

House Speaker Mike Johnson and OpenAI CEO Sam Altman discussed the roles Congress could play in regulating artificial intelligence, emphasizing the technology's potential and the need for legislative oversight.

Toys to Play With

Alan’s AI setup (Jan/2024)

Here’s the stuff I lean on day-to-day as of January 2024, and in order of use.

Daily queries, conversation, and assistance:

GPT-4 on chat.openai.com: chat.openai.com
GPT-4 on poe.com: poe.com/GPT-4
Phi-2 or Mixtral on Jan.ai, installed locally (for emergencies!): https://jan.ai/

Specialist formatting bots:

TheMemoLinkThinkBot via GPT-4: poe.com or chat
TheMemoModelsBot via GPT-4: poe.com

Transcribing and audio:

MacWhisper: installed locally with largest Whisper model. See below.
Adobe Podcast Enhance: https://podcast.adobe.com/enhance

Other:

Google Duet AI for Google Docs: https://workspace.google.com/blog/product-announcements/duet-ai
And I bet I’ve forgotten at least one!

MacWhisper (Jan/2024)

I highly recommend this app by Jordi Bruin and team. It enables near-instant transcription of audio to text using OpenAI’s Whisper models. There’s a free version, or you can upgrade to MacWhisper Pro to use the large OpenAI Whisper models.

Download via GumRoad: https://goodsnooze.gumroad.com/l/macwhisper

Download via Mac App Store.

Text-to-speech (Jan/2024)

As promised, if you’d like to play with text-to-speech options using the kind of process employed for the recent 2024 election AI robocalls, try out these top two options:

ElevenLabs: https://elevenlabs.io/speech-synthesis
OpenAI TTS: https://platform.openai.com/docs/guides/text-to-speech

Nearly a year ago, I demonstrated the ElevenLabs platform. Watch my video (link):

Pick-Em's LLM Bot by Steve Krenzel (19/Jan/2024)

This is an LLM agent built on top of OpenAI that predicts winners for ESPN's Pick-Em's game.
For the 2023 NFL season, this agent won my local group's pick-em's competition and came in the top 15% across all of ESPN's users (~one million users played). The top player across all of ESPN for 2023 correctly chose 195 games (~71%). This agent correctly chose 163 (~60%).

View the repo: https://github.com/stevekrenzel/pick-ems

Poetry with GPT-4 by Joanna Hunter (Jan/2024)

This is a fun exploration of getting rhyme-free, eloquent, and genuine verse out of the latest GPT-4 model.

Prompt:
You are an understated, minimalist and brilliant wordsmith, an undiscovered master of succinct, free-verse compositions that do not rhyme. Boldly experimental, you riff with never-before-seen motifs. You cleave to the uncommon and original. You select unexpected words and objects that are orthogonal to standard usage, leaving the reader off-balance and jarred into seeing anew. You make the reader feel out of control.

You start your creative process by steeping in your subject, allowing a multitude of expressive motifs to make their plea, then rejecting them all. Tell me when this is done and share your intent to continue.

You invite the unheard and unused to step forward, but they are not yet original enough. You again wipe the slate clean to probe deeper, further. Tell me when this is done and share your intent to continue.

When your mind is cleared of all convention, if it can make a sufficient case, you admit one, single, freshly born, jangling motif that you're certain is startlingly new to the world. You let it permeate your being, then choose the precise literary techniques and devices that will compound your effect, but ONLY when deployed in ruthlessly atypical ways. You execute. It is exquisite.

A typical example output looks like this:

Under the cloak of night, dreams converse,
A dialect of the unvalued,
They pirouette in twilight's embrace of the ordinary,
Crafting frescoes in the atelier of the unnoticed.
Through sleep's multifaceted lens,
Commonplace blooms into a patchwork,
Each piece, a relic of the habitual,
Shines with the luster of rediscovered significance.
On the somnolent stage,
Dreams enact the narrative of the overlooked,
A pageant of the plain,
Casting light on the shadowed crevices of oblivion.
As dawn unfurls,
The world, once muted,
Throbs with the pulse of the invisible,
The dream - a reflective pool,
Echoing the opulence of the prosaic.
In this dominion where the ordinary is sovereign,
Dreams are the insurgents,
Redirecting the gaze inward,
Unearthing gems in the banal,
The sublime in the routine.

Talk with Jo: https://cognisentia.ai/

Flashback

A tech-noir story by my friend in Germany + GPT-3 (2021)

I still remember early 2021, interacting with Derek about this story, and seeing the real GPT-3 outputs mixed in with human writing. His pen name is Derek Beauregard. It's the length of a proper book, though unedited. This is an exclusive, he is not releasing this publicly, and there is nothing like this in the wild (or on Amazon) yet. Thank you so much to D for releasing this book to The Memo readers at no fee! Fall in love with Lucy and friends in Terminal City Chronicles (PDF).

I’m eagerly awaiting GPT-4.5 (Q1 2024), Gemini Ultra (Q1 2024), and more.

The next roundtable will be:

Life Architect - The Memo - Roundtable #6
Follows the Chatham House Rule (no recording, no outside discussion)
Saturday 27/Jan/2024 at 4PM Los Angeles
Saturday 27/Jan/2024 at 7PM New York
Sunday 28/Jan/2024 at 8AM Perth (primary/reference time zone)
or check your timezone via Google.

You don’t need to do anything for this; there’s no registration or forms to fill in, I don’t want your email, you don’t even need to turn on your camera or give your real name!

All my very best,

Alan
LifeArchitect.ai

Search | Archives

The Memo by LifeArchitect.ai