The Memo - 20/Jun/2023

AGI @ 50%, Meta AI MusicGen 3.3B, McKinsey: generative AI adding $4.4T/y to economy, Harvard cracks truthfulness, and much more!

Jun 19, 2023

FOR IMMEDIATE RELEASE: 20/Jun/2023

Welcome back to The Memo.

The Who Moved My Cheese? AI Awards! for June has four winners, each more preposterous than the last…

The 153-year-old science journal Nature is banning AI-generated illustrations. Read the announce. Read the report by Ars.
The Grammy awards are banning AI-generated music, ‘A work that contains no human authorship is not eligible in any categories’. Read more via Reuters.
Nikon is freaking out about AI replacing photography: “Millions of people around the world are generating surreal images just by entering a few keywords on a website, which is directly affecting photographers” Read more via PetaPixel.
Belgian ad agency Impact think that they’re immune to the AI revolution because they’re in construction:

AI can do a lot. But AI can’t finish this building on the Keyserlei in Antwerp. AI can't fix a leak or install a heating system neither. Crafts(wo)men are here to stay, and they deserve to be recognized. Their skills are simply irreplaceable.

Good luck with that. Give it a few months…

The BIG Stuff

Exclusive: Harvard doubles truthfulness in LLMs using new approach (8/Jun/2023)

Once again, I’m not sure why the peanut gallery is focusing on old tech. Or worse, misguided tech like Microsoft’s imitation model Orca (I am not a fan of the unnecessary hype around this small model).

Anyway, Harvard’s latest research introduces a concept called ‘inference-time intervention’ (ITI).

Our findings suggest that LLMs may have an internal representation of the likelihood of something being true, even as they produce falsehoods on the surface… At a high level, we first identify a sparse set of attention heads with high linear probing accuracy for truthfulness. Then, during inference, we shift activations along these truth-correlated directions. We repeat the same intervention autoregressively until the whole answer is generated.

Read the paper: https://arxiv.org/abs/2306.03341

My AGI counter requires truthfulness to get to 50%, and we are achieving that with this new approach: https://lifearchitect.ai/agi/

AGI countdown at 50% (16/Jun/2023)

I stand by seeing AGI achieved in the next few months (not the next few years), sometime between now and 2025ish. That doesn’t mean we’ll all have it in our lounge rooms, but that in the lab certain groups will have real AI on par with all human capabilities; probably Google DeepMind post-Gemini (my link) or OpenAI post-GPT-5 (my link), both with full physical embodiment.

McKinsey: AI to add $4.4T annually (14/Jun/2023)

Our latest research estimates that generative AI could add the equivalent of $2.6 trillion to $4.4 trillion annually across the 63 use cases we analyzed—by comparison, the United Kingdom’s entire GDP in 2021 was $3.1 trillion. [I had to check; Australia’s GDP was $1.5T, so this would be 3x Australia’s GDP just from post-2020 AI]

Read the report via McKinsey.

Download PDF.

My table with other major AI economic analyses: https://lifearchitect.ai/economics/

92% of software developers are using AI now (13/Jun/2023)

Back in Oct/2022 when I presented to 4,000 Microsoft, Google, and IBM developers in Belgium, only around 50% of software developers had used AI coding tools.

(Watch that keynote video with transcript, timecode with question about AI use.)

Just eight months later, that number has changed significantly! GitHub reports:

Almost all developers have used AI coding tools—92% of those we surveyed say they have used them either at work or in their personal time. We expect this number to increase in the months to come.

Read the report via GitHub.

New version of GPT-4 and more updates (13/Jun/2023)

new function calling capability in the Chat Completions API
updated and more steerable versions of gpt-4 and gpt-3.5-turbo [gpt-4-0613 and gpt-3.5-turbo-0613]
new 16k context [12,000 words] version of gpt-3.5-turbo (vs the standard 4k version)
75% cost reduction on our state-of-the-art embeddings model [text-embedding-ada-002, see my viz of the GPT-3 family]
25% cost reduction on input tokens for gpt-3.5-turbo

Read the announce: https://openai.com/blog/function-calling-and-other-api-updates

Here’s an example using function calls with Stable Diffusion, LangChain, & DeepLake.

The Interesting Stuff

Exclusive: 46% of crowd workers using LLMs to write (13/Jun/2023)

I’ve been talking about the ‘AI-zation’ of data for a couple of years, comparing it with pre- and post- war steel.

On 16/Jul/1945, the US detonated the first nuclear bomb in New Mexico. The bomb was referred to as ‘Gadget,’ a copy of the ‘Fat Man’ nuclear bomb dropped over Nagasaki a few weeks later, and part of Oppenheimer’s Trinity project. Both detonations marked the beginning of an increased number of radioactive particles in Earth’s atmosphere, as these particles made their way into steel due to the use of air during the steel production process.

In plain English: beginning in 1945, the steel we produce is now slightly radioactive. Since those first bombs, our air now carries radionuclides like cobalt-60, which are deposited into the steel and give it a weak radioactive signature. Medical laboratories source pre-war steel (primarily from shipwrecks) to get ‘pure’ steel (‘low-background steel’ with no radiation.

Similarly, there may be three data points for pre- and post- large language model data:

14/Feb/2019: OpenAI GPT-2 paper released; 26/May/2019: GPT-2 subreddit simulator using GPT-2 345M launched; 20/Aug/2019: OpenAI GPT-2 774M publicly released; 5/Nov/2019: OpenAI GPT-2 1.5B publicly released.
28/May/2020: OpenAI GPT-3 175B paper released; 18/Nov/2021: API completely public.
21/Mar/2021: EleutherAI GPT-Neo 2.7B (GPT-2/3 clone by EleutherAI) publicly released.

At some point, using any of these milestones (and I’d lean toward the first date of 14/Feb/2019), the data available on the web, in your email, on your social media, and even in new books (my link), started becoming ‘contaminated’ with AI-generated text.

Interestingly, the first article published using text from GPT-2—in The Verge on 14/Feb/2019 (link)—actually used screenshots of GPT-2-generated text, perhaps to minimize contamination…

This is now coming to a head in 2023, as even people tasked with writing pure text—Amazon Turk or Upwork workers hired to write new content—are using ChatGPT and other large language models to do their work for them… about 46% of the time.

In plain English, beginning in 2019, the text and content we generate via AI was trained on a significant percentage of AI-generated text (blog posts written by GPT-3, rather than say, books written by humans). This percentage will get higher and higher.

This is shocking, and a significant change for humanity. Consider the implications, especially in sourcing human-generated data from the web for training new AI models. Labs like OpenAI could choose to remove any documents close to the terms ‘GPT’ or ‘AI’ (although this would be unwise), but it is incredibly difficult (perhaps impossible) to detect AI-generated content.

This means that our next large language models will be increasingly trained on AI-generated text rather than human-generated text. Like Ouroboros, the snake that eats itself.

[Note: I don’t actually have a problem with this, as I promote the concept of ‘integrated AI’. I just think it’s an interesting and significant point in humanity’s timeline!]

Read the paper: https://arxiv.org/abs/2306.07899

ChatGPT in healthcare (15/Jun/2023)

I’m doing a fair bit of AI consulting in the healthcare and medicine space this year, as well as two upcoming keynotes for doctors. I found this practical report particularly interesting.

I’ve taken to using ChatGPT to help empathically explain specific medical scenarios to patients and their loved ones. It’s become an invaluable resource for the frequent situations where my ER ward is too busy or short-staffed for explaining complex medical diagnoses in a way that is accurate but easy to understand.

Read: I’m an ER doctor. Here’s how I’m already using ChatGPT to help treat patients.

OpenAI: 4,500 enterprise + government clients via Microsoft (May-Jun/2023)

OpenAI has its own business development arm, but additionally benefits from its $10B investment partner, Microsoft bringing in billion-dollar (and trillion-dollar) clients.

[Microsoft] Customers including IKEA and Volvo are leveraging this [OpenAI service] feature to discover business insights at scale and improve end-user journeys. (23/May/2023)
[Microsoft] Customers are already benefitting from Azure OpenAI Service today, including DocuSign, Volvo, Ikea, Crayon, and 4,500 others. (23/May/2023)
The Defense Department, the Energy Department and NASA are among the federal government customers of Azure Government… Federal, state and local government customers can access OpenAI’s GPT-4 and GPT-3 models for tasks such as generating answers to research questions, producing computer code and summarizing field reports

Policy

The smartest person in the world now chairing US GPT-4 working group committee (May-Jun/2023)

I’m really enjoying seeing former child prodigy (and Aussie native) Terence Tao continue to embrace post-2020 AI. Terence is measurably the smartest man in the world, and I covered his use of GPT-4 in The Memo edition 20/Apr/2023. More recently, he wrote:

As part of my duties on the [US] Presidential Council of Advisors on Science and Technology (PCAST), I am co-chairing (with Laura Greene) a working group studying the impacts of generative artificial intelligence technology (which includes popular text-based large language models such as ChatGPT or diffusion model image generators such as DALL-E 2 or Midjourney…

Read the source via Terry’s blog.

Whitehouse link on Terry.

Read an analysis by GD.

Read Terry’s latest post via Microsoft (Jun/2023).

Read Terry and GPT-4 writing essays (Jun/2023).

OpenAI lobbies EU (20/Jun/2023)

[Alan: this piece edited in a few hours late to this edition, will be repeated next edition]

OpenAI has lobbied for significant elements of the most comprehensive AI legislation in the world—the E.U.’s AI Act—to be watered down in ways that would reduce the regulatory burden on the company, according to documents about OpenAI’s engagement with E.U. officials obtained by TIME from the European Commission via freedom of information requests.

Read more via Time (exclusive): https://time.com/6288245/openai-eu-lobbying-ai-act/

Download OpenAI EU report (PDF).

Safe and responsible AI in Australia (1/Jun/2023)

If you’re in charge of writing policy, Australia has recently presented a nice review with visualisations. The premise and approach is as disappointing as usual, but I found that the summarization of US/EU/Aus guidelines was useful.

Project page: https://apo.org.au/node/322938

Download PDF.

Top five use cases for AI (May/2023)

Read the report.

OpenAI, DeepMind will open up models to UK government (12/Jun/2023)

Google DeepMind, OpenAI and Anthropic have agreed to open up their AI models to the U.K. government for research and safety purposes, Prime Minister Rishi Sunak announced at London Tech Week on Monday.