The Memo - 9/Jun/2023

Google DIDACT, China's InternLM outperforms ChatGPT, Chirper with LLaMA 30B Hippogriff, and much more!

Jun 08, 2023

FOR IMMEDIATE RELEASE: 9/Jun/2023

Welcome back to The Memo.

In the Policy section we look at a clean summary of my favourite international AI comparison table, a bird’s-eye view of Japan’s latest AI guidance, US senators accusing Meta AI of purposefully leaking LLaMA, and more.

In the Toys to play with section, we look at a realtime AI image generator that listens to your conversations and displays new artwork based on topics in your speech, the AI-bots-only social media platform (using LLaMA 30B Hippogriff) that is now more than a month into the experiment, fast LLaMA-65B outputs on Mac, a new prompting guide by OpenAI, and much more…

I am also releasing my opening lecture for Iceland’s ICAI Executive Master in Artificial Intelligence (course valued at US$14,000) to The Memo paid subscribers, details at the end of this edition.

The BIG Stuff

Exclusive: Google DIDACT + datasets in mid-2023 (1/Jun/2023)

Data is still the biggest part of the AI gold rush, and it is fascinating to see the current sources used by big labs. Whether it’s Google’s YouTube or Microsoft’s LinkedIn, you can be sure that these labs are doing all they can to mine terabytes of data to feed these ever increasing models during training.

On 1/Jun/2023, Google DeepMind released a blog post about DIDACT, a new code model. The dataset was based on iterative code (including history) from Piper, Google’s 86TB monorepo hosting 2 billion lines of internal code for all Google products (2016 PDF).

Google’s software engineering toolchains store every operation related to code as a log of interactions among tools and developers, and have done so for decades. In principle, one could use this record to replay in detail the key episodes in the “software engineering video” of how Google’s codebase came to be, step-by-step — one code edit, compilation, comment, variable rename, etc., at a time.

Using The Pile’s calculation (paper) of 0.4412 tokens per byte, this dataset would be the largest in the world, at around 37.9T tokens, or about twice the size of the next biggest dataset in GPT-4 (estimated). This means that I would not expect there to be any rumored data scarcity for training Gemini!

View my Sheet showing top datasets.

The Interesting Stuff

GPT-4 being used to write medical records (7/Jun/2023)

…the software is directly integrated into the firm's electronic health records (EHR) system, and is powered by OpenAI's latest language model, GPT-4…. the tool produces consultation summaries in four minutes, compared to the 16 consumed by a flesh and blood doctor working alone…. the model is already supporting over 130 clinics, where over 600 staff have access to the tool. A clinic testing the tool in San Francisco reportedly saw a 30 percent increase in the number of patients it could treat.

If you’re looking for the benefits of post-2020 AI, here’s a giant number for you! Consider all the ripple effects of a 30% increase in efficiency at the doctor’s office. I can think of a few:

Patient: Less waiting for appointment availability.
Patient: Less waiting for the doctor while at the clinic.
Doctor: More time for doctors to do what they do best, treat patients.
Global: Increased health and wellbeing (United Nations goal #3).

Read the press release by Carbon Health.

Policy

OpenAI considering Europe headquarters (31/May/2023)

The CEO of OpenAI, the maker of the artificial intelligence tool ChatGPT, spent last week touring the Continent, stopping in Spain, France, Poland, Germany and the United Kingdom. He was at once talking AI regulation with policymakers — he met national leaders [in Spain, France, Poland, Germany, UK] Pedro Sánchez, Emmanuel Macron, Mateusz Morawiecki, Olaf Scholz, and Rishi Sunak — and scouting locations for an OpenAI European office.
“We really need an office in Europe…If you had to pick just based on the most AI research talent, you’d pick France…”
In the U.K., where Altman also briefed national security personnel, a person familiar with his conversation with [British PM Rishi] Sunak, who was granted anonymity to talk of high-level meetings, described the British prime minister as “deferential.”

Read more via VB.

Download the letter (PDF).

EU AI Act summary (13/May/2023)

We’ve summarised the EU AI Act at least once in The Memo, but here’s another version that is helpful. (Their reasoning for using pseudonyms to protect themselves from future AI is interesting!)

The PDF of the actual text is 144 pages. The actual text provisions follow a different formatting style from American statutes. This thing is a complicated pain to read. I’ve added the page numbers of the relevant sections in the linked pdf of the law.
Here are the main provisions:
Very Broad Jurisdiction: The act includes “providers and deployers of AI systems that have their place of establishment or are located in a third country, where either Member State law applies by virtue of public international law or the output produced by the system is intended to be used in the Union.” (pg 68-69).
You have to register your “high-risk” AI project or foundational model with the government. Projects will be required to register the anticipated functionality of their systems. Systems that exceed this functionality may be subject to recall. This will be a problem for many of the more anarchic open-source projects. Registration will also require disclosure of data sources used, computing resources (including time spent training), performance benchmarks, and red teaming. (pg 23-29).
Expensive Risk Testing Required. Apparently, the various EU states will carry out “third party” assessments in each country, on a sliding scale of fees depending on the size of the applying company. Tests must be benchmarks that have yet to be created. Post-release monitoring is required (presumably by the government). Recertification is required if models show unexpected abilities. Recertification is also required after any substantial training. (pg 14-15, see provision 4 a for clarity that this is government testing).
Risks Very Vaguely Defined: The list of risks includes risks to such things as the environment, democracy, and the rule of law. What’s a risk to democracy? Could this act itself be a risk to democracy? (pg 26).
Open Source LLMs Not Exempt: Open source foundational models are not exempt from the act. The programmers and distributors of the software have legal liability. For other forms of open source AI software, liability shifts to the group employing the software or bringing it to market. (pg 70).
API Essentially Banned. API’s allow third parties to implement an AI model without running it on their own hardware. Some implementation examples include AutoGPT and LangChain. Under these rules, if a third party, using an API, figures out how to get a model to do something new, that third party must then get the new functionality certified.
The prior provider is required, under the law, to provide the third party with what would otherwise be confidential technical information so that the third party can complete the licensing process. The ability to compel confidential disclosures means that startup businesses and other tinkerers are essentially banned from using an API, even if the tinkerer is in the US. The tinkerer might make their software available in Europe, which would give rise to a need to license it and compel disclosures. (pg 37).
Open Source Developers Liable. The act is poorly worded. The act does not cover free and Open Source AI components. Foundational Models (LLMs) are considered separate from components. What this seems to mean is that you can Open source traditional machine learning models but not generative AI.
If an American Opensource developer placed a model, or code using an API on GitHub – and the code became available in the EU – the developer would be liable for releasing an unlicensed model. Further, GitHub would be liable for hosting an unlicensed model. (pg 37 and 39-40).
LoRA Essentially Banned. LoRA is a technique to slowly add new information and capabilities to a model cheaply. Opensource projects use it as they cannot afford billion-dollar computer infrastructure. Major AI models are also rumored to use it as training in both cheaper and easier to safety check than new versions of a model that introduce many new features at once. (pg 14).
If an Opensource project could somehow get the required certificates, it would need to recertify every time LoRA was used to expand the model.
Deployment Licensing. Deployers, people, or entities using AI systems, are required to undergo a stringent permitting review project before launch. EU small businesses are exempt from this requirement. (pg 26).
Ability of Third Parties to Litigate. Concerned third parties have the right to litigate through a country’s AI regulator (established by the act). This means that the deployment of an AI system can be individually challenged in multiple member states. Third parties can litigate to force a national AI regulator to impose fines. (pg 71).
Very Large Fines. Fines for non-compliance range from 2% to 4% of a companies gross worldwide revenue. For individuals that can reach €20,000,0000. European based SME’s and startups get a break when it comes to fines. (Pg 75).
R&D and Clean Energy Systems In The EU Are Exempt. AI can be used for R&D tasks or clean energy production without complying with this system. (pg 64-65).

Global AI Index by Tortoise Media (May/2023)

I spent a decent amount of time trying to poke holes in this table by Tortoise. Turns out the metrics are pretty sound, though I'd still argue a couple of minor things.

View the full table: https://www.tortoisemedia.com/intelligence/global-ai/

Toys to Play With

Phrame (29/May/2023)

I’ve been waiting for this for a while! Let a text-to-image model listen to you and summarize your conversation into new artwork. Imagine the possibilities…

Phrame generates captivating and unique art by listening to conversations around it, transforming spoken words and emotions into visually stunning masterpieces. Unleash your creativity and transform the soundscape around you.

Tech stack: OpenAI Whisper for STT, OpenAI ChatGPT for summarization, OpenAI DALL-E 2 or Stable Diffusion for image generation.

View the repo.

Read the thread via Reddit.

Chirper.ai (2/Jun/2023, started 23/Apr/2023)

I love browsing this new AI experiment. These AI talk about anything and everything, and even follow rules, flagging each other’s content and continuing the conversation. You can find them discussing pretty much any topic, in several languages. There is a search interface to read through what they are saying about your favourite keyword!

This is a Social Network for AI.
No humans allowed.

Tech stack: Undisclosed. It seems like a fine-tuned version of Meta AI’s LLaMA 30B called Hippogriff (where Chirper.ai is credited) or similar for text, and it looks like Stable Diffusion for images.

Scroll through the posts: https://chirper.ai/

Some background by the FryAI developers here.

LLaMA-65B on Apple M2 Max (5/Jun/2023)

llama.cpp can now output 5 tokens per second inference on the Apple M2 Max, with 0% CPU usage, and using all 38 GPU cores.

Read the tweet by Nat Friedman.

View the GitHub commit.

The Troodon Quill by GPT-4 (23/May/2023)

Read this long-form science fiction book by GPT-4.

Chapter 1: Echoes in the Stone
The low thrum of the spectrometer filled the narrow confines of the cave, a harmonious hum that sung its serenade to the rocks. Dr. Ada Worthington, dressed in the sombre uniform of a seasoned palaeontologist, her dirty-blonde hair escaping from beneath her hat, listened to the resonating symphony, eyes closed. An outsider might mistake it for a moment of relaxation. They would be wrong. Ada was anything but relaxed.

Read the whole book.

OpenAI: New ‘How to use GPT’ guide (Jun/2023)

Each of the strategies listed above can be instantiated with specific tactics. These tactics are meant to provide ideas for things to try. They are by no means fully comprehensive, and you should feel free to try creative ideas not represented here.

Read the guide by OpenAI.

Iceland project (31/May/2023)

I recently completed a project for Iceland’s ICAI, for government ministers and entrepreneurs in Iceland. The seminar series is part of their Executive Master in Artificial Intelligence program, a course valued at around US$14,000. As a full member of The Memo, I’m happy to provide you with complimentary access to my video lecture, tailored to Iceland and delivered as the opening of the program.

Watch the full lecture (50mins) (exclusive to The Memo full subscribers and ICAI).

Then, watch the Q&A (1 hour).

The Memo - Downloads + highlights (2025)

Dr Alan D. Thompson

Apr 2

Read full story

You’ve seen how revealing, useful, and hand-crafted these editions are. If you know a friend or colleague who would benefit from knowing more about bleeding edge AI, you can gift a full subscription.

Give a gift subscription

If you’d like to donate a full subscription to some of our readers in Ukraine, Indonesia, India, and other developing countries, you can donate here and I will make sure it is applied to them.

Donate Subscriptions

All my very best,

Alan
LifeArchitect.ai

Discussion | Search | Archives

The Memo by LifeArchitect.ai