The Memo - 24/Nov/2022

Stable Diffusion 2.0, Meta Galactica 120B, Microsoft/NVIDIA H100 supercomputer, and much more!

Nov 24, 2022

FOR IMMEDIATE RELEASE: 24/Nov/2022

Welcome back to The Memo.

This edition features a bunch of exclusive content, including a Chinese AI-generated song, one of which has 100M views. We also hear some reggae music via Jukebox, and play with GPT-3 in Roblox (based on Google SayCan) and a “human or AI” GPT-3 game!

I’ve been experimenting with livestreams to allow more interaction and Q&A during videos. You’re welcome to join the next one. You can click the ‘notify’ button to be pinged when a new livestream begins.

LiveArchitect.ai livestreams

The BIG Stuff

Stable Diffusion 2.0 released (24/Nov/2022)

The new Stable Diffusion 2.0 base model ("SD 2.0") was released two hours ago. It was trained from scratch using the OpenCLIP-ViT/H text encoder that generates 512×512 images, with improvements over previous releases (better FID and CLIP-g scores).

It also features upscaling to 2048×2048 and beyond!

Read the release notes: https://github.com/Stability-AI/StableDiffusion

There is a demo at HF, or wait for the update to hit mage.space and the official dreamstudio.

The Interesting Stuff

Meta Galactica 120B (16/Nov/2022)

Meta AI has released Galactica, a 120B-parameter model specializing in scientific data. Meta have hit on some very interesting innovations here. Training on prompts is fascinating. Maintaining full reference data is fascinating.

- “Chinchilla scaling laws”… did not take into the account of fresh versus repeated tokens. In this work, we show that we can improve upstream and downstream performance by training on repeated tokens.
- Our corpus consists of 106 billion tokens from papers, reference material, encyclopedias and other scientific sources.
- We train the models for 450 billion tokens.
- For inference Galactica 120B requires a single A100 node.

See my report card: https://lifearchitect.ai/report-card/

Read the paper: https://galactica.org/static/paper.pdf

~~Play with the demo:~~ ~~https://galactica.org/~~

Note: The slick demo site was swiftly pulled within 72 hours, seemingly for political/sensitivity reasons. Read the update by MIT: https://www.technologyreview.com/2022/11/18/1063487/meta-large-language-model-ai-only-survived-three-days-gpt-3-science/

It was reinstated by a HF user without the nice interface.

Watch my 1-hour livestream of the model release a few hours before the model demo was suspended:

VectorFusion by UC Berkeley (21/Nov/2022)

Text-to-image for vectors (SVG exports).

Prompt: the Sydney Opera House. minimal flat 2d vector icon. lineal color. on a white background. trending on artstation

Read the paper.

Browse the gallery.

MagicVideo by Bytedance (22/Nov/2022)

Efficient text-to-video by Chinese company, Bytedance.

Read the paper: https://arxiv.org/abs/2211.11018

View the gallery: https://magicvideo.github.io/

SceneComposer by Johns Hopkins & Adobe (22/Nov/2022)

Text-to-image by researchers.

Read the paper: https://arxiv.org/abs/2211.11742

View the gallery: https://zengyu.me/scenec/

Andromeda: Cerebras’ supercomputer (14/Nov/2022)

Andromeda delivers 13.5 million AI cores and near perfect linear scaling across the largest language models. It is not really comparable to a standard supercomputer with GPUs. Andromeda is deployed in Santa Clara, California.

Read the press release.

Read a related article by The Verge.

NVIDIA & Microsoft building a supercomputer based on the H100 (16/Nov/2022)

Back in the Jul/2022 edition of The Memo, we talked about NVIDIA’s newest H100 Hopper GPUs, the fastest AI-specific GPUs, designed for—and by—AI training!

…[H100] Hopper chips, which are up to 6x faster than the A100 chips used to train current AI models in 2021 and 2022:
“We demonstrate that not only can AI learn to design these circuits from scratch, but AI-designed circuits are also smaller and faster than those designed by [humans and even] state-of-the-art electronic design automation (EDA) tools. The latest NVIDIA Hopper GPU architecture has nearly 13,000 instances of AI-designed circuits.” — NVIDIA (8/Jul/2022)

Now, NVIDIA and Microsoft are putting 10,000+ H100s into a new supercomputer.

NVIDIA today announced a multi-year collaboration with Microsoft to build one of the most powerful AI supercomputers in the world, powered by Microsoft Azure’s advanced supercomputing infrastructure combined with NVIDIA GPUs, networking and full stack of AI software to help enterprises train, deploy and scale AI, including large, state-of-the-art models…
Future [Azure instances] will be integrated with NVIDIA Quantum-2 400Gb/s InfiniBand networking and NVIDIA H100 GPUs. Combined with Azure’s advanced compute cloud infrastructure, networking and storage, these AI-optimized offerings will provide scalable peak performance for AI training and deep learning inference workloads of any size.

Read the NVIDIA press release.

Read a related article by Ars.

The Beatles’ Revolver album de-mixed and re-mixed with AI (Nov/2022)

I first addressed this amazing AI milestone one year ago, in my Dec/2021 report The Sky is on Fire. Peter Jackson had hired a crack team of AI scientists to de-mix the single microphone recordings of The Beatles.

In the last few months, his technology has been used to de-mix the one microphone into separate tracks, and he has had the Revolver album remixed properly. Producer Giles Martin, son of producer George Martin, says:

“There’s no one who’s getting audio even close as to what Peter Jackson’s guys can do. The funny thing, they won’t let anyone else use it — they may do eventually. But Peter’s such a big Beatles fan, he’s willing to help out. I quite like that in a way, that the Beatles are still using technologies that no one else is using. It’s really groundbreaking. The simplest way I can explain it: It’s like you giving me a cake, and then me going back to you about an hour later with flour, eggs, sugar, and all the ingredients to that cake, that all haven’t got any cake mix left on them.”
“Taxman,” for example, was famously recorded with the drums, bass, and rhythm guitar all on one track. The new tech allows separate tracks for Ringo’s kick drum, toms, hi-hats, etc. Nothing is being altered, obviously — but now we can hear more of what the lads played in the room that day. You can hear details buried way down in the mix, like the acoustic guitar in “For No One,” or the finger snaps in “Here, There, and Everywhere.” — Rolling Stone (Sep/2022) and Vulture (Nov/2022).

Listen to the new multi-track remix in the trailer video:

State of AI report by AI investors Nathan Benaich and Ian Hogarth (Oct/2022)

Key themes in the 2022 Report include:

New independent research labs are rapidly open sourcing the closed source output of major labs. Despite the dogma that AI research would be increasingly centralised among a few large players, the lowered cost of and access to compute has led to state-of-the-art research coming out of much smaller, previously unknown labs. Meanwhile, AI hardware remains strongly consolidated to NVIDIA.
Safety is gaining awareness among major AI research entities, with an estimated 300 safety researchers working at large AI labs, compared to under 100 in last year's report, and the increased recognition of major AI safety academics is a promising sign when it comes to AI safety becoming a mainstream discipline.
The China-US AI research gap has continued to widen, with Chinese institutions producing 4.5 times as many papers than American institutions since 2010, and significantly more than the US, India, UK, and Germany combined. Moreover, China is significantly leading in areas with implications for security and geopolitics, such as surveillance, autonomy, scene understanding, and object detection.
AI-driven scientific research continues to lead to breakthroughs, but major methodological errors like data leakage need to be interrogated further. Even though AI breakthroughs in science continue, researchers warn that methodological errors in AI can leak to these disciplines, leading to a growing reproducibility crisis in AI-based science driven in part by data leakage.

Take a look: https://www.stateof.ai/ (114 slides)

Notion and BundleIQ writing tools integrate new language models (Nov/2022)

Notion: https://www.notion.so/ai

BundleIQ: https://bundleiq.medium.com/bundleiq-an-ai-powered-writing-assistant-6f823138e37e

Midjourney v4 used for commercialized products (7/Nov/2022)

The latest Midjourney v4 (as reported in the previous edition of The Memo) was launched this month, and users are already applying it to business and real products…

r/midjourney - Thanks to V4 I was able to make the "stock graphics" for a new sour beer line completely with MJ. What a time to be alive!

Notice just how Midjourney v4 enables very simple new prompting, which is obviously expanded during priming.

Prompts:

commercial shot of raspberry, teal background, splashes, juicy --v 4

commercial shot of passionfruit, yellow background, splashes, juicy --v 4

commercial shot of hops, green background, splashes, juicy --v 4

Toys to Play With

Exclusive: Chinese songs generated by AI, some with 100M views (Nov/2022)

…by the end of September, TME says it had created and released over 1,000 songs with human-style vocals manufactured by the [AI] Lingyin Engine.
One of those tracks has set the standard for popularity… a version of one song, which appears to be called Today (English translation), “has become the first song by an AI singer to be streamed over 100 million times across the internet”.
…TME’s Executive Chairman, explained to analysts earlier today (November 15) that TME used the Lingyin Engine to “pay tribute” to Anita Mui by “creating an AI code based on her [voice]” for a new track – May You Be Treated Kindly By This World [English transation].

Read the article.

Anita Mui passed away in Dec/2003, this song was generated by AI around Sep/2022:

GPT-3 in VR in Rotterdam (28/Nov/2022)

“Our GPT3 powered VR venue https://quantumbar.ai will be serving engaging AI conversations in SocialVR on its World Premiere at VRDays’ ChurchOfVR in Rotterdam next week - November 28th until December 2nd, 2022.”

The Memo by LifeArchitect.ai

The Memo - 24/Nov/2022

Stable Diffusion 2.0, Meta Galactica 120B, Microsoft/NVIDIA H100 supercomputer, and much more!

The BIG Stuff

The Interesting Stuff

Toys to Play With

Next