The Memo - 31/Oct/2023

Amazon using Digit humanoids already, RedPajama-Data-v2 with 30T tokens, 28 new AI models, and much more!

Oct 30, 2023

FOR IMMEDIATE RELEASE: 31/Oct/2023

OpenAI CEO (22/Oct/2023):
We define AGI as ‘the thing we don’t have quite yet.’ There were a lot of people who would have—ten years ago [2013 compared to 2023]—said alright, if you can make something like GPT-4, GPT-5 maybe, that would have been an AGI… I think we’re getting close enough to whatever that AGI threshold is going to be.

Welcome back to The Memo.

You’re joining full subscribers from Lockheed Martin, L'Oréal, the Linux Foundation, Lincoln University, Locus Robotics, Lycos (wow, that’s a blast from the past!), Landmark Education, and more…

This edition’s Policy section is extraordinarily long to address developments around the world in Oct-Nov/2023. In the Toys to play with section, we look at a new audio model with vocals, a famous book simulated by GPT-4, a new way of writing programs you can talk to, using seeds in DALL-E 3, and much more…

The next roundtable for full subscribers will be:

Life Architect - The Memo - Roundtable #4
Follows the Chatham House Rule (no recording, no outside discussion)
Saturday 4/Nov/2023 at 5PM Los Angeles
Saturday 4/Nov/2023 at 8PM New York
Sunday 5/Nov/2023 at 8AM Perth (primary/reference time zone)
or check your timezone via Google.

Details at the end of this edition.

Every now and again, an amazing output from GPT-4V (GPT-4 Vision) makes me laugh. This is one of those times (Reddit, 22/Oct/2023). This is a well-known publicity shot of the OpenAI team in 2023: Mira (CTO), Sam (CEO), Greg (President), Ilya (Chief Scientist).

Following a run-in with a misinformed ‘leader’ here in Australia, I was guided to issue another press release a few days ago on 25/Oct/2023.

Artificial general intelligence is here. Leaders not endorsing this revolution are guilty of negligence.
Having AI augment and amplify humans is an exciting event, but Dr Thompson warns that most world leaders were understating or ignoring the present level of AI capabilities. “Burying heads in the sand is irresponsible. Discussions about copyright and IP are a distraction. Comparisons with robots in Hollywood movies are a misuse of imagination. Commentary downplaying AI’s role in replacing outdated job roles is dangerous. Nearly all leaders are demonstrating negligence in their lack of preparation, understanding, and application of a growth mindset during humanity’s most rapid and important evolution.”

Read this media release from 25/Oct/2023: https://lifearchitect.ai/leaders-guilty-of-negligence/

Read ‘AI is outperforming humans in both IQ and creativity in 2021’ (19/Sep/2021): https://lifearchitect.ai/outperforming-humans/

Read ‘AI fire alarm’ (20/Jul/2021): https://lifearchitect.ai/fire-alarm/

We are entering the final few weeks of 2023. I will be running public livestreams every Tuesday night (US time) until the end of the year.

Notify/watch: https://www.youtube.com/@DrAlanDThompson

The BIG Stuff

RedPajama-Data-v2: an open dataset with 30 trillion tokens for training LLMs (30/Oct/2023)

Together AI has released a new version of the RedPajama dataset, with 30T filtered and deduplicated tokens from 84 Common Crawl (web crawl, the Google version is known as the Colossal Clean Crawled Corpus or C4) dumps covering 5 languages. This dataset is believed to be the largest public dataset ever released for large language model (LLM) training.

At an estimated 125TB of data for 30T tokens, RedPajama-Data-v2 is:

2.3× larger than the dataset used to train GPT-4 across 13T tokens (estimated).
4.7× larger than the next biggest public dataset (CulturaX by UOregon).
6× larger than TII’s RefinedWeb used for training Falcon 180B just a few months ago in Jun/2023.

The Interesting Stuff

New models (Oct/2023)

There were 18 models announced in September 2023 (highlights only):

SUTD/Independent - TinyLlama (1.1B), TII - Falcon 180B (180B), BAAI - FLM-101B (101B), Adept - Persimmon-8B (8B), Apple - UniLM (0.034B), Microsoft - phi-1.5 (1.3B), Singapore - NExT-GPT (7B), IBM - MoLM (8B), Deci - DeciLM (5.7B), ThirdAI - BOLT2.5B (2.5B), Baichuan - Baichuan 2 (13B), Microsoft - Kosmos-2.5 (1.3B), Mistral AI - Mistral 7B (7.3B), Hessian AI/LAION - LeoLM (13B), Meta AI - Llama 2 Long (70B), Alibaba - Qwen (14B), Wayve - GAIA-1 (9B), Waymo - MotionLM (0.09B).

I counted 10 models announced in October 2023 (highlights only):

Google DeepMind - RT-X (55B), Reka AI - Yasa-1, KAUST/Shenzhen - AceGPT (13B), XLANG Lab - Lemur (70B), NVIDIA - Retro 48B (48B), Google DeepMind - PaLI-3 (5B), Hugging Face H4 - Zephyr (7.3B), Baidu - ERNIE 4.0 (1T+), Adept - Fuyu (8B), Jina AI - jina-embeddings-v2 (0.435B).

See the Models Table with playground/paper links: https://lifearchitect.ai/models-table/

More OpenAI rumors from Jimmy Apples (Oct/2023)

‘Jimmy Apples’ is a pseudonym probably used by someone inside or with intimate knowledge of OpenAI. While the Twitter account was paused for a while, it is back with a vengeance…

OpenAI received the first of its 25,000 NVIDIA H100 GPUs in Oct/2023.
OpenAI CEO investing in a non-invasive brain-computer interface (BCI) in Oct/2023 (‘Hey @sama, would be rather interesting of you to be working on a BCI device, non-invasive, via a stealth startup.‘).
OpenAI potentially losing employees in Oct/2023 (‘There’s been a vibe change at openai and we risk losing some key ride or die openai employees.’).
Conversation about AGI and UBI (‘there are [GPT-4] agents who can learn and update knowledge, possess vision, and work on long-term goals (e.g., agents that can work on tasks for several months).’).

Exclusive: Agents that can work on tasks for several months (Oct/2023)

It’s exclusive in that I’m talking about it here while no-one else seems to be (they’re off having arguments about AI regulation, copyright, and god-knows-what-else), but ‘Jimmy’ noted it first above.

I think I have a pretty good imagination, but it’s a challenge to imagine an agent with the speed and depth of GPT-n working on any problem (even a super wicked problem) for months. Off the top of my head, these may be some good examples, but do they really take months of processing?

Economy: As we integrate AI + robots, and humans no longer need to work to produce goods or services (the ‘post-scarcity economy’, wiki), redesigning the entire world economy down to allocation of $ for each person.
Energy: redesigning our energy harnessing and storage using new methods, from sustainable extraction, transformation, storage, distribution, and utilization.
Environment: Planning out and deploying a full resolution to our current environmental challenges including climate change.
Happiness (or your choice of similar word): Ensuring human flourishing post-AI, in a world with no employment/work needed, and in a world where everyone is living inside ‘full dive virtual reality’, FDVR (wiki), in line with Prof Martin Seligman’s PERMA model: Positive Emotion, Engagement, Relationships, Meaning, and Accomplishments.

GPT-4 integrated tools (Oct/2023)

Tooling for the GPT-4 model within the chat.openai.com platform is being slowly rolled out. Paid users can now send ‘anything’ to GPT-4, and it will work out what to do with it.

Policy

UN: Stressing artificial intelligence could power extraordinary progress for humanity, UN Secretary-General says new high-level advisory body ‘is the starting point’ (26/Oct/2023)

Alan’s alternative title: UN pulls their finger out on AI three years too late

The Secretary-General of the United Nations, António Guterres, has announced the launch of a High-Level Multistakeholder Advisory Body on Artificial Intelligence. He acknowledged the transformative potential of AI in various fields such as crisis prediction, public health, and education services. He emphasized the need for responsible and inclusive use of AI technologies, particularly in developing countries.

One user noted ‘I'm excited to hear what 80 year old out of touch boomers that barely know what a computer is and still use fax machines have to say about AI.’

Guterres picked some 40 experts in technology, law and personal data protection—coming from academia, government and the private sector—to sit on the panel.
They include Amandeep Singh Gill, Guterres's special envoy for technology; James Manyika, vice president in charge of AI at Google and Alphabet; Mira Murati, technical director of ChatGPT developer OpenAI; and Omar al-Olama, minister of AI in the United Arab Emirates.

Read the UN media release: https://press.un.org/en/2023/sgsm22007.doc.htm

Biden AI executive order directs agencies to develop safety guidelines (30/Oct/2023)

President Joe Biden signed an executive order providing rules around generative AI, ahead of any legislation coming from lawmakers.

The order has eight goals: to create new standards for AI safety and security, protect privacy, advance equity and civil rights, stand up for consumers, patients, and students, support workers, promote innovation and competition, advance US leadership in AI technologies, and ensure the responsible and effective government use of the technology.

Read the order at WhiteHouse.gov.

Toys to Play With

Riffusion.com with vocals (Oct/2023)

Thanks to the team at Riffusion for sending through this sample:

https://www.riffusion.com/riffs/a1d1c92c-450f-4684-9b49-429981355e26

Now add a walrus: Prompt engineering in DALL-E 3 (26/Oct/2023)

Simon Willison provides a detailed account of his experiments with DALL-E 3, OpenAI’s image generation model, and how it generates different images based on various prompts via ChatGPT. He plays around with requests like 'a super posh pelican with a monocle watching the Monaco F1', 'add a walrus', and further manipulates the generated images by using image 'seeds'.

Just like movie frames, the use of seeds here makes me consider just how rapidly we will have instant generation of new and unique feature films tailored to our whims… It’s gonna be huge!

Flashback

I am completely fascinated by the rapid progress of text-to-image models. From the GAN stuff we were exploring just 2-3 years ago to today, it has been an exponential leap.

Here are two images I generated today in less than 10 seconds, using this big prompt in DALL-E 3:

Very crude childlike drawing: Use raw strokes on slightly crumpled white paper, often extending beyond intended areas. Depict naive shapes like a skewed house, a stick-like tree, and a rough circular sun. Stick figures should have mismatched dot eyes and scribbled features. Colors are applied haphazardly, sometimes bleeding out of the lines. The overall scene should have unintentional marks and smudges, capturing the essence of childlike creativity. If there's handwriting, it should echo a toddler's scribbled style. Make the overall look messy and have a very low artistic level. widescreen

Read more and see images from the early 2020s: https://lifearchitect.ai/art/

I have a significant announcement about ASI coming out shortly.
The next roundtable will be:

You don’t need to do anything for this; there’s no registration or forms to fill in, I don’t want your email, you don’t even need to turn on your camera or give your real name!

All my very best,

Alan
LifeArchitect.ai

Search | Archives

Mark Kratzer

Oct 31, 2023

Sorry, Allan, I failed see you raised Biden's executive order. The Administration has already received a one year "stay" suspension of the first amendment, Free Speech. I think Open AI poses a great threat the 2024 end of Democratic and Free Speech Theater in the West, and the Shadow Governments have become too used tech giants being compliant in turning objective reality into a virtual world. So, we are going to see a clamp down of AI developed by engineers and tinkerers.

Modern warfare has already shown us the dangers that electronic miniaturization and reduced cost can threaten Goliaths ... as we have seen with drones. The slow moving elites and militarist have yet to determine how this new technology can be centralized and controlled.

In the categories of interesting developments, I would consider the following. As I have said AI is not life, because it lacks a body (mean it's own body) and perception of time (memories and evolution of relationships and thoughts yield a perception of time).

https://www.youtube.com/watch?v=QQ2QOPWZKVc

Expand full comment

1 reply

There are no 80-year old boomers. The oldest boomers, born in 1945 are 78. Maybe we can get some quoted comments limited to those who aren't arithmetically-challenged.

2 replies

4 more comments...

The Memo by LifeArchitect.ai

Discussion about this post