Apr 19, 2023 2 min read Here

Stable Diffusion Releases StableLM – A Truly Open-Sourced Language Model

BREAKING: StabilityAI just released their own LLM, called StableLM.

"The Alpha version of the model is available in 3 billion and 7 billion parameters, with 15 billion to 65 billion parameter models to follow."

The models are available on GitHub!

Repo: https://t.co/N0lsJI3Rpm… pic.twitter.com/W0t4Y3Mf3B
— Lior⚡ (@AlphaSignalAI) April 19, 2023

Yesterday’s note discussed how making LLMs bigger, with more parameters (training data), may be through. But that isn’t stopping Anthropic, AI21, Cohere, and Character.AI from trying. Even Elon has teased his own LLM called TruthGPT. And now Stable Diffusion has joined the fun.

Stable Diffusion is known for their open-source AI image generator. (Sidenote: their Reddit is really entertaining and the most inventive space.)

Stable Diffusion’s LLM, called StableLM, is completely open-sourced and uses a permissive license, which means you don’t have to work through an API or purchase a license to build with it. That’s a major contrast from everything else out there and could give it a long-term leg-up on GPT-4. It’s also much smaller in size, making it more efficient to run, and theoretically could show how smaller models can outperform bigger models.

Notably, the models StableLM released don’t include RLHF (Reinforcement Learning from Human Feedback). This is the feature that largely made GPT-3.5 so much better than GPT-3. StableLM plans to include this feature in their model in the next releases.

Ultimately, though, an LLM is only as good as its interface. The reason that OpenAI is winning and surpassing 100M+ users with ease is because of ChatGPT. That application gave ease of access to their GPT-3 (3.5 and 4) model.

GPT-3 was released June 2020 and it took over 2 years to get to ChatGPT in Nov 2022.

As we release the full suite or StableLM models it will be interesting to see how long it takes to get to StableChat.

Any guesses? 🤔

Better data, optimisation & open input will be key 🤝 https://t.co/se0FHxmXZj
— Emad (@EMostaque) April 19, 2023

I don’t think it’ll take two years for us to get a widespread StableChat competitor to ChatGPT. I think it’ll be just months away considering it’s an open-sourced model and thus any builder can work with it. I’ll be keeping my eyes on Twitter to see what people are creating with StableLM in the coming days.

I didn’t have time to test it myself, however, follow the Tweet below if you’re interested in trying the model out yourself:

🚨BREAKING🚨

StableLM: Stability AI Language Models

A TRUE Open-source model trained from scratch

[No trick or licenses - T-R-U-E - Open-source]

Code: https://t.co/U79kRpofq4

Here is simple code that YOU can use to run it,
and everything else you need to know about 👇🧵 pic.twitter.com/QX4o8G9Hmo
— Yam peleg (@Yampeleg) April 19, 2023

Another option is following the steps below, which allows you to bring the LLM into Terminal on macOS:

step 3: chat with StableLM in your terminal 🔥🎉

that's a wrap. 🫔

you are officially chatting with your own personal StableLM. thanks for following along! 🍌😄 pic.twitter.com/2NZWjokfQI
— Banana (@BananaDev_) April 20, 2023

You might also like...

Link: A deepfake porn crisis has hit 500+ South Korean schools, as police investigate crime rings targeting two major universities and consider a probe into Telegram (Jean Mackenzie/BBC)

Link: Japanese companies develop AI tools against 'customer harassment'

Link: Source: NightCafe, a bootstrapped generative art marketplace with 25M users who created ~1B images, has $4M annualized revenue with a ~50% margin (Kyle Wiggers/TechCrunch)

Link: Dating apps develop AI ‘wingmen’ to generate better chat-up lines

Link: How I Use "AI"