OpenChatKit – Fully open-source model, data and more

jamiedg · on March 10, 2023

Hi Everyone, I work at Together. Today we released OpenChatKit: an open-source base to create chatbots for various applications. More than a model release, this is the beginning of an open source project. We are releasing a set of tools and processes for ongoing improvement with community contributions.

You can try it now on Hugging Face! https://huggingface.co/spaces/togethercomputer/OpenChatKit

OpenChatKit includes 4 key components:

1. An instruction-tuned large language model, fine-tuned for chat from EleutherAI’s GPT-NeoX-20B with over 43 million instructions on 100% carbon negative compute available under Apache-2.0 license on Hugging Face.

2. A set of customization recipes to fine-tune the model to achieve high accuracy on your tasks documented and available as open-source under the Apache-2.0 license on Github, along with code to recreate our model results.

3. An extensible retrieval system enabling you to augment bot responses with information from a document repository, API, or other live-updating information source at inference time, with open-source examples for using Wikipedia or a web search API.

4. A moderation model, fine-tuned from GPT-JT-6B, designed to filter which questions the bot responds to, also available under the Apache-2.0 license on Hugging Face.

We collaborated with the tremendous communities at @laion_ai and Ontocord to create the training dataset used for these models, also released as open-source. Read the full details on LAION's blog post!

Can't wait to hear your feedback!

Thanks, -Together

mach1ne · on March 11, 2023

How did this not get traction? I came upon the model from Twitter by chance. This is the first seemingly valid and functional open-source ChatGPT alternative. Can’t wait for the optimizations which will allow it (or similar) to run on consumer-grade GPUs.

lxe · on March 14, 2023

This works out of the box with oobabooga/text-generation-webui

    python .\download-model.py togethercomputer/GPT-NeoXT-Chat-Base-20B
    python server.py --load-in-8bit --int8-threshold 0 --listen --model GPT-NeoXT-Chat-Base-20B --cai-chat

Retr0id · on March 13, 2023

I'm also here wondering exactly the same thing!

lxe · on March 14, 2023

Hacker News seemed to have missed it. I only now discovering this amidst the hype of llama.

> We are releasing OpenChatKit 0.15 under an Apache-2.0 license, with full access to source code, model weights and training datasets.

This model has the best license of them all, allowing for all sorts of use cases. Nicely done!