Stability AI

Stability AI Introduces StableVicuna: The First Open-Source RLHF LLM Chatbot

April 28, 2023 • 2 min read

Stability AI, known for its groundbreaking contributions to the AI and machine learning landscape, has unveiled their latest project, StableVicuna. As the first open-source chatbot to utilize Reinforcement Learning from Human Feedback (RLHF) in conjunction with Large Language Models (LLMs), StableVicuna aims to revolutionize AI-driven conversational systems.

Stability AI's recent announcement of StableVicuna follows a series of innovative releases, including the widely-acclaimed StableLM open-source language models and DeepFloyd IF, a state-of-the-art text-to-image model. StableVicuna is a further instruction fine tuned and RLHF trained version of Vicuna v0 13b, which is an instruction fine tuned LLaMA 13b model.

By combining publicly available chat RLHF datasets from reputable organizations such as Open Assistant, Anthropic, and Stanford, with the advanced RLHF training capabilities of trlX, Stability AI has positioned StableVicuna as a powerful open-source chatbot capable of tasks like basic math, code writing, and grammar assistance.

The company, which has been focusing on transparent, accessible, and supportive AI technology, believes that the release of StableVicuna showcases the advantages of integrating instruction fine-tuning with RLHF in LLMs. In a statement from Stability AI's press release, the company shared its enthusiasm for the new chatbot: "We are proud to present StableVicuna, the first large-scale open-source chatbot trained via reinforced learning from human feedback (RHLF). We believe this technology has the potential to revolutionize the user experience in AI chatbot systems."

StableVicuna can be accessed on the HuggingFace Hub, where users can download the weight delta and combine it with the original LLaMA model. As an open-source project, StableVicuna is expected to foster innovation and collaboration within the AI community.

Alongside StableVicuna, Stability AI has been working on a range of cutting-edge tools and services, such as their Image Upscaling API, which allows for increasing image size without loss of sharpness or detail. The company's commitment to democratizing AI technology and promoting transparency, accessibility, and user support is evident in its growing roster of offerings.

Preview of Stability AI's upcoming Upcoming Chatbot Interface

In addition to the chatbot, Stability AI is also developing a user-friendly chat interface for StableVicuna, which is currently in its final stages. The company plans to deploy a Discord bot to the Stable Foundation server, inviting users to provide valuable feedback and contribute to the improvement of the chatbot's user experience.

While the creators of StableVicuna consider it an important advancement in AI chatbot technology, the true impact of this large-scale open-source RLHF LLM chatbot will become clearer as organizations and developers explore and build upon this new offering. As Stability AI continues its pursuit of a more inclusive and ethical AI landscape, the future of AI-driven conversation systems is poised to evolve significantly.