StableVicuna
free
AI Model Training

StableVicuna

The first large-scale open-source chatbot trained by RLHF

Tags:

StableVicuna is the first large-scale open-source chatbot trained with reinforcement learning from human feedback (RLHF) from StabilityAI, the company behind Stable Diffusion. StableVicuna is a further instruction fine-tuning and RLHF training version of Vicuna v0 13b, which is an instruction-fine-tuned LLaMA 13 billion model.

Relevant Navigation

No comments

No comments...