AI Model Training
StableVicuna
StableVicuna is the first large-scale open-source chatbot trained with reinforcement learning from human feedback (RLHF) from StabilityAI, the company behind Stable Diffusion. StableVicuna is a further instruction fine-tuning and RLHF training version of Vicuna v0 13b, which is an instruction-fine-tuned LLaMA 13 billion model.
Relevant Navigation
No comments...