使用FastChat快速部署开源大模型

Jul 23, 2024
llm

FastChat是一个用于训练、服务和评估基于大型语言模型的聊天机器人的开放平台，提供了丰富的UI界面选项。可以使用FastChat来部署HuggingFace的模型。它提供了一套核心功能，包括：

FastChat框架主要分为三个部分：Controller、Server、Worker。这三者的关系如下图所示：

除此之外，FastChat还提供了命令行接口和网页界面，以及OpenAI兼容API接口。

建议首先使用Conda来创建虚拟环境：

conda create -n fastchat python=3.11
conda activate fastchat

安装相关常用的库：

pip install fschat torch transformers accelerate sentencepiece gradio==4.9.1

此时，就可以分别启动前面提到的Fastchat的组件：

python -m fastchat.serve.controller --host 0.0.0.0

python -m fastchat.serve.model_worker --host 0.0.0.0 --model-path /your_model_path

python -m fastchat.serve.openai_api_server --host 0.0.0.0

python -m fastchat.serve.gradio_web_server --host 0.0.0.0 --share

通过上面的步骤，就可以在短时间内部署一个开源大模型相关的服务了。