File tree Expand file tree Collapse file tree 1 file changed +3
-3
lines changed Expand file tree Collapse file tree 1 file changed +3
-3
lines changed Original file line number Diff line number Diff line change @@ -57,15 +57,15 @@ pip install dist/fastdeploy-2.0.0a0-py3-none-any.whl
57
57
58
58
在安装后,执行如下命令快速部署Qwen2模型, 更多参数的配置与含义参考[ 参数说明] ( docs/serving.md ) .
59
59
60
- ```
60
+ ``` shell
61
61
# 下载与解压Qwen模型
62
62
wget https://fastdeploy.bj.bcebos.com/llm/models/Qwen2-7B-Instruct.tar.gz && tar xvf Qwen2-7B-Instruct.tar.gz
63
63
# 指定单卡部署
64
64
python -m fastdeploy.entrypoints.openai.api_server --model ./Qwen2-7B-Instruct --port 8188 --tensor-parallel-size 1
65
65
```
66
66
67
67
使用如下命令请求模型服务
68
- ```
68
+ ``` shell
69
69
curl -X POST " http://0.0.0.0:8188/v1/chat/completions" \
70
70
-H " Content-Type: application/json" \
71
71
-d ' {
@@ -75,7 +75,7 @@ curl -X POST "http://0.0.0.0:8188/v1/chat/completions" \
75
75
}'
76
76
```
77
77
响应结果如下所示
78
- ```
78
+ ``` json
79
79
{
80
80
"id" : " chatcmpl-db662f47-7c8c-4945-9a7a-db563b2ddd8d" ,
81
81
"object" : " chat.completion" ,
You can’t perform that action at this time.
0 commit comments