qidao123.com技术社区-IT企服评测·应用市场

标题: 华为云kubernetes摆设deepseek r1、ollama和open-webui（已踩过坑） [打印本页]

作者: 西河刘卡车医 时间: 2025-2-12 18:42
标题: 华为云kubernetes摆设deepseek r1、ollama和open-webui（已踩过坑）
1 概述

ollama是一个管理大模型的一个中心层，通过它你可以下载并管理deepseek R1、llama3等大模型。
open-webui是一个web界面（界面计划受到chatgpt启发），可以集成ollama API、 OpenAI的 API。
用常见的web应用架构来类比，open-webui是前端，ollama是后端，大模型是数据库。

文本介绍华为云kubernetes摆设open-webui最新版、ollama最新版、DeepSeek-R1-Distill-Qwen-1.5B（因为小模型可以只使用CPU，节省本文测试的成本）。

2 云资源情况准备

2.1 购买文件存储SFS Turbo

2.2 购买kubernetes集群

2.3 在k8s中创建storageclass对象

参数everest.io/share-access-to是VPC的ID。
参数everest.io/share-export-location是sfs turbo实例的共享路径:自界说子目次，sfs turbo实例的共享路径是在sfs实例的详细页查询，自界说子目次可以是任意路径。
参数everest.io/volume-id是sfs turbo实例的ID。
只需要修改以上三个参数。
在本文，storageclass的名称叫做sfsturbo-subpath-sc。

apiVersion: storage.k8s.io/v1
allowVolumeExpansion: true
kind: StorageClass
metadata:
name: sfsturbo-subpath-sc
mountOptions:
- lock
parameters:
csi.storage.k8s.io/csi-driver-name: sfsturbo.csi.everest.io
csi.storage.k8s.io/fstype: nfs
everest.io/archive-on-delete: "true"
everest.io/share-access-to: xxxxxxxxxxxxxxxxxx # VPC ID
everest.io/share-expand-type: bandwidth
everest.io/share-export-location: xxxxx.sfsturbo.internal:/mydir # sfs turbo实例的共享路径:自定义子目录
everest.io/share-source: sfs-turbo
everest.io/share-volume-type: STANDARD
everest.io/volume-as: subpath
everest.io/volume-id: xxxxxxxxxxxxx # sfs turbo实例的ID
provisioner: everest-csi-provisioner
reclaimPolicy: Delete
volumeBindingMode: Immediate

复制代码

2.4 购买用于暴露容器的负载均衡器ELB

3 摆设

3.1 创建namespace

ollama和open webui都摆设在此namespace。

kubectl create ns ollama

复制代码

3.1 摆设ollama

statefulset使用刚刚创建的存储类sfsturbo-subpath-sc。
确保PVC的磁盘容量能存储下全部待下载的大模型。

复制代码

3.1 摆设open webui（重点）

deployment挂载一个固定的PVC，PVC使用刚刚创建的存储类sfsturbo-subpath-sc。
OLLAMA_BASE_URL情况变量是ollama的地址。
无法毗连huggingface.co：
由于在国内情况是无法毗连huggingface.co，最终导致open webui的界面是一片空缺（应用日志报错：MaxRetryError("HTTPSConnectionPool(host=‘huggingface.co’, port=443)），因此需要增长情况变量HF_ENDPOINT=https://hf-mirror.com。
无法毗连openai：
由于不使用openai，因此将情况变量OPENAI_API_BASE_URL和OPENAI_API_KEY都设置成None，否则open webui在国内情况是无法毗连openai，最终导致open webui的界面是一片空缺（应用日志报错：Connection error: Cannot connect to host api.openai.com:443）。

复制代码

接着为open webui容器添加ingress路由以在公网暴露：

4 下载模型

进入ollama容器：

kubectl exec -it ollama-0 -n ollama bash

复制代码

在容器内执行ollama pull下令下载大模型DeepSeek-R1-Distill-Qwen-1.5B。

nohup ollama pull deepseek-r1:1.5b &
tail -f nohup.out

复制代码

有哪些deepseek模型可以下载，请去https://ollama.com/library/deepseek-r1地址里搜刮。

5 与大模型对话

在浏览器地址输入负载均衡器ELB的公网IP，打开网页后需要先设置open webui的管理员账号密码，登录乐成后即可选择刚刚下载的deepseek模型来谈天。

6 小结

文本介绍使用华为云kubernetes摆设open-webui最新版、ollama最新版、DeepSeek-R1-Distill-Qwen-1.5B。在现实过程中，花费时间最多的是open-webui，因为它默认去访问在国内无法访问的两个外国地址：huggingface.co和api.openai.com，而访问这些地址最终又导致界面变成空缺。

免责声明：如果侵犯了您的权益，请联系站长，我们会及时删除侵权内容，谢谢合作！更多信息从访问主页：qidao123.com:ToB企服之家，中国第一个企服评测及商务社交产业平台。

欢迎光临 qidao123.com技术社区-IT企服评测·应用市场 (https://dis.qidao123.com/)

Powered by Discuz! X3.4