经常项目上会出现不能连外网的情况,要怎么使用ragflow镜像部署呢,这里提供详细的步骤。
1、下载基础镜像
根据docker-compose-base.yml及docker-compose.yml中的image配置信息,下载所需要的基础服务镜像。
# 创建资源目录
mkdir -p /data/ragflow-offline/{docker-images,pip-packages}
cd /data/ragflow-offline
# 拉取并保存所有所需镜像
docker pull elasticsearch:8.11.3
#infinity和elasticsearch可以二选一下载
docker pull infiniflow/infinity:v0.6.0-dev3
#8+版本的mysql才支持向量数据
docker pull mysql:8.0.39
docker pull quay.io/minio/minio:RELEASE.2023-12-20T01-00-02Z
docker pull valkey/valkey:8
当使用sudo docker save时,虽然docker命令以root权限执行,但输出重定向操作(>)仍受当前用户目录权限限制。需要确保操作路径有足够权限,所以要先通过以下命令给足够的权限。
sudo chmod 777 /data/ragflow-offline/docker-images # 临时放宽权限
再执行下面的保存命令
sudo docker save elasticsearch:8.11.3 > docker-images/elasticsearch-8.11.3.tar
sudo docker save infiniflow/infinity:v0.6.0-dev3 > docker-images/infinity-v0.6.0-dev3.tar
sudo docker save mysql:8.0.39 > docker-images/mysql-8.0.39.tar
sudo docker save quay.io/minio/minio:RELEASE.2023-12-20T01-00-02Z > docker-images/minio-2023-12-20.tar
sudo docker save valkey/valkey:8 > docker-images/valkey-8.tar
二、下载Ragflow
1、下载RagFlow,github上可能很慢,找一个gitee上的地址
cd /data
sudo git clone https://gitee.com/da_xu/ragflow.git
cd ragflow
2、配置python虚拟环境,如果已有请跳过
#安装venv
sudo apt install python3.12-venv
# 创建虚拟环境
sudo python3 -m venv .venv
#激活虚拟环境
source .venv/bin/activate
#安装所需要的依赖,可能会遇到很多版本问题,比如我是python3.12,要提示修改一些依赖版本
pip install -r requirements.txt
需要修改的依赖版本如下:
Pillow=10.3.0修改为11.1.0
torch=2.3.0修改为2.3.1
torch直接安装大概有800多M,看网络情况可能会超时,所以我直接选择先下载,下载地址:
https://download.pytorch.org/whl,找到你合适的版本
或者从阿里巴巴的镜像下载:
https://mirrors.aliyun.com/pypi/simple
然后使用以下命令安装
#下载
sudo wget https://download.pytorch.org/whl/cpu/torch-2.3.1%2Bcpu-cp312-cp312-linu
x_x86_64.whl#sha256=2141a6cb7021adf2f92a0fd372cfeac524ba460bd39ce3a641d30a561e41f69a
#安装torch
pip install torch-2.3.1+cpu-cp312-cp312-linux_x86_64.whl
以上可能会出现没有权限的问题,如下报错:
Installing collected packages: mpmath, typing-extensions, sympy, networkx, MarkupSafe, fsspec, filelock, jinja2, torch ERROR: Could not install packages due to an OSError: [Errno 13] Permission denied: '/data/ragflowd/.venv/lib/python3.12/site-packages/mpmath' Check the permissions.
解决方式如下:
# 进入虚拟环境上级目录
cd /data/ragflow
# 递归修改虚拟环境目录所有权(替换 为实际用户名)
sudo chown -R : .venv
# 赋予读写执行权限
sudo chmod -R 755 .venv
#或者如下操作
# 确保 /data/ragflow 目录允许当前用户操作
sudo chown -R $USER:$USER /data/ragflow
sudo chmod -R 755 /data/ragflow
#然后重新安装torch
pip install torch-2.3.1+cpu-cp312-cp312-linux_x86_64.whl
如果再遇到安装超时的问题,就采用以上方式先下载whl文件,然后单独安装,然后再安装其它的依赖。
比如又碰到160M的xgboost无法安装,如图操作:
其它比较大的whl下载:
sudo wget https://mirrors.aliyun.com/pypi/packages/67/ca/f42388aed0fddd64ade7493dbba36e1f534d4e6fdbdd355c6a90030ae028/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl#sha256=694cf3879a206553cc9d7dbda76b13efaf610fdb70a50cba303de1b0d1530ac6
sudo wget https://mirrors.aliyun.com/pypi/packages/92/0f/a6eeabc9d4f3dffa52d629ba2aedce3c3953a49340a65ef49be85eb26cc1/Aspose.Slides-25.2.0-py3-none-manylinux1_x86_64.whl#sha256=d60bacea1fd43d09ae1351e7a1a1c0790ff80ca7955ceb95823e47f6b39f6b91
三、打包.venv中已下载的包到pip-packages文件夹中
# 在本地开发环境执行
cd /data/ragflow
sudo tar czvf /data/ragflow-offline/pip-packages/venv.tar.gz .venv/
四、离线环境部署步骤
1. 传输离线资源到目标服务器
将以下目录复制到目标服务器/data/ragflow_inner:
sudo mkdir /data/ragflow_inner
cd /data/ragflow_inner
sudo cp -r /data/ragflow-offline/docker-images/ .
sudo cp -r /data/ragflow-offline/pip-packages/ .
2. 加载Docker镜像
cd /data/ragflow_inner
for img in docker-images/*.tar; do
docker load -i $img
done
3. 恢复Python虚拟环境
# 解压已有虚拟环境
sudo tar xzvf pip-packages/venv.tar.gz -C /data/ragflow_inner/
4、优化配置
新的docker-compose.yml
services:
elasticsearch:
image: elasticsearch:8.11.3
pull_policy: never
environment:
- discovery.type=single-node
- ES_JAVA_OPTS=-Xms4g -Xmx4g
volumes:
- /data/ragflow_inner/data/elasticsearch:/usr/share/elasticsearch/data
ports:
- "9200:9200"
healthcheck:
test: ["CMD-SHELL", "curl --silent --fail localhost:9200/_cluster/health || exit 1"]
infinity:
image: infiniflow/infinity:v0.6.0-dev3
pull_policy: never
ports:
- "8088:8088"
depends_on:
elasticsearch:
condition: service_healthy
mysql:
image: mysql:8.0.39
pull_policy: never
environment:
MYSQL_ROOT_PASSWORD: ragflow@2024
MYSQL_DATABASE: ragflow
volumes:
- /data/ragflow_inner/data/mysql:/var/lib/mysql
ports:
- "3306:3306"
healthcheck:
test: ["CMD", "mysqladmin", "ping", "-h", "localhost"]
minio:
image: quay.io/minio/minio:RELEASE.2023-12-20T01-00-02Z
pull_policy: never
command: server /data --console-address ":9001"
volumes:
- /data/ragflow_inner/data/minio:/data
ports:
- "9000:9000"
- "9001:9001"
environment:
MINIO_ROOT_USER: ragflowadmin
MINIO_ROOT_PASSWORD: ragflow@minio
valkey:
image: valkey/valkey:8
pull_policy: never
ports:
- "6379:6379"
volumes:
- /data/ragflow_inner/data/valkey:/data
server:
image: infiniflow/ragflow:0.17.2
pull_policy: never
environment:
DB_HOST: mysql
DB_PORT: 3306
DB_NAME: ragflow
DB_USER: root
DB_PASSWORD: ragflow@2024
MINIO_ENDPOINT: minio:9000
MINIO_ACCESS_KEY: ragflowadmin
MINIO_SECRET_KEY: ragflow@minio
HF_ENDPOINT: "" # 显式声明空变量避免警告
MACOS: "" # 同上
volumes:
- /data/ragflow_inner/logs:/app/logs
- /data/ragflow_inner/models:/app/models
- /data/ragflow_inner/.venv:/opt/venv # 挂载已有虚拟环境
ports:
- "8000:8000"
depends_on:
mysql:
condition: service_healthy
minio:
condition: service_started
valkey:
condition: service_started
5. 启动服务
# 创建数据目录
sudo mkdir -p /data/ragflow_inner/data/{elasticsearch,mysql,minio,valkey}
# 设置文件权限
sudo chmod -R 777 /data/ragflow_inner/data
# 启动所有服务
cd /data/ragflow_inner/docker
sudo docker compose -f docker-compose.yml up -d
#验证服务,输出如下图
sudo docker compose ps
sudo docker exec -it ragflow-server bash
source .venv/bin/activate