docker coentos7 tensorflow
来源:互联网 发布:标书制作软件 编辑:程序博客网 时间:2024/05/19 22:06
1 cpu的设备识别检测
yum install pciutils lspci | grep NVIDIA00:03.0 3D controller: NVIDIA Corporation Device 1b38 (rev a1)
2 安装显卡驱动
wget http://developer.download.nvidia.com/compute/cuda/repos/rhel7/x86_64/cuda-repo-rhel7-8.0.61-1.x86_64.rpmrpm -Uvh cuda-repo-rhel7-8.0.61-1.x86_64.rpm依赖安装wget http://vault.centos.org/7.0.1406/updates/x86_64/Packages/kernel-devel-3.10.0-123.4.4.el7.x86_64.rpmwget http://vault.centos.org/7.0.1406/updates/x86_64/Packages/kernel-headers-3.10.0-123.4.4.el7.x86_64.rpmrpm -Uvh kernel-devel-3.10.0-123.4.4.el7.x86_64.rpmrpm -Uvh kernel-headers-3.10.0-123.4.4.el7.x86_64.rpmyum install cuda-8-0
3 查看gpu使用情况
nvidia-smiMon Nov 27 11:01:51 2017 +-----------------------------------------------------------------------------+| NVIDIA-SMI 384.81 Driver Version: 384.81 ||-------------------------------+----------------------+----------------------+| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC || Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. ||===============================+======================+======================|| 0 Tesla P40 Off | 00000000:00:03.0 Off | 0 || N/A 23C P0 45W / 250W | 0MiB / 22912MiB | 0% Default |+-------------------------------+----------------------+----------------------++-----------------------------------------------------------------------------+| Processes: GPU Memory || GPU PID Type Process name Usage ||=============================================================================|| No running processes found |+-----------------------------------------------------------------------------+
4.测试GPU是否安装成功
cd /usr/local/cuda/binsh cuda-install-samples-8.0.sh ~/cuda-test/cd ~/cuda-test/NVIDIA_CUDA-8.0_Samplesmake./bin/x86_64/linux/release/deviceQuery 获取设备状态 ./bin/x86_64/linux/release/bandwidthTest 测试设备带宽
5.配置环境变量
PATH=$PATH:$HOME/bin:/usr/local/cuda/binLD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda/lib64/CUDA_HOME=/usr/local/cudaexport PATHexport LD_LIBRARY_PATHexport CUDA_HOME
6.安装cudnn
下载对应的:http://www.nvidia.cn/Download/index.aspx?lang=cn
7.安装docker
yum update curl -sSL https://get.docker.com/ | sh 报错: Delta RPMs disabled because /usr/bin/applydeltarpm not installed.查找包含的包:yum provides '*/applydeltarpm'安装:yum install deltarpmyum install -y nvidia-dockerdocker tensorflow image安装nvidia-docker run -it -p 8888:8888 tensorflow/tensorflow:latest-gpu
修改默认的存储位置
1.修改配置文件
/usr/lib/systemd/system/docker.service
/usr/lib/systemd/system/nvidia-docker.service
root@10-23-14-125 data]#cat /usr/lib/systemd/system/docker.service[Unit]Description=Docker Application Container EngineDocumentation=https://docs.docker.comAfter=network-online.target firewalld.serviceWants=network-online.target[Service]Type=notify# the default is not to use systemd for cgroups because the delegate issues still# exists and systemd currently does not support the cgroup feature set required# for containers run by docker#ExecStart=/usr/bin/dockerd#graph 指定存储位置,registry-mirror 加速镜像ExecStart=/usr/bin/dockerd --graph /data/docker-data/docker --registry-mirror=https://u1qbyfsc.mirror.aliyuncs.comExecReload=/bin/kill -s HUP $MAINPID# Having non-zero Limit*s causes performance problems due to accounting overhead# in the kernel. We recommend using cgroups to do container-local accounting.LimitNOFILE=infinityLimitNPROC=infinityLimitCORE=infinity# Uncomment TasksMax if your systemd version supports it.# Only systemd 226 and above support this version.#TasksMax=infinityTimeoutStartSec=0# set delegate yes so that systemd does not reset the cgroups of docker containersDelegate=yes# kill only the docker process, not all processes in the cgroupKillMode=process# restart the docker process if it exits prematurelyRestart=on-failureStartLimitBurst=3StartLimitInterval=60s[Install]WantedBy=multi-user.target[root@10-23-14-125 data]# cat /usr/lib/systemd/system/nvidia-docker.service[Unit]Description=NVIDIA Docker pluginDocumentation=https://github.com/NVIDIA/nvidia-docker/wikiAfter=local-fs.target network.targetWants=docker.service[Service]Environment="SOCK_DIR=/data/docker-data/nvidia-docker"Environment="SPEC_FILE=/etc/docker/plugins/nvidia-docker.spec"#User=nvidia-dockerUser=rootPermissionsStartOnly=trueRestart=on-failureRestartSec=1TimeoutStartSec=0TimeoutStopSec=20ExecStart=/usr/bin/nvidia-docker-plugin -s $SOCK_DIRExecStartPost=/bin/sh -c '/bin/mkdir -p $( dirname $SPEC_FILE )'ExecStartPost=/bin/sh -c '/bin/echo unix://$SOCK_DIR/nvidia-docker.sock > $SPEC_FILE'ExecStopPost=/bin/rm -f $SPEC_FILE[Install]WantedBy=multi-user.target
2.数据盘建立软件
ln -s /data/docker /var/lib/docker
3 注意启动需要加 –privileged=true
docker run -it -d --privileged=true xcartensorflow/xcartensorflow:new-gpu /bin/bash-privileged=true 加这个参数,否则nvida设备不会挂载上查看设备是否正确挂载ll /dev/nv*crw-rw-rw- 1 root root 195, 0 12月 18 17:59 /dev/nvidia0crw-rw-rw- 1 root root 195, 255 12月 18 17:59 /dev/nvidiactlcrw-rw-rw- 1 root root 247, 0 12月 18 17:59 /dev/nvidia-uvmcrw-rw-rw- 1 root root 247, 1 12月 18 17:59 /dev/nvidia-uvm-toolscrw------- 1 root root 10, 144 12月 18 17:59 /dev/nvram
参考:
https://hub.docker.com/r/tensorflow/tensorflow/
https://aur.archlinux.org/packages/nvidia-docker/
阅读全文
1 0
- docker coentos7 tensorflow
- Docker TensorFlow
- TensorFlow-docker
- Docker部署Tensorflow
- windows docker安装 tensorflow
- Docker windows安装tensorflow
- Docker-tensorflow跑VGG16
- TensorFlow Docker一览
- TensorFlow + Docker + PyCharm
- vmware + ubuntu + docker+ tensorflow
- 使用Docker运行TensorFlow
- Docker使用TensorFlow Serving
- docker 安装tensorflow
- Docker 安装 TensorFlow GPU 实战
- TensorFlow(1):使用docker镜像搭建TensorFlow环境
- docker︱在nvidia-docker中使用tensorflow-gpu/jupyter
- Windows下通过Docker安装Tensorflow环境
- TensorFlow安装:win7安装(非docker)
- Spring Resttemplate post方法踩坑记录
- Camera Filter 美颜相机的实现
- 17、TensorFLow GPU 的使用
- iOS 给UIImage添加边框(直接加在UIImage上)
- python里使用difflib库的比较文本时丢弃不要的字符
- docker coentos7 tensorflow
- 习题6(6.13)
- [通过scikit-learn掌握机器学习] 02 线性回归
- C 自定义字符串输出
- Spiral Matrix II
- 18、使用 tf.app.flags 接口定义命令行参数
- C语言面向对象编程之一:封装与继承
- [Yii2 Widget]FancytreeWidget树状结构
- 19、TensorFlow 实现最近邻分类器(K=1)