HPL-AMD Configuration
来源:互联网 发布:卖家怎么查淘宝客的pid 编辑:程序博客网 时间:2024/05/17 06:12
1. Basic
wget http://code.compeng.uni-frankfurt.de/attachments/download/55/hpl-gpu-1.1.0.tar.bz2//HPL
wget http://code.compeng.uni-frankfurt.de/attachments/download/54/caldgemm-1.1.0.tar.bz2 //CALDGEMM
bzip2 -d *.tar.bz2
tar -xvf *.tar
http://code.compeng.uni-frankfurt.de/projects/hpl/wiki
make NO_MEMPOLICY=1 -j incur errors
gotoblas error undefined symbol prefetchsize ' in operation
gmake clean
gmake TARGET=NEHALEM NO_MEMPOLICY=1 -j
http://code.compeng.uni-frankfurt.de/projects/caldgemm/files
tar xzf *.tgz
-lpthread -ldl -L/usr/X11R6/lib -laticalrt -laticalcl -lgfortran ../GotoBLAS2/
/home/shir/download/AMD-APP-SDK-v2.7-RC-lnx64/lib/x86_64
grep HPL_CUSTOM_CALDGEMM_OPTIONS ./ -r
MV2_ENABLE_AFFINITY=0 MV2_RNDV_PROTOCOL=R3 MV2_USE_RDMA_ONE_SIDED=0
http://code.compeng.uni-frankfurt.de/projects/caldgemm/repository/entry/README
-------------------------------------------
-z (default: disabled)
Enable Multithreading. You definitely want to activate this
-c (default: disabled)
Use CPU for DGEMM. You can supply -g as well to use both CPU and GPU. Supplying neither of them will use GPU only.
-j <dbl> (default: -1)
Ratio of GPU performance to total CPU+GPU performance. Set to -1 for autodetection.
-s (default: disabled)
Dynamic CPU GPU scheduling. Do not use only the fixed ratio specified by -j but use a dynamic CPU/GPU workload scheduling. This includes work-stealing, etc. The value provided by -j is the basis for the scheduling.
-p (default: disabled)
Interleaving Memory Policy. Gotoblas usually activates memory interleaving. This leads to a problem with the CAL library. Interleaving should be activated after memory for the CAL library is allocated. Thus it is recommended to disable interleaving in GotoBLAS (apply the patch provided with caldgemm and set NO_MEMINTERLEAVE in GotoBLAS Make.rule) and use -p.
-A (default: disabled)
Do the DMA transfer to GPU asynchronously. If you are not debugging, always enable this.
-O (default: enabled)
Define backend to use. Available options are:
disabled: CAL
enabled: OpenCL
tried: ./dgemm_bench -c -z -p -m 40960 -n 40960
Error 'Unable to load aticaldd' while initializing CAL
No GPU available, falling back to CPU
Initializing Matrix C
Running Benchmark
Starting DGEMM Run m=40960 k=1024 n=40960 Alpha=-1.000000 Beta=1.000000 LDA=0x408 LDB=0xa008 LDC=0xa008 At=0 Bt=0 ColMajor=0 (A=0x7f5a7553f010, B=0x7f5a6152e010, C=0x7f57412ad010, (C-A=2305843007493692416, (C-B)/w=2251799812046527))
Program: caldgemm Sizes - A: 40960x1024 B: 1024x40960 C:40960x40960 (Host: head.cluster)
System Time 44.493 System Gflops 77.300
HPL.make
-DHPL_NO_MPI_THREAD_CHECK
-DHPL_NO_HACKED_LIB
-DHPL_MPI_FUNNELED_THREADING
root / testing / util / UTIL_cal.cpp
#ifndef HPL_CALDGEMM_BACKEND
------------------------------------------------------------------
cpu1*1 16384 1024 1 1 41.31 314.18 7.099e+01
Avg. matri size per node: 2.00 GiB
Q: http://superuser.com/questions/425084/error-while-compiling-cuda-accelerated-linpack-hpl-2-0-fermi
mpirun_rsh -np 2 -hostfile hosts MV2_CPU_BINDING_LEVEL=socket ./run_linpack
~/cmvapich2/bin/mpirun_rsh -np 2 -hostfile hosts MV2_ENABLE_AFFINITY=0 ./run_linpack
MV2_RNDV_PROTOCOL=R3 MV2_USE_RDMA_ONE_SIDED=0
continued ...
- HPL-AMD Configuration
- CBLAS/GotoBLAS2/MKL/CUDA HPL Configuration
- AMD
- AMD
- AMD
- AMD
- AMD
- HPL测试程序安装
- HPL TEST Q&A
- HPL test Compiling History
- 安装HPL、GotoBLAS2、openmpi
- 编译HPL (hpl-2.0_FERMI_v08.tar) 过程简记
- HPL on Intel MPI+MKL
- mpich+blas+hpl 集群配置
- configuration
- configuration
- configuration
- Configuration
- DEDE织梦标签dede:arclist 详解
- apache无限二级域名配置方案
- 操作系统
- C++ 万年历
- 《影响力》读书笔记,第二章 互惠
- HPL-AMD Configuration
- git使用详细介绍
- MSC VLR HLR 简介
- 技术成长路线
- 系统抖动
- Dolphin.Integration.Smash.v5.19.1-Lz0
- Address already in use JVM_Bind8080类似问题解决方法
- 《影响力》读书笔记,第三章 承诺和一致
- Qt4设计自定义LED灯控件(提升法)