Bug: 372344643

Clone this repo:
  1. 966912e Upgrade clpeak to 1.1.4 am: 6b1f0df756 by Sadaf Ebrahimi · 4 months ago main
  2. 6b1f0df Upgrade clpeak to 1.1.4 by Sadaf Ebrahimi · 4 months ago
  3. 8a5fd62 Add janitors to the OWNERS file am: c8b3bd4a91 by Sadaf Ebrahimi · 5 months ago
  4. c8b3bd4 Add janitors to the OWNERS file by Sadaf Ebrahimi · 5 months ago
  5. 1cd03ba Explicitly link against the OpenCL ICD by Jeremy Kemp · 10 months ago android15-automotiveos-dev android15-automotiveos-release android15-d4-release android15-qpr1-release android15-qpr1-s3-release android15-qpr1-s4-release android15-qpr1-s5-release android15-qpr2-release android15-qpr2-s1-release android15-qpr2-s2-release android15-qpr2-s3-release android15-qpr2-s4-release android15-qpr2-s5-release android15-qpr2-s6-release aml_ads_351121120 aml_ads_351312060 aml_ads_351420000 aml_ase_351112060 aml_ase_351114000 aml_ase_351311040 aml_ase_351412000 aml_cbr_351111000 aml_cbr_351310040 aml_cbr_351410000 aml_con_351110000 aml_con_351412000 aml_doc_351113060 aml_doc_351311000 aml_doc_351412000 aml_ext_351122080 aml_ext_351312060 aml_ext_351423040 aml_hef_351120040 aml_hef_351314220 aml_hef_351420080 aml_ips_351111040 aml_ips_351310000 aml_ips_351410000 aml_med_351311020 aml_mpr_351113060 aml_mpr_351113100 aml_mpr_351311140 aml_mpr_351416060 aml_net_351111100 aml_net_351111140 aml_net_351310100 aml_net_351410000 aml_net_351410060 aml_odp_351121040 aml_odp_351313000 aml_odp_351421000 aml_per_351112280 aml_per_351112300 aml_per_351310020 aml_per_351410020 aml_res_351111020 aml_res_351111040 aml_rkp_351310000 aml_sdk_351110000 aml_sdk_351415000 aml_sta_351110040 aml_sta_351311000 aml_sta_351416000 aml_swc_351311020 aml_tet_351110060 aml_tet_351314140 aml_tet_351415080 aml_uwb_351310040 aml_wif_351110060 aml_wif_351310040 aml_wif_351410040 android-15.0.0_r10 android-15.0.0_r11 android-15.0.0_r12 android-15.0.0_r13 android-15.0.0_r14 android-15.0.0_r15 android-15.0.0_r16 android-15.0.0_r17 android-15.0.0_r20 android-15.0.0_r21 android-15.0.0_r22 android-15.0.0_r23 android-15.0.0_r24 android-15.0.0_r25 android-15.0.0_r26 android-15.0.0_r27 android-15.0.0_r28 android-15.0.0_r29 android-15.0.0_r30 android-15.0.0_r6 android-15.0.0_r7 android-15.0.0_r8 android-15.0.0_r9 android-automotiveos-15.0.0_lts1

clpeak

Build Status Snap Status

A synthetic benchmarking tool to measure peak capabilities of opencl devices. It only measures the peak metrics that can be achieved using vector operations and does not represent a real-world use case

Building

git submodule update --init --recursive --remote
mkdir build
cd build
cmake ..
cmake --build .

Sample

Platform: NVIDIA CUDA
  Device: Tesla V100-SXM2-16GB
    Driver version  : 390.77 (Linux x64)
    Compute units   : 80
    Clock frequency : 1530 MHz

    Global memory bandwidth (GBPS)
      float   : 767.48
      float2  : 810.81
      float4  : 843.06
      float8  : 726.12
      float16 : 735.98

    Single-precision compute (GFLOPS)
      float   : 15680.96
      float2  : 15674.50
      float4  : 15645.58
      float8  : 15583.27
      float16 : 15466.50

    No half precision support! Skipped

    Double-precision compute (GFLOPS)
      double   : 7859.49
      double2  : 7849.96
      double4  : 7832.96
      double8  : 7799.82
      double16 : 7740.88

    Integer compute (GIOPS)
      int   : 15653.47
      int2  : 15654.40
      int4  : 15655.21
      int8  : 15659.04
      int16 : 15608.65

    Transfer bandwidth (GBPS)
      enqueueWriteBuffer         : 10.64
      enqueueReadBuffer          : 11.92
      enqueueMapBuffer(for read) : 9.97
        memcpy from mapped ptr   : 8.62
      enqueueUnmap(after write)  : 11.04
        memcpy to mapped ptr     : 9.16

    Kernel launch latency : 7.22 us