Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
4517 commits
Select commit Hold shift + click to select a range
587092c
Merge remote-tracking branch 'refs/remotes/origin/master'
zjin-lcf Dec 30, 2024
d301559
Update Licence
zjin-lcf Jan 6, 2025
45da25a
[attention] evaluate block, warp, and mixed reductions; increase the …
zjin-lcf Jan 8, 2025
d2a7682
[addBiasResidualLayerNorm-sycl] replace custom reduction with sycl::r…
zjin-lcf Jan 8, 2025
6161c50
[attentionMultiHead-hip] replace custom reductions with hipcub reduct…
zjin-lcf Jan 8, 2025
1faa508
[attentionMultiHead-cuda] replace custom reductions with cub reductions
zjin-lcf Jan 8, 2025
f88db18
[attentionMultiHead-sycl] replace custom reductions with sycl reductions
zjin-lcf Jan 8, 2025
b880738
[attentionMultiHead] set a random seed for data initialization
zjin-lcf Jan 8, 2025
6b9d4df
[atomicAggregate-hip] fix verification error with wave-specific kernels
zjin-lcf Jan 8, 2025
41af9bf
[bscan-hip] add a program for a wavefront of 32 to fix verification e…
Jan 8, 2025
3ed8f8e
Merge remote-tracking branch 'refs/remotes/origin/master'
zjin-lcf Jan 8, 2025
1f0f2fb
[addBiasResidualLayerNorm-hip] update the block and warp reductions
zjin-lcf Jan 8, 2025
98a72a4
[addBiasResidualLayerNorm] remove a dimension of 768 in the problem s…
zjin-lcf Jan 8, 2025
52b2de5
[bsw-sycl] fix compile errors; TODO support a wavefront of 64
zjin-lcf Jan 8, 2025
96a74e3
[cc-hip] use the HIP warpSize
zjin-lcf Jan 8, 2025
a2849a4
[cc-sycl] make the program portable
zjin-lcf Jan 8, 2025
eb5ccfd
[fpdc-sycl] comment out the reqd_sub_group_size
zjin-lcf Jan 8, 2025
ea789af
[fpdc-cuda] fix the deprecated warning
zjin-lcf Jan 8, 2025
c317c07
[vote-hip] make the program portable
zjin-lcf Jan 8, 2025
8729e64
[vote-cuda] specify the warp_size in the program instead of on the co…
zjin-lcf Jan 8, 2025
bb77db6
[vote-sycl] make the program portable
zjin-lcf Jan 8, 2025
27760fa
[gemv-hip] address compiler warnings; replace WARP_SIZE with warpSize
zjin-lcf Jan 9, 2025
ab2b9c3
[gemv-sycl] comment out reqd_sub_group_size; TODO support a wavefront…
zjin-lcf Jan 9, 2025
cad30d7
[wedford-sycl] comment out reqd_sub_group_size; fix compile errors
zjin-lcf Jan 9, 2025
8314845
[softmax-sycl] comment out reqd_sub_group_size and remove warp_size c…
zjin-lcf Jan 9, 2025
dfd5051
[simpleSpmv-cuda] remove the unreachable function call
zjin-lcf Jan 9, 2025
30e46c5
[simpleSpmv-sycl] comment out reqd_sub_group_size
zjin-lcf Jan 9, 2025
168b0a9
[warpsort-sycl] comment out reqd_sub_group_size
zjin-lcf Jan 9, 2025
579053c
[softmax-online-sycl] comment out reqd_sub_group_size
zjin-lcf Jan 9, 2025
d19f7d6
[softmax-fused-sycl] remove reqd_sub_group_size
zjin-lcf Jan 9, 2025
01a4344
[gc-hip] a wavefront of 64 is not supported
zjin-lcf Jan 9, 2025
ecf2efc
[softmax-online] add an offline kernel as a baseline
zjin-lcf Jan 10, 2025
7e0a89c
[score-cuda] add a reference function for verification; note the atom…
zjin-lcf Jan 11, 2025
8570ffd
[score-hip] make the program portable and include the reference for v…
zjin-lcf Jan 11, 2025
9b50eb8
[quant] rename the benchmark names
zjin-lcf Jan 13, 2025
585e9b4
[score-sycl] comment out reqd_sub_group_size and add a reference for …
zjin-lcf Jan 13, 2025
24fd823
[quantVLLM] add the examples in CUDA, HIP and SYCL
zjin-lcf Jan 14, 2025
774f0ac
[quantVLLM] add the category and reference for the examples
zjin-lcf Jan 14, 2025
949ef08
[quantAQLM] add the examples in CUDA, HIP and SYCL
zjin-lcf Jan 18, 2025
431a70a
[quantAQLM] add the category and reference for the examples
zjin-lcf Jan 18, 2025
f043826
[scatter] add the examples in CUDA, HIP and SYCL
zjin-lcf Jan 20, 2025
cf581dd
[scatter] add the category and reference for the examples
zjin-lcf Jan 20, 2025
c4420d7
[scatter] update the license
zjin-lcf Jan 23, 2025
fec8cff
[quantBnB-cuda] update the kernel
zjin-lcf Jan 24, 2025
0526d86
[quantBnB-hip] update the kernel and Makefile
zjin-lcf Jan 24, 2025
5b223bc
[quantBnB-sycl] update the kernel and Makefile
zjin-lcf Jan 24, 2025
c1a12f6
[blockAccess] update the kernels slightly
zjin-lcf Jan 24, 2025
508deb2
[kiss] add the CUDA, HIP, and SYCL examples
zjin-lcf Jan 26, 2025
253ced4
[kiss] add the category and reference for the examples
zjin-lcf Jan 26, 2025
2276cec
[blockAccess-sycl] main2.cpp contains a kernel that implements block …
zjin-lcf Jan 27, 2025
d624551
[blockAccess-sycl] add more properties to the group load/store functions
zjin-lcf Feb 1, 2025
3963453
[readme] add a missing comma
zjin-lcf Feb 3, 2025
b34f24d
[attention] replace expf() with __expf()
zjin-lcf Feb 6, 2025
0887671
[maxpool3d-sycl] factor out the kernel function
zjin-lcf Feb 6, 2025
7df6782
[perplexity-cuda] comment out the fast math option in Makefile
zjin-lcf Feb 6, 2025
e9702cd
[perplexity-sycl] factor out the kernel function
zjin-lcf Feb 6, 2025
f445186
[heat2d] remove the return statement in the CUDA/HIP kernels and fact…
zjin-lcf Feb 7, 2025
e459608
[logprob-sycl] fix compile errors and warnings
zjin-lcf Feb 12, 2025
f3ea54b
fix hybridsort segfault
thomasRoglin Feb 13, 2025
3d0189b
replace non standard calls to sycl::pow<>
thomasRoglin Feb 13, 2025
75a5679
Remove fsycl flag when vendor is AdaptiveCpp
thomasRoglin Feb 13, 2025
4f702bb
replace sycl::abs() by sycl::fabs()
thomasRoglin Feb 13, 2025
9a505ca
[laplace3d] slightly update the kernels for consistence
zjin-lcf Feb 15, 2025
8d2e18b
gitignore main
pvelesko Feb 12, 2025
73649a8
rename 2 targets to main
pvelesko Feb 12, 2025
02ae378
rename ans- exec to main
pvelesko Feb 17, 2025
0587267
ans- rename exec to main
pvelesko Feb 17, 2025
ec8e7c8
Relax precision on f16max
pvelesko Feb 17, 2025
ec654d1
Relax precision haccmk
pvelesko Feb 17, 2025
d0e4dcc
head2d use python instead of python2
pvelesko Feb 17, 2025
8e70d0b
Relax precision on hellinger-cuda
pvelesko Feb 17, 2025
96010de
is-cuda default to S class (C class causes OOM on some GPUs)
pvelesko Feb 17, 2025
c60eec0
keogh - relax precision limits
pvelesko Feb 17, 2025
f4de831
relax precision lombscargle
pvelesko Feb 17, 2025
fd937d8
wordcount-hip default to CPP headers
pvelesko Feb 17, 2025
79e7ed6
reduce wordcount problem size to better fit igpu
pvelesko Feb 17, 2025
e2c7272
Update Makefile
phani544 Feb 20, 2025
34f711e
Fix HIP tests: Correct include paths, linking, and compilation issues
dkrottap Feb 20, 2025
6da3853
Merge pull request #95 from CHIP-SPV/chipstar-bench-changes
zjin-lcf Feb 20, 2025
14e4a67
Merge pull request #96 from phani544/patch-1
zjin-lcf Feb 20, 2025
9325646
Merge pull request #97 from SiPearl/tr/improve-AdaptiveCpp-support
zjin-lcf Feb 20, 2025
dabe74b
Merge pull request #98 from SiPearl/tr/replace-abs-by-fabs
zjin-lcf Feb 20, 2025
f74f1e2
Merge pull request #99 from SiPearl/tr/replace-templated-pow
zjin-lcf Feb 20, 2025
ad37216
Merge pull request #100 from dkrottap/fixing_hip_tests
zjin-lcf Feb 20, 2025
1024d10
[degrid] update the math function calls to address the ambiguous call…
zjin-lcf Feb 21, 2025
90165c5
[degrid-sycl] upgrade the sg.shuffle_down() function (https://github.…
zjin-lcf Feb 21, 2025
8347add
Add include paths to hipBlas and hipSparse libraries (https://github.…
zjin-lcf Feb 21, 2025
81cb2b2
[bicgstab-hip] hipsparseSpSV_solve() expects 9 arguments now
zjin-lcf Feb 21, 2025
68d5768
[mixbench-hip] upgrade the kernel launch call (https://github.com/zji…
zjin-lcf Feb 21, 2025
e7f43de
add wait to prevent a segfault on hmm
thomasRoglin Feb 13, 2025
6cef8a9
Merge pull request #101 from SiPearl/tr/fix-segfauls
zjin-lcf Feb 21, 2025
f50e340
[jacobi-sycl][nbnxm-sycl] upgrade sg.shuffle_down/up() functions
thomasRoglin Feb 13, 2025
29418e6
Merge pull request #103 from SiPearl/tr/replace-shuffle-by-shift-group
zjin-lcf Feb 21, 2025
2afd0e4
Merge remote-tracking branch 'refs/remotes/origin/master'
zjin-lcf Feb 21, 2025
caa2dea
[bm3d-sycl, lda-sycl, marchingCubes-sycl, metropolis-sycl] upgrade no…
zjin-lcf Feb 21, 2025
0bf37af
[word2vec] update the link to the repository for training data files …
zjin-lcf Feb 21, 2025
2ce4818
Add an include path to hipSparse library (https://github.com/zjin-lcf…
zjin-lcf Feb 21, 2025
37ac567
[bm3d-sycl] fix SYCL deprecated warnings
zjin-lcf Feb 21, 2025
cab03e5
[blockAccess-sycl] call the member functions of the group class in th…
zjin-lcf Feb 22, 2025
40ea22c
[blockAccess-sycl] reorder the function arguments
zjin-lcf Feb 22, 2025
488c06f
[voxelization] upload the test data (https://github.com/zjin-lcf/HeCB…
zjin-lcf Feb 22, 2025
9b3b107
[voxelization] slightly optimize the voxelization kernel
zjin-lcf Feb 22, 2025
3306542
[clenergy] __[u]mul24 are legacy intrinsic functions that no longer h…
zjin-lcf Feb 24, 2025
c3a76ed
[dxtc1] delete the dxtc1 files and please checkout the dxtc2 (https:/…
zjin-lcf Feb 24, 2025
d91dddc
[opticalFlow] add the input files from CUDA samples (https://github.c…
zjin-lcf Feb 24, 2025
090196f
[tsne] add the text file from Velocity-Bench (https://github.com/zjin…
zjin-lcf Feb 24, 2025
87258c1
[tsne-sycl] add the C++ complex header
zjin-lcf Feb 24, 2025
c3cd835
[tsne-sycl] set the GCC_TOOLCHAIN option properly if needed
zjin-lcf Feb 24, 2025
d53db99
[tsne-sycl] add the category and reference for the examples
zjin-lcf Feb 24, 2025
1b2c01d
[clink-cuda] replace tanh() with tanhf()
zjin-lcf Feb 24, 2025
3859bdf
[channelSum] simplify the CUB block reduce calls
zjin-lcf Feb 25, 2025
51e47eb
[geodesic] change the time unit to `us`
zjin-lcf Feb 25, 2025
1f38128
[scatter] improve readability of the `update` function
zjin-lcf Feb 26, 2025
c1c9af2
[p4, clink] rename sigmoid() as sigmoidf()
zjin-lcf Feb 26, 2025
a16e742
[vanGenuchten] rename abs() as fabs() in the kernels; call delete[] t…
zjin-lcf Feb 27, 2025
068223a
[lif] call the fmin/fmax functions for the min/max comparisons in the…
zjin-lcf Feb 28, 2025
7f62982
[accuracy] factor out the block reduce function
zjin-lcf Mar 2, 2025
173d050
[channelSum] factor out the block reduce function
zjin-lcf Mar 2, 2025
3dcf525
[fdtd3d] reorder the indexing sequence
zjin-lcf Mar 2, 2025
2be139d
[lif] fix the issue that the min/max functions were not called in the…
zjin-lcf Mar 3, 2025
08b126d
[quantVLLM-sycl] add a compiler option in the Makefile (https://githu…
zjin-lcf Mar 4, 2025
7e80639
[softmax-online] add the implementations using shared local memory; c…
zjin-lcf Mar 4, 2025
94c39d3
[softmax-online] add the implementations using OMP target offload
zjin-lcf Mar 5, 2025
d3f1422
use wall clock time instead of CPU time in sycl benchmarks
thomasRoglin Mar 5, 2025
daeb325
Merge pull request #107 from SiPearl/tr/replace-cpu-time-by-wall-time
zjin-lcf Mar 5, 2025
fdf3eca
[attentionMultiHead] verify the kernel results with a reference imple…
zjin-lcf Mar 6, 2025
b4cde37
Merge remote-tracking branch 'refs/remotes/origin/master'
zjin-lcf Mar 7, 2025
e23048a
[aobench] replace clock() by std::chrono::steady_clock::now() (https:…
zjin-lcf Mar 7, 2025
c7381a8
[ccsd-trpdrv] replace clock() by std::chrono::steady_clock::now() (ht…
zjin-lcf Mar 7, 2025
a407a4f
[cm] replace clock() by std::chrono::steady_clock::now() (https://git…
zjin-lcf Mar 7, 2025
b8b9de6
[d3q19-bgk] replace clock() by std::chrono::steady_clock::now() (http…
zjin-lcf Mar 7, 2025
ce2573a
[hybridsort] replace clock() by std::chrono::steady_clock::now() (htt…
zjin-lcf Mar 7, 2025
d469ae4
[jacobi] replace clock() by std::chrono::steady_clock::now() (https:/…
zjin-lcf Mar 7, 2025
32356bc
[lsqt] replace clock() by std::chrono::steady_clock::now() (https://g…
zjin-lcf Mar 7, 2025
e163673
[miniWeather] replace clock() by std::chrono::steady_clock::now() (ht…
zjin-lcf Mar 7, 2025
5bd1529
[simplemoc] replace clock() by std::chrono::steady_clock::now() (http…
zjin-lcf Mar 7, 2025
00385fa
[sss] replace clock() by std::chrono::steady_clock::now() (https://gi…
zjin-lcf Mar 7, 2025
8a4e032
[word2vec] replace clock() by std::chrono::steady_clock::now() (https…
zjin-lcf Mar 7, 2025
b20ff2a
[word2vec-hip] use hipHostMalloc instead of hipMallocHost
zjin-lcf Mar 7, 2025
dd92e86
[sss] replace clock() by std::chrono::steady_clock::now() (https://gi…
zjin-lcf Mar 7, 2025
c9325ca
[lr] remove the unused timing function calls
zjin-lcf Mar 7, 2025
c033c54
[wedford] verify the kernel results with a reference implementation; …
zjin-lcf Mar 8, 2025
c94730a
[wedford] found the original license for the CUDA kernel
zjin-lcf Mar 8, 2025
6810a23
[wedford-omp] add the OpenMP version
zjin-lcf Mar 9, 2025
ddd8b72
[zerocopy-cuda] add a README file
zjin-lcf Mar 10, 2025
d74684d
[geodesic] add the math header
zjin-lcf Mar 17, 2025
202deb3
[heat2d] simplify the host programs without file I/O and Python scripts
zjin-lcf Mar 18, 2025
ef6fe24
[laplace3d] __[u]mul24 are legacy intrinsic functions that no longer …
zjin-lcf Mar 18, 2025
7b0a690
[knn] update the header list
zjin-lcf Mar 18, 2025
3f2dd7c
[knn-omp] update the Makefile for the icpx compiler
zjin-lcf Mar 18, 2025
c365d20
[backprop-sycl] delete the cuda directory
zjin-lcf Mar 18, 2025
ced508e
[moe] add the examples in CUDA, HIP and SYCL
zjin-lcf Mar 22, 2025
7cefaf6
[moe] add the category and reference for the examples
zjin-lcf Mar 22, 2025
10b3bae
[p4] add the math header and inline the sigmoid function
zjin-lcf Mar 23, 2025
92669b5
[permute] update the array names
zjin-lcf Mar 23, 2025
d9a5dcc
[vol2col] verify the benchmark results with reference implementations
zjin-lcf Mar 24, 2025
e72f5cb
[burger] update the array indexing macro; change the x and y dimensio…
zjin-lcf Mar 24, 2025
e78e60d
[tqs] use a `next` reference in the kernel
zjin-lcf Mar 25, 2025
c7f112d
[tqs] avoid memory leaks on verification failure
zjin-lcf Mar 25, 2025
87a9bb9
[convolution3D] verify the result of each kernel execution; disable t…
zjin-lcf Mar 26, 2025
6f7292f
[background-subtract] verify the device results with a reference impl…
zjin-lcf Mar 26, 2025
b2fb123
[clink] inline the sigmoid function
zjin-lcf Mar 26, 2025
a0d0522
[vol2col] use auto variables for pointers
zjin-lcf Mar 27, 2025
0778147
[mcpr] verify the device results with reference implementations; remo…
zjin-lcf Mar 27, 2025
a2d7ef6
[qkv-sycl] update common.hpp so it can be included in other benchmark…
zjin-lcf Mar 27, 2025
c4173e4
[pathfinder] add a space in the timing message
zjin-lcf Mar 28, 2025
7826537
[haccmk] rename the executable to main
zjin-lcf Mar 28, 2025
f8937bf
[aidw] relax the accuracy from 0.1 to 1 when the fast math option is …
zjin-lcf Mar 30, 2025
883989e
[meanshift-cuda] use a short name for threadIdx.x
zjin-lcf Mar 31, 2025
27ecbb7
[heat] rename the executable
zjin-lcf Apr 1, 2025
cd2bc43
[hwt1d] remove a character unsupported by utf-8 encoding
zjin-lcf Apr 1, 2025
7da23ee
[chemv] move the ppcg macros out of the kernel functions
zjin-lcf Apr 2, 2025
fcdf154
[axhelm] change the name of the kernel argument
zjin-lcf Apr 3, 2025
427ce8b
[md-cuda] update the indexing slightly
zjin-lcf Apr 3, 2025
5a2e794
[maxpool3d-cuda] update the indexing slightly
zjin-lcf Apr 3, 2025
d1f1335
[heat] inline the PI macro in the kernels and replace PI with M_PI in…
zjin-lcf Apr 4, 2025
bd2c779
[permute-sycl] name the SYCL queue consistently
zjin-lcf Apr 6, 2025
163c5e2
[permute-sycl] the global SYCL queue in the validation function could…
zjin-lcf Apr 6, 2025
78a1b46
[maxpool3d] remove the constraint of the image dimensions; change the…
zjin-lcf Apr 9, 2025
c19d025
[adamw] verify the device results using reference implementations; re…
zjin-lcf Apr 10, 2025
2291750
[gelu] add the baseline kernels for performance comparison
zjin-lcf Apr 11, 2025
ee1ba33
[pointwise] inline the sigmoid function
zjin-lcf Apr 15, 2025
57fcb21
[gd-omp] update the time unit
zjin-lcf Apr 16, 2025
6a79914
[adamw] add a constant template parameter in the CUDA/HIP kernels
zjin-lcf Apr 17, 2025
a325209
[adamw-omp] add an OpenMP version
zjin-lcf Apr 17, 2025
117d7a3
[pointwise] rename the kernel
zjin-lcf Apr 23, 2025
3609739
Merge branch 'master' of https://github.com/zjin-lcf/HeCBench
zjin-lcf Apr 23, 2025
6d4d2a1
[chi2-omp] rename the executable in Makefiles
zjin-lcf Apr 23, 2025
0627f6d
[assert] add the OpenMP version
zjin-lcf Apr 23, 2025
508701d
[softmax-cuda/hip] update the Makefiles
zjin-lcf Apr 23, 2025
506f728
[softmax-omp] adjust the execution time unit
zjin-lcf Apr 23, 2025
1a9ddb3
[tissue-omp] relax the error bound; disable -O3 in nvc++ due to incor…
zjin-lcf Apr 24, 2025
9674f48
[scripts] upgrade to C++17 in Makefile.aomp
zjin-lcf Apr 24, 2025
0396e5e
[pointwise-omp] fix memory map error
zjin-lcf Apr 24, 2025
93aeaac
[p4-omp] fix a memory map issue
zjin-lcf Apr 25, 2025
7ef9082
[md-sycl] define a macro for zero
zjin-lcf Apr 28, 2025
6a853ad
Add missing cuda.h includes
illuhad Apr 28, 2025
673e2cd
[geodesic] move the constants into the global space and update the ke…
zjin-lcf Apr 29, 2025
83e2b4c
Merge branch 'master' of https://github.com/zjin-lcf/HeCBench
zjin-lcf Apr 29, 2025
cf0e879
Merge pull request #110 from illuhad/pcuda-support
zjin-lcf Apr 29, 2025
268eed8
[fft] add a header for memcpy (https://github.com/zjin-lcf/HeCBench/p…
zjin-lcf Apr 29, 2025
129b88d
[ising-omp] add the OpenMP header (https://github.com/zjin-lcf/HeCBen…
zjin-lcf Apr 29, 2025
00977d0
[geodesic] update the CUDA and HIP kernels
zjin-lcf Apr 29, 2025
9232a31
[knn] display precision-based verification result
zjin-lcf Apr 29, 2025
799a6df
[scripts] add basic support of running OpenMP programs; capture error…
zjin-lcf May 2, 2025
87f3ba1
[scripts] add more benchmarks to subset.json
zjin-lcf May 2, 2025
543a859
[glu] move the definitions of variables after the boundary check
zjin-lcf May 8, 2025
f22d0a8
[chi2] move the definitions and declarations of variables after the b…
zjin-lcf May 8, 2025
f1316e5
[goulash] explicitly specify the read-only array sizes in the kernels
zjin-lcf May 8, 2025
52494dc
[iso2dfd-omp] add the OpenMP header
zjin-lcf May 8, 2025
31d514a
[laplace] add a boundary check for the kernels
zjin-lcf May 8, 2025
27bd4f3
[tissue] add a boundary check for the kernels
zjin-lcf May 9, 2025
12a756e
[aidw] add an alternative boundary check for the baseline kernels
zjin-lcf May 9, 2025
96d2fad
[clink] add a boundary check for the kernels
zjin-lcf May 9, 2025
095b4a5
[p4] remove the integer type cast in the CUDA and HIP kernels
zjin-lcf May 9, 2025
7060f44
[ace] fix a segfault error when the cube dimensions are not the same;…
zjin-lcf May 9, 2025
d16ea96
[threadfence] reduce the memory copies between host and device; incre…
zjin-lcf May 12, 2025
8dd3161
[jacobi] fix the compile error; replace the swap kernel with std::swap()
zjin-lcf May 12, 2025
1d123ae
[jacobi-sycl] factor out the SYCL kernel
zjin-lcf May 13, 2025
2b0e4cb
[all-pairs-distance] optimize the k2 kernel with a tree reduction
zjin-lcf May 13, 2025
6642eba
[bsearch] add boundary checks in the kernels
zjin-lcf May 13, 2025
1d0a745
[chacha20] performance may be improved with loop unrolling in the ker…
zjin-lcf May 13, 2025
8d1186e
[all-pairs-distance] reduce memory transfers between host and device
zjin-lcf May 13, 2025
a0ab4f4
[ace] update the reference code for result verification
zjin-lcf May 13, 2025
efb6136
[randomAccess] remove the device memory writes with per-thread variables
zjin-lcf May 13, 2025
7e6f4b5
[entropy] add boundary checks and call intrinsic/native math function…
zjin-lcf May 13, 2025
1b938c4
[extrema] slightly update the kernels; reuse the reference codes
zjin-lcf May 13, 2025
5e19aab
[jaccard] add optimized jaccard_is kernels
zjin-lcf May 13, 2025
70617ad
[readme] acknowledge Leveraging LLMs to Automate Energy-Aware Refacto…
zjin-lcf May 14, 2025
c662f70
[memcpy-omp] fix compile errors
zjin-lcf May 15, 2025
690c7eb
[cm-omp] fix the compile errors
zjin-lcf May 15, 2025
9863904
[slit] add the reference codes to fix compile errors
zjin-lcf May 15, 2025
685bbb7
[lzss] update the sample data name
zjin-lcf May 15, 2025
6911d12
Merge branch 'master' of https://github.com/zjin-lcf/HeCBench
zjin-lcf May 15, 2025
e712ed9
Merge branch 'master' of https://github.com/zjin-lcf/HeCBench
zjin-lcf May 15, 2025
b9ee0bd
[graphExecution] add the C++ chrono header
zjin-lcf May 15, 2025
1bf3b2c
[grep] update the timer header
zjin-lcf May 15, 2025
46e9019
[softmax-online-cuda] add the C++ chrono header
zjin-lcf May 15, 2025
8d88f22
[readme] acknowledge Can Large Language Models Predict Parallel Code …
zjin-lcf May 15, 2025
dbe2271
[debayer-omp] fix memory map error
zjin-lcf May 15, 2025
8e28892
Revise the copyright statement
zjin-lcf May 29, 2025
e32aba5
Add a License section
zjin-lcf May 29, 2025
32b7ac2
[bwt-cuda] add the CUDA header
zjin-lcf Jun 1, 2025
8c81664
[bwt-omp] add the OpenMP header
zjin-lcf Jun 1, 2025
8a7cf3c
[adjacent] add the CUDA and HIP examples
zjin-lcf Jun 8, 2025
64c40f7
[geglu] add the GeGLU examples
zjin-lcf Jun 9, 2025
ffd7f62
[geglu] add the category and reference for the examples
zjin-lcf Jun 9, 2025
10e1303
Add a verification system to autohecbench script
thomasRoglin Apr 24, 2025
ed66181
Use enums for benchmark status instead of string
thomasRoglin Jul 2, 2025
2344bf5
Add verification info to subset.json
thomasRoglin Jul 2, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -12,3 +12,6 @@
*.log
*.yaml
dpct_output
*/main
main
*.tmp
3 changes: 2 additions & 1 deletion LICENSE
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
BSD 3-Clause License

Copyright (c) 2020, Zheming Jin
Copyright (c) 2020 , Argonne National Laboratory (Zheming Jin)
Copyright (c) 2020-, Oak Ridge National Laboratory (Zheming Jin)
All rights reserved.

Redistribution and use in source and binary forms, with or without
Expand Down
1,855 changes: 1,515 additions & 340 deletions README.md

Large diffs are not rendered by default.

57 changes: 0 additions & 57 deletions amgmk-cuda/Makefile

This file was deleted.

Loading