Skip to content

Commit b80f0cb

Browse files
author
TinySemVer
committed
Release: v0.11.0 [skip ci]
### Minor - Add: `dp4a` & `umul24` instructions (ce1e3b7) - Add: DPX instructions on Hopper (1ab4f41) - Add: In-register FMA benchmarks for GPUs (97991fd) ### Patch - Docs: FMA CUDA throughput (c00e421) - Fix: Initialize FMA inputs (22f52c4) - Improve: Naming variables (80e1d83) - Fix: `bf16` requires Ampere (306ee3f)
1 parent 4445bf3 commit b80f0cb

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

CMakeLists.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ cmake_minimum_required(VERSION 3.25.2 FATAL_ERROR)
77
# ------------------------------------------------------------------------------
88
project(
99
less_slow
10-
VERSION 0.10.12
10+
VERSION 0.11.0
1111
LANGUAGES C CXX ASM
1212
DESCRIPTION
1313
"Learning how to write Less Slow code, from numerical micro-kernels and SIMD to coroutines, ranges, and polymorphic state machines"

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.10.12
1+
0.11.0

0 commit comments

Comments
 (0)