-
Notifications
You must be signed in to change notification settings - Fork 11
/
Copy pathReleaseNotes.txt
129 lines (87 loc) · 2.67 KB
/
ReleaseNotes.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
========================= (Pending) Release Notes v0.X.Y =========================
API:
Performance optimizations:
Internal features:
Build system:
Bug fixes:
Retired features:
========================= Release Notes v0.3.0 =========================
API:
- H2 added a thin GPU portability layer that supports CUDA and ROCm
(HIP) programming environments.
- Added a spdlog-based logging capability.
- Added initial support for (sequential) tensors.
Performance optimizations:
Internal features:
- MIOpen is supported as a backend for "DNN compute kernels" in
DistConv.
- DistConv is supported on ROCm systems.
- Added support for Gitlab CI.
- Support for DaCe convolution kernels in DistConv
Build system:
- Several updates to support ROCm 5.*.
- Requires CMake >= 3.21.
- Requires ROCm 5, with the latest available minor release being
STRONGLY recommended, when targeting ROCm-based platforms. The
current packages we need in this stack are:
-- HIP (host)
-- hipCUB
-- ROCm-SMI
-- MIOpen
-- ROC-tracer
Bug fixes:
- Fixed a bug in kernel selection for batchnorm gradients in the case of
overlap/strides.
Retired features:
========================= Release Notes v0.2.1 =========================
API:
Performance optimizations:
Internal features:
- Updated to support the Aluminum v0.7 API
- Aluminum is now only required when DistConv Legacy is enabled
Build system:
- Require Aluminum v0.7.0
Bug fixes:
Retired features:
========================= Release Notes v0.2.0 =========================
API:
Performance optimizations:
Internal features:
- Updated to support the Aluminum v0.5 API
Build system:
- Require Aluminum v0.5.0
Bug fixes:
Retired features:
========================= Release Notes v0.1 =========================
Initial release of the DiHydrogen Distributed Tensor Library.
DiHydrogen is the second version of the Hydrogen fork of the
well-known distributed linear algebra library, Elemental. DiHydrogen
aims to be a basic distributed multilinear algebra interface with a
particular emphasis on the needs of the distributed machine learning
effort, LBANN.
API:
Performance optimizations:
Internal features:
- Support for the DistConv legacy library
- Added multi-dimensional tensor class
- Added support for older CUDA architectures v3.0+
- Use CUB for memory management
- Support for point-to-point (P2P) library
- Support for NVSHMEM halo exchange
DistConv Legacy:
- batchnorm
- channel-wise softmax
- concat
- convolution
- deconv
- leaky-relu
- pooling
- relu
- slice
- softmax
Build system:
- Require C++14
- Added support for cuDNN v8
Bug fixes:
- Fixed build issues with GCC8
Retired features: