Skip to content

Commit

Permalink
update kernel parameters for nvidia
Browse files Browse the repository at this point in the history
  • Loading branch information
xfong committed May 9, 2022
1 parent 10f8892 commit 38e62c0
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 0 deletions.
1 change: 1 addition & 0 deletions opencl/init.go
Original file line number Diff line number Diff line change
Expand Up @@ -226,6 +226,7 @@ func Init(gpu int) {
ClMaxWGNum = ClCUnits
ClTotalPE = ClWGSize[2] * ClCUnits
if GPUVend == 1 { // Nvidia
ClTotalPE *= 2
if ClMaxWGSize > ClTotalPE {
ClMaxWGNum = 1
ClMaxWGSize = ClTotalPE
Expand Down
1 change: 1 addition & 0 deletions opencl64/init.go
Original file line number Diff line number Diff line change
Expand Up @@ -229,6 +229,7 @@ func Init(gpu int) {
ClMaxWGNum = ClCUnits
ClTotalPE = ClWGSize[2] * ClCUnits
if GPUVend == 1 { // Nvidia
ClTotalPE *= 2
if ClMaxWGSize > ClTotalPE {
ClMaxWGNum = 1
ClMaxWGSize = ClTotalPE
Expand Down

0 comments on commit 38e62c0

Please sign in to comment.