Fixes:
- Fix max|min_index to behave like numo with CUMO_COMPATIBLE_MODE=ON
Fixes:
- cond_unary.c: add cudaDeviceSynchronize to avoid bus error
- index.c: add cudaDeviceSynchronize to avoid bus error
- cum.c: add cudaDeviceSynchronize to avoid bus error
Fixes:
- Fix
fixed_batch_norm
Released (same with 0.3.5)
Fixes:
- Fix
each_with_index
to synchronize on each element
Enhancements:
- Support cuDNN fixed_batch_norm (cudnnBatchNormalizationForwardInference)
Fixes:
- Fix each to synchronize on each element
Fixes:
- Fix max and max_index for sfloat and dfloat
Fixes:
- Fix batch_norm_backward
- Fix scalar.dot(scalar)
- Fix clip
Enhancements:
- Support cuDNN
- conv (cudnnConvolution)
- conv_transpose (cudnnConvolutionBackwardData)
- conv_grad_w (cudnnConvolutionBackwardFilter)
- batch_norm (cudnnBatchNormalizationForwardTraining)
- batch_norm_backward (cudnnBatchNormalizationBackward)
- avg_pool and max_pool (cudnnPoolingForward)
- avg_pool_backward and max_pool_backward (cudnnPoolingBackward)
Enhancements:
- Support arithmetic sequence, which is available in ruby >= 2.6.0 (thanks to naitoh)
Changes:
- Turn off
CUMO_SHOW_WARNING
as default
Enhancements:
- Add some missing
synchronize
workarounds
Enhancements:
- CUDA kernelize na_index_aref_naview
- CUDA kernelize na_index_aref_nadata
- CUDA kernelize diagonal
- CUDA kernelize copy
Enhancements:
- Add
CUMO_SHOW_WARNING
andCUMO_SHOW_WARNING_ONCE
environment variables to suppress cumo warnings (They are only for debug purpose, would be removed in future).
All tests in red-chainer passed.
Fixes:
- Fix advanced indexing
- Fix accum_index reduction (max_index, and min_index)