Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

why cuNDT cannot work with x86 version ubuntu20.04 #50

Open
pauls0323 opened this issue Sep 6, 2023 · 1 comment
Open

why cuNDT cannot work with x86 version ubuntu20.04 #50

pauls0323 opened this issue Sep 6, 2023 · 1 comment

Comments

@pauls0323
Copy link

:~/cuPCL/cuNDT$ ./demo

GPU has cuda devices: 1
----device id: 0 info----
GPU : NVIDIA A800 80GB PCIe
Capbility: 8.0
Global memory: 81085MB
Const memory: 64KB
SM in a block: 48KB
warp size: 32
threads in a block: 1024
block dim: (1024,1024,64)
grid dim: (2147483647,65535,65535)

Loaded 7000 data points for P with the following fields: x y z
Loaded 7000 data points for Q with the following fields: x y z
Target rigid transformation : cloud_P -> cloud_Q
Rotation matrix :
| 0.923880 -0.382683 0.000000 |
R = | 0.382683 0.923880 0.000000 |
| 0.000000 0.000000 1.000000 |
Translation vector :
t = < 0.000000, 0.000000, 0.200000 >

------------checking PCL NDT(CPU)----------------
PCL align Time: 27.1937 ms.
Normal Distributions Transform has converged: 1 score: 0.648334
Rotation matrix :
| 0.999894 0.004857 0.013688 |
R = | -0.004680 0.999905 -0.012931 |
| -0.013750 0.012865 0.999823 |
Translation vector :
t = < 0.015418, 0.056840, 0.078443 >

------------checking CUDA NDT(GPU)----------------
CUDA NDT by Time: 0.777725 ms.
CUDA NDT fitness_score: 0.349491
Rotation matrix :
| 0.000000 0.000000 0.000000 |
R = | 0.000000 0.000000 0.000000 |
| 0.000000 0.000000 0.000000 |
Translation vector :
t = < 0.000000, 0.000000, 0.000000 >

@QYChan
Copy link

QYChan commented Sep 8, 2023

x86 Ubuntu18.04 can not work either

GPU has cuda devices: 1
----device id: 0 info----
  GPU : NVIDIA GeForce RTX 3060 
  Capbility: 8.6
  Global memory: 12036MB
  Const memory: 64KB
  SM in a block: 48KB
  warp size: 32
  threads in a block: 1024
  block dim: (1024,1024,64)
  grid dim: (2147483647,65535,65535)

Loaded 7000 data points for P with the following fields: x y z
Loaded 7000 data points for Q with the following fields: x y z
Target rigid transformation : cloud_P -> cloud_Q
Rotation matrix :
    | 0.923880 -0.382683 0.000000 | 
R = | 0.382683 0.923880 0.000000 | 
    | 0.000000 0.000000 1.000000 | 
Translation vector :
t = < 0.000000, 0.000000, 0.200000 >

------------checking PCL NDT(CPU)---------------- 
PCL align Time: 48.3015 ms.
Normal Distributions Transform has converged: 1 score: 0.540197
Rotation matrix :
    | 0.999237 0.009737 0.037825 | 
R = | -0.008490 0.999419 -0.033002 | 
    | -0.038124 0.032655 0.998739 | 
Translation vector :
t = < 0.038175, 0.110290, 0.185038 >

------------checking CUDA NDT(GPU)---------------- 
CUDA NDT by Time: 0.596691 ms.
CUDA NDT fitness_score: 0.349491
Rotation matrix :
    | 0.000000 0.000000 0.000000 | 
R = | 0.000000 0.000000 0.000000 | 
    | 0.000000 0.000000 0.000000 | 
Translation vector :
t = < 0.000000, 0.000000, 0.000000 >

@pauls0323 pauls0323 reopened this Sep 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants