Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

X not started #58

Open
RailWar opened this issue Jul 7, 2021 · 3 comments
Open

X not started #58

RailWar opened this issue Jul 7, 2021 · 3 comments

Comments

@RailWar
Copy link

RailWar commented Jul 7, 2021

I have RTX2080 (connected to monitor)and Intel UHD 630 (primary, connected to monitor). Host is on AlmaLinux8.4 (clone RHEL8), guest OS is Ubuntu 20.04
Use NVIDIA-GRID-Linux-KVM-460.32.04-460.32.03-461.33.zip
Install NVIDIA-Linux-x86_64-460.32.04-vgpu-kvm.run
After installation have next on host:

mdevctl types
[user@localhost work]$ mdevctl types
0000:01:00.0
nvidia-256
  Available instances: 0
  Device API: vfio-pci
  Name: GRID RTX6000-1Q
  Description: num_heads=4, frl_config=60, framebuffer=1024M, max_resolution=5120x2880, max_instance=24
nvidia-257
  Available instances: 0
  Device API: vfio-pci
  Name: GRID RTX6000-2Q
  Description: num_heads=4, frl_config=60, framebuffer=2048M, max_resolution=7680x4320, max_instance=12
nvidia-258
  Available instances: 0
  Device API: vfio-pci
  Name: GRID RTX6000-3Q
  Description: num_heads=4, frl_config=60, framebuffer=3072M, max_resolution=7680x4320, max_instance=8
nvidia-259
  Available instances: 0
  Device API: vfio-pci
  Name: GRID RTX6000-4Q
  Description: num_heads=4, frl_config=60, framebuffer=4096M, max_resolution=7680x4320, max_instance=6
nvidia-260
  Available instances: 3
  Device API: vfio-pci
  Name: GRID RTX6000-6Q
  Description: num_heads=4, frl_config=60, framebuffer=6144M, max_resolution=7680x4320, max_instance=4
First, why RTX6000 with TU102GL, not TU104GL? My card have gpu TU104.

Next, in KVM create new VM with Ububtu 20.04 and add new PCI device with XML
Install NVIDIA-Linux-x86_64-460.32.03-grid.run
After reboot have no graphics, black screen.

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 460.32.03    Driver Version: 460.32.03    CUDA Version: 11.2     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  GRID RTX6000-6Q     On   | 00000000:04:00.0 Off |                  N/A |
| N/A   N/A    P8    N/A /  N/A |    432MiB /  6144MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                             
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+`
lsmod | grep nvidia
`nvidia_drm             57344  0
nvidia_modeset       1228800  1 nvidia_drm
nvidia              34050048  1 nvidia_modeset
drm_kms_helper        217088  4 qxl,nvidia_drm
drm                   552960  6 drm_kms_helper,qxl,drm_ttm_helper,nvidia_drm,ttm`
Xorg.0.log
`[     3.818] (--) Log file renamed from "/var/log/Xorg.pid-895.log" to "/var/log/Xorg.0.log"
[     3.818] 
X.Org X Server 1.20.9
X Protocol Version 11, Revision 0
[     3.836] Build Operating System: Linux 4.15.0-130-generic x86_64 Ubuntu
[     3.836] Current Operating System: Linux user-KVM 5.8.0-59-generic #66~20.04.1-Ubuntu SMP Thu Jun 17 11:14:10 UTC 2021 x86_64
[     3.836] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-5.8.0-59-generic root=UUID=7cbeea00-f21b-4a76-ba88-0649e9195180 ro quiet splash vt.handoff=7
[     3.836] Build Date: 17 January 2021  09:13:31AM
[     3.837] xorg-server 2:1.20.9-2ubuntu1.2~20.04.1 (For technical support please see http://www.ubuntu.com/support) 
[     3.837] Current version of pixman: 0.38.4
[     3.837] 	Before reporting problems, check http://wiki.x.org
  to make sure that you have the latest version.
[     3.837] Markers: (--) probed, (**) from config file, (==) default setting,
  (++) from command line, (!!) notice, (II) informational,
  (WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[     3.839] (==) Log file: "/var/log/Xorg.0.log", Time: Wed Jul  7 16:54:24 2021
[     3.839] (==) Using config file: "/etc/X11/xorg.conf"
[     3.839] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[     3.839] (==) ServerLayout "Layout0"
[     3.839] (**) |-->Screen "Screen0" (0)
[     3.840] (**) |   |-->Monitor "Monitor0"
[     3.842] (**) |   |-->Device "Device0"
[     3.842] (**) |-->Input Device "Keyboard0"
[     3.842] (**) |-->Input Device "Mouse0"
[     3.842] (==) Automatically adding devices
[     3.842] (==) Automatically enabling devices
[     3.842] (==) Automatically adding GPU devices
[     3.843] (==) Automatically binding GPU devices
[     3.847] (==) Max clients allowed: 256, resource mask: 0x1fffff
[     3.847] (WW) The directory "/usr/share/fonts/X11/cyrillic" does not exist.
[     3.847] 	Entry deleted from font path.
[     3.847] (WW) The directory "/usr/share/fonts/X11/100dpi/" does not exist.
[     3.847] 	Entry deleted from font path.
[     3.847] (WW) The directory "/usr/share/fonts/X11/75dpi/" does not exist.
[     3.847] 	Entry deleted from font path.
[     3.847] (WW) The directory "/usr/share/fonts/X11/100dpi" does not exist.
[     3.847] 	Entry deleted from font path.
[     3.847] (WW) The directory "/usr/share/fonts/X11/75dpi" does not exist.
[     3.848] 	Entry deleted from font path.
[     3.848] (==) FontPath set to:
  /usr/share/fonts/X11/misc,
  /usr/share/fonts/X11/Type1,
  built-ins
[     3.848] (==) ModulePath set to "/usr/lib/xorg/modules"
[     3.848] (WW) Hotplugging is on, devices using drivers 'kbd', 'mouse' or 'vmmouse' will be disabled.
[     3.848] (WW) Disabling Keyboard0
[     3.848] (WW) Disabling Mouse0
[     3.848] (II) Loader magic: 0x556d1102f020
[     3.848] (II) Module ABI versions:
[     3.856] 	X.Org ANSI C Emulation: 0.4
[     3.860] 	X.Org Video Driver: 24.1
[     3.860] 	X.Org XInput driver : 24.1
[     3.860] 	X.Org Server Extension : 10.0
[     3.861] (++) using VT number 1

[     3.863] (II) systemd-logind: took control of session /org/freedesktop/login1/session/c1
[     3.864] (II) xfree86: Adding drm device (/dev/dri/card0)
[     3.868] (II) systemd-logind: got fd for /dev/dri/card0 226:0 fd 12 paused 0
[     3.877] (II) xfree86: Adding drm device (/dev/dri/card1)
[     3.878] (II) systemd-logind: got fd for /dev/dri/card1 226:1 fd 13 paused 0
[     3.909] (--) PCI:*(0@0:1:0) 1b36:0100:1af4:1100 rev 4, Mem @ 0xf0000000/67108864, 0xf4000000/67108864, 0xfcc14000/8192, I/O @ 0x0000c040/32, BIOS @ 0x????????/131072
[     3.909] (--) PCI: (4@0:0:0) 10de:1e30:10de:1329 rev 161, Mem @ 0xfa000000/16777216, 0xd0000000/268435456, 0xf8000000/33554432
[     3.910] (II) LoadModule: "glx"
[     3.910] (II) Loading /usr/lib/xorg/modules/extensions/libglx.so
[     3.910] (II) Module glx: vendor="X.Org Foundation"
[     3.910] 	compiled for 1.20.9, module version = 1.0.0
[     3.911] 	ABI class: X.Org Server Extension, version 10.0
[     3.911] (II) LoadModule: "nvidia"
[     3.911] (II) Loading /usr/lib/xorg/modules/drivers/nvidia_drv.so
[     3.911] (II) Module nvidia: vendor="NVIDIA Corporation"
[     3.911] 	compiled for 1.6.99.901, module version = 1.0.0
[     3.911] 	Module class: X.Org Video Driver
[     3.911] (II) NVIDIA dlloader X Driver  460.32.03  Sun Dec 27 18:56:00 UTC 2020
[     3.912] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[     3.912] (II) systemd-logind: releasing fd for 226:1
[     3.912] (II) Loading sub module "fb"
[     3.912] (II) LoadModule: "fb"
[     3.913] (II) Loading /usr/lib/xorg/modules/libfb.so
[     3.913] (II) Module fb: vendor="X.Org Foundation"
[     3.913] 	compiled for 1.20.9, module version = 1.0.0
[     3.913] 	ABI class: X.Org ANSI C Emulation, version 0.4
[     3.913] (II) Loading sub module "wfb"
[     3.913] (II) LoadModule: "wfb"
[     3.913] (II) Loading /usr/lib/xorg/modules/libwfb.so
[     3.914] (II) Module wfb: vendor="X.Org Foundation"
[     3.914] 	compiled for 1.20.9, module version = 1.0.0
[     3.914] 	ABI class: X.Org ANSI C Emulation, version 0.4
[     3.914] (II) Loading sub module "ramdac"
[     3.914] (II) LoadModule: "ramdac"
[     3.914] (II) Module "ramdac" already built-in
[     3.915] (EE) No devices detected.
[     3.915] (II) Applying OutputClass "nvidia" to /dev/dri/card1
[     3.915] 	loading driver: nvidia
[     4.058] (==) Matched qxl as autoconfigured driver 0
[     4.058] (==) Matched nvidia as autoconfigured driver 1
[     4.058] (==) Matched nouveau as autoconfigured driver 2
[     4.058] (==) Matched modesetting as autoconfigured driver 3
[     4.058] (==) Matched fbdev as autoconfigured driver 4
[     4.058] (==) Matched vesa as autoconfigured driver 5
[     4.058] (==) Assigned the driver to the xf86ConfigLayout
[     4.059] (II) LoadModule: "qxl"
[     4.059] (II) Loading /usr/lib/xorg/modules/drivers/qxl_drv.so
[     4.059] (II) Module qxl: vendor="X.Org Foundation"
[     4.059] 	compiled for 1.20.7, module version = 0.1.5
[     4.059] 	Module class: X.Org Video Driver
[     4.059] 	ABI class: X.Org Video Driver, version 24.1
[     4.059] (II) LoadModule: "nvidia"
[     4.059] (II) Loading /usr/lib/xorg/modules/drivers/nvidia_drv.so
[     4.059] (II) Module nvidia: vendor="NVIDIA Corporation"
[     4.059] 	compiled for 1.6.99.901, module version = 1.0.0
[     4.059] 	Module class: X.Org Video Driver
[     4.059] (II) UnloadModule: "nvidia"
[     4.059] (II) Unloading nvidia
[     4.059] (II) Failed to load module "nvidia" (already loaded, 0)
[     4.059] (II) LoadModule: "nouveau"
[     4.059] (II) Loading /usr/lib/xorg/modules/drivers/nouveau_drv.so
[     4.059] (II) Module nouveau: vendor="X.Org Foundation"
[     4.059] 	compiled for 1.20.3, module version = 1.0.16
[     4.059] 	Module class: X.Org Video Driver
[     4.059] 	ABI class: X.Org Video Driver, version 24.0
[     4.059] (II) LoadModule: "modesetting"
[     4.059] (II) Loading /usr/lib/xorg/modules/drivers/modesetting_drv.so
[     4.059] (II) Module modesetting: vendor="X.Org Foundation"
[     4.059] 	compiled for 1.20.9, module version = 1.20.9
[     4.059] 	Module class: X.Org Video Driver
[     4.059] 	ABI class: X.Org Video Driver, version 24.1
[     4.059] (II) LoadModule: "fbdev"
[     4.059] (II) Loading /usr/lib/xorg/modules/drivers/fbdev_drv.so
[     4.059] (II) Module fbdev: vendor="X.Org Foundation"
[     4.059] 	compiled for 1.20.1, module version = 0.5.0
[     4.059] 	Module class: X.Org Video Driver
[     4.059] 	ABI class: X.Org Video Driver, version 24.0
[     4.059] (II) LoadModule: "vesa"
[     4.059] (II) Loading /usr/lib/xorg/modules/drivers/vesa_drv.so
[     4.059] (II) Module vesa: vendor="X.Org Foundation"
[     4.059] 	compiled for 1.20.4, module version = 2.4.0
[     4.059] 	Module class: X.Org Video Driver
[     4.059] 	ABI class: X.Org Video Driver, version 24.0
[     4.059] (II) NVIDIA dlloader X Driver  460.32.03  Sun Dec 27 18:56:00 UTC 2020
[     4.059] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[     4.059] (II) qxl: Driver for QXL virtual graphics: QXL 1
[     4.059] (II) NOUVEAU driver Date:   Mon Jan 28 23:25:58 2019 -0500
[     4.059] (II) NOUVEAU driver for NVIDIA chipset families :
[     4.059] 	RIVA TNT            (NV04)
[     4.059] 	RIVA TNT2           (NV05)
[     4.059] 	GeForce 256         (NV10)
[     4.059] 	GeForce 2           (NV11, NV15)
[     4.059] 	GeForce 4MX         (NV17, NV18)
[     4.060] 	GeForce 3           (NV20)
[     4.060] 	GeForce 4Ti         (NV25, NV28)
[     4.060] 	GeForce FX          (NV3x)
[     4.060] 	GeForce 6           (NV4x)
[     4.060] 	GeForce 7           (G7x)
[     4.060] 	GeForce 8           (G8x)
[     4.060] 	GeForce 9           (G9x)
[     4.060] 	GeForce GTX 2xx/3xx (GT2xx)
[     4.060] 	GeForce GTX 4xx/5xx (GFxxx)
[     4.060] 	GeForce GTX 6xx/7xx (GKxxx)
[     4.060] 	GeForce GTX 9xx     (GMxxx)
[     4.060] 	GeForce GTX 10xx    (GPxxx)
[     4.060] (II) modesetting: Driver for Modesetting Kernel Drivers: kms
[     4.060] (II) FBDEV: driver for framebuffer: fbdev
[     4.060] (II) VESA: driver for VESA chipsets: vesa
[     4.060] (WW) Falling back to old probe method for modesetting
[     4.060] (WW) Falling back to old probe method for fbdev
[     4.060] (WW) Falling back to old probe method for modesetting
[     4.060] (WW) Falling back to old probe method for fbdev
[     4.060] (II) [KMS] Kernel modesetting enabled.
[     4.060] (EE) No devices detected.
[     4.060] (EE) 
Fatal server error:
[     4.060] (EE) no screens found(EE) 
[     4.060] (EE) 
Please consult the The X.Org Foundation support 
   at http://wiki.x.org
for help. 
[     4.060] (EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.
[     4.060] (EE) 
[     4.066] (EE) Server terminated with error (1). Closing log file.`
@RailWar RailWar closed this as completed Jul 7, 2021
@RailWar RailWar reopened this Jul 7, 2021
@RailWar
Copy link
Author

RailWar commented Jul 11, 2021

I managed start Xorg. I added to xorg.conf "BusID "PCI:4:0:0"" But now I see only black screen with logo UBUNTU. And is what next?
How Can I check what nvidia loaded successfully and how can I use vnc with nvidia?
ps -A | grep gnome-session shows two gnome-session-*

@DualCoder
Copy link
Owner

First, why RTX6000 with TU102GL, not TU104GL? My card have gpu TU104.

The RTX6000 (TU102) offers more profiles than the T4 (TU104), so by using it you get access to 3 and 6 GB profiles.

The full lists are here:
https://docs.nvidia.com/grid/latest/grid-vgpu-user-guide/index.html#vgpu-types-tesla-t4
https://docs.nvidia.com/grid/latest/grid-vgpu-user-guide/index.html#vgpu-types-quadro-rtx6000

I managed start Xorg. I change mdev to nvidia-259 (GRID RTX6000-4Q Description: num_heads=4, frl_config=60, framebuffer=4096M, max_resolution=7680x4320). But now I see only black screen with logo UBUNTU. And is what next?
How Can I check what nvidia loaded successfully

nvidia smi on the guest should show that the vGPU (RTX6000-4Q) is present and that Xorg is running (under Processes:).

$ nvidia-smi
Sun Jul 11 14:53:41 2021       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 460.73.01    Driver Version: 460.73.01    CUDA Version: 11.2     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  GRID P40-8Q         Off  | 00000000:07:01.0  On |                  N/A |
| N/A   N/A    P8    N/A /  N/A |    666MiB /  8192MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A       762      G   /usr/lib/xorg/Xorg                 94MiB |
+-----------------------------------------------------------------------------+

and how can I use vnc with nvidia?

To use vnc you can install x11vnc on the guest, this allows you to take advantage of the vGPU for rendering. I will also note that I had issues with getting this to work with the GNOME display manager and ended up installing LightDM instead.

sudo x11vnc -display :0 -auth /var/run/lightdm/root/:0 -forever -shared

ps -A | grep gnome-session shows two gnome-session-*

I see that you still have the QXL graphics adapter attached to the VM, maybe that is the issue? Anyway, I would remove that display adapter since I don't see why you would need it.

Also, if it helps, here is my xorg.conf

Click to show
# nvidia-xconfig: X configuration file generated by nvidia-xconfig
# nvidia-xconfig:  version 460.73.01


Section "ServerLayout"
    Identifier     "Layout0"
    Screen      0  "Screen0" 0 0
    InputDevice    "Keyboard0" "CoreKeyboard"
    InputDevice    "Mouse0" "CorePointer"
EndSection

Section "Files"
EndSection

Section "InputDevice"

    # generated from default
    Identifier     "Mouse0"
    Driver         "mouse"
    Option         "Protocol" "auto"
    Option         "Device" "/dev/psaux"
    Option         "Emulate3Buttons" "no"
    Option         "ZAxisMapping" "4 5"
EndSection

Section "InputDevice"

    # generated from default
    Identifier     "Keyboard0"
    Driver         "kbd"
EndSection

Section "Monitor"
    Identifier     "Monitor0"
    VendorName     "Unknown"
    ModelName      "Unknown"
    Option         "DPMS"
EndSection

Section "Device"
    Identifier     "Device0"
    Driver         "nvidia"
    VendorName     "NVIDIA Corporation"
EndSection

Section "Screen"
    Identifier     "Screen0"
    Device         "Device0"
    Monitor        "Monitor0"
    DefaultDepth    24
    SubSection     "Display"
        Depth       24
    EndSubSection
EndSection

@RailWar
Copy link
Author

RailWar commented Jul 13, 2021

Thank you. And now it works. But fps is very low (few frames in second) and gpu usage up to 98%

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants