cuda-packages

Out of tree (Nixpkgs) experiments with packaging CUDA in an extensible way.

Most code lives in Nixpkgs and is copied/modified here for ease of development.

Top-level:

cudaConfig: evaluated configuration for CUDA package sets
- includes hostNixSystem, hostRedistArch, and cudaCapabilities (among others), which are helpful when writing modules for cudaModules with the goal of conditionally changing the build based on what's being targeted through mkMerge and mkIf.
cudaPackagesExtensions: extensions (overlays) applied to each CUDA package set
- an easy way to add packages to all CUDA package sets
cudaModules: modules which change the creation of CUDA package sets
- an easy way to add new CUDA package sets, change the defaults, or add packages to a specific CUDA package set

Notes

Python wrappers which invoke CMake do not always pass their environment to the CMake process. That means a number of the environment variables we set so CMake's auto-detection functionality just works is broken.
autoAddDriverRunpath for CMake projects is a crutch -- the correct fix is to have the CMake project link against CUDA::cudart.
cudaStdenv sets strictDeps=true and __structuredAttrs=true by default. Packages must have a good reason to opt out (e.g., Python packaging has not been updated yet to support structured attributes: NixOS/nixpkgs#347194).
cudaStdenv uses a name prefix for more descriptive store path names.
- Prefix is available as cudaStdenv.cudaNamePrefix.
Manifests and overrides are versioned
- This prevents conflicts when downstream consumers add their own manifests and overrides.
noBrokenSymlinksHook checks for broken or reflexive symlinks in your outputs, which are usually a sign of packaging gone wrong
12.2.2 is kept around because it is the last version of CUDA 12 supported by Xavier through cuda_compat
- DO NOT rely on it being around forever -- try to upgrade to newer hardware!
cuda_compat can be disabled by setting the package to null. This is useful in cases where the host OS has a recent enough CUDA driver that the compatibility library isn't needed.

cuda_compat should only be used when the host driver is not equal to the version of the CUDA package set currently being used.
- Would detecting that would be impure?
When requested capability is newer than what is supported by version of CUDA, emit PTX for forward compat?
Update modules/gpus.nix for Jetson devices (i.e., Xavier and Orin on JetPack 5 max out at 12.2 with cuda_compat)
- This would require knowing what the host driver version is, which is impure
Allow devices to be in pkgsCuda if at least one CUDA package set version supports them?
- Hide the other releases?
A bunch of stuff (including docs)
Throw if trying to build for a capability newer than the CUDA package set can support.
Figure out why separable compilation isn't working.
Packages:

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
cuda-packages		cuda-packages
lib		lib
modules		modules
nvidia-redist-json		nvidia-redist-json
scripts		scripts
upstreamable-packages		upstreamable-packages
.gitignore		.gitignore
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
overlay.nix		overlay.nix