All Package Index

This repository contains the mods2docs tool for parsing module files and generating documentation (including handling dependencies for various architectures). It also contains supporting scripts for syncing generated documentation with a repository. Together they automate the documentation process for an "All Package Index" section of a HPC cluster's documentation website.

See Contributing if you would like to contribute to this project.

TODO

Support any number of architectures
Add more example workflows

Scripts Overview

The repository includes several scripts and modules organised under mods2docs, each with a specific role in the pipeline:

|____ README
|____ config.env           configuration file
|____ run-hpc-rocket.sh    initiates slurm job on cluster
|____ setup_local.sh       activates required conda environment, generating it if doesn't exist
|____ slurm.sh             job script run collect_data.py on hpc cluster
|____ sync_stacks.sh       syncs *rst files into hpc docs repo
|____ mods2docs
| |____ config.py          configuration file updated from config.env
| |____ config.yml         template configuration for hpc-rocket
| |____ collect_data.py    parses module files in modulepaths for each arch
| |____ utils.py           commonly used functions
| |____ start_pipeline.py  produces *.rst files, running collect_data.py if not already run today
| |____ writer
| | |____ common.py        commonly used writer functions
| | |____ obsidian.py      creates markdown files for force directed graph in Obsidian
| | |____ rest.py          produces *.rst files for sphinx documentation
| |____ parser
| | |____ common.py
| | |____ lmod.py          parses lua module files on modulepath

Setup and Usage

Setting Up

   # clone repository
   git clone https://github.com/Carldkennedy/all_package_index.git
   cd all_package_index
   vim config.env  # See Configuration below

Configuration

To configure the necessary parameters, use config.env:

config.env populates config.py in mods2docs; is used to create a config for HPC-Rocket (sends a batch job to a cluster) using config.yml as a template; and is sourced in bash scripts. It defines output directories for generated documentation and other key paths.

Output directories: Directories synced with the documentation repo:

# These are paths within the documentation repository which we will sync generated docs with
STACKS_DIR = "stanage/software/stacks/"
IMPORTS_DIR = "referenceinfo/imports/stanage/packages/"
CUSTOM_DIR = "referenceinfo/imports/stanage/packages/custom/"

Module Paths and Titles: Paths for module files to parse, titles for stacks, and output directories:

# Modulepaths for each architecture on each stack (this example has only one stack)  
modulepaths = {
    'icelake': "/opt/apps/tuos/el9/modules/live/all:/opt/apps/tuos/common/modules/easybuild-only/all:/opt/apps/tuos/common/modules/live/all",
    'znver3': "/opt/apps/tuos/el9-znver3/modules/live/all:/opt/apps/tuos/common/modules/easybuild-only/all:/opt/apps/tuos/common/modules/live/all"
}
# Titles for pages related to each stack
titles = [
    "Icelake and Znver (OS: Rocky 9) Packages"
]
# This will construct the path (appending to $stacks_dir) for the main package files for each stack
output_dirs = [
    "el9-icelake-znver-stanage"
]

In this example, the directories which the category directories and package files will be stored:

/stanage/software/stacks/el9-icelake-znver-stanage/{All,Bio,Cae,Chem,etc}

The title of the stack is 'Icelake and Znver (OS: Rocky 9) Packages'.

The module files which are located in modulepaths are parsed, in this case one set for each architecture.

Set up local conda environment

./setup_local.sh

Workflows

There are multiple scripts which can be used to initiate different workflows:

sync_stacks.sh completes the process from a local device sending a slurm job to the cluster to collect the data and then syncs with a local docs repo.

Please ensure you have set up ssh keys for access to the cluster and github tokens or ssh keys for the remote repo.

./sync_stacks.sh --rocket --push

Note: The code at the end of the script that pushes changes to the new branch on the remote documentation repository has been commented out. This is useful if you want to build and review the output locally, especially on the first run - after which you will need to add the stacks directory to index.rst. Once you are confident that everything is correct and have merged a pull request into the main or master branch on the remote docs repository, you can uncomment this code to automate the process in the future.

More coming soon!

mods2docs

mods2ocs is a tool designed to automate the process of generating structured documentation from modular data or configuration files. It collects and processes information from various modules, configuration files, or source code, and outputs it in a standardised documentation format (such as Markdown, HTML, or other formats). This is especially useful in environments where documentation needs to be kept up-to-date with frequent configuration or code changes, as mods2docs extracts information directly from the source and formats it into easy-to-read documents.

The primary script, start_pipeline.py, orchestrates the data parsing and documentation generation pipeline. This example uses the lmod parser and ReST writer to generate documentation:

python -m mods2docs.start_pipeline --parser lmod --writer rest

Writer modules

mods2docs.writer.rest

mods2docs.writer.obsidian

Below are some of the functions in mods2docs.writer.rest, which we may wish to customise:

# Processes data parsed from modulepaths 
process_modulepath(modulepaths, title, output_dir)
# which is then passed to the write_package_file 
write_package_file(category_dir, category, package, output_dir)
# all the following functions write files which are imported into the package file
write_sidebar_file(package, category, latest_version_info, output_dir)
write_description_file(package, latest_info, output_dir)
write_installation_file(package, latest_info, output_dir)
write_custom_file(package, output_dir)
write_dependencies(dependencies, output_dir, category, package, package_ref)
write_ml_file(package, package_infos, output_dir)

We recommend copying mods2docs/writer/rest.py to for example mods2docs/writer/rest-shef.py Edit as required then you can use this module for writing your documentation.

python -m mods2docs.start_pipeline --parser lmod --writer rest-shef

The generated files for each package found on the given module paths includes:

Description
Sidebar - Latest version available on each architecture, date module file was last modified, and URL.
Versions available - as module load commands in grouped tabs (for each architecture).
Notes - detailing how to view build logs, etc.
Dependencies - Shows the dependencies for the latest version (across architectures), each is a link to it's respective page.

Each of the above is imported into a package's page when built, this allows re-use of these imports elsewhere in the documentation.

# A list of files generated in imports_dir for CUDA
$ls referenceinfo/imports/stanage/packages/cuda-*
cuda-dpnd-el9-icelake-znver-stanage.rst
cuda-dscr-el9-icelake-znver-stanage.rst
cuda-inst-el9-icelake-znver-stanage.rst
cuda-ml-el9-icelake-znver-stanage.rst
cuda-sdbr-el9-icelake-znver-stanage.rst

An example of built output is available here

The mods2docs.writer.obsidian module will output markdown files into a directory which we can open with Obsidian to leverage it's built-in force-directed-graph functionality to visualise the relationships between the latest versions of all packages. We can use the global graph to see all connections between packages or a local graph to focus on specific packages and their immediate connections.

python -m mods2docs.start_pipeline --parser lmod --writer obsidian

Local graph example for CUDA:

Parser Modules

mods2docs.parser.lmod

There is currently one parser module mods2docs.parser.lmod which utilises LuaRuntime to extract all information from module files, and stores data in a pickle.

Contributing

We welcome contributions to the All Package Index project! Whether you’d like to report a bug, suggest new features, or improve the documentation, your help is invaluable to the project’s success. Please follow the guidelines below to ensure a smooth collaboration process.

Start by forking this repository to create a personal copy under your GitHub account.
Clone the forked repository to your local machine for development.

git clone https://github.com/your-username/all-package-index.git

Create a new branch to isolate your changes.

git checkout -b feature/your-feature-name

Develop and test your changes locally.
Commit your work with clear, descriptive commit messages.

git commit -m "Add feature or fix: brief description"

Push to Your Fork: Push your branch to your fork on GitHub.

git push origin feature/your-feature-name

Go to the original repository on GitHub, select "Pull Requests," and open a new pull request. Describe your changes and why they should be merged.

Guidelines

Keep It Focused: Ensure each pull request addresses a single issue or feature.
Stay in Sync: Regularly sync your fork with the original repository to prevent merge conflicts.
Document Changes: Update the README or relevant documentation sections as necessary.
Thank you for your contribution! We appreciate your efforts to help improve the project.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
data		data
docs/images		docs/images
mods2docs		mods2docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.env		config.env
push_stacks.sh		push_stacks.sh
run-hpc-rocket.sh		run-hpc-rocket.sh
setup_local.sh		setup_local.sh
slurm.sh		slurm.sh
sync_stacks.sh		sync_stacks.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

All Package Index

TODO

Scripts Overview

Setup and Usage

Setting Up

Configuration

Set up local conda environment

Workflows

mods2docs

Writer modules

Parser Modules

Contributing

Guidelines

About

Releases

Packages

Languages

License

Carldkennedy/all_package_index

Folders and files

Latest commit

History

Repository files navigation

All Package Index

TODO

Scripts Overview

Setup and Usage

Setting Up

Configuration

Set up local conda environment

Workflows

mods2docs

Writer modules

Parser Modules

Contributing

Guidelines

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages