Merge pull request #129 from PennLINC/docs/how_to_test

[DOCS] add guide on how to test BABS
PennLINC · Aug 2, 2023 · 84ce490 · 84ce490
2 parents b924b86 + 6ebacf0
commit 84ce490
Show file tree

Hide file tree

Showing 5 changed files with 184 additions and 3 deletions.
diff --git a/docs/source/developer.rst b/docs/source/developer.rst
@@ -21,6 +21,7 @@ Below is a detailed description of implementations.
 .. toctree::
    :maxdepth: 1
 
+   developer_how_to_test
    developer_config_yaml_file
    developer_babs_status
 

diff --git a/docs/source/developer_babs_status.rst b/docs/source/developer_babs_status.rst
@@ -133,6 +133,8 @@ when ``print(df)`` by python::
 
 Note: ``0`` at the beginning: index of pd.DataFrame
 
+.. _how_to_test_out_babs_status:
+
 ====================================
 How to test out ``babs-status``
 ====================================
@@ -144,8 +146,12 @@ Create pending or failed jobs
 Change/Add these in ``participant_job.sh``:
 
 - failed: see next section
-- pending: on SGE clusters: increase ``-l h_vmem`` and ``-l s_vmem``; increase ``-pe threaded N``
-- stalled (``eqw``): skip this for now. See Bergman email 12/20/22
+- pending: Please increase the cluster resources you request,
+  e.g., memory, number of CPUs, temporary disk space, etc.
+
+    - on SLurm clusters: increase ``#SBATCH --mem``, ``#SBATCH --tmp``, etc
+    - on SGE clusters: increase ``-l h_vmem`` and ``-l s_vmem``; increase ``-pe threaded N``
+- stalled (``eqw`` on SGE clusters): skip this for now. See Bergman email 12/20/22
 
 After these changes, ``datalad save -m "message"`` and ``datalad push --to input``
 

diff --git a/docs/source/developer_how_to_test.rst b/docs/source/developer_how_to_test.rst
@@ -0,0 +1,170 @@
+*****************************
+How to test BABS
+*****************************
+
+After some changes in the source code, it's important to test BABS and evaluate if it's behaving as expected.
+There are two general steps to test BABS:
+
+1. Run pytest, which can be automatically run by CircleCI
+2. Manual tests on an HPC cluster (SGE or Slurm)
+
+The reason that BABS requires some manual tests is that, it is challenging to mimic a HPC job scheduler's behaviors
+in a container required by CircleCI. However, we are working on building a container to do so,
+and we welcome other researchers to help this too - please
+see `issue #113 <https://github.com/PennLINC/babs/issues/113>`_ for more.
+
+================
+Step 1. pytest
+================
+
+The pytest of BABS can be done manually or automatically on CircleCI, without the need of being on an HPC cluster.
+
+------------------------------------
+Manually run pytest
+------------------------------------
+
+Currently, the pytest could be run on Linux system (preferred) as well as Mac M1 chip system.
+
+* If you run pytest on a local computer, please make sure Docker has been launched and running;
+* If you run pytest on HPC clusters, please make sure Singularity software has been loaded.
+
+Before you run pytest, please install BABS in the following way so that necessary packages
+for our testing infrastructure will also be installed::
+    
+    cd <path/to/babs>    # change dir to the root of cloned `babs` github repository
+    pip install -e .[tests]
+
+At present, the pytest of BABS only covers testing ``babs-init`` and ``babs-check-setup``,
+due to challenges of running tests interactively with a job scheduler.
+
+You should run all pytest. After running pytest, you should not receive error messages.
+Warning messages are fine.
+
+To run all tests in pytest::
+
+    cd <path/to/babs>    # change dir to the root of cloned `babs` github repository
+    pytest -sv    # `-sv` is optional, and it means verbose + print messages
+
+If you have access to multiple CPUs, you may speed up the testing by running tests in parallel::
+
+    pytest -sv -n 2   # using 2 CPUs
+
+All pytest are defined in ``tests/test_*.py``. To run tests defined in a specific file::
+
+    pytest -sv tests/<test_*.py>   # replace `<test_*.py>` with the actual file name
+
+To run tests for a specific case (defined in ``@pytest.mark.parametrize()``)::
+
+    pytest -sv tests/<test_*.py>::<test_function_name>[toybidsapp-BIDS-single-ses-False-False]
+    # please replace `<*>` with actual file or function names;
+    # `[*]` is the combination of the parameters; above is just an example.
+
+    # e.g.: pytest -sv tests/test_babs_init.py::test_babs_init[toybidsapp-BIDS-single-ses-False-False]
+
+All command-line flags of ``pytest`` can be found `here <https://docs.pytest.org/en/stable/reference/reference.html#command-line-flags>`_
+
+--------------------------------
+Automatic pytest via CircleCI
+--------------------------------
+
+Whenever there is a commit to GitHub, CircleCI tests will be triggered, and it will automatically run the pytest.
+
+=======================================================
+Step 2. Manual tests on an HPC cluster (SGE or Slurm)
+=======================================================
+
+Currently pytest does not cover ``babs-submit``, ``babs-status`` and ``babs-merge``.
+Therefore, we need to manually test them on an HPC cluster with SGE or Slurm job scheduler system.
+
+There are two general steps in manual testing:
+
+* Step 2.1 Tests using a toy BIDS data and the toy BIDS App
+* Step 2.2 Real application using a large-scale dataset and a real BIDS App
+
+Note that here we provide a comprehensive list of tests, which would be important to go through before a new release
+(if there are major changes in job submissions/status checking).
+However, for minor changes in the source code, comprehensive testing may not be necessary and more focused tests may be sufficient.
+If you are not sure which tests are sufficient, we are happy to discuss about it.
+
+--------------------------------------------------------------------
+General guidelines for testing ``babs-submit`` and ``babs-status``
+--------------------------------------------------------------------
+
+In theory, it's best to test on both SGE and Slurm systems. However, researchers may not have access
+to both systems. Therefore, if you make a pull request, please let us know which HPC job scheduler system
+you've used to test.
+
+For Step 2.1 Tests using a toy BIDS data and the toy BIDS App,
+if the looping of the jobs (subjects in single-session data, or subject/session pairs in multi-session data) were changed,
+you should have two rounds of testing, one using a single-session dataset, the other using a multi-session dataset.
+Toy datasets can be found :ref:`here <example_input_BIDS_datasets_for_BABS>`.
+
+You may use the toy BIDS App to test out. See :doc:`here <preparation_container>` for more.
+
+After running each ``babs-submit`` or ``babs-status`` below,
+please check the printed messages and the updated ``job_status.csv``.
+This CSV file can be found at: ``analysis/code/job_status.csv`` in a BABS project.
+The explanations of this CSV file can be found :ref:`here <detailed_description_of_job_status_csv>`.
+
+-----------------------------------------
+Step 2.1.1: Testing ``babs-check-setup``
+-----------------------------------------
+
+Comprehensive test checklist (please add ``--project-root``):
+
+- [ ] ``babs-merge --job-test`` --> see if the information summarized by BABS is correct
+  (e.g., information of designated environment and temporary workspace)
+
+------------------------------------
+Step 2.1.2: Testing ``babs-submit``
+------------------------------------
+
+Comprehensive test checklist (please add ``--project-root``):
+
+- [ ] ``babs-submit`` (to submit one job)
+- [ ] ``babs-submit --job``
+- [ ] ``babs-submit --count``
+- [ ] ``babs-submit --all``
+
+------------------------------------
+Step 2.1.3: Testing ``babs-status``
+------------------------------------
+
+Comprehensive test checklist (please add ``--project-root``):
+
+- [ ] ``babs-status``
+- [ ] ``babs-status --resubmit failed``
+- [ ] ``babs-status --resubmit pending``
+- [ ] ``babs-status --resubmit-job <sub_id/ses_id of a failed job>``
+- [ ] ``babs-status --resubmit-job <sub_id/ses_id of a pending job>``
+- [ ] ``babs-status --resubmit-job <sub_id/ses_id of a running job>`` --> expect BABS to say not to submit a running job
+- [ ] ``babs-status --container-config-yaml-file path/to/config.yaml`` for failed job auditing
+- [ ] ``babs-status --container-config-yaml-file path/to/config.yaml --job-account`` for failed job auditing
+
+
+Please check out :ref:`this page <how_to_test_out_babs_status>`
+for how to create failed and pending jobs.
+
+------------------------------------
+Step 2.1.4: Testing ``babs-merge``
+------------------------------------
+
+Comprehensive test checklist (please add ``--project-root``):
+
+- [ ] ``babs-merge``
+
+----------------------------------------------------------------------
+Step 2.2: Testing using a large-scale dataset + a real BIDS App
+----------------------------------------------------------------------
+This is to make sure that the updated code also works on a large-scale dataset
+and when using a real BIDS App (e.g., fMRIPrep, QSIPrep).
+This is especially important to test out when you have updated the workflow of status updates,
+i.e., how ``job_status.csv`` is updated, or you revised the source code for generating BABS scripts
+and the changes are related to a real BIDS App.
+
+For example, you may use a dataset with hundreds of (or more) subjects or subject/session pairs.
+Run BABS commands, and check if the content of generated scripts are as expected.
+Then submit a few jobs.
+While the jobs are running, use ``babs-status`` to check their statuses and see
+how long this command takes. It should not take a long time (see :doc:`jobs` for example run time).
+Finally, check if you can successfully merge the results + get the zip file content + unzip it.
diff --git a/docs/source/jobs.rst b/docs/source/jobs.rst
@@ -240,6 +240,8 @@ Below is an example python script of reading ``job_status.csv``::
 You can also slice ``df`` and extract only failed jobs, only jobs whose ``alert_message``
 matches with a specific string, etc.
 
+.. _detailed_description_of_job_status_csv:
+
 ==================================================
 Detailed description of ``job_status.csv``
 ==================================================

diff --git a/docs/source/preparation_input_dataset.rst b/docs/source/preparation_input_dataset.rst
@@ -104,8 +104,10 @@ as input dataset for your current BABS project ("BABS project B"), you may:
 .. Developer's Notes: In theory the user could directly provide ``ria+file://xxx/output_ria#~data`` as the path to the input dataset in ``babs-init``,
 ..      but we hope they could test if this string is correct by letting them clone once.
 
+.. _example_input_BIDS_datasets_for_BABS:
+
 ================================================================
-Examples input BIDS datasets for BABS
+Example input BIDS datasets for BABS
 ================================================================
 .. list-table:: Example input datasets available on OSF
    :widths: 25 25 25