325 mixed model in limma #339

KamilMaliszArdigen · 2024-11-05T15:03:18Z

This is a limma module update to provide new logic related to mixed models

PR checklist

github-actions · 2024-11-05T15:05:59Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 7065efa

+| ✅ 300 tests passed       |+
#| ❔   6 tests were ignored |#
!| ❗   4 tests had warnings |!

❗ Test warnings:

pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in base.config: Check the defaults for all processes

❔ Tests ignored:

files_exist - File is ignored: assets/multiqc_config.yml
nextflow_config - Config default ignored: params.report_file
nextflow_config - Config default ignored: params.logo_file
nextflow_config - Config default ignored: params.css_file
nextflow_config - Config default ignored: params.citations_file
multiqc_config - multiqc_config

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-differentialabundance_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-differentialabundance_logo_light.png
files_exist - File found: docs/images/nf-core-differentialabundance_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: conf/igenomes_ignored.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-differentialabundance_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowDifferentialabundance.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: validation.help.enabled
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable found: validation.help.beforeText
nextflow_config - Config variable found: validation.help.afterText
nextflow_config - Config variable found: validation.help.command
nextflow_config - Config variable found: validation.summary.beforeText
nextflow_config - Config variable found: validation.summary.afterText
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 1.6.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.study_name= study
nextflow_config - Config default value correct: params.study_type= rnaseq
nextflow_config - Config default value correct: params.study_abundance_type= counts
nextflow_config - Config default value correct: params.affy_cel_files_archive= null
nextflow_config - Config default value correct: params.querygse= null
nextflow_config - Config default value correct: params.observations_id_col= sample
nextflow_config - Config default value correct: params.observations_type= sample
nextflow_config - Config default value correct: params.features_id_col= gene_id
nextflow_config - Config default value correct: params.features_name_col= gene_name
nextflow_config - Config default value correct: params.features_type= gene
nextflow_config - Config default value correct: params.features_metadata_cols= gene_id,gene_name,gene_biotype
nextflow_config - Config default value correct: params.features_gtf_feature_type= transcript
nextflow_config - Config default value correct: params.features_gtf_table_first_field= gene_id
nextflow_config - Config default value correct: params.affy_file_name_col= file
nextflow_config - Config default value correct: params.affy_background= true
nextflow_config - Config default value correct: params.affy_bgversion= 2
nextflow_config - Config default value correct: params.affy_cdfname= null
nextflow_config - Config default value correct: params.affy_build_annotation= true
nextflow_config - Config default value correct: params.proteus_measurecol_prefix= LFQ intensity
nextflow_config - Config default value correct: params.proteus_norm_function= normalizeMedian
nextflow_config - Config default value correct: params.proteus_plotsd_method= violin
nextflow_config - Config default value correct: params.proteus_plotmv_loess= true
nextflow_config - Config default value correct: params.proteus_palette_name= Set1
nextflow_config - Config default value correct: params.filtering_min_abundance= 1.0
nextflow_config - Config default value correct: params.filtering_min_samples= 1.0
nextflow_config - Config default value correct: params.filtering_min_proportion_not_na= 0.5
nextflow_config - Config default value correct: params.exploratory_clustering_method= ward.D2
nextflow_config - Config default value correct: params.exploratory_cor_method= spearman
nextflow_config - Config default value correct: params.exploratory_n_features= 500
nextflow_config - Config default value correct: params.exploratory_whisker_distance= 1.5
nextflow_config - Config default value correct: params.exploratory_mad_threshold= -5
nextflow_config - Config default value correct: params.exploratory_main_variable= auto_pca
nextflow_config - Config default value correct: params.exploratory_assay_names= raw,normalised,variance_stabilised
nextflow_config - Config default value correct: params.exploratory_final_assay= variance_stabilised
nextflow_config - Config default value correct: params.exploratory_palette_name= Set1
nextflow_config - Config default value correct: params.differential_feature_id_column= gene_id
nextflow_config - Config default value correct: params.differential_fc_column= log2FoldChange
nextflow_config - Config default value correct: params.differential_pval_column= pvalue
nextflow_config - Config default value correct: params.differential_qval_column= padj
nextflow_config - Config default value correct: params.differential_min_fold_change= 2.0
nextflow_config - Config default value correct: params.differential_max_pval= 1.0
nextflow_config - Config default value correct: params.differential_max_qval= 0.05
nextflow_config - Config default value correct: params.differential_feature_name_column= gene_name
nextflow_config - Config default value correct: params.differential_foldchanges_logged= true
nextflow_config - Config default value correct: params.differential_palette_name= Set1
nextflow_config - Config default value correct: params.deseq2_test= Wald
nextflow_config - Config default value correct: params.deseq2_fit_type= parametric
nextflow_config - Config default value correct: params.deseq2_sf_type= ratio
nextflow_config - Config default value correct: params.deseq2_min_replicates_for_replace= 7
nextflow_config - Config default value correct: params.deseq2_independent_filtering= true
nextflow_config - Config default value correct: params.deseq2_lfc_threshold= 0
nextflow_config - Config default value correct: params.deseq2_alt_hypothesis= greaterAbs
nextflow_config - Config default value correct: params.deseq2_p_adjust_method= BH
nextflow_config - Config default value correct: params.deseq2_alpha= 0.1
nextflow_config - Config default value correct: params.deseq2_minmu= 0.5
nextflow_config - Config default value correct: params.deseq2_vs_method= vst
nextflow_config - Config default value correct: params.deseq2_shrink_lfc= true
nextflow_config - Config default value correct: params.deseq2_cores= 1
nextflow_config - Config default value correct: params.deseq2_vs_blind= true
nextflow_config - Config default value correct: params.deseq2_vst_nsub= 1000
nextflow_config - Config default value correct: params.limma_spacing= null
nextflow_config - Config default value correct: params.limma_block= null
nextflow_config - Config default value correct: params.limma_correlation= null
nextflow_config - Config default value correct: params.limma_method= ls
nextflow_config - Config default value correct: params.limma_proportion= 0.01
nextflow_config - Config default value correct: params.limma_stdev_coef_lim= 0.1,4
nextflow_config - Config default value correct: params.limma_winsor_tail_p= 0.05,0.1
nextflow_config - Config default value correct: params.limma_lfc= 0
nextflow_config - Config default value correct: params.limma_adjust_method= BH
nextflow_config - Config default value correct: params.limma_p_value= 1.0
nextflow_config - Config default value correct: params.limma_use_voom= false
nextflow_config - Config default value correct: params.gsea_permute= phenotype
nextflow_config - Config default value correct: params.gsea_nperm= 1000
nextflow_config - Config default value correct: params.gsea_scoring_scheme= weighted
nextflow_config - Config default value correct: params.gsea_metric= Signal2Noise
nextflow_config - Config default value correct: params.gsea_sort= real
nextflow_config - Config default value correct: params.gsea_order= descending
nextflow_config - Config default value correct: params.gsea_set_max= 500
nextflow_config - Config default value correct: params.gsea_set_min= 15
nextflow_config - Config default value correct: params.gsea_norm= meandiv
nextflow_config - Config default value correct: params.gsea_rnd_type= no_balance
nextflow_config - Config default value correct: params.gsea_make_sets= true
nextflow_config - Config default value correct: params.gsea_num= 100
nextflow_config - Config default value correct: params.gsea_plot_top_x= 20
nextflow_config - Config default value correct: params.gsea_rnd_seed= timestamp
nextflow_config - Config default value correct: params.gprofiler2_significant= true
nextflow_config - Config default value correct: params.gprofiler2_measure_underrepresentation= false
nextflow_config - Config default value correct: params.gprofiler2_evcodes= false
nextflow_config - Config default value correct: params.gprofiler2_max_qval= 0.05
nextflow_config - Config default value correct: params.gprofiler2_domain_scope= annotated
nextflow_config - Config default value correct: params.gprofiler2_min_diff= 1
nextflow_config - Config default value correct: params.gprofiler2_palette_name= Blues
nextflow_config - Config default value correct: params.shinyngs_build_app= true
nextflow_config - Config default value correct: params.shinyngs_shinyapps_account= null
nextflow_config - Config default value correct: params.shinyngs_shinyapps_app_name= null
nextflow_config - Config default value correct: params.gene_sets_files= null
nextflow_config - Config default value correct: params.report_title= null
nextflow_config - Config default value correct: params.report_author= null
nextflow_config - Config default value correct: params.report_description= null
nextflow_config - Config default value correct: params.report_scree= true
nextflow_config - Config default value correct: params.report_round_digits= 4
nextflow_config - Config default value correct: params.igenomes_base= s3://ngi-igenomes/igenomes/
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-differentialabundance_logo_light.png matches the template
files_unchanged - docs/images/nf-core-differentialabundance_logo_light.png matches the template
files_unchanged - docs/images/nf-core-differentialabundance_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_ci - '.github/workflows/ci.yml' is triggered on expected events
actions_ci - '.github/workflows/ci.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 24.04.2, Config: 24.04.2
readme - README Zenodo placeholder was replaced with DOI.
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: template_version_comment.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - GUNZIP_GTF found in conf/modules.config and Nextflow scripts.
modules_config - GTF_TO_TABLE found in conf/modules.config and Nextflow scripts.
modules_config - VALIDATOR found in conf/modules.config and Nextflow scripts.
modules_config - AFFY_JUSTRMA_RAW found in conf/modules.config and Nextflow scripts.
modules_config - AFFY_JUSTRMA_NORM found in conf/modules.config and Nextflow scripts.
modules_config - PROTEUS found in conf/modules.config and Nextflow scripts.
modules_config - GEOQUERY_GETGEO found in conf/modules.config and Nextflow scripts.
modules_config - DESEQ2_NORM found in conf/modules.config and Nextflow scripts.
modules_config - DESEQ2_DIFFERENTIAL found in conf/modules.config and Nextflow scripts.
modules_config - LIMMA_DIFFERENTIAL found in conf/modules.config and Nextflow scripts.
modules_config - FILTER_DIFFTABLE found in conf/modules.config and Nextflow scripts.
modules_config - GSEA_GSEA found in conf/modules.config and Nextflow scripts.
modules_config - GPROFILER2_GOST found in conf/modules.config and Nextflow scripts.
modules_config - PLOT_EXPLORATORY found in conf/modules.config and Nextflow scripts.
modules_config - PLOT_DIFFERENTIAL found in conf/modules.config and Nextflow scripts.
modules_config - SHINYNGS_APP found in conf/modules.config and Nextflow scripts.
modules_config - RMARKDOWNNOTEBOOK found in conf/modules.config and Nextflow scripts.
modules_config - MAKE_REPORT_BUNDLE found in conf/modules.config and Nextflow scripts.
modules_config - CUSTOM_MATRIXFILTER found in conf/modules.config and Nextflow scripts.
modules_config - CUSTOM_TABULARTOGSEACLS found in conf/modules.config and Nextflow scripts.
modules_config - CUSTOM_TABULARTOGSEAGCT found in conf/modules.config and Nextflow scripts.
modules_config - TABULAR_TO_GSEA_CHIP found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.0.2

Run details

nf-core/tools version 3.0.2
Run at 2024-12-03 13:55:22

KamilMaliszArdigen · 2024-11-06T08:35:14Z

Hi @pinin4fjords, This PR is updating limma module with it's latest version. I will be more than happy to provide any additional information if needed.

KamilMaliszArdigen · 2024-11-12T12:17:11Z

@pinin4fjords I would really appreciate your input in this topic. Thank you in advance.

README.md

conf/modules.config

DSchreyer · 2024-11-15T07:13:52Z

I am getting this error when running a simple comparison. It seems that in FILTER_DIFFTABLE it expects a log2FoldChange column which does not exist.

This is the command:

        nextflow run output/differentialabundance/main.nf \
          --input ${ diffabundance_samplesheet } \
          --matrix ${ diffabundance_counts } \
          --contrasts ${ diffabundance_contrasts } \
          --outdir output \
          --observations_id_col sample \
          --features_id_col geneID \
          -profile rnaseq \
          --differential_use_limma

Workflow execution completed unsuccessfully
The exit status of the task that caused the workflow execution to fail was: 1

Error executing process > 'NFCORE_DIFFERENTIALABUNDANCE:DIFFERENTIALABUNDANCE:FILTER_DIFFTABLE (1)'

Caused by:
  Process `NFCORE_DIFFERENTIALABUNDANCE:DIFFERENTIALABUNDANCE:FILTER_DIFFTABLE (1)` terminated with an error exit status (1)


Command executed:

  #!/usr/bin/env python
  
  from math import log2
  from os import path
  import pandas as pd
  import platform
  from sys import exit
  
  # 1. Check that the current logFC/padj is not NA
  # 2. Check that the current logFC is >= threshold (abs does not work, so use a workaround)
  # 3. Check that the current padj is <= threshold
  # If this is true, the row is written to the new file, otherwise not
  if not any("VISIT_V05_vs_V02.limma.results.tsv".endswith(ext) for ext in [".csv", ".tsv", ".txt"]):
      exit("Please provide a .csv, .tsv or .txt file!")
  
  table = pd.read_csv("VISIT_V05_vs_V02.limma.results.tsv", sep=("," if "VISIT_V05_vs_V02.limma.results.tsv".endswith(".csv") else "	"), header=0)
  logFC_threshold = log2(float("2.0"))
  table = table[~table["log2FoldChange"].isna() &
              ~table["padj"].isna() &
              (pd.to_numeric(table["log2FoldChange"], errors='coerce').abs() >= float(logFC_threshold)) &
              (pd.to_numeric(table["padj"], errors='coerce') <= float("0.05"))]
  
  table.to_csv(path.splitext(path.basename("VISIT_V05_vs_V02.limma.results.tsv"))[0]+"_filtered.tsv", sep="	", index=False)
  
  with open('versions.yml', 'a') as version_file:
      version_file.write('"NFCORE_DIFFERENTIALABUNDANCE:DIFFERENTIALABUNDANCE:FILTER_DIFFTABLE":' + "\n")
      version_file.write("    pandas: " + str(pd.__version__) + "\n")

Command exit status:
  1

Command output:
  (empty)

Command error:
  Traceback (most recent call last):
    File "/usr/local/lib/python3.11/site-packages/pandas/core/indexes/base.py", line 3803, in get_loc
      return self._engine.get_loc(casted_key)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "pandas/_libs/index.pyx", line 138, in pandas._libs.index.IndexEngine.get_loc
    File "pandas/_libs/index.pyx", line 165, in pandas._libs.index.IndexEngine.get_loc
    File "pandas/_libs/hashtable_class_helper.pxi", line 5745, in pandas._libs.hashtable.PyObjectHashTable.get_item
    File "pandas/_libs/hashtable_class_helper.pxi", line 5753, in pandas._libs.hashtable.PyObjectHashTable.get_item
  KeyError: 'log2FoldChange'
  
  The above exception was the direct cause of the following exception:
  
  Traceback (most recent call last):
    File ".command.sh", line 18, in <module>
      table = table[~table["log2FoldChange"].isna() &
                     ~~~~~^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/pandas/core/frame.py", line 3805, in __getitem__
      indexer = self.columns.get_loc(key)
                ^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/pandas/core/indexes/base.py", line 3805, in get_loc
      raise KeyError(key) from err
  KeyError: 'log2FoldChange'

KamilMaliszArdigen · 2024-11-15T13:51:01Z

@DSchreyer There are differences in column names in outputs of deseq2 and limma so we created dedicated profile rnaseq_limma where this is adjusted for next steps. You can see in details what options are set here:

differentialabundance/conf/rnaseq_limma.config

Line 26 in 81cc521

// Differential options

Please try to launch pipeline with following command:

        nextflow run output/differentialabundance/main.nf \
          --input ${ diffabundance_samplesheet } \
          --matrix ${ diffabundance_counts } \
          --contrasts ${ diffabundance_contrasts } \
          --outdir output \
          --observations_id_col sample \
          --features_id_col geneID \
          -profile rnaseq_limma

DSchreyer · 2024-11-18T12:11:34Z

@DSchreyer There are differences in column names in outputs of deseq2 and limma so we created dedicated profile rnaseq_limma where this is adjusted for next steps. You can see in details what options are set here:

differentialabundance/conf/rnaseq_limma.config

Line 26 in 81cc521

// Differential options

Please try to launch pipeline with following command:
        nextflow run output/differentialabundance/main.nf \
          --input ${ diffabundance_samplesheet } \
          --matrix ${ diffabundance_counts } \
          --contrasts ${ diffabundance_contrasts } \
          --outdir output \
          --observations_id_col sample \
          --features_id_col geneID \
          -profile rnaseq_limma

@KamilMaliszArdigen Thanks, yes this helped to solve some of the issues. However, I am still getting the following error in the PLOT_EXPLORATORY process with my own dataset.

  In cond_log2_transform_matrix(matrix_data = assay_data[[index]],  :
    NaNs produced
  Fontconfig error: No writable cache directories
  Error in density.default(cond_log2_transform_matrix(plotmatrices[[pm]][,  : 
    'x' contains missing values
  Calls: ggplot_densityplot ... lapply -> FUN -> density -> density -> density.default
  Execution halted

could this come from the issue that we can have negative values in the normalised count matrix ? I also tested it with the test_rnaseq_limma profile which works as expected. However, i checked the normalised count tables from the test data and there are no negative values. Could this be an issue ?

KamilMaliszArdigen · 2024-11-18T18:22:18Z

@DSchreyer So main difference is that in rnaseq profile we have exploratory_log2_assays = 'raw,normalised' and in rnaseq_limma this is left empty. I'm not sure what is happening in your data please take a look at the work directory and the LIMMA count table - I expect that there might be negative values present. If this is the case it will be needed to keep exploratory_log2_assays empty at least this is my understanding of the plotting logic.

KamilMaliszArdigen · 2024-11-19T12:35:04Z

@nf-core-bot fix linting

DSchreyer · 2024-11-20T13:12:41Z

@DSchreyer So main difference is that in rnaseq profile we have exploratory_log2_assays = 'raw,normalised' and in rnaseq_limma this is left empty. I'm not sure what is happening in your data please take a look at the work directory and the LIMMA count table - I expect that there might be negative values present. If this is the case it will be needed to keep exploratory_log2_assays empty at least this is my understanding of the plotting logic.

Hi @KamilMaliszArdigen yes, i checked and there are negative values. When using --limma_use_voom false it skips the plotting which resolves the issue. Do you think that is a good idea or could interfere with the previous steps ?

KamilMaliszArdigen · 2024-11-22T10:16:43Z

@DSchreyer So main difference is that in rnaseq profile we have exploratory_log2_assays = 'raw,normalised' and in rnaseq_limma this is left empty. I'm not sure what is happening in your data please take a look at the work directory and the LIMMA count table - I expect that there might be negative values present. If this is the case it will be needed to keep exploratory_log2_assays empty at least this is my understanding of the plotting logic.

Hi @KamilMaliszArdigen yes, i checked and there are negative values. When using --limma_use_voom false it skips the plotting which resolves the issue. Do you think that is a good idea or could interfere with the previous steps ?

Well this will not work as expected voom is responsible for normalisation of rna_seq data. So this will result in not normalised data processing. I recommend to set the --exploratory_log2_assays = '' and than simply during plot generation log2 will not be generated.

pinin4fjords

Could this go into the usage docs please?

README.md

conf/modules.config

DSchreyer · 2024-11-28T14:11:55Z

@DSchreyer So main difference is that in rnaseq profile we have exploratory_log2_assays = 'raw,normalised' and in rnaseq_limma this is left empty. I'm not sure what is happening in your data please take a look at the work directory and the LIMMA count table - I expect that there might be negative values present. If this is the case it will be needed to keep exploratory_log2_assays empty at least this is my understanding of the plotting logic.

Hi @KamilMaliszArdigen yes, i checked and there are negative values. When using --limma_use_voom false it skips the plotting which resolves the issue. Do you think that is a good idea or could interfere with the previous steps ?

Well this will not work as expected voom is responsible for normalisation of rna_seq data. So this will result in not normalised data processing. I recommend to set the --exploratory_log2_assays = '' and than simply during plot generation log2 will not be generated.

Using --exploratory_log2_assays "" \ results in the following error:

The following invalid input values have been detected:

* --exploratory_log2_assays (true): Value is [boolean] but should be [string]

DSchreyer · 2024-11-28T14:23:06Z

I experience another issue when trying to run limma with the blocking column:

contrasts.csv

id,variable,reference,target,blocking
TRT_VISIT_TRT_Dose1_V05_vs_V02,TRT_VISIT,TRT_Dose1_V02,TRT_Dose1_V05,SUBJID

error:

Coefficients not estimable: TRT_VISIT.TRT_Dose1_V05 TRT_VISIT.TRT_Dose2_V05
Warning message:
Partial NA coefficients for 1150 probe(s)
Coefficients not estimable: TRT_VISIT.TRT_Dose1_V05 TRT_VISIT.TRT_Dose2_V05
Warning message:
Partial NA coefficients for 1150 probe(s)
Error in contrasts.fit(fit, contrast.matrix) :
  trying to take contrast of non-estimable coefficient
Execution = design
)
halted

without the blocking column it works normally. Any issues on my side here or something that needs to be fixed on pipeline level? This should be with regards to the mixedmodel analysis

pinin4fjords · 2024-11-28T15:25:57Z

@DSchreyer could you maybe make future support queries in Slack (differentialabundance channel) please? This PR is not really related.

But it sounds like you're specifying a blocking variable that's non-independent from your variable of interest.

KamilMaliszArdigen · 2024-11-29T15:13:26Z

@nf-core-bot fix linting

KamilMaliszArdigen · 2024-11-29T15:22:24Z

@DSchreyer So main difference is that in rnaseq profile we have exploratory_log2_assays = 'raw,normalised' and in rnaseq_limma this is left empty. I'm not sure what is happening in your data please take a look at the work directory and the LIMMA count table - I expect that there might be negative values present. If this is the case it will be needed to keep exploratory_log2_assays empty at least this is my understanding of the plotting logic.

Hi @KamilMaliszArdigen yes, i checked and there are negative values. When using --limma_use_voom false it skips the plotting which resolves the issue. Do you think that is a good idea or could interfere with the previous steps ?

Well this will not work as expected voom is responsible for normalisation of rna_seq data. So this will result in not normalised data processing. I recommend to set the --exploratory_log2_assays = '' and than simply during plot generation log2 will not be generated.

Using --exploratory_log2_assays "" \ results in the following error:
The following invalid input values have been detected:

* --exploratory_log2_assays (true): Value is [boolean] but should be [string]

Have you provided --exploratory_log2_assays="" or --exploratory_log2_assays "" in CLI while launching nextflow? This is evaluated differently by nextflow.

…ntialabundance into 325-mixed-model-in-limma

grst · 2024-12-20T10:15:55Z

Hi @KamilMaliszArdigen,

thanks for your work on this! We aligned within our team at BI and with @pinin4fjords, and we decided to close this PR and move forward with implementing DREAM (#363) instead. This is a generalization of limma that works with an arbitrary number of random effects, doesn't require manual addition of columns to the metadata table and allows to conveniently specify random effects using a wilkinson formula, e.g. ~ timepoint + (1 | subject_id).

I know it sucks throwing this away, but sometimes one needs to get started on something to find out that it's not the best way forward.

limma module update to propagate mixed models logic

6a1a02b

KamilMaliszArdigen linked an issue Nov 5, 2024 that may be closed by this pull request

Mixed-model in Limma #325

Open

KamilMaliszArdigen changed the base branch from master to dev November 5, 2024 15:04

nf-core deleted a comment from github-actions bot Nov 5, 2024

KamilMaliszArdigen added 2 commits November 5, 2024 16:06

Changelog update

174bae5

Readme update

81cc521

KamilMaliszArdigen requested a review from pinin4fjords November 5, 2024 15:57

pinin4fjords reviewed Nov 12, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

conf/modules.config Outdated Show resolved Hide resolved

Update README.md with mixed model instruction

088f788

[automated] Fix code linting

ff75bb1

KamilMaliszArdigen requested a review from pinin4fjords November 22, 2024 10:16

Merge branch 'dev' into 325-mixed-model-in-limma

b2a122b

pinin4fjords requested changes Nov 22, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

conf/modules.config Outdated Show resolved Hide resolved

After review updates

dc88a9b

nf-test update

e24a3af

nf-test update

9e6c918

[automated] Fix code linting

6090667

grst mentioned this pull request Dec 2, 2024

Support for linear mixed effects models via DREAM #363

Open

KamilMaliszArdigen added 2 commits December 2, 2024 10:48

nf-test update

8da7e01

Merge branch '325-mixed-model-in-limma' of github.com:nf-core/differe…

7065efa

…ntialabundance into 325-mixed-model-in-limma

grst closed this Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

325 mixed model in limma #339

325 mixed model in limma #339

KamilMaliszArdigen commented Nov 5, 2024 •

edited

Loading

github-actions bot commented Nov 5, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

KamilMaliszArdigen commented Nov 6, 2024

KamilMaliszArdigen commented Nov 12, 2024

DSchreyer commented Nov 15, 2024 •

edited

Loading

KamilMaliszArdigen commented Nov 15, 2024 •

edited

Loading

DSchreyer commented Nov 18, 2024

KamilMaliszArdigen commented Nov 18, 2024

KamilMaliszArdigen commented Nov 19, 2024

DSchreyer commented Nov 20, 2024

KamilMaliszArdigen commented Nov 22, 2024

pinin4fjords left a comment

DSchreyer commented Nov 28, 2024

DSchreyer commented Nov 28, 2024

pinin4fjords commented Nov 28, 2024

KamilMaliszArdigen commented Nov 29, 2024

KamilMaliszArdigen commented Nov 29, 2024

grst commented Dec 20, 2024

325 mixed model in limma #339

325 mixed model in limma #339

Conversation

KamilMaliszArdigen commented Nov 5, 2024 • edited Loading

PR checklist

github-actions bot commented Nov 5, 2024 • edited Loading

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

KamilMaliszArdigen commented Nov 6, 2024

KamilMaliszArdigen commented Nov 12, 2024

DSchreyer commented Nov 15, 2024 • edited Loading

KamilMaliszArdigen commented Nov 15, 2024 • edited Loading

DSchreyer commented Nov 18, 2024

KamilMaliszArdigen commented Nov 18, 2024

KamilMaliszArdigen commented Nov 19, 2024

DSchreyer commented Nov 20, 2024

KamilMaliszArdigen commented Nov 22, 2024

pinin4fjords left a comment

Choose a reason for hiding this comment

DSchreyer commented Nov 28, 2024

DSchreyer commented Nov 28, 2024

pinin4fjords commented Nov 28, 2024

KamilMaliszArdigen commented Nov 29, 2024

KamilMaliszArdigen commented Nov 29, 2024

grst commented Dec 20, 2024

KamilMaliszArdigen commented Nov 5, 2024 •

edited

Loading

github-actions bot commented Nov 5, 2024 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

DSchreyer commented Nov 15, 2024 •

edited

Loading

KamilMaliszArdigen commented Nov 15, 2024 •

edited

Loading