diff --git a/docs/alert/vgsc.ipynb b/docs/alert/SA-1-Vgsc.ipynb similarity index 99% rename from docs/alert/vgsc.ipynb rename to docs/alert/SA-1-Vgsc.ipynb index b09135f..fb7f531 100644 --- a/docs/alert/vgsc.ipynb +++ b/docs/alert/SA-1-Vgsc.ipynb @@ -62,22 +62,7 @@ "id": "1f2288aa-fe3d-44c2-88be-aa897d9b4b49", "metadata": {}, "source": [ - "# *Vgsc*\n", - "\n", - "AGAP004707 (*Vgsc*)\n", - "\n", - "Mutations in this gene cause resistance to pyrethroid insecticides and DDT. The most common of these mutations is L995F, which is found across many insect taxa. In *Anopheles gambiae s.l*, we observe many distinct variants under selection, for example, L995F, L995S, and a haplotype containing both V402L and I1527T [(Clarkson *et al.,* 2021)](https://doi.org/10.1111/mec.15845). \n", - "\n", - "This form of resistance was first reported in *Anopheles gambiae* in 1998 [(Martinez-Torres *et al.,* 1998)](https://pubmed.ncbi.nlm.nih.gov/9535162/). Mutants at this locus are now found throughout sub-Saharan Africa. " - ] - }, - { - "cell_type": "markdown", - "id": "97da6cd4-0dc2-4ba4-8cac-4f93edc4badc", - "metadata": {}, - "source": [ - "## Cohorts affected\n", - "Overlapping signals of selection are found in the following cohorts. " + "# SA-1 (*Vgsc*)" ] }, { @@ -117,30 +102,6 @@ "region_signals = region_signals.merge(cohorts)" ] }, - { - "cell_type": "code", - "execution_count": null, - "id": "9222a438-686f-410f-8fd9-02428c2d3095", - "metadata": { - "tags": [ - "remove-input" - ] - }, - "outputs": [], - "source": [ - "cohort_links = ['' + row[\"cohort_label\"] + \"\" for i, row in region_signals.iterrows()]\n", - "html_message = '
  • ' + '
  • '.join(cohort_links) + '
  • '\n", - "HTML(html_message)" - ] - }, - { - "cell_type": "markdown", - "id": "b29f7a5a-c33d-4f5a-b6d3-9023ab7ebe55", - "metadata": {}, - "source": [ - "## Signals and candidate genes" - ] - }, { "cell_type": "code", "execution_count": null, @@ -315,6 +276,45 @@ "plot_region_summary(df=region_signals.reset_index(), region_name=region_name)" ] }, + { + "cell_type": "markdown", + "id": "97da6cd4-0dc2-4ba4-8cac-4f93edc4badc", + "metadata": {}, + "source": [ + "## Cohorts affected\n", + "Overlapping signals of selection are found in the following cohorts. " + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "9222a438-686f-410f-8fd9-02428c2d3095", + "metadata": { + "tags": [ + "remove-input" + ] + }, + "outputs": [], + "source": [ + "cohort_links = ['' + row[\"cohort_label\"] + \"\" for i, row in region_signals.iterrows()]\n", + "html_message = '
  • ' + '
  • '.join(cohort_links) + '
  • '\n", + "HTML(html_message)" + ] + }, + { + "cell_type": "markdown", + "id": "b29f7a5a-c33d-4f5a-b6d3-9023ab7ebe55", + "metadata": {}, + "source": [ + "## Candidate genes\n", + "\n", + "AGAP004707 (*Vgsc*)\n", + "\n", + "Mutations in this gene cause resistance to pyrethroid insecticides and DDT. The most common of these mutations is L995F, which is found across many insect taxa. In *Anopheles gambiae s.l*, we observe many distinct variants under selection, for example, L995F, L995S, and a haplotype containing both V402L and I1527T [(Clarkson *et al.,* 2021)](https://doi.org/10.1111/mec.15845). \n", + "\n", + "This form of resistance was first reported in *Anopheles gambiae* in 1998 [(Martinez-Torres *et al.,* 1998)](https://pubmed.ncbi.nlm.nih.gov/9535162/). Mutants at this locus are now found throughout sub-Saharan Africa. " + ] + }, { "cell_type": "markdown", "id": "fbf2eacc-ee18-425e-a3ee-ad8e53bda5f9", diff --git a/docs/alert/coeaexf.ipynb b/docs/alert/SA-2-Coeaexf.ipynb similarity index 98% rename from docs/alert/coeaexf.ipynb rename to docs/alert/SA-2-Coeaexf.ipynb index e7562dc..d738c47 100644 --- a/docs/alert/coeaexf.ipynb +++ b/docs/alert/SA-2-Coeaexf.ipynb @@ -31,21 +31,18 @@ }, "outputs": [], "source": [ - "from IPython.display import Markdown, HTML\n", + "from IPython.display import HTML\n", "import malariagen_data\n", "import numpy as np\n", "import pandas as pd\n", "from pyprojroot import here\n", - "import yaml\n", "import dask\n", "dask.config.set(scheduler=dask_scheduler);\n", - "from textwrap import dedent\n", "\n", "import geopandas as gpd\n", "import bokeh.layouts as bklay\n", "import bokeh.plotting as bkplt\n", "import bokeh.models as bkmod\n", - "import plotly.express as px\n", "\n", "from bokeh.io import output_notebook # enables plot interface in J notebook\n", "\n", @@ -62,23 +59,15 @@ "id": "1f2288aa-fe3d-44c2-88be-aa897d9b4b49", "metadata": {}, "source": [ - "# *Coeaexf*\n", - "\n", - "AGAP006227 (*Coeae1f*) \n", - "AGAP006228 (*Coeae2f*)\n", - "\n", - "Gene amplifications covering these alpha-esterases have been recently associated with resistance to the organophosphate, pirimiphos-methyl [(Nagi *et al.,* 2024)](https://www.biorxiv.org/content/10.1101/2024.02.01.578361v1). Pirimiphos-methyl is widely used in indoor-residual spraying campaigns as the formulation Actellic CS300. \n", - "\n", - "The genes are orthologous to the Est2 and Est3 genes of *Culex pipiens*, which also confer resistance to organophosphates, and were studied intensely in the late 20th century as an example of evolution due to anthropogenic pressures [(Raymond *et al.,* 1998)](https://doi.org/10.1098/rstb.1998.0322). " + "# SA-2 (*Coeaexf*)" ] }, { "cell_type": "markdown", - "id": "97da6cd4-0dc2-4ba4-8cac-4f93edc4badc", + "id": "b29f7a5a-c33d-4f5a-b6d3-9023ab7ebe55", "metadata": {}, "source": [ - "## Cohorts affected\n", - "Overlapping signals of selection are found in the following cohorts. " + "## Signals" ] }, { @@ -118,30 +107,6 @@ "region_signals = region_signals.merge(cohorts)" ] }, - { - "cell_type": "code", - "execution_count": null, - "id": "5b39047f-6b12-4402-a973-7dc1cd4f39b1", - "metadata": { - "tags": [ - "remove-input" - ] - }, - "outputs": [], - "source": [ - "cohort_links = ['' + row[\"cohort_label\"] + \"\" for i, row in region_signals.iterrows()]\n", - "html_message = '
  • ' + '
  • '.join(cohort_links) + '
  • '\n", - "HTML(html_message)" - ] - }, - { - "cell_type": "markdown", - "id": "b29f7a5a-c33d-4f5a-b6d3-9023ab7ebe55", - "metadata": {}, - "source": [ - "## Signals and candidate genes" - ] - }, { "cell_type": "code", "execution_count": null, @@ -316,6 +281,46 @@ "plot_region_summary(df=region_signals.reset_index(), region_name=region_name)" ] }, + { + "cell_type": "markdown", + "id": "97da6cd4-0dc2-4ba4-8cac-4f93edc4badc", + "metadata": {}, + "source": [ + "## Cohorts affected\n", + "Overlapping signals of selection are found in the following cohorts. " + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "5b39047f-6b12-4402-a973-7dc1cd4f39b1", + "metadata": { + "tags": [ + "remove-input" + ] + }, + "outputs": [], + "source": [ + "cohort_links = ['' + row[\"cohort_label\"] + \"\" for i, row in region_signals.iterrows()]\n", + "html_message = '
  • ' + '
  • '.join(cohort_links) + '
  • '\n", + "HTML(html_message)" + ] + }, + { + "cell_type": "markdown", + "id": "b29c3508", + "metadata": {}, + "source": [ + "## Candidate genes \n", + "\n", + "AGAP006227 (*Coeae1f*) \n", + "AGAP006228 (*Coeae2f*)\n", + "\n", + "Gene amplifications covering these alpha-esterases have been recently associated with resistance to the organophosphate, pirimiphos-methyl [(Nagi *et al.,* 2024)](https://www.biorxiv.org/content/10.1101/2024.02.01.578361v1). Pirimiphos-methyl is widely used in indoor-residual spraying campaigns as the formulation Actellic CS300. \n", + "\n", + "The genes are orthologous to the Est2 and Est3 genes of *Culex pipiens*, which also confer resistance to organophosphates, and were studied intensely in the late 20th century as an example of evolution due to anthropogenic pressures [(Raymond *et al.,* 1998)](https://doi.org/10.1098/rstb.1998.0322). " + ] + }, { "cell_type": "markdown", "id": "fbf2eacc-ee18-425e-a3ee-ad8e53bda5f9", diff --git a/workflow/notebooks/selection-alert.ipynb b/workflow/notebooks/selection-alert.ipynb index edeedcd..a9d0a8e 100644 --- a/workflow/notebooks/selection-alert.ipynb +++ b/workflow/notebooks/selection-alert.ipynb @@ -62,20 +62,41 @@ "id": "1f2288aa-fe3d-44c2-88be-aa897d9b4b49", "metadata": {}, "source": [ - "# example_locus_name\n", - "\n", - "AGAP004707 (*Vgsc*)\n", - "\n", - "Mutations in this gene cause resistance to pyrethroid insecticides and DDT." + "# SA-1" ] }, { "cell_type": "markdown", - "id": "97da6cd4-0dc2-4ba4-8cac-4f93edc4badc", + "id": "b29f7a5a-c33d-4f5a-b6d3-9023ab7ebe55", "metadata": {}, "source": [ - "## Cohorts affected\n", - "Overlapping signals of selection are found in the following cohorts. " + "## Signals" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "9a2a86dd-58c7-4037-b99c-53ede4c54f87", + "metadata": { + "tags": [ + "remove-input", + "remove-output" + ] + }, + "outputs": [], + "source": [ + "extra_params = dict()\n", + "if use_gcs_cache:\n", + " extra_params[\"url\"] = \"simplecache::gs://vo_agam_release\"\n", + " extra_params[\"simplecache\"] = dict(cache_storage=(here() / \"gcs_cache\").as_posix())\n", + "\n", + "ag3 = malariagen_data.Ag3(\n", + " # pin the version of the cohorts analysis for reproducibility\n", + " cohorts_analysis=cohorts_analysis,\n", + " results_cache=(here() / \"malariagen_data_cache\").as_posix(),\n", + " **extra_params,\n", + ")\n", + "ag3" ] }, { @@ -115,56 +136,6 @@ "region_signals = region_signals.merge(cohorts)" ] }, - { - "cell_type": "code", - "execution_count": null, - "id": "346bbbd3-c0f8-4f44-a3ce-e417080d2245", - "metadata": { - "tags": [ - "remove-input" - ] - }, - "outputs": [], - "source": [ - "cohort_links = ['' + row[\"cohort_label\"] + \"\" for i, row in region_signals.iterrows()]\n", - "html_message = '
  • ' + '
  • '.join(cohort_links) + '
  • '\n", - "HTML(html_message)" - ] - }, - { - "cell_type": "markdown", - "id": "b29f7a5a-c33d-4f5a-b6d3-9023ab7ebe55", - "metadata": {}, - "source": [ - "## Signals and candidate genes" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "id": "9a2a86dd-58c7-4037-b99c-53ede4c54f87", - "metadata": { - "tags": [ - "remove-input", - "remove-output" - ] - }, - "outputs": [], - "source": [ - "extra_params = dict()\n", - "if use_gcs_cache:\n", - " extra_params[\"url\"] = \"simplecache::gs://vo_agam_release\"\n", - " extra_params[\"simplecache\"] = dict(cache_storage=(here() / \"gcs_cache\").as_posix())\n", - "\n", - "ag3 = malariagen_data.Ag3(\n", - " # pin the version of the cohorts analysis for reproducibility\n", - " cohorts_analysis=cohorts_analysis,\n", - " results_cache=(here() / \"malariagen_data_cache\").as_posix(),\n", - " **extra_params,\n", - ")\n", - "ag3" - ] - }, { "cell_type": "code", "execution_count": null, @@ -313,6 +284,41 @@ "plot_region_summary(df=region_signals.reset_index(), region_name=region_name)" ] }, + { + "cell_type": "markdown", + "id": "97da6cd4-0dc2-4ba4-8cac-4f93edc4badc", + "metadata": {}, + "source": [ + "## Cohorts affected\n", + "Overlapping signals of selection are found in the following cohorts. " + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "346bbbd3-c0f8-4f44-a3ce-e417080d2245", + "metadata": { + "tags": [ + "remove-input" + ] + }, + "outputs": [], + "source": [ + "cohort_links = ['' + row[\"cohort_label\"] + \"\" for i, row in region_signals.iterrows()]\n", + "html_message = '
  • ' + '
  • '.join(cohort_links) + '
  • '\n", + "HTML(html_message)" + ] + }, + { + "cell_type": "markdown", + "id": "db71a1cd", + "metadata": {}, + "source": [ + "## Candidate genes\n", + "\n", + "AGAP004707 (*Vgsc*) - The Voltage-gated sodium channel (VGSC) is the target of pyrethroids and DDT. Mutations in this gene are associated with knockdown resistance (kdr) to pyrethroids and DDT..." + ] + }, { "cell_type": "markdown", "id": "fbf2eacc-ee18-425e-a3ee-ad8e53bda5f9",