Merge pull request #1150 from JohnSnowLabs/chore/final_website_updates

Chore/final website updates for 2.5.0
JohnSnowLabs · Dec 17, 2024 · 497b86b · 497b86b
2 parents 2e45753 + f6a7ead
commit 497b86b
Show file tree

Hide file tree

Showing 11 changed files with 4,260 additions and 252 deletions.
diff --git a/.github/workflows/build_and_test.yml b/.github/workflows/build_and_test.yml
@@ -17,7 +17,7 @@ jobs:
     strategy:
       fail-fast: false
       matrix:
-        python-version: [ "3.8", "3.9","3.10" ]
+        python-version: [ "3.9","3.10", "3.11" ]
 
     steps:
       - name: Free up disk space at start

diff --git a/demo/tutorials/misc/Degradation_Analysis_Test.ipynb b/demo/tutorials/misc/Degradation_Analysis_Test.ipynb
diff --git a/docs/_data/navigation.yml b/docs/_data/navigation.yml
@@ -31,6 +31,8 @@ docs-menu:
           url: /docs/pages/docs/install
         - title:  One Liners
           url:    /docs/pages/docs/one_liner
+        - title:  Integrations
+          url:    /docs/pages/docs/integrations
 
   - title: General Concepts
     url: /docs/pages/docs/harness

diff --git a/docs/_includes/docs-langtest-pagination.html b/docs/_includes/docs-langtest-pagination.html
@@ -1,15 +1,20 @@
 <ul class="pagination owl-carousel pagination_big">
+  <!-- <li><a href="release_notes_2_5_0">2.5.0</a></li> -->
+  <li><a href="release_notes_2_4_0">2.4.0</a></li>
+  <li><a href="release_notes_2_3_1">2.3.1</a></li>
+  <li><a href="release_notes_2_3_0">2.3.0</a></li>
+  <li><a href="release_notes_2_2_0">2.2.0</a></li>
   <li><a href="release_notes_2_1_0">2.1.0</a></li>
   <li><a href="release_notes_2_0_0">2.0.0</a></li>
   <li><a href="release_notes_1_10_0">1.10.0</a></li>
   <li><a href="release_notes_1_9_0">1.9.0</a></li>
   <li><a href="release_notes_1_8_0">1.8.0</a></li>
   <li><a href="release_notes_1_7_0">1.7.0</a></li>
   <li><a href="release_notes_1_6_0">1.6.0</a></li>
-  <li><a href="release_notes_1_5_0">1.5.0</a></li>
+  <!-- <li><a href="release_notes_1_5_0">1.5.0</a></li>
   <li><a href="release_notes_1_4_0">1.4.0</a></li>
   <li><a href="release_notes_1_3_0">1.3.0</a></li>
   <li><a href="release_notes_1_2_0">1.2.0</a></li>
-  <!-- <li><a href="release_notes_1_1_0">1.1.0</a></li>
+  <li><a href="release_notes_1_1_0">1.1.0</a></li>
   <li><a href="release_notes_1_0_0">1.0.0</a></li> -->
 </ul>
diff --git a/docs/api/quick_start.html b/docs/api/quick_start.html
@@ -368,7 +368,7 @@ <h1>Quick Start<a class="headerlink" href="#quick-start" title="Permalink to thi
 <h2>LangTest Quick Start<a class="headerlink" href="#langtest-quick-start" title="Permalink to this heading">#</a></h2>
 <p>The following can be used as a quick reference on how to get up and running with <code class="docutils literal notranslate"><span class="pre">langtest</span></code>:</p>
 <div class="highlight-bash notranslate"><div class="highlight"><pre><span></span><span class="c1"># Install langtest from PyPI</span>
-pip<span class="w"> </span>install<span class="w"> </span><span class="nv">langtest</span><span class="o">==</span><span class="m">1</span>.1.0
+pip<span class="w"> </span>install<span class="w"> </span><span class="nv">langtest</span><span class="o">==</span><span class="m">2</span>.3.1
 </pre></div>
 </div>
 <div class="highlight-python notranslate"><div class="highlight"><pre><span></span><span class="kn">from</span> <span class="nn">langtest</span> <span class="kn">import</span> <span class="n">Harness</span>
@@ -386,20 +386,13 @@ <h2>Alternative Installation Options<a class="headerlink" href="#alternative-ins
 <p>We can create a Python <a class="reference external" href="https://virtualenv.pypa.io/en/latest/">Virtualenv</a>:</p>
 <div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>virtualenv<span class="w"> </span>langtest<span class="w"> </span>--python<span class="o">=</span>python3.8
 <span class="nb">source</span><span class="w"> </span>langtest/bin/activate
-pip<span class="w"> </span>install<span class="w"> </span><span class="nv">langtest</span><span class="o">==</span><span class="m">1</span>.1.0<span class="w"> </span>jupyter
+pip<span class="w"> </span>install<span class="w"> </span><span class="nv">langtest</span><span class="o">==</span><span class="m">2</span>.3.1<span class="w"> </span>jupyter
 </pre></div>
 </div>
 <p>Now you should be ready to create a jupyter notebook with LangTest running:</p>
 <div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>jupyter<span class="w"> </span>notebook
 </pre></div>
 </div>
-<p>We can also use conda and create a new <a class="reference external" href="https://docs.conda.io/projects/conda/en/latest/index.html">conda</a> environment to manage all the dependencies there.</p>
-<p>Then we can create a new environment <code class="docutils literal notranslate"><span class="pre">langtest</span></code> and install the <code class="docutils literal notranslate"><span class="pre">langtest</span></code> package with pip:</p>
-<div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>conda<span class="w"> </span>create<span class="w"> </span>-n<span class="w"> </span>langtest<span class="w"> </span><span class="nv">python</span><span class="o">=</span><span class="m">3</span>.8<span class="w"> </span>-y
-conda<span class="w"> </span>activate<span class="w"> </span>langtest
-conda<span class="w"> </span>install<span class="w"> </span>-c<span class="w"> </span><span class="nv">langtest</span><span class="o">==</span><span class="m">1</span>.1.0<span class="w"> </span>jupyter
-</pre></div>
-</div>
 <p>Now you should be ready to create a jupyter notebook with LangTest running:</p>
 <div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>jupyter<span class="w"> </span>notebook
 </pre></div>

diff --git a/docs/pages/docs/integrations.md b/docs/pages/docs/integrations.md
@@ -0,0 +1,167 @@
+---
+layout: docs
+seotitle: Integrations | LangTest | John Snow Labs
+title: Integrations
+permalink: /docs/pages/docs/integrations
+key: docs-integrations
+modify_date: "2023-03-28"
+header: true
+---
+
+<div class="main-docs" markdown="1">
+<div class="h3-box" markdown="1">
+
+
+**LangTest** is an open-source Python library that empowers developers to build safe and reliable Natural Language Processing (NLP) models. It seamlessly integrates with popular platforms and tools, including **Databricks**, enabling scalable testing and evaluation. Install LangTest easily using pip to enhance your NLP workflows.
+
+</div>
+<div class="h3-box" markdown="1">
+
+## Databricks
+
+**Introduction**  
+LangTest is a powerful tool for testing and evaluating NLP models, and integrating it with Databricks allows users to scale their testing with large datasets and leverage real-time analytics. This integration streamlines the process of assessing model performance, ensuring high-quality results while maintaining scalability and efficiency. With Databricks, LangTest becomes an even more versatile solution for NLP practitioners working with substantial data pipelines and diverse datasets.  
+
+**Prerequisites**  
+Before starting, ensure you meet the following requirements. You need access to a Databricks Workspace and an installed version of the `LangTest` package (version `2.5.0` or `later`). Additionally, make sure you have your Databricks API keys or credentials ready and have Python (version 3.9 or later) installed on your system. Optionally, access to sample datasets is helpful for testing and exploring features during your initial setup.  
+
+#### **Step-by-Step Setup**
+
+Getting started with LangTest and Databricks is straightforward and involves a few simple steps. Follow the instructions below to set up and run your first NLP model test.
+
+1. **Install LangTest and Dependencies**  
+   Begin by installing LangTest using pip:
+   ```bash
+   pip install langtest==2.5.0
+   ```
+   Ensure all required dependencies are installed and your environment is ready.
+
+2. **Load Datasets from Databricks**  
+   Use the Databricks connector to load data directly into your LangTest pipeline:
+   ```python
+   from pyspark.sql import DataFrame
+
+    # Load the dataset into a Spark DataFrame
+    df: DataFrame = spark.read.json("<FILE_PATH>")
+
+   ```
+   print the dataframe schema
+   ```python
+   df.printSchema()
+    ```
+
+3. **Configuration**  
+   In this section, we will configure the tests, datasets, and model settings required to effectively use LangTest. This includes setting up the test parameters, loading datasets, and defining the model configuration to ensure seamless integration and accurate evaluation.
+
+   - **Tests Config:**
+
+    ```python
+    test_config = {
+        "tests": {
+            "defaults": {"min_pass_rate": 1.0},
+            "robustness": {
+                "add_typo": {"min_pass_rate": 0.7},
+                "lowercase": {"min_pass_rate": 0.7},
+            },
+        },
+    }
+    ```
+
+   - **Dataset Config:**
+
+    ```python
+    input_data = {
+        "data_source": df,
+        "source": "spark",
+        "spark_session": spark # make sure that spark session is started or not
+    }
+    ```
+
+   - **Model Config:**
+
+    ```python
+    model_config = {
+        "model": {
+            "endpoint": "databricks-meta-llama-3-1-70b-instruct",
+        },
+        "hub": "databricks",
+        "type": "chat"
+    }
+    ```
+
+
+4. **Set Up and Run Tests with Harness**  
+   Use the `Harness` class to configure, generate, and execute tests. Define your task, model, data, and configuration:
+
+   ```python
+    harness = Harness(
+        task="question-answering",
+        model=model_config,
+        data=input_data,
+        config=test_config
+    )
+   ```
+
+   Generate and Execute the testcases on model to evaluate with langtest:
+   ```python
+   harness.generate().run().report()
+   ```
+
+   To Review the Testcases:
+   ```python
+   testcases_df = harness.testcases()
+   testcases_df
+   ```
+
+   To save testcases in delta live tables
+   ```python
+   import os
+   from deltalake import DeltaTable
+   from deltalake.writer import write_deltalake
+
+    write_deltalake("tmp/langtest_testcases", testcases_df) # for existed tables, pass mode="append"
+
+   ```
+
+   To Review the Generated Results 
+   ```python
+   results_df = harness.generated_results()
+   results_df
+   ```
+
+   Similary, for results_df in delta live tables.
+   ```python
+   import os
+   from deltalake import DeltaTable
+   from deltalake.writer import write_deltalake
+
+    write_deltalake("tmp/langtest_generated_results", results_df) # for existed tables, pass mode="append"
+
+   ```
+
+   This process evaluates your model's performance on the loaded data and provides a comprehensive report of the results.
+
+By following these steps, you can easily integrate Databricks with LangTest to perform NLP or LLM model testing. If you encounter issues during setup or execution, refer to the troubleshooting section for solutions.
+
+**Troubleshooting & Support**  
+While setting up, you may encounter common issues like authentication errors with Databricks, incorrect dataset paths, or model compatibility problems. To resolve these, verify your API keys and workspace URL, ensure the specified dataset exists in Databricks, and confirm that your LangTest version is compatible with your project. If further help is needed, explore the FAQ section, access detailed documentation, or reach out through the support channels or community forum for assistance.
+
+### FAQ
+
+**Q: How do I resolve authentication errors with Databricks?**  
+A: Ensure that your API keys and workspace URL are correct. Double-check that your credentials have the necessary permissions to access the Databricks workspace.
+
+**Q: What should I do if the dataset path is incorrect?**  
+A: Verify that the specified dataset exists in Databricks and that the path is correctly formatted. You can use the Databricks UI to navigate and confirm the dataset location.
+
+**Q: How can I check if my LangTest version is compatible with my project?**  
+A: Refer to the LangTest documentation for version compatibility information. Ensure that you are using a version of LangTest that supports the features and integrations required for your project.
+
+**Q: Where can I find more detailed documentation?**  
+A: Access the detailed documentation on the LangTest official website or the Databricks documentation portal for comprehensive guides and examples.
+
+**Q: How can I get additional support?**  
+A: Reach out through the support channels provided by LangTest or Databricks. You can also join the community forum to ask questions and share experiences with other users.
+
+
+</div></div>