Skip to content

Latest commit

 

History

History
138 lines (99 loc) · 6.5 KB

CONTRIBUTING.md

File metadata and controls

138 lines (99 loc) · 6.5 KB

Contribution Guidelines for LLMeBench

Introduction

Welcome to the LLMeBench project! We are excited to have you here. This document outlines the process and guidelines for contributing to this project. We welcome all forms of contribution: code, documentation, bug reports, feature requests, and more.

Prerequisites

Before you begin, please ensure you have the following installed:

  • Git
  • Python (>=3.8)

How to Contribute

  1. Fork the repository: Fork the repository to your own GitHub account.
  2. Clone the fork: Clone your fork locally on your computer.
git clone <url-of-LLMeBench-fork>
cd LLMeBench
  1. **Set up environment: Create and activate virtual environment:
python -m venv .envs/llmebench
source .envs/llmebench/bin/activate

Install the dependencies and benchmarking package:

pip install -e '.[dev,fewshot]'
  1. Create a new branch: Create a new branch for your contribution.
  2. Make changes: Implement your changes, additions, or fixes.
  3. Test: Run tests to ensure your changes do not break existing functionality.
  4. Submit a pull request: Push your changes to your fork on GitHub and then create a pull request for review.

Adding New Dataset, Task, Models or Features

If you are adding new dataset, task, models or features, please ensure:

  • The code is well-documented.
  • You have added corresponding tests.
  • Update the README to reflect any new changes or requirements.

Reporting Issues

Please use the GitHub issue tracker to report bugs or request features. Be as descriptive as possible and include any relevant code snippets, error messages, or screenshots.

Testing

Before submitting your changes, please make sure to run all the relevant scripts. This often involves running:

  • Format the code
bash scripts/format_code.sh
  • Run the tests
bash scripts/run_tests.sh

Documentation

Documentation is crucial for understanding the project's code and for users to effectively use the package. If you make changes that require updates to the documentation, please also include those changes in your pull request.

Branch Naming

To contribute to a specific version, please name your branch according to the version you are targeting. For example, if you are contributing to version 1.0, name your branch feature/1.0/your-feature or fix/1.0/your-fix.

For example, to add ArSAS sentiment dataset,

git checkout -b feature/1.0/ArSAS_sentiment

Example Contributions

We welcome contributions of all types, however, we are particularly interested in those that align with the current needs and goals of the LLMeBench framework. Below are some example contributions that would be especially valuable:

Adding new datasets/tasks/assets

  • Datasets: Contribute by adding new datasets in the field of large language models.
    • Create a new dataset loading scripts to include your new dataset. Before doing so, please check if it has already been implemented.
    • Provide proper citations and documentation for the dataset.
      Please check examples in the tutorial -- adding dataset
  • Tasks: Contribute on tasks that can help to the community, for exampels tasks reported in:
  • Assets: Contribute to the assets (prompts, post-processing etc.) associated with the tasks and datasets
    Please find the examples in the tutorial -- adding asset

Adding new languages

  • Under-represented languages: Add support for languages that are not well-represented in current large language models.
  • Language variants: Add support for regional variants or dialects.

Adding new model providers

State-of-the-Art Models: Contribute implementations or interfaces for state-of-the-art language models (e.g., LLaMA, Vecuna).

Please ensure that your contributions align with the existing code style guidelines and that you update any relevant documentation and tests.

Security and Sensitive Information

It is crucial to ensure that no sensitive information, especially API keys, is included in any of your contributions. Inadvertently committing API keys or other sensitive information to the repository poses a significant risk.

If you are working with API keys, tokens, or other sensitive information during development, make sure to:

  • Use environment variables: Store your API keys or other sensitive information in environment variables or use a secrets management service.
  • Git ignore: Double-check that files containing sensitive information are listed in your .gitignore before submitting a pull request.
  • Review before commit: Always review your changes before committing and again before submitting a pull request to ensure no sensitive information is included.

If you do accidentally commit API keys or any sensitive information, please contact the maintainers immediately to address the issue.

Roadmap

Short-term goals

  • Expand language support:
    • Introduce core tasks and datasets for additional languages to diversify the framework's language capabilities.
  • Enrich dataset collection:
    • Add a variety of datasets to enhance the framework's benchmarking scope.
  • Broaden model support:
    • Develop a generic interface for integration with Hugging Face models, enabling more versatile benchmarking options.
  • Incorporate additional few-shot methods:
    • Add support for various few-shot learning techniques to expand the evaluation metrics available.
  • Enable dual modes of interaction:
    • Facilitate interaction with both online and offline models, offering users greater flexibility.
  • Multi-dataset evaluation:
    • Allow for running a single asset across multiple datasets to provide a more comprehensive performance analysis.

Long-term goals

  • Implement results summarization:
    • Create a feature for automatically summarizing and comparing results from multiple assets, possibly including graphical representations.
  • More coming ...

License

By contributing, you agree that your contributions will be licensed under the same license used by the original project MIT-LICENSE.