ppl

History

Name		Name	Last commit message	Last commit date
parent directory ..
data		data
utils		utils
README.md		README.md
demo.ipynb		demo.ipynb
requirements.txt		requirements.txt
single_ckpt_ppl_eval.py		single_ckpt_ppl_eval.py

README.md

Perplexity

This folder contains implementations to measure model's per-token perplexity.

Overview

The folder contains code for model perplexity measurement. Amber and Crystal models are currently supported.

Directory Structure

single_ckpt_ppl_eval.py is the main entrypoint for calculating perplexity on a single model. It uses python modules in utils/ folder.

The utils/ folder contains helper functions for model/dataset IO:

data_utils.py: Dataset IO utils
model_utils.py: Model loader

We provide a sample dataset at ./data/wikitext.txt, which contains a 1,000-line random sample from the wikitext-2-v1 train split. By default, the perplexity results are saved in ./results.josn.

Installation

Clone and enter the folder:

git clone https://github.com/LLM360/Analysis360.git
cd Analysis360/analysis/metrics/ppl

Install dependencies:
```
pip install -r requirements.txt
```

Quick Start

Perplexity evaluation

An example usage is provided in the demo.ipynb, which can be executed with a single A100 80G GPU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

ppl

ppl

README.md

Perplexity

Table of Contents

Overview

Directory Structure

Installation

Quick Start

Perplexity evaluation

Files

ppl

Directory actions

More options

Directory actions

More options

Latest commit

History

ppl

Folders and files

parent directory

README.md

Perplexity

Table of Contents

Overview

Directory Structure

Installation

Quick Start

Perplexity evaluation