ForecastingScores.jl

Note: This Module is under development, use it at your own risk!

Evaluate the performance of your prediction models using strictly proper scoring functions.

All useful models make predictions. Assessing the quality of these predictions is essential to understanding a model's limits, how well it performs compared to other models, and how you can use it responsibly.

A forecasting score puts a number to how accurate a series of predictions are. For binary events, a forecasting score will measure how consistent the outcome of an event is with the model's prediction of how likely the event was to happen. The strictly proper Brier and logarithmic scoring functions give their optimum outputs when this prediction is the same as the event's true probability.

This package provides three scoring algorithms, brier_score, logarithmic_score and naive_score, to rate your predictions with.

Installation

Clone this repository to your local drive
Enter the julia REPL in the repository's parent directory
run Pkg.add(ForecastingScores) to add the module to your environment

NOTE this package has not been developed to the exact Pkg guidelines and should not be installed globally.

Usage

To create a set of predictions about some events and whether or not they occurred, create a EventRecord.

> eventrecord = Dict(
  0.5 => true,
  0.1 => false,
  0.99 => true
) |> EventRecord;
> brier_score(eventrecord)
0.08669999241828918

Types of scores

Brier score (brier_score) - A quadratic distance between the truth value of an event, $o_i$, and its predicted probability, $p_i$. A low Brier score means a series of predictions closely matches what actually occured, but makes less meaningful inferences from predicting rare events accurately.

$$\text{Brier Score} = \frac{1}{N} \sum_{i=1}^{N} (p_i - o_i)^2$$

Logarithmic score (logarithmic_score) - The expectation value of the logarithmic score is largest when the predicted probability is the same as the true probability of the event occuring. It handles extreme predictions far better than the Brier score, but is unbounded and harder to interpret without comparing it to other logarithmic scores.

$$\text{Logarithmic score} = \frac{1}{N} \sum_{i=1}^{N} f(o_i, p_i) \quad \text{where} \quad f(o_i, p_i) = \begin{cases} \log(p_i) & : o_i = 1 & \ \log(1 - p_i) & : o_i = 0 \end{cases} $$

Naive score (naive_score) - A simple scoring algorithm that calculates the difference between prediction and outcome. This is not a strictly proper score as it has a bias towards extreme predictions; it is included to compare against other methods.

$$ \text{Naive score} = \frac{1}{N} \sum_{i=1}^{N} |p_i - o_i| $$

Credit

Ben Butterworth, 2024

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
tests		tests
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ForecastingScores.jl

Installation

Usage

Types of scores

Credit

About

Releases

Packages

Languages

License

benbutterworth/ForecastingScores.jl

Folders and files

Latest commit

History

Repository files navigation

ForecastingScores.jl

Installation

Usage

Types of scores

Credit

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages