Visual Benchmarks for Testing LLM's Level of Common Sense

This repository holds a number of benchmark test cases, intended for use as a way to test the level of common sense in in various spaces for a given multimodal LLM (Large Language Model).