A full frontend (in react) and backend (using FastAPI) to try out LLaMA 7B on a similiar interface to GPT-Playground.
- Clone the repository
git clone https://github.com/MohamedAliRashad/llama-playground.git
- Download the LLaMA weights (There is different methods to do that, the easiest i found is this)
pip install pyllama -U && cd llama-playground && python3 -m llama.download --model_size 7B --folder backend/app/weights
- Add
.env
file to the root of the project with the ngrok auth tokenNGROK_AUTH= # Add your ngrok auth token here
- Run the docker-compose file
docker-compose up --build
And that's it, you can now access the playground on http://localhost:4551
where you will find the ngrok links to try the playground and share it with your friends 👏