Skip to content

Commit

Permalink
Fix the clone url of README
Browse files Browse the repository at this point in the history
  • Loading branch information
byshiue authored Apr 12, 2021
1 parent 850a57d commit e3d56f8
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ We provide a docker file, which bases on Triton image `nvcr.io/nvidia/tritonserv

```bash
mkdir workspace && cd workspace
git clone https://gitlab-master.nvidia.com/liweim/transformer_backend.git
git clone https://github.com/triton-inference-server/fastertransformer_backend.git
nvidia-docker build --tag ft_backend --file transformer_backend/Dockerfile .
nvidia-docker run --gpus=all -it --rm --volume $HOME:$HOME --volume $PWD:$PWD -w $PWD --name ft-work ft_backend
cd workspace
Expand Down Expand Up @@ -120,4 +120,4 @@ The model configuration for Triton server is put in `all_models/transformer/conf
- vocab_size: size of vocabulary
- decoder_layers: number of transformer layers
- batch_size: max supported batch size
- is_fuse_QKV: fusing QKV in one matrix multiplication or not. It also depends on the weights of QKV.
- is_fuse_QKV: fusing QKV in one matrix multiplication or not. It also depends on the weights of QKV.

0 comments on commit e3d56f8

Please sign in to comment.