Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add JittorLLMs examples #173

Merged
merged 5 commits into from
Oct 7, 2023
Merged

add JittorLLMs examples #173

merged 5 commits into from
Oct 7, 2023

Conversation

MAiTlx
Copy link
Contributor

@MAiTlx MAiTlx commented Sep 6, 2023

No description provided.

README.md Outdated
@@ -483,6 +483,10 @@ print(text)

详细的推理文档见:[inference-speed/GPU/vllm_example](https://github.com/FlagAlpha/Llama2-Chinese/blob/main/inference-speed/GPU/vllm_example/README.md)

### JittorLLMs
[JittorLLMs](https://github.com/Jittor/JittorLLMs)由非十科技领衔,与清华大学可视媒体研究中心合作研发,通过动态swap机制大幅降低硬件配置要求(减少80%),并且Jittor框架通过零拷贝技术,大模型加载开销降低40%,同时,通过元算子自动编译优化,计算性能提升20%以上。
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里的计算开销40%,性能提升20%可以写一下是和哪个框架对比不


## 配置要求

* 内存要求:至少2G,推荐32G
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里配置要求是针对所有的模型吗,还是什么样子的模型呢


部署只需一行命令即可:

```
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里可以以本地的Atom-7B模型为例子吗 https://huggingface.co/FlagAlpha/Atom-7B

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

JittorLLM通过gradio库,允许用户在浏览器之中和大模型直接进行对话。

~~~bash
python web_demo.py llama2
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同样这里可以以本地的Atom-7B模型为例子吗 https://huggingface.co/FlagAlpha/Atom-7B

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

JittorLLM在api.py文件之中,提供了一个架设后端服务的示例。

~~~bash
python api.py llama2
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同样这里可以以本地的Atom-7B模型为例子吗 https://huggingface.co/FlagAlpha/Atom-7B
Atom-7Bd的prompt格式和了lama2不太一样
https://github.com/FlagAlpha/Llama2-Chinese/blob/main/examples/chat_gradio.py#L33

@Rayrtfr Rayrtfr merged commit 2e60e51 into LlamaFamily:main Oct 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants