Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Android App to run Llama-v2-7B-Chat Quantized INT4 on my Android Device #1

Open
taeyeonlee opened this issue Jul 25, 2024 · 5 comments
Labels
Feature Request New feature or request

Comments

@taeyeonlee
Copy link

Hi,
Could you share the sample Android App to run Llama-v2-7B-Chat Quantized INT4 on my Android Device ?

@mestrona-3 mestrona-3 added the Feature Request New feature or request label Aug 1, 2024
@mestrona-3
Copy link

Hi @taeyeonlee, thank you for the feature request. We have heard this request from many users and have it in our backlog. In the meantime, please reference the demo.py and export.py for Llamav2. Thank you for your patience!

@taeyeonlee
Copy link
Author

Hi, @mestrona-3
Could you please share the plan to release the Android sample app with C++ APIs ?

@bhushan23
Copy link

Hi @taeyeonlee
We are validating Android App internally and will be releasing it during first half of November.

@YangWang92
Copy link

YangWang92 commented Nov 19, 2024

Hi @bhushan23 , thanks for the issue. Do you have any further updates? Thanks.

@mestrona-3
Copy link

Hi All, we are still testing the app and are aiming for it to be released in one of upcoming releases. Please join our Slack community for continued updates.

We use Github for bugs and attaching detailed logs. Slack is the go to place for any new feature announcements and release notes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature Request New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants