Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Time to consider on-device GPU accelerate? #24

Open
sorasoras opened this issue Jan 3, 2025 · 2 comments
Open

Time to consider on-device GPU accelerate? #24

sorasoras opened this issue Jan 3, 2025 · 2 comments

Comments

@sorasoras
Copy link

option 1,
opencl backend by qualcomm
https://www.qualcomm.com/developer/blog/2024/11/introducing-new-opn-cl-gpu-backend-llama-cpp-for-qualcomm-adreno-gpu
https://github.com/CodeLinaro/llama.cpp
The backend has been tested with many premium devices powered by Snapdragon SOCs:

Laptops running Windows 11 with Snapdragon X Elite and Snapdragon X Plus chips
Android smartphones powered by Snapdragon 8 Gen 1, 2, 3, and the latest Snapdragon 8 Elite
it would work faster on qualcomm device which is the most used brand these day.

option 2

https://github.com/mlc-ai/mlc-llm

should work with most SOC

it would help a lot for running larger model like 7B or even 14B on android devices

@shubham0204
Copy link
Owner

shubham0204 commented Jan 4, 2025

Adding OpenCL support for devices with Adreno GPUs looks doable. Would you be interested in testing an unreleased APK that contains OpenCL acceleration? I do not have a physical device that has any one of the required CPUs from the Snapdragon 8 family. I can attach the APK in this thread once I have an APK ready for testing.

@sorasoras
Copy link
Author

Adding OpenCL support for devices with Adreno GPUs looks doable. Would you be interested in testing an unreleased APK that contains OpenCL acceleration? I do not have a physical device that has any one of the required CPUs from the Snapdragon 8 family. I can attach the APK in this thread once I have an APK ready for testing.

Sure, I have 8G2 device, it should work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants