-
Notifications
You must be signed in to change notification settings - Fork 223
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Readme modification #1700
base: main
Are you sure you want to change the base?
Readme modification #1700
Conversation
examples/image-to-text/README.md
Outdated
@@ -17,55 +17,11 @@ limitations under the License. | |||
# Image to Text Examples | |||
This directory contains a script that showcases how to perform image to text generation on Intel® Gaudi® AI Accelerators. | |||
|
|||
## Single-HPU inference | |||
Habana FusedSDPA is a fused and optimized implementation of torch.nn.functional.scaled_dot_product_attention() for Gaudi. For more details, refer to [Gaudi online documentation](https://docs.habana.ai/en/latest/PyTorch/Model_Optimization_PyTorch/Optimization_in_PyTorch_Models.html?highlight=fusedsdpa#using-fused-scaled-dot-product-attention-fusedsdpa). We optimized many models with FusedSDPA optimization as in optimum/habana/transformers/models. If models are not optimzied with FusedSDPA, it uses SDPA implementation. |
This comment was marked as resolved.
This comment was marked as resolved.
Sorry, something went wrong.
Finetune model can be done as single device, multi-devices with mpi or deepspeed. | ||
|
||
For multi-devices training with mpi, add the following before run_glue.py where X is device number | ||
> python ../gaudi_spawn.py \ | ||
> --world_size X --use_mpi run_glue.py \ | ||
|
||
For multi-devices training with deepspeed, add the following before run_glue.py and deepspeed config with --deepspeed after where X is device number | ||
> python ../gaudi_spawn.py \ | ||
> --world_size X --use_deepspeed run_glue.py --deepspeed \ | ||
|
This comment was marked as resolved.
This comment was marked as resolved.
Sorry, something went wrong.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some typo fixes suggested- pls see
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
couple links need fixing
Co-authored-by: Vidya Galli <[email protected]>
Co-authored-by: Vidya Galli <[email protected]>
examples/image-to-text/README.md
Outdated
--bf16 \ | ||
--sdp_on_bf16 | ||
``` | ||
Inference with FP8 precision are enabled using [Intel Neural Compressor (INC)](https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_FP8.html), which provides model measurement and quantization capabilities in PyTorch. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_FP8.html returns "page not found"
``` | ||
Inference with FP8 precision are enabled using [Intel Neural Compressor (INC)](https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_FP8.html), which provides model measurement and quantization capabilities in PyTorch. | ||
More information on enabling FP8 in SynapseAI is available here: | ||
https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_FP8.html |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: regisss <[email protected]>
Simplify ReadMe for text-classification, text-feature-extraction, and image-to-text.
What does this PR do?
Fixes # (issue)
Before submitting