docs: Benchmarks #92

jakmro · 2025-01-31T16:03:30Z

Description

Add models Benchmarks (memory usage, inference time, model size)

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update (improves or adds clarity to existing documentation)

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

mkopcins · 2025-02-03T12:58:50Z

docs/docs/benchmarks/memory-usage.md

+
+| Model                                                                                           | Android (XNNPack) [MB] | iOS (CoreML) [MB] |
+| ----------------------------------------------------------------------------------------------- | ---------------------- | ----------------- |
+| STYLE_TRANSFER_CANDY, STYLE_TRANSFER_MOSAIC, STYLE_TRANSFER_UDNIE, STYLE_TRANSFER_RAIN_PRINCESS | 950                    | 350               |


Lets split it, one model per line. This looks a bit off to me

mkopcins · 2025-02-03T13:03:43Z

docs/docs/benchmarks/inference-time.md

@@ -0,0 +1,39 @@
+---
+title: Inference Time


I think it would be better to only list 'consecutive' value. Firstly because it might be a bit confusing what it actually means and also who knows when executorch and system might decide to reload the model, causing new 'first' run. Better put warning banner on the top of the page with notice that initial runs can be significantly (even 2x) slower due to model loading in to the memory.

mkopcins · 2025-02-03T13:04:40Z

docs/docs/benchmarks/inference-time.md

+| LLAMA3_2_1B_QLORA     | 31.8                               | 11.4                               | 11.2                             | 37.3                                    | 44.4                            |
+| LLAMA3_2_3B           | ❌                                 | ❌                                 | ❌                               | ❌                                      | 7.1                             |
+| LLAMA3_2_3B_SPINQUANT | 17.2                               | 8.2                                | ❌                               | 16.2                                    | 19.4                            |
+| LLAMA3_2_3B_QLORA     | 14.5                               | ❌                                 | ❌                               | 14.8                                    | 18.1                            |


Here, below this table add description of why we have x's in some places (not enough memory)

…tion

…otice

jakmro added 2 commits January 30, 2025 16:20

Add Benchmarks

3d16caf

Add OnePlus 12 Benchmarks

a1cb949

jakmro changed the title ~~@jakmro/benchmarks~~ docs: Benchmarks Jan 31, 2025

jakmro requested a review from mkopcins January 31, 2025 16:08

mkopcins requested changes Feb 3, 2025

View reviewed changes

jakmro added 3 commits February 3, 2025 14:31

Add suggested changes

3ffc683

Standardize XNNPack to XNNPACK and CoreML to Core ML across documenta…

a7ccd30

…tion

Change admonition type info -> warning, add more details to warning n…

31db391

…otice

jakmro requested a review from mkopcins February 3, 2025 14:04

Update insufficient-RAM-indicator description

1eacf1f

mkopcins approved these changes Feb 3, 2025

View reviewed changes

mkopcins merged commit c2eee13 into main Feb 3, 2025
2 checks passed

mkopcins deleted the @jakmro/benchmarks branch February 3, 2025 19:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Benchmarks #92

docs: Benchmarks #92

jakmro commented Jan 31, 2025

mkopcins Feb 3, 2025

mkopcins Feb 3, 2025

mkopcins Feb 3, 2025

docs: Benchmarks #92

docs: Benchmarks #92

Conversation

jakmro commented Jan 31, 2025

Description

Type of change

Checklist

mkopcins Feb 3, 2025

Choose a reason for hiding this comment

mkopcins Feb 3, 2025

Choose a reason for hiding this comment

mkopcins Feb 3, 2025

Choose a reason for hiding this comment