use maxcut for total ru #1324

irenaby · 2025-01-13T14:17:24Z

Pull Request Description:

Use max cut activation method for total resource utilization target.
Compute total target from weights and activations utilization instead of as a separate metric.
Simplify mixed precision search manger and linear programming to support 2d utilization matrix only instead of >=2.

Checklist before requesting a review:

I set the appropriate labels on the pull request.
I have added/updated the release note draft (if necessary).
I have updated the documentation to reflect my changes (if necessary).
All function and files are well documented.
All function and classes have type hints.
There is a licenses in all file.
The function and variable names are informative.
I have checked for code duplications.
I have added new unittest (if necessary).

irenaby · 2025-01-13T18:14:09Z

model_compression_toolkit/core/common/mixed_precision/mixed_precision_search_manager.py

@@ -67,13 +67,19 @@ def __init__(self,
        self.compute_metric_fn = self.get_sensitivity_metric()
        self._cuts = None

-        self.ru_metrics = target_resource_utilization.get_restricted_metrics()
+        # To define RU Total constraints we need to compute weights and activations even if they have no constraints


I couldn't just replace Total target computation without ugly hacks, so I changed the way it's handled altogether. I'm not thrilled about this implementation, but I think it's the lesser evil, as now we can actually reuse activations and weights utilization without recomputing it for Total and it simplifies the code around utilization matrix (and no need for the hack).

irenaby · 2025-01-13T18:15:07Z

tests/keras_tests/feature_networks_tests/feature_networks/mixed_precision_tests.py

@@ -510,21 +508,22 @@ def compare(self, quantized_model, float_model, input_x=None, quantization_info:
        activation_bits = [layer.activation_holder_quantizer.get_config()['num_bits'] for layer in holder_layers]
        # TODO maxcut: restore activation_bits == [4, 4] and unique_tensor_values=16 when maxcut calculates tensor sizes
        #              of fused nodes correctly.
-        self.unit_test.assertTrue((activation_bits == [4, 8]))
+        # TODO: maxcut Test updated but lowered activation ru (how can 4000 enforce 4,4??). Not sure what the fused nodes


@elad please comment on my comment to your comment :)

remove comments

irenaby · 2025-01-13T18:17:14Z

tests/keras_tests/feature_networks_tests/feature_networks/mixed_precision_tests.py

@@ -174,13 +174,12 @@ def compare(self, quantized_model, float_model, input_x=None, quantization_info=
        # test with its current setup (therefore, we don't check the input layer's bitwidth)
        self.unit_test.assertTrue((activation_bits == [4, 8]))

-        # TODO maxcut: restore this test after total_memory is fixed to be the sum of weight & activation metrics.


I restored some of maxcut TODOs but there are more remaining that I wasn't sure about

elad-c · 2025-01-14T09:21:59Z

tests/keras_tests/feature_networks_tests/feature_networks/mixed_precision_tests.py

@@ -510,21 +508,22 @@ def compare(self, quantized_model, float_model, input_x=None, quantization_info:
        activation_bits = [layer.activation_holder_quantizer.get_config()['num_bits'] for layer in holder_layers]
        # TODO maxcut: restore activation_bits == [4, 4] and unique_tensor_values=16 when maxcut calculates tensor sizes
        #              of fused nodes correctly.
-        self.unit_test.assertTrue((activation_bits == [4, 8]))
+        # TODO: maxcut Test updated but lowered activation ru (how can 4000 enforce 4,4??). Not sure what the fused nodes


remove comments

use maxcut for total ru

274f77d

irenaby force-pushed the ru_refactor_total branch from e718734 to 274f77d Compare January 13, 2025 14:32

github-actions bot added auto:core auto:tests labels Jan 13, 2025

fix test

779d9bf

irenaby marked this pull request as ready for review January 13, 2025 17:54

irenaby commented Jan 13, 2025

View reviewed changes

irenaby requested review from ofirgo and elad-c January 13, 2025 18:17

irenaby added 2 commits January 13, 2025 20:53

add no cover and small updates

637176f

rename and move mp ru helper file

90d9dc2

ofirgo removed their request for review January 13, 2025 19:06

elad-c approved these changes Jan 14, 2025

View reviewed changes

remove irrelevant TODOs

e88aaf2

irenaby merged commit 557593c into main Jan 14, 2025
43 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use maxcut for total ru #1324

use maxcut for total ru #1324

irenaby commented Jan 13, 2025 •

edited

Loading

irenaby Jan 13, 2025

irenaby Jan 13, 2025

elad-c Jan 14, 2025

irenaby Jan 13, 2025

elad-c Jan 14, 2025

use maxcut for total ru #1324

use maxcut for total ru #1324

Conversation

irenaby commented Jan 13, 2025 • edited Loading

Pull Request Description:

Checklist before requesting a review:

irenaby Jan 13, 2025

Choose a reason for hiding this comment

irenaby Jan 13, 2025

Choose a reason for hiding this comment

elad-c Jan 14, 2025

Choose a reason for hiding this comment

irenaby Jan 13, 2025

Choose a reason for hiding this comment

elad-c Jan 14, 2025

Choose a reason for hiding this comment

irenaby commented Jan 13, 2025 •

edited

Loading