Add MultiCategoricalProjectionNetwork #705

sidney-tio · 2022-02-12T19:08:37Z

Closes #694

sibyjackgrove · 2022-02-22T02:39:04Z

@sidney-tio Thank you for this PR. It could also solve #702 for me. Hope that this will be merged soon.

sidney-tio · 2022-03-31T11:10:26Z

hi team! could i request a review for this please?

sguada · 2022-03-31T14:33:38Z

tf_agents/networks/multi_categorical_projection_network.py

+    return distribs
+
+  def _mode(self):
+    return self._flatten_and_concat_event(


Why mode needs flatten_and_concat_event but sample doesn't?

Blockwise from tf-probability doesn't implement mode, but implements flatten_and_concat_event for sample and mean.

https://github.com/tensorflow/probability/blob/88d217dfe8be49050362eb14ba3076c0dc0f1ba6/tensorflow_probability/python/distributions/blockwise.py#L347-L366

sguada · 2022-03-31T14:34:19Z

tf_agents/networks/multi_categorical_projection_network.py

+
+  def __init__(self, logits, categories_shape):
+    self.categories_shape = categories_shape
+    distribs = self._create_distrib(logits)


Can you validate that the dimensions of logits align with categories_shape?

sguada · 2022-03-31T14:35:02Z

tf_agents/networks/multi_categorical_projection_network.py

+    Args:
+      sample_spec: A collection of `tensor_spec.BoundedTensorSpec` detailing
+        the shape and dtypes of samples pulled from the output distribution.
+      logits_init_output_factor: Output factor for initializing kernal


Not clear what does it mean output factor?
kernal -> kernel

Fixed kernel spelling error.

The descriptor for logits_init_output_factor was inherited from CategoricalProjectionNetwork, but I do agree it is not exactly clear what output factor means here.

sguada · 2022-03-31T14:36:54Z

tf_agents/networks/multi_categorical_projection_network.py

+
+    self._projection_layer = tf.keras.layers.Dense(
+        self.n_unique_categories,
+        kernel_initializer=tf.compat.v1.keras.initializers.VarianceScaling(


Why using VarianceScaling as initializer? Can you use just tf.keras.initializers?

Similar to output factor, the initializer was selected for CategoricalProjectionNetwork, to which I'm not sure if there's any reason behind VarianceScaling used here

agents/tf_agents/networks/categorical_projection_network.py

Lines 81 to 82 in 5360685

kernel_initializer=tf.compat.v1.keras.initializers.VarianceScaling(

scale=logits_init_output_factor),

sguada · 2022-03-31T14:40:43Z

tf_agents/networks/multi_categorical_projection_network.py

+
+  def _categories_shape(self, sample_spec):
+    def _get_n_categories(array_spec):
+      if not tensor_spec.is_bounded(array_spec):


Simplify to
if tensor_spec.is_bounded(array_spec) and tensor_spec.is_discrete(array_spec) :
n_categories = array_spec.maximum - array_spec.minimum + 1
return n_categories
else:
raise ValueError('sample_spec must be discrete and bounded. Got: %s.' % array_spec)

sguada · 2022-03-31T14:47:51Z

tf_agents/networks/multi_categorical_projection_network.py

+    logits = tf.reshape(logits, [-1] + [self.n_unique_categories])
+    logits = batch_squash.unflatten(logits)
+    if mask is not None:
+      # assume mask is a flattened array for now


I suppose mask should have the same shape as actions.

I'm not sure about the more appropriate approach for this so any advice here would be good; At this stage the outputs of the network exists as logits, i.e. the vector exists as flattened. In this case, mask needs to be flattened as well.

Alternatively, if we want the mask to be same shape as actions, we could do the masking during init of MultiCategoricalDistributionBlock

sguada · 2022-03-31T14:49:36Z

tf_agents/networks/multi_categorical_projection_network.py

+      # Overwrite the logits for invalid actions to a very large negative
+      # number. We do not use -inf because it produces NaNs in many tfp
+      # functions.
+      almost_neg_inf = tf.constant(logits.dtype.min, dtype = logits.dtype)


Make sure lines are <80 and parameters don't have extra spaces.
almost_neg_inf = tf.constant(logits.dtype.min,
dtype=logits.dtype)

sguada · 2022-03-31T14:50:01Z

tf_agents/networks/multi_categorical_projection_network.py

+      logits = tf.compat.v2.where(
+        tf.cast(mask, tf.bool), logits, almost_neg_inf)
+
+    return self.output_spec.build_distribution(logits= logits), ()


(logits= logits) -> (logits=logits)

sguada · 2022-03-31T14:51:29Z

tf_agents/networks/multi_categorical_projection_network_test.py

+    output_spec = [
+      tensor_spec.BoundedTensorSpec([], tf.int32, 0, 1),
+      tensor_spec.BoundedTensorSpec([], tf.int32, 0, 4)]
+    network = multi_categorical_projection_network.MultiCategoricalProjectionNetwork(


too long of a name, can you make it shorter multi_categorical_network.MultiCategoricalNetwork

sguada · 2022-03-31T14:52:50Z

tf_agents/networks/multi_categorical_projection_network_test.py

+    self.assertEqual(tfp.distributions.Categorical, type(distribution.distributions[0]))
+    self.assertEqual(2, len(distribution.distributions))
+    self.assertEqual((3, 7), distribution._parameters['logits'].shape)
+    self.assertEqual((3, 2), sample.shape)


Can you also test that the samples respect the bounds?

sidney-tio · 2022-04-06T19:51:25Z

thanks for the review! made the changes as advised. Some of the changes requested were for code inherited from CategoricalProjectionNetwork, so I'm not sure if it warrants a separate PR to reconcile both?

MiriamJo · 2023-07-14T07:34:08Z

@sguada Is there a reason this was never merged? I think this one would make such a difference.

tfboyd · 2023-09-30T13:08:20Z

Closing, after this much time I do not think anyone will get to it. I am merging others that I can review myself. This one is too big for me. Sorry this one got so far and didn't land. :-(

tfboyd · 2023-09-30T13:08:34Z

@sguada just for visibility.

sidney-tio added 5 commits February 13, 2022 02:59

Added MultiCategoricalProjectionNetwork

7301038

Init test for MultiCategoricalProjectionNetwork

1c8cb7c

Added multicategoricalnetwork to __init__.py

68e4d74

Changed list comprehesion to tf.nest api for consistency

9b059e2

Added test cases

e29f9d2

sidney-tio marked this pull request as ready for review February 15, 2022 18:07

sidney-tio changed the title ~~Add MultiCategoricalProjectionMetwork~~ Add MultiCategoricalProjectionNetwork Feb 21, 2022

sguada requested changes Mar 31, 2022

View reviewed changes

sidney-tio added 4 commits April 7, 2022 03:36

Renamed classes to shorter name

7b22e04

Added check to ensure alignment of logits

0380937

Fixed formatting

0cfbc6c

Simplified checks for bounded and discrete tensor spec

107d8bc

sidney-tio requested a review from sguada May 5, 2022 19:12

sibyjackgrove mentioned this pull request Jul 15, 2022

Multiple actions for PPOAgent #759

Open

MiriamJo mentioned this pull request Jul 14, 2023

Is there any agent implemented that can work with MultiBinary action space? #702

Closed

tfboyd closed this Sep 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MultiCategoricalProjectionNetwork #705

Add MultiCategoricalProjectionNetwork #705

sidney-tio commented Feb 12, 2022 •

edited

Loading

sibyjackgrove commented Feb 22, 2022

sidney-tio commented Mar 31, 2022

sguada Mar 31, 2022

sidney-tio Apr 6, 2022

sguada Mar 31, 2022

sguada Mar 31, 2022

sidney-tio Apr 6, 2022

sguada Mar 31, 2022

sidney-tio Apr 6, 2022 •

edited

Loading

sguada Mar 31, 2022

sguada Mar 31, 2022

sidney-tio Apr 6, 2022

sguada Mar 31, 2022

sguada Mar 31, 2022

sguada Mar 31, 2022

sguada Mar 31, 2022

sidney-tio commented Apr 6, 2022

MiriamJo commented Jul 14, 2023 •

edited

Loading

tfboyd commented Sep 30, 2023

tfboyd commented Sep 30, 2023

	kernel_initializer=tf.compat.v1.keras.initializers.VarianceScaling(
	scale=logits_init_output_factor),

Add MultiCategoricalProjectionNetwork #705

Add MultiCategoricalProjectionNetwork #705

Conversation

sidney-tio commented Feb 12, 2022 • edited Loading

sibyjackgrove commented Feb 22, 2022

sidney-tio commented Mar 31, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sidney-tio Apr 6, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sidney-tio commented Apr 6, 2022

MiriamJo commented Jul 14, 2023 • edited Loading

tfboyd commented Sep 30, 2023

tfboyd commented Sep 30, 2023

sidney-tio commented Feb 12, 2022 •

edited

Loading

sidney-tio Apr 6, 2022 •

edited

Loading

MiriamJo commented Jul 14, 2023 •

edited

Loading