You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
thanks so much for releasing all the t51.1 and mt5 weights!
I'm currently working on porting all these models to huggingface's transformers.
Is there anyway to run mesh tensorflow in eager mode by any chance?
is there any way that I can run the prediction in eager mode so that I can print out the actual values in of the tensors? E.g. the tensor values of the input to the cross attention layer:
Hey Patrick, unfortunately I believe that because the mesh tf transformer uses tf.Estimator it is not eager-friendly. In the past when we've needed to do similar things, I'm sad to say that we just used Print ops.
FWIW we will soon be releasing a JAX implementation of T5(.1.1) which should make this kind of debugging and inspection a lot easier.
Hey guys,
thanks so much for releasing all the t51.1 and mt5 weights!
I'm currently working on porting all these models to huggingface's transformers.
Is there anyway to run mesh tensorflow in eager mode by any chance?
E.g. if I run the following predict command:
is there any way that I can run the prediction in eager mode so that I can print out the actual values in of the tensors? E.g. the tensor values of the input to the cross attention layer:
mesh/mesh_tensorflow/transformer/transformer_layers.py
Line 729 in 165d3dc
I had a hard time finding tests in the repo that run a small transformer network.
I'd be super happy for some pointers :-)
Also pinging @craffel in case you have any good pointers for good debugging tools :-)
The text was updated successfully, but these errors were encountered: