is it possible to convert tf-agents to tf-lite and run on android device #280

ssujr · 2020-01-02T00:58:56Z

We want to implement RL on android device. Just wondering if it is possible to run tf-agents on android or to convert tf-agents to tf-lite. It will be great if someone can share some experience. Thank you!

ebrevdo · 2020-01-22T16:26:37Z

Yes; you should be able to do this. I'm guessing you care about inference (running a policy) more than training (since tflite doesn't support that anyway).

See the PolicySaver class. You can use it to export a SavedModel. You can then use the TFLite converter to convert that SavedModel to a TFLite model.

Please report back and let us know if this works for you!

ssujr · 2020-02-03T06:15:46Z

Actually, we plan to do both training and inference on device. Do you guys have plan to support training in near future? Thank you for the response.

dvdhfnr · 2020-05-04T15:55:02Z

Hi!

Yes; you should be able to do this. I'm guessing you care about inference (running a policy) more than training (since tflite doesn't support that anyway).

See the PolicySaver class. You can use it to export a SavedModel. You can then use the TFLite converter to convert that SavedModel to a TFLite model.

Please report back and let us know if this works for you!

We tried to do this (using the DqnAgent.). However, we are receiving the following error when trying to convert the saved model (policy):
"ValueError: This converter can only convert a single ConcreteFunction. Converting multiple functions is under development."

@ebrevdo Any suggestions?
(If required, further details can be provided.)

Thanks!

ebrevdo · 2020-05-04T16:08:55Z

For "only convert a single ConcreteFunction" this is cause it's trying to use the new MLIR converter. I suggest filing a repro separately with the TensorFlow Issues so they can see this feature is required. @aselle @jdduke fyi.

Separately; for now you should be able to use the "old-style" converter (it should work fine). Try passing --enable_v1_converter when you call tflite_convert and report back :)

ebrevdo · 2020-05-04T16:10:35Z

For training on device you cannot do this with TFLite. You must either use the standard TF runtime, or try the (less well supported path) of using the new saved_model_cli aot_compile_cpu approach, which does not support dynamic shapes and a lot more manual, but would allow you to train on device. Unfortunately there's no tutorial (yet) on how to do this. If you're interested in that, we can involve the TF team to maybe write something about this approach.

ebrevdo · 2020-05-04T16:11:01Z

(for aot_compile_cpu; you will need the most recent tf2.2 RC; it's not in TF2.1).

dvdhfnr · 2020-05-04T16:22:00Z

enable_v1_converter

Thanks for the fast response!

--enable_v1_converter works "better", but leads to a different error:
ValueError: No 'serving_default' in the SavedModel's SignatureDefs. Possible values are 'get_initial_state,__saved_model_init_op,get_train_step,action'.

(We do not require training on the device.)

ebrevdo · 2020-05-04T16:22:25Z

We can add a TODO to be able to create SavedModels out of the Agent.train() method; but my comments above still apply...

…

On Mon, May 4, 2020 at 9:11 AM ebrevdo ***@***.***> wrote: (for aot_compile_cpu; you will need the most recent tf2.2 RC; it's not in TF2.1). — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#280 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AANWFG36MMSYA4LDOCQTOXLRP3SKLANCNFSM4KB5SFLA> .

ebrevdo · 2020-05-04T16:26:11Z

The tflite_convert CLI help doesn't seem to show it, but you can pass a " --saved_model_signature_key <https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/python/tflite_convert.py#L385>" flag, you probably want to point it to "action". If you have an RNN in the model, you'll also want to create a separate TFLite model for "get_initial_state" which you would use to initialize the RNN at the beginning of an episode/sequence and pass as the initial state to "action".

…

On Mon, May 4, 2020 at 9:22 AM ebrevdo ***@***.***> wrote: We can add a TODO to be able to create SavedModels out of the Agent.train() method; but my comments above still apply... On Mon, May 4, 2020 at 9:11 AM ebrevdo ***@***.***> wrote: > (for aot_compile_cpu; you will need the most recent tf2.2 RC; it's not in > TF2.1). > > — > You are receiving this because you are subscribed to this thread. > Reply to this email directly, view it on GitHub > <#280 (comment) >, > or unsubscribe > < https://github.com/notifications/unsubscribe-auth/AANWFG36MMSYA4LDOCQTOXLRP3SKLANCNFSM4KB5SFLA > > . > — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#280 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AANWFG2D2TV76ACWGFPIXSDRP3TVDANCNFSM4KB5SFLA> .

dvdhfnr · 2020-05-04T16:34:48Z

Great. Thanks.

tflite_convert --saved_model_dir saveDir --enable_v1_converter --saved_model_signature_key action --output_file out.tflite --allow_custom_ops
seems to work for the conversion.

(Still need to investigate if this tflite model runs as expected on the Android device. I will try to report back.)

Thanks.

maslovay · 2020-05-10T18:43:20Z

@dvdhfnr how are things with implementing your tf agents trained NN on Android? I have this error:

"RuntimeError: Encountered unresolved custom op: BroadcastArgs.Node number 0 (BroadcastArgs) failed to prepare."

Here the case: https://stackoverflow.com/questions/61715154/tflite-model-load-error-runtimeerror-encountered-unresolved-custom-op-broadca

@ebrevdo

ebrevdo · 2020-05-10T19:14:11Z

@jdduke @raziel any suggestions?

dvdhfnr · 2020-05-11T06:30:40Z

When converting with the flag "--allow_custom_ops" you need to implement the ops that are not supported by TFLite by yourself: see e.g. https://www.tensorflow.org/lite/guide/ops_custom

Try to convert without "--allow_custom_ops". Then, you will see a list of ops that are not supported. Unfortunately, it seems that we will have to implement those by ourselves.

maslovay · 2020-05-11T07:17:27Z

@dvdhfnr you are right, the problem is this ops:

Exception: <unknown>:0: error: loc(fused["Deterministic_1/sample/BroadcastArgs@__inference_action_11129549", "StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/Deterministic_1/sample/BroadcastArgs"]): 'tf.BroadcastArgs' op is neither a custom op nor a flex op
<unknown>:0: error: loc(fused["ActorDistributionNetwork/TanhNormalProjectionNetwork/MultivariateNormalDiag/shapes_from_loc_and_scale/prefer_static_broadcast_shape/BroadcastArgs@__inference_action_11129549", "StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/ActorDistributionNetwork/TanhNormalProjectionNetwork/MultivariateNormalDiag/shapes_from_loc_and_scale/prefer_static_broadcast_shape/BroadcastArgs"]): 'tf.BroadcastArgs' op is neither a custom op nor a flex op
<unknown>:0: error: loc(fused["Deterministic_1/sample/BroadcastArgs_1@__inference_action_11129549", "StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/Deterministic_1/sample/BroadcastArgs_1"]): 'tf.BroadcastArgs' op is neither a custom op nor a flex op
<unknown>:0: error: loc(fused["Deterministic_1/sample/BroadcastTo@__inference_action_11129549", "StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/StatefulPartitionedCall/Deterministic_1/sample/BroadcastTo"]): 'tf.BroadcastTo' op is neither a custom op nor a flex op
<unknown>:0: error: failed while converting: 'main': Ops that can be supported by the flex runtime (enabled via setting the -emit-select-tf-ops flag): BroadcastArgs,BroadcastArgs,BroadcastArgs,BroadcastTo.

dvdhfnr · 2020-05-12T14:16:07Z

Currently, I am using the following pipeline:

policy_saver = PolicySaver(policy)
policy_saver.save('tmp')
converter = tf.lite.TFLiteConverter.from_saved_model('tmp', signature_keys=["action"])
tflite_policy = converter.convert()

Since I am actually not interested in saving the policy to a file, I tried to exchange the 2nd and 3rd line with

converter = tf.lite.TFLiteConverter.from_concrete_functions([policy_saver._signatures['action'].get_concrete_function()])

I noticed that this changes the order of the input tensors. Do I need to take care of other side-effects or is this method safe to use? Moreover, do I need to use the PolicySaver at all or can I just directly create a concrete function ('action') and convert from this?
(The PolicySaver code looks quite sophisticated. Hence, I cannot fully get an overview of what is done and why.)

Thanks for your comments!

ebrevdo · 2021-04-17T21:03:19Z

There is now a unit test showing how to use policy saver with tflite converter in policy_saver_test.py. does it help?

soldierofhell · 2021-06-12T22:14:52Z

Hi @ebrevdo,
There's short note in the code:

agents/tf_agents/policies/policy_saver_test.py

Lines 358 to 359 in 3448c9e

    
           # TODO(b/111309333): Remove this when `has_input_fn_and_spec` 
        
           # is `False` once TFLite has native support for RNG ops, atan, etc.

I guess this "native support for RNG ops, atan, etc." relates to unsupported BroadcastArgs and BroadcastTo ops.
Could you please provide more details what is the root cause of the problem (e.g. where are those broadcast coming from)? Maybe it's possible to change something in tf_agents code? Or maybe we can somehow contribute to improve something on TFLite side?
Thanks in advance,
Regards,

ebrevdo · 2021-06-17T22:15:04Z

This has nothing to do with TF-Agents - it depends on TFLite team. @jdduke FYI. Is there a relevant issue open on tf's side?

ebrevdo · 2021-06-17T22:16:50Z

I'm not sure where the broadcast args are coming from. possibly from TF Probability? Here's where we use broadcast_to but I don't think these are the real places it's coming from. Probably from a library we're using as I mentioned.

jdduke · 2021-06-17T22:47:08Z

@thaink is actively working to support this. I'm not sure if there's a corresponding TF issue, but we do have an internal issue tracking this.

thaink · 2021-06-18T00:23:28Z

@ebrevdo I think the BroadcastArgs may come from using broadcast_to on a dynamic tensor.
I am working on supporting BroadcastArgs now.

soldierofhell · 2021-06-18T06:35:26Z

Thanks guys, please leave here a comment when BroadcastArgs will be available

soldierofhell · 2021-06-25T08:43:08Z

@thaink any ETA for this BroadcastArgs issue? :)

thaink · 2021-06-28T00:51:02Z

Unfortunately, it is still under review.

thaink · 2021-07-08T02:04:17Z

@soldierofhell BroadcastArgs is added to master branch.
You could try it using the nightly now.

windmaple · 2021-07-09T10:21:51Z

I can convert the model now. Thanks for @thaink 's work.

ebrevdo self-assigned this Jan 22, 2020

ebrevdo assigned jdduke Jun 17, 2021

jdduke removed their assignment Aug 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is it possible to convert tf-agents to tf-lite and run on android device #280

is it possible to convert tf-agents to tf-lite and run on android device #280

ssujr commented Jan 2, 2020

ebrevdo commented Jan 22, 2020

ssujr commented Feb 3, 2020

dvdhfnr commented May 4, 2020 •

edited

Loading

ebrevdo commented May 4, 2020

ebrevdo commented May 4, 2020

ebrevdo commented May 4, 2020

dvdhfnr commented May 4, 2020

ebrevdo commented May 4, 2020 via email

ebrevdo commented May 4, 2020 via email

dvdhfnr commented May 4, 2020 •

edited

Loading

maslovay commented May 10, 2020 •

edited

Loading

ebrevdo commented May 10, 2020

dvdhfnr commented May 11, 2020

maslovay commented May 11, 2020

dvdhfnr commented May 12, 2020 •

edited

Loading

ebrevdo commented Apr 17, 2021

soldierofhell commented Jun 12, 2021

ebrevdo commented Jun 17, 2021

ebrevdo commented Jun 17, 2021

jdduke commented Jun 17, 2021

thaink commented Jun 18, 2021

soldierofhell commented Jun 18, 2021

soldierofhell commented Jun 25, 2021

thaink commented Jun 28, 2021

thaink commented Jul 8, 2021

windmaple commented Jul 9, 2021

is it possible to convert tf-agents to tf-lite and run on android device #280

is it possible to convert tf-agents to tf-lite and run on android device #280

Comments

ssujr commented Jan 2, 2020

ebrevdo commented Jan 22, 2020

ssujr commented Feb 3, 2020

dvdhfnr commented May 4, 2020 • edited Loading

ebrevdo commented May 4, 2020

ebrevdo commented May 4, 2020

ebrevdo commented May 4, 2020

dvdhfnr commented May 4, 2020

ebrevdo commented May 4, 2020 via email

ebrevdo commented May 4, 2020 via email

dvdhfnr commented May 4, 2020 • edited Loading

maslovay commented May 10, 2020 • edited Loading

ebrevdo commented May 10, 2020

dvdhfnr commented May 11, 2020

maslovay commented May 11, 2020

dvdhfnr commented May 12, 2020 • edited Loading

ebrevdo commented Apr 17, 2021

soldierofhell commented Jun 12, 2021

ebrevdo commented Jun 17, 2021

ebrevdo commented Jun 17, 2021

jdduke commented Jun 17, 2021

thaink commented Jun 18, 2021

soldierofhell commented Jun 18, 2021

soldierofhell commented Jun 25, 2021

thaink commented Jun 28, 2021

thaink commented Jul 8, 2021

windmaple commented Jul 9, 2021

dvdhfnr commented May 4, 2020 •

edited

Loading

dvdhfnr commented May 4, 2020 •

edited

Loading

maslovay commented May 10, 2020 •

edited

Loading

dvdhfnr commented May 12, 2020 •

edited

Loading