Allow dynamic allocation of GPU memory #5

somerandomguyontheweb · 2019-07-04T14:18:24Z

Hi again,

I thought it might be worth a separate ticket – when running on GPU, all available memory is allocated, but the Tensorflow model of BERT may not actually need it. This should be simple enough to configure – e.g. in the Java API, the following code did the trick for me (replacing this line):

        ConfigProto configProto = ConfigProto.newBuilder()
                .setAllowSoftPlacement(true)
                .setGpuOptions(GPUOptions.newBuilder()
                                .setAllowGrowth(true)
                                .build())
                .build();
        SavedModelBundle bundle = SavedModelBundle.loader(path.toString())
                .withTags("serve")
                .withConfigProto(configProto.toByteArray())
                .load();

        return new Bert(bundle, model, path.resolve("assets").resolve(VOCAB_FILE));

Similarly in the Python API, it should be possible to start the TF session with an appropriately configured ConfigProto.

Thanks

The text was updated successfully, but these errors were encountered:

robrua · 2019-07-11T03:11:27Z

This sounds good to me. I'll add this for both Python and Java next time I do some work on this project, or feel free to send a PR.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow dynamic allocation of GPU memory #5

Allow dynamic allocation of GPU memory #5

somerandomguyontheweb commented Jul 4, 2019

robrua commented Jul 11, 2019

Allow dynamic allocation of GPU memory #5

Allow dynamic allocation of GPU memory #5

Comments

somerandomguyontheweb commented Jul 4, 2019

robrua commented Jul 11, 2019