Enable TensorFloat32 with XLA #44887

n2cholas · 2020-11-15T22:39:23Z

System information

TensorFlow version (you are using): 2.5.0-dev20201115
Are you willing to contribute it (Yes/No): No (don't know how to)

Describe the feature and the current behavior/state.
Currently, TF32 works in the normal run time but not with XLA.

Will this change the current api? How?
No.

Who will benefit with this feature?
Anyone using Ampere GPUs and XLA compilation.

Any Other info.
I checked the TF32 was not being used with XLA using the following script:

import tensorflow as tf
tf.debugging.set_log_device_placement(True)
tf.config.experimental.enable_tensor_float_32_execution(True)

@tf.function(experimental_compile=True)
def f(x):
    return x @ x
with tf.device("/GPU:0"):
    x = tf.cast(tf.random.uniform(shape=(16,16)), tf.float32)
f(x)

I ran this on an A100 with dlprof. With experimental_compile=False, the following kernel ran:

With experimental_compile=False and tf.config.experimental.enable_tensor_float_32_execution(False), the following kernel ran:

With experimental_compile=True and tf.config.experimental.enable_tensor_float_32_execution(True), the same kernel ran:

This indicates to me that the TF32 is not enabled with XLA. Please let me know if I'm doing something wrong. Thanks!

The text was updated successfully, but these errors were encountered:

n2cholas added the type:feature Feature requests label Nov 15, 2020

google-ml-butler bot assigned ravikyram Nov 15, 2020

n2cholas mentioned this issue Nov 15, 2020

Automatically doing TensorFloat32 Math on Ampere GPUs google/jax#4873

Closed

ravikyram added the comp:xla XLA label Nov 17, 2020

ravikyram assigned ymodak and unassigned ravikyram Nov 17, 2020

ymodak assigned r4nt and unassigned ymodak Nov 17, 2020

ymodak added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Nov 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable TensorFloat32 with XLA #44887

Enable TensorFloat32 with XLA #44887

Enable TensorFloat32 with XLA #44887

Enable TensorFloat32 with XLA #44887

Comments