Enable TensorFloat32 with XLA #44887
Labels
comp:xla
XLA
stat:awaiting tensorflower
Status - Awaiting response from tensorflower
type:feature
Feature requests
System information
2.5.0-dev20201115
Describe the feature and the current behavior/state.
Currently, TF32 works in the normal run time but not with XLA.
Will this change the current api? How?
No.
Who will benefit with this feature?
Anyone using Ampere GPUs and XLA compilation.
Any Other info.
I checked the TF32 was not being used with XLA using the following script:
I ran this on an A100 with dlprof. With
experimental_compile=False
, the following kernel ran:With
experimental_compile=False
andtf.config.experimental.enable_tensor_float_32_execution(False)
, the following kernel ran:With
experimental_compile=True
andtf.config.experimental.enable_tensor_float_32_execution(True)
, the same kernel ran:This indicates to me that the TF32 is not enabled with XLA. Please let me know if I'm doing something wrong. Thanks!
The text was updated successfully, but these errors were encountered: