Custom Gradients in TensorFlow

来源：互联网发布：java调用方法格式编辑：程序博客网时间：2024/05/16 14:48

TensorFlow defines deep learning models as computational graphs, where nodes are called ops, short for operations, and the data that flows between these ops are called tensors. Given a graph of ops, TensorFlow uses automatic differentiation to compute gradients. The theory behind automatic differentiation is that all numeric computations are composed of a finite set of elementary operations for which the gradient is well defined. In TensorFlow, each op must then have a well defined gradient for automatic differentiation to work properly.

When adding new ops in TensorFlow, you must use tf.RegisterGradient to register a gradient function which computes gradients with respect to the ops’ input tensors given gradients with respect to the ops’ output tensors. For example, let’s say we have an operation Square which computes the square of the input. Its forward activity and backward activity are defined as follows:

Forward: $y = x^2$backward:$y=2x$

……

网址：
https://uoguelph-mlrg.github.io/tensorflow_gradients/

阅读全文

0 0