code-points/tensorflow tips at master · AlbertFutureLab/code-points · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
1. 关于使用clip来计算梯度：
  1.1 compute_gradients()
  1.2 clip_by_global_norm()
  1.3 apply_gradients()
2. clip用来控制梯度爆炸和梯度消失问题，dropout用来控制遗忘率
3. 在save和restore的时候变量的shape和type必须一样
4. 使用interactiveSession类，可以更加灵活地构建代码。可以不需要再运行图之前创建好图。
5. elements in tensor:tf.Variable   tf.constant   tf.placeholder  tf.SparseTensor
6. tf.rank返回的是tensor的维度，常量的话维度为0
7. 在reshape中-1代表剩下的；cast函数可以改变tensor的datatype
8. tensor.eval()only当session被激活的时候才起作用。Tensor.eval returns a numpy array with the same contents as the tensor.
9. 关于tf的variables：
  9.1 my_variable = tf.get_variable("my_variable", [1, 2, 3]) 创建变量，default: dtype=tf.float32, initial: tf.glorot_uniform_initializer.
  9.2 tf.getvariable(变量名, 变量类型，初始化函数, collections函数)
  9.3 不使一个变量trainable的办法：加入到tf.GraphKeys.LOCAL_VARIABLES中，或者使用trainable=False
  9.4 加入和读取collection中的变量：tf.add_to_collection() tf.get_collection()
10. 关于设备布置：
  10.1 with tf.device("/device:GPU:1"):
11. 关于变量初始化：
  11.1 Most high-level frameworks tf.contrib.slim, tf.estimator.Estimator and Keras automatically initialize variables for
  you before training a model.
  11.2 tf.global_variables_initializer() would initialize all variables in tf.GraphKeys.GLOBAL_VARIABLES collection.
       此函数中并不指定初始化的顺序，所以如果变量之间有依赖关系，可能会产生错误。
  11.3 initialize variable by yourself: session.run(my_variable.initializer)
  11.4 report unitialized_variables: print(session.run(tf.report_uninitialized_variables()))
  11.5 To assign a value to a variable, use the methods assign, assign_add, and friends in the tf.Variable class
12. 关于变量分享：
  TensorFlow supports two ways of sharing variables:
          Explicitly passing tf.Variable objects around.
          Implicitly wrapping tf.Variable objects within tf.variable_scope objects.
  12.1 trigger the variables sharing:
    reuse=True
    scope.resuce_variables()
13. For convenience, these functions will accept a tensor-like object in place of a tf.Tensor,
    and implicitly convert it to a tf.Tensor using the tf.convert_to_tensor method.
    tf.Tensor
    tf.Variable
    numpy.ndarray
    list (and lists of tensor-like objects)
    Scalar Python types: bool, float, int, str
14. about saving and restoring.
  Note that Estimators automatically saves and restores variables (in the model_dir).
15. restore variables:
    传入变量list或者ditc
16. chkp.print_tensors_in_checkpoint_file("/tmp/model.ckpt", tensor_name='', all_tensors=True) 打印
    保存的变量值
17. check thee devices:
  sess = tf.Session(config=tf.ConfigProto(log_device_placement=True))
18. two ways to limit gpu memory:
  18.1 config = tf.ConfigProto()
       config.gpu_options.allow_growth = True
       session = tf.Session(config=config, ...)
  18.2 config = tf.ConfigProto()
        config.gpu_options.per_process_gpu_memory_fraction = 0.4
        session = tf.Session(config=config, ..)
19. embeddings:
    word_embeddings = tf.get_variable(“word_embeddings”, [vocabulary_size, embedding_size])
    embedded_word_ids = tf.nn.embedding_lookup(word_embeddings, word_ids)
20. 双向rnn：
  inputs: The RNN inputs. If time_major == False (default), this must be a tensor of shape: [batch_size, max_time, ...], or a nested tuple of such elements. I
  if time_major == True, this must be a tensor of shape: [max_time, batch_size, ...], or a nested tuple of such elements.
  inputs: The RNN inputs. If time_major == False (default), this must be a tensor of shape: [batch_size, max_time, ...], or a nested tuple of such elements. If time_major == True, this must be a tensor of shape: [max_time, batch_size, ...], or a nested tuple of such elements.
21. tf.shape(tf.expand_dims(t2, 2))  # [2, 3, 1, 5] 在第2个地方插入一个维度
22. cnn中的一些tips:
  The inputs are 4 dimensional and are of form: [batch_size, image_rows, image_cols, number_of_colors]
  strieds=[1,x,x,1] 第一个是不跳过任何batch,最后一个是depth=1
23. 'VALID' means without padding, 'SAME' means with zeros padding.
24. tf.nn.conv2d(input, filter, strides, padding,
  tf.nn.conv2d(input, filter, strides, padding, use_cudnn_on_gpu=None, data_format=None, name=None)
  See the guide: Neural Network > Convolution
  Computes a 2-D convolution given 4-D input and filter tensors.
  Given an input tensor of shape [batch, in_height, in_width, in_channels] and a filter / kernel tensor of shape
  [filter_height, filter_width, in_channels, out_channels]
25. using tf.summary.scalar to check the output lr or lese.
    using tf.summary.histogram to the gradients output and to the variables that holds your weights
    using tf.summary.merge_all to combine them into a single op that generates all the summary data
26. in NLP: the pooling always doing in the length of the sequence.
27. 如果padding = ‘VALID’, new_height = new_width = (W – F + 1) / S （结果向上取整）
    如果padding = ‘SAME’, new_height = new_width = W / S （结果向上取整）
    pooling和这个的效果相同。
28. 关于tf.estimator.Estimator:
  28.1 input_function: features: {key: value} key is feature name, value is the value of the feature.
  28.2