cs20_3-3

1. tf.data

相比 feed_in和placeholder的优势：数据的一些操作(比如shuffle/batch/repeat/map)集成在tf中，所以效率高速度快，而且属于high-level api，使用方便
tf.data.Dataset

输出一些dataset的types/shape以做sanity check

print(xxxdataset.output_types)			# >> (tf.float32, tf.float32)
print(xxxdataset.output_shapes)		       # >> (TensorShape([]), TensorShape([]))

有很多格式，就我做过大型视频和图像的经验，我推荐tf.data.TFRecordDataset(filenames)

一些基本的数据操作

# 准备数据
dataset = tf.data.TFRecordDataset([file1, file2, file3, ...])
# 数据操作
dataset = dataset.shuffle(1000)
dataset = dataset.repeat(100)
dataset = dataset.batch(128)
dataset = dataset.map(lambda x: tf.one_hot(x, 10)) #转化为 one-hot encoding
# 取数据
iterator = dataset.make_one_shot_iterator() # 一种获取iterator的方式，后面还有更通用的
X, Y = iterator.get_next() # 如果上面batch过，一次就是取一个batch,否则就是一个sample(x,y)

据Notes所说，tf.data比fedd_in和placehodler效率要高

一个非常的编程实践

iterator = tf.data.Iterator.from_structure(train_data.output_types,
                                           train_data.output_shapes)
img, label = iterator.get_next()

train_init = iterator.make_initializer(train_data)  # initializer for train_data
test_init = iterator.make_initializer(test_data)  # initializer for train_data

# ...
sess.run(train_init) # 系统会自动加载training set的img,label
# ...
sess.run(test_init) # 加载的是 testing set的 img,labels

# 最上面的 img, label = iterator.get_next() 完全不存在同名的冲突，因为为init的控制隔离

2. optimizer速记

对梯度做些特殊修改

# create an optimizer.
optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.1)

# compute the gradients for a list of variables.
grads_and_vars = optimizer.compute_gradients(loss, <list of variables>)

# grads_and_vars is a list of tuples (gradient, variable).  Do whatever you
# need to the 'gradient' part, for example, subtract each of them by 1.
subtracted_grads_and_vars = [(gv[0] - 1.0, gv[1]) for gv in grads_and_vars]

# ask the optimizer to apply the subtracted gradients.
optimizer.apply_gradients(subtracted_grads_and_vars)

让某些变量不参与计算梯度的过程
```
stop_gradient( input, name=None )
```
- 应用场景举例：
  - When you train a GAN (Generative Adversarial Network) where no backprop should happen through the adversarial example generation process.
  - The EM algorithm where the M-step should not involve backpropagation through the output of the E-step
手动对某些y=f(x)求偏导：
```
tf.gradients(
    ys,
    xs,
    grad_ys=None,
    name='gradients',
    colocate_gradients_with_ops=False,
    gate_gradients=False,
    aggregation_method=None,
    stop_gradients=None
)
```
- 应用场景举例
  
  Technical detail: This is especially useful when training only parts of a model. For example, we can use tf.gradients() to take the derivative G of the loss w.r.t. to the middle layer. Then we use an optimizer to minimize the difference between the middle layer output M and M + G. This only updates the lower half of the network.(冻结某些层，只训练一些层，比如说：fine-tune过程)

ZhiHu ：HaoZhang的知乎

GitHub：HaoZhang的GitHub

Gmail ：njuhaozhang@gmail.com

相关阅读:
Atitit orm 之道艾龙著 1. 一、ORM的由来 1 2. ORM的组成： 2 3. 常见的ORM框架： 3 4. 、ORM与数据持久化的关系 3 5. Atitit
Atitit 移动互联网产业维度 1. 移动互联网带来的模式变革 1 2. 从视窗到“苹果与机器人”，软件发展模式的颠覆 2 3. 第3章从X86到ARM，蚂蚁绊倒了大象 2 4. 第5
Atitit 装备工具分类 attilax总结艾龙著工具链体系武器与软件行业工具也是很近似的的。 1. 分类思维 1 1.1. 总分类：冷、热 1 1.2. 轻、重、大规模杀伤性 1
Atitit 区块链之道 attilax著艾龙著 1. 金融＝制度＋技术＋信息 1 2. 第一章可信的协议 1 3. 第二章引导未来：区块链经济七大设计原则 1 4. 第五章新商业
 Atitit 几大研发体系对比 StageGate体系 PACE与IPD体系敏捷开发体系 CMMI体系艾龙著 1. 3. 1.5：业界领先的研发管理体系简介 2 1 2. 《产品及生命周期
 Atitit 传感器之道 1. 视觉传感器摄像头 1 1.1. 一、光线传感器： 1 1.2. 二、距离传感器： 1 1.3. 第一种是震动传感器。 4 1.4. 第二种是声响传感
 Atitit 架构之道之可读性可维护性架构之道提升效率架构之道 attilax著艾龙著 1.1. Hybrid架构 1 1.2. 分层架构是使用最多的架构模式 Layers模式也称Tie
Atitit cko之道首席知识官之道 attilax著艾龙著 1. 2 2. 第 1 章知识管理到底是什么，有什么用／1 2 3. 1.1 知识管理全景／1 1.2 波士顿矩阵／3 1.2.
Atitit 提升效率降低技术难度与提升技术矛盾的解决方案 1. 问题 2 1.1. 高手喜欢技术挑战怎么办，但会提升技术难度导致新手不会用怎么办 2 2. 解决方案 2 2.1. 通过开会统
 Atitit 依赖管理之道 1. 概念依赖管理，是指在什么地方以什么形式引入外部代码。 1 1.1.1. 理解模块化和依赖管理： 1 1.2. 依赖管理，有三个层面。单一职责原则，协议对象引用，
原文地址：https://www.cnblogs.com/LS1314/p/10371038.html