Theano学习笔记（二）——逻辑回归函数解析

Theano学习笔记（二）——逻辑回归函数解析
有了前面的准备，可以用Theano实现一个逻辑回归程序，逻辑回归是典型的有监督学习。

为了形象，这里我们假设分类任务是区分人与狗的照片。

首先是生成随机数对象
[python] view plain copy
1. importnumpy
2. importtheano
3. importtheano.tensor as T
4. rng= numpy.random
数据初始化

有400张照片，这些照片不是人的就是狗的。

每张照片是28*28=784的维度。

D[0]是训练集，是个400*784的矩阵，每一行都是一张照片。

D[1]是每张照片对应的标签，用来记录这张照片是人还是狗。

training_steps是迭代上限。
[python] view plain copy
1. N= 400
2. feats= 784
3. D= (rng.randn(N, feats), rng.randint(size=N, low=0, high=2))
4. training_steps= 10000
[python] view plain copy
1. #Declare Theano symbolic variables
2. x= T.matrix("x")
3. y= T.vector("y")
4. w= theano.shared(rng.randn(feats), name="w")
5. b= theano.shared(0., name="b")
6. print"Initial model:"
7. printw.get_value(), b.get_value()
x是输入的训练集，是个矩阵，把D[0]赋值给它。

y是标签，是个列向量，400个样本所以有400维。把D[1]赋给它。

w是权重列向量，维数为图像的尺寸784维。

b是偏倚项向量，初始值都是0，这里没写成向量是因为之后要广播形式。
[python] view plain copy
1. #Construct Theano expression graph
2. p_1= 1 / (1 + T.exp(-T.dot(x, w) - b)) #Probability that target = 1
3. prediction= p_1 > 0.5 # Theprediction thresholded
4. xent= -y * T.log(p_1) - (1-y) * T.log(1-p_1) # Cross-entropy loss function
5. cost= xent.mean() + 0.01 * (w ** 2).sum()# The cost to minimize
6. gw,gb = T.grad(cost, [w, b]) #Compute the gradient of the cost
7. # (we shall return to this in a
8. #following section of this tutorial)
这里是函数的主干部分，涉及到3个公式

1.判定函数

${h_ heta }left( x ight) = frac{1}{{I + {e^{ - { heta ^T}X}}}}$

2.代价函数

3.总目标函数

$Jleft( heta ight) = - frac{1}{m}left[ {sumlimits_{i = 1}^m {left( {{y^{left( i ight)}}log {h_ heta }left( {{x^{left( i ight)}}} ight) + left( {1 - {y^{left( i ight)}}} ight)log left( {1 - {h_ heta }left( {{x^{left( i ight)}}} ight)} ight)} ight)} } ight] + frac{lambda }{2}sumlimits_{i = 1}^m {W_i^2}$

第二项是权重衰减项，减小权重的幅度，用来防止过拟合的。
[python] view plain copy
1. #Compile
2. train= theano.function(
3. inputs=[x,y],
4. outputs=[prediction, xent],
5. updates=((w, w - 0.1 * gw), (b, b -0.1 * gb)))
6. predict= theano.function(inputs=[x], outputs=prediction)
构造预测和训练函数。
[python] view plain copy
1. #Train
2. fori in range(training_steps):
3. pred,err = train(D[0], D[1])
4. print"Final model:"
5. printw.get_value(), b.get_value()
6. print"target values for D:", D[1]
7. print"prediction on D:", predict(D[0])
这里算过之后发现，经过10000次训练，预测结果与标签已经完全相同了。

Theano学习笔记（二）——逻辑回归函数解析
相关阅读:
一步步学习SPD2010--附录C--使用SP2010管理任务（5）--配置外部内容类型的权限
 一步步学习SPD2010--附录C--使用SP2010管理任务（4）--在Web应用程序层次限制SPD2010使用
 一步步学习SPD2010--附录C--使用SP2010管理任务（3）--在SP Server网站激活所有网站模板
 一步步学习SPD2010--附录C--使用SP2010管理任务（2）--创建网站集
 一步步学习SPD2010--附录C--使用SP2010管理任务（1）--安装SP Foundation2010
一步步学习SPD2010--附录C--使用SP2010管理任务
 一步步学习SPD2010--附录B--创建新的批准流程
 指数分布与泊松分布
 宋浩《概率论与数理统计》笔记---2.2.3、指数分布
 宋浩《概率论与数理统计》笔记---2.2.3、均匀分布
原文地址：https://www.cnblogs.com/anyview/p/5014642.html