matlab(3) Logistic Regression: 求cost 和gradient 求sigmoid的值

sigmoid.m文件

function g = sigmoid(z)
%SIGMOID Compute sigmoid functoon
% J = SIGMOID(z) computes the sigmoid of z.

g = zeros(size(z)); 初始化g ,z可以是一个数，一个向量或者一个矩阵

% ====================== YOUR CODE HERE ======================
% Instructions: Compute the sigmoid of each value of z (z can be a matrix, vector or scalar)
Ones = ones(size(z));
g = Ones./(Ones + exp((-1).*z)); 计算,g(z)的值域在[0,1]之间，符合概率的分布.

当z=0时，g=0.5; 当z<0时,g<0.5;当z>0时，g>0.5;

当z->-∞时，g->0; 当z->+∞时，g->1

z可以是一个数，一个向量或者是一个矩阵

% =============================================================

end

costFunction.m

function [J, grad] = costFunction(theta, X, y)
%COSTFUNCTION Compute cost and gradient for logistic regression
% J = COSTFUNCTION(theta, X, y) computes the cost of using theta as the
% parameter for logistic regression and the gradient of the cost
% w.r.t. to the parameters.

% Initialize some useful values
m = length(y); % number of training examples

% You need to return the following variables correctly
J = 0;
grad = zeros(size(theta)); %grad的维数与theta的一至

% ====================== YOUR CODE HERE ======================
% Instructions: Compute the cost of a particular choice of theta.
% You should set J to the cost.
% Compute the partial derivatives and set grad to the partial
% derivatives of the cost w.r.t. each parameter in theta
%
% Note: grad should have the same dimensions as theta
%

J(θ)的表达式

grad的表达式

J = 1/m*(-1*y'*log(sigmoid(X*theta)) - (ones(1,m)-y')*log(ones(m,1)-sigmoid(X*theta))); %logM是对矩阵的每个元素都是求log, exp(M)同样是表示对矩阵的每个元素求e的底

调用的函数参见上述函数sigmoid.m

grad = 1/m * (X' * (sigmoid(X*theta) - y));,

% =============================================================

end

%% ============ Part 2: Compute Cost and Gradient ============
% In this part of the exercise, you will implement the cost and gradient
% for logistic regression. You neeed to complete the code in
% costFunction.m

% Setup the data matrix appropriately, and add ones for the intercept term
[m, n] = size(X); %求x矩阵的维数

% Add intercept term to x and X_test
X = [ones(m, 1) X]; %X矩阵左侧加一列1,用来匹配常数量

% Initialize fitting parameters
initial_theta = zeros(n + 1, 1);

% Compute and display initial cost and gradient
[cost, grad] = costFunction(initial_theta, X, y); %参见上述文件costFunction.m

fprintf('Cost at initial theta (zeros): %f ', cost);
fprintf('Gradient at initial theta (zeros): ');
fprintf(' %f ', grad);

fprintf(' Program paused. Press enter to continue. ');
pause;

相关阅读:
VUE课程参考---2、VUE基本使用
VUE课程---1、VUE课程介绍
JS数组常用方法---3、pop方法使用及原理
JavaScript中数组元素删除的七大方法汇总
Stack的三种含义
JS数组常用方法---6、reverse方法
数据库水平切分的实现原理解析－－－分库，分表，主从，集群，负载均衡器
服务框架HSF分析之一容器启动
淘宝HSF服务的原理以及简单的实现
DAS 原文出自【比特网】

原文地址：https://www.cnblogs.com/yan2015/p/4826635.html