coursera机器学习笔记(第一周、第二周)
来源:互联网 发布:投资tw域名 编辑:程序博客网 时间:2024/05/16 08:17
一、机器学习的分类
二、线性回归模型
Hypothesis:
Parameters:
Cost function:
Gradient descent:
其中
注:
1.对每
2.关于α的取值
①如果α太小,梯度下降会很慢
②如果α太大,可能不会收敛,甚至会发散
三、编程作业
此次作业一共为以下8个文件:
warmUpExercise.m
plotData.m
gradientDescent.m
computeCost.m
gradientDescentMulti.m
computeCostMulti.m
featureNormalize.m
normalEqn.m
1.warmUpExercise.m
function A = warmUpExercise()%WARMUPEXERCISE Example function in octave% A = WARMUPEXERCISE() is an example function that returns the 5x5 identity matrixA = [];% ============= YOUR CODE HERE ==============% Instructions: Return the 5x5 identity matrix % In octave, we return values by defining which variables% represent the return values (at the top of the file)% and then set them accordingly. A=eye(5);% ===========================================end
2.plotData.m
function plotData(x, y)%PLOTDATA Plots the data points x and y into a new figure % PLOTDATA(x,y) plots the data points and gives the figure axes labels of% population and profit.figure; % open a new figure window% ====================== YOUR CODE HERE ======================% Instructions: Plot the training data into a figure using the % "figure" and "plot" commands. Set the axes labels using% the "xlabel" and "ylabel" commands. Assume the % population and revenue data have been passed in% as the x and y arguments of this function.%% Hint: You can use the 'rx' option with plot to have the markers% appear as red crosses. Furthermore, you can make the% markers larger by using plot(..., 'rx', 'MarkerSize', 10);plot(x, y, 'rx', 'MarkerSize', 10);ylabel('Profit in $10,000s');xlabel('Population of City in 10,000s'); % ============================================================end
3.gradientDescent.m
function [theta, J_history] = gradientDescent(X, y, theta, alpha, num_iters)%GRADIENTDESCENT Performs gradient descent to learn theta% theta = GRADIENTDESCENT(X, y, theta, alpha, num_iters) updates theta by % taking num_iters gradient steps with learning rate alpha% Initialize some useful valuesm = length(y); % number of training examplesJ_history = zeros(num_iters, 1);for iter = 1:num_iters % ====================== YOUR CODE HERE ====================== % Instructions: Perform a single gradient step on the parameter vector % theta. % % Hint: While debugging, it can be useful to print out the values % of the cost function (computeCost) and gradient here. % tempTheta = theta; theta(1) = tempTheta(1) - alpha / m * sum(X * tempTheta - y); theta(2) = tempTheta(2) - alpha / m * sum((X * tempTheta - y) .* X(:,2));%============================================================ % Save the cost J in every iteration J_history(iter) = computeCost(X, y, theta);endend
4.computeCost.m
function J = computeCost(X, y, theta)%COMPUTECOST Compute cost for linear regression% J = COMPUTECOST(X, y, theta) computes the cost of using theta as the% parameter for linear regression to fit the data points in X and y% Initialize some useful valuesm = length(y); % number of training examples% You need to return the following variables correctly J = 0;% ====================== YOUR CODE HERE ======================% Instructions: Compute the cost of a particular choice of theta% You should set J to the cost.temp = sum(((X * theta - y).^2));J = 1 / (2*m) * temp;%==========================================================end
5.gradientDescentMulti.m
function [theta, J_history] = gradientDescentMulti(X, y, theta, alpha, num_iters)%GRADIENTDESCENTMULTI Performs gradient descent to learn theta% theta = GRADIENTDESCENTMULTI(x, y, theta, alpha, num_iters) updates theta by% taking num_iters gradient steps with learning rate alpha% Initialize some useful valuesm = length(y); % number of training examplesJ_history = zeros(num_iters, 1);for iter = 1:num_iters% ====================== YOUR CODE HERE ====================== % Instructions: Perform a single gradient step on the parameter vector % theta. % % Hint: While debugging, it can be useful to print out the values % of the cost function (computeCostMulti) and gradient here. %tempTheta = theta; for i = 1 : size(X,2) theta(i) = tempTheta(i) - alpha / m * sum((X * tempTheta - y) .* X(:,i)); end % ============================================================ % Save the cost J in every iteration J_history(iter) = computeCostMulti(X, y, theta);endend
6.computeCostMulti.m
function J = computeCostMulti(X, y, theta)%COMPUTECOSTMULTI Compute cost for linear regression with multiple variables% J = COMPUTECOSTMULTI(X, y, theta) computes the cost of using theta as the% parameter for linear regression to fit the data points in X and y% Initialize some useful valuesm = length(y); % number of training examples% You need to return the following variables correctly J = 0;% ====================== YOUR CODE HERE ======================% Instructions: Compute the cost of a particular choice of theta% You should set J to the cost.J = 1 / (2*m) * sum(((X * theta - y).^2));% =========================================================================end
7.featureNormalize.m
function [X_norm, mu, sigma] = featureNormalize(X)%FEATURENORMALIZE Normalizes the features in X % FEATURENORMALIZE(X) returns a normalized version of X where% the mean value of each feature is 0 and the standard deviation% is 1. This is often a good preprocessing step to do when% working with learning algorithms.% You need to set these values correctlyX_norm = X;mu = zeros(1, size(X, 2));sigma = zeros(1, size(X, 2));% ====================== YOUR CODE HERE ======================% Instructions: First, for each feature dimension, compute the mean% of the feature and subtract it from the dataset,% storing the mean value in mu. Next, compute the % standard deviation of each feature and divide% each feature by it's standard deviation, storing% the standard deviation in sigma. %% Note that X is a matrix where each column is a % feature and each row is an example. You need % to perform the normalization separately for % each feature. %% Hint: You might find the 'mean' and 'std' functions useful.% for i = 1 : size(X,2) mu(i) = mean(X(:,i)); sigma(i) = std(X(:,i)); X_norm(:,i) = (X(:,i) - mu(i)) / sigma(i);end% ============================================================end
8.normalEqn.m
function [theta] = normalEqn(X, y)%NORMALEQN Computes the closed-form solution to linear regression % NORMALEQN(X,y) computes the closed-form solution to linear % regression using the normal equations.theta = zeros(size(X, 2), 1);% ====================== YOUR CODE HERE ======================% Instructions: Complete the code to compute the closed form solution% to linear regression and put the result in theta.%% ---------------------- Sample Solution ----------------------theta = (X' * X) \ X' * y;% -------------------------------------------------------------% ============================================================end
阅读全文
0 0
- coursera机器学习笔记(第一周、第二周)
- Coursera机器学习第一周学习笔记
- Coursera 机器学习 第一周 学习笔记
- Coursera机器学习第一周学习笔记(二)——Gradient descent
- 第一周-Coursera/Stanford机器学习课程学习笔记-单变量线性回归
- 机器学习第一周笔记
- coursera-斯坦福-机器学习-吴恩达-第1周笔记
- coursera-斯坦福-机器学习-吴恩达-第2周笔记
- coursera-斯坦福-机器学习-吴恩达-第4周笔记-神经网络
- coursera机器学习第二周编程作业
- coursera-斯坦福-机器学习-吴恩达-第9周笔记(上)-异常检测
- coursera-斯坦福-机器学习-吴恩达-第9周笔记(下)-推荐系统
- Coursera机器学习笔记
- Coursera机器学习第8周作业
- coursera-斯坦福-机器学习-吴恩达-第6周笔记-算法改进and机器学习系统设计
- 《机器学习》学习笔记-第一周
- 机器学习第一周学习笔记
- 第二周-Coursera/Stanford机器学习课程学习笔记-多变量线性回归
- 设置数据库连接信息
- 容斥原理
- JS对象创建常用方式及原理分析
- 完整的定义事物属性
- 深入分析javaweb技术内幕笔记——深入分析Web请求过程
- coursera机器学习笔记(第一周、第二周)
- 树莓派(Raspberry Pi 3b)安装Kail Linux 2017.1
- Java正则环视和反向引用
- swagger2
- 晋晨瑞
- iOS图片拼接,图片文字合成总结
- MySQL两种安装方式之01_day02
- 安卓开发—自定义电池电量显示状态
- 作为产品新人,这三件事你都做到了吗?