线性回归梯度下降法python实现网站首页 其他

线性回归梯度下降法python实现

coderzjy 2025-04-18 00:01:02

简介线性回归梯度下降法python实现

线性回归的公式：
$h_ heta(x) = heta_0 + heta_1 x_1 + heta_2 x_2 + dots + heta_n x_n$
其中， $h_ heta(x)$ 是预测值， $θ$ 是模型参数， $x_1, x_2, dots, x_n$ 是输入特征。

梯度下降的更新公式：
$heta_j = heta_j - alpha cdot frac{1}{m} sum_{i=1}^m left(h_ heta(x^{(i)}) - y^{(i)} ight) x_j^{(i)}$
其中， $heta_j$ 是第 $j$ 个参数， $α$ 是学习率， $m$ 是样本的数量， $h_ heta(x^{(i)})$ 是预测值， $y^{(i)}$ 是实际值， $x_j^{(i)}$ 是样本的第 $j$ 个特征。
线性回归的损失函数（均方误差）：

$sum_{i=1}^{m} left( h_ heta(x^{(i)}) - y^{(i)} ight)^2$

其中：

$J (θ)$ 是损失函数（成本函数），
$m$ 是样本的数量，
$h_ heta(x^{(i)})$ 是模型对第 $i$ 个样本的预测值，
$y^{(i)}$ 是第 $i$ 个样本的实际值。

通常，在梯度下降中，我们通过最小化损失函数来优化模型参数 $θ$ 。

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
def computeCost(X,y,theta):
    m = X.shape[0]
    part = np.power(((X@theta.T)-y),2)
    return np.sum(part)/(m*2)
def gradientDescent(X,y,theta,alpha,iterations):
    #副本
    temp = theta
    #样本数量
    m = X.shape[0]
    #特征数
    params = theta.shape[1]
    #记录每一次的成本变化
    costs = np.zeros(iterations)
    #迭代iterations次
    for i in range(iterations):
        error = X@theta.T-y
        for j in range(params):
            temp[0,j] = temp[0,j] - (alpha/m)*np.sum(np.multiply(error,X[:,j:j+1]))
        theta = temp
        costs[i] = computeCost(X,y,theta)
    return theta, costs
# 读取数据，dataframe
data = pd.read_csv('ex1data1.txt', header=None,names = ["population","profits"])
#插入一行1，作为线性回归函数的常数项
data.insert(0,"Ones",1)
print(data.head())
cols = data.shape[1]
#切片第二项也要用:，保持是dataframe
X = data.iloc[:,0:cols-1]
y = data.iloc[:,cols-1:cols]
#theta都初始化为0，np.array默认是行向量
theta = np.zeros((1,cols-1))
#将dataframe转化为np.array
X = X.to_numpy()
y = y.to_numpy()
print(X.shape)
print(y.shape)
print(theta.shape)
alpha = 0.01
iterations = 1000
theta, costs =gradientDescent(X,y,theta,alpha,iterations)
print(costs[iterations-1])
x = np.linspace(data.population.min(), data.population.max(),100)
f = theta[0,0] + theta[0,1] * x
fig = plt.figure()
#fig 为整个画图区域，ax为子图的集合
#当nrows=1或ncols=1时为一维数组，此时下标索引只有一个维度，否则是两个维度
fig,ax = plt.subplots(figsize=(12,8),nrows=1,ncols=2)
ax[0].plot(x, f, 'r', label='Prediction')
ax[0].set_title('Predicted Profit vs. Population Size')
ax[0].set_xlabel('population')
ax[0].set_ylabel('profit')
ax[0].scatter(data.population, data.profits, label='Traning Data')
ax[0].legend()
#np.arange(iterations) 的意思是 生成从 0 到 iterations-1 的整数数组。
#np.arange(start, stop, step, dtype) 用于生成等间隔的数组
ax[1].plot(np.arange(iterations),costs,label='Cost Function')
ax[1].set_title('Error vs. Training Epoch')
ax[1].set_xlabel('iterations')
ax[1].set_ylabel('cost')
ax[1].legend()
plt.show()

在这里插入图片描述

风语者！平时喜欢研究各种技术，目前在从事后端开发工作，热爱生活、热爱工作。

上一篇
使用Cline+deepseek实现VsCode自动化编程...

下一篇
图像质量评估：SSIM算法详解与Python实现...

站长推荐

U8W/U8W-Mini使用与常见问题解决
U8W/U8W-Mini使用与常见问题解决
分享几个国内免费的ChatGPT镜像网址(亲测有效)
分享几个国内免费的ChatGPT镜像网址(亲测有效)
stm32使用HAL库配置串口中断收发数据（保姆级教程）
stm32使用HAL库配置串口中断收发数据（保姆级教程）
QT多线程的5种用法，通过使用线程解决UI主界面的耗时操作代码，防止界面卡死。
QT多线程的5种用法，通过使用线程解决UI主界面的耗时操作代码，防止界面卡死。...
SpringSecurity实现前后端分离认证授权
SpringSecurity实现前后端分离认证授权

您现在的位置是：首页 >其他 >线性回归梯度下降法python实现网站首页其他

线性回归梯度下降法python实现

上一篇 使用Cline+deepseek实现VsCode自动化编程...

下一篇 图像质量评估：SSIM算法详解与Python实现...

站长推荐

上一篇
使用Cline+deepseek实现VsCode自动化编程...

下一篇
图像质量评估：SSIM算法详解与Python实现...