BP算法和梯度下降法的关系，什么是误差反向传播_随笔

BP算法和梯度下降法的关系，什么是误差反向传播

BP算法就是指如何更好地求出神经网络中每个参数关于损失函数的梯度，这个计算梯度的算法就是bp算法
梯度下降是指让损失函数每个参数的取值沿着梯度的负方向进行变化，从而使损失函数取最小值。

简单的就是，BP是求梯度，梯度下降是根据梯度改变参数。

误差反向传播就是为了更好地计算每个参数的梯度。

至于这个反向的意思，举有两个神经元的例子：

另一种观察角度：

当我们使用python实现反向传播时，就需要在前向传递时把a和z都存起来
代码

```python
def backprop(self, x, y):
        """Return a tuple ``(nabla_b, nabla_w)`` representing the
        gradient for the cost function C_x.  ``nabla_b`` and
        ``nabla_w`` are layer-by-layer lists of numpy arrays, similar
        to ``self.biases`` and ``self.weights``."""
        nabla_b = [np.zeros(b.shape) for b in self.biases]
        nabla_w = [np.zeros(w.shape) for w in self.weights]
        # feedforward
        activation = x
        activations = [x] # list to store all the activations, layer by layer
        zs = [] # list to store all the z vectors, layer by layer
        for b, w in zip(self.biases, self.weights):
            z = np.dot(w, activation)+b
            zs.append(z)
            activation = sigmoid(z)
            activations.append(activation)
        # backward pass
        delta = self.cost_derivative(activations[-1], y) * 
            sigmoid_prime(zs[-1])
        nabla_b[-1] = delta
        nabla_w[-1] = np.dot(delta, activations[-2].transpose())
        # Note that the variable l in the loop below is used a little
        # differently to the notation in Chapter 2 of the book.  Here,
        # l = 1 means the last layer of neurons, l = 2 is the
        # second-last layer, and so on.  It's a renumbering of the
        # scheme in the book, used here to take advantage of the fact
        # that Python can use negative indices in lists.
        for l in xrange(2, self.num_layers):
            z = zs[-l]
            sp = sigmoid_prime(z)
            delta = np.dot(self.weights[-l+1].transpose(), delta) * sp
            nabla_b[-l] = delta
            nabla_w[-l] = np.dot(delta, activations[-l-1].transpose())
        return (nabla_b, nabla_w)

欢迎分享，转载请注明来源：内存溢出

原文地址: http://outofmemory.cn/zaji/5680562.html

BP算法和梯度下降法的关系，什么是误差反向传播

发表评论

评论列表（0条）