deep learning notes: loss function