Logistic Loss And Convexity

Chris Tralie

The purpose of this notebook is to explore a very simple pathological case where logistic regression can get stuck in a local min if we don't use a convex loss function

Showing an example with only 4 data points that's nonconvex with squared loss

Do an animation of gradient descent, changing the initial point and the loss function used