Created by: kavyasrinet
Adding proximal gradient descent and test: prox_param = param - learning_rate * grad param = sign(prox_param) / (1 + learning_rate * l2) * max { |prox_param| - learning_rate * l1 , 0 }