I have some questions about Matlab ANN algorithm functions.
1) I have an example about ANN which is used trainscg as the train function .What does train function means ? Because we know that we can add so many layers and every layer has own activation function. I think trainscg function is just used help to minimize error in loss function (like cross-entropy) , right ? So it is not activation function.I think it is just an optimizer . But why matlab called it train function ?
I want to replicate this function in Python but I have no idea which function I should use instead of trainscg. How about ADAM or SGD ?
2) Is there any difference between tangent sigmoid activation function and sigmoid function activation function?