How to select the number of samples to train a Machine Learning algorithm?

Question

Jose Marques on 31 Jan 2019

0
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm

Commented: Greg Heath on 4 Feb 2019

I working in a dataset of 12000 samples concerning about 5 years of an industrial process.

It is likely that during this time the plant has undergone changes (equipments, the performance drop itself, chemical products).

Is there a tool for identifying the best subset of this data? In my view, a temporal cut in the data could increase the quality of the models created.

3 Comments
Show 1 older commentHide 1 older comment

Jose Marques on 31 Jan 2019

Thanks for the comment!

The dataset has 426 inputs (I am using techniques for feature selection too).

I am using four algorithms to create the models: Regression Tree, Bagged Trees, SVM and Neural Networks.

Greg Heath on 4 Feb 2019

As a common sense rule of thumb I try to use at least 10 to 30 times as many training points as unknown parameters that have to be estimated.

In addition I use 10 to 20 sets of random initial weights.

I assume , of course, that you ave examined plots of the data to initialize your common sense.

Hope this Helps

Greg

Sign in to comment.

Sign in to answer this question.

Answer 1

BERGHOUT Tarek on 3 Feb 2019

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm#answer_359276

u can use deep belif networks ; they are the best for feature sellection and mapping; and train you network by driven chunks of data "by randomly chosing a pairs of (inputs,targets)" and in the same time pire attention to your approximation function you must keep your error function in its local minimam. deep belif nets depands on a set of stacked auto_encoders that allows to tune all the parameters of the networks with small amount of training data

https://www.youtube.com/watch?v=E2Mt_7qked0

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

How to select the number of samples to train a Machine Learning algorithm?

3 Comments
Show 1 older commentHide 1 older comment

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

How to select the number of samples to train a Machine Learning algorithm?

3 Comments Show 1 older commentHide 1 older comment

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

3 Comments
Show 1 older commentHide 1 older comment

0 Comments
Show -2 older commentsHide -2 older comments