GPU vs TPU training speed for timeseries data
Show older comments
Currently a NVIDIA P100 is being used for timeseries data training, however, as it goes, LTSM with a batch size of 1 just goes slow. Some can be relatively large datasets. I am looking at other options, either something newer than the P100, or using Google TPU's. Does anyone have experience with this, and if so, can you please let me know what type of performance and speed increases could be expected, or that you experienced, of your LSTM time series data training.
4 Comments
Joss Knight
on 5 Nov 2022
Why are you using a batch size of 1?
Walter Roberson
on 5 Nov 2022
"relatively large datasets"... probably running out of memory with larger batch size.
Robert
on 9 Nov 2022
Joss Knight
on 9 Nov 2022
For stateful sequence networks you can split sequences and MATLAB will manage the state propagation between minibatches. See the documentation This would be the best way to get good parallelism with long sequences. With only one sequence per mini-batch there's almost nothing useful the GPU can do to parallelize what is a fundamentally serial algorithm.
Answers (0)
Categories
Find more on Parallel and Cloud in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!