How to restart a CommunicatingJob using only the MATLAB workspaces?
5 views (last 30 days)
Show older comments
Alberto Brandl
on 11 Feb 2021
Commented: Alberto Brandl
on 12 Feb 2021
My code runs several jobs on an HPC using createCommunicatingJob and then assigning a task in it. After submission, the main code exit. It works nicely, however sometimes I ask for the wrong amount of time or RAM and some jobs do not save any results. The question is: I see that MATLAB creates a folder and a workspace with the same name of the Job, is it possible to load those workspaces and requeue the job with a different SubmitArguments, without doing it manually? I'd do it from the HPC manager but it is not possible to requeue a TIMEOUT job in my Slurm configuration.
0 Comments
Accepted Answer
Raymond Norris
on 11 Feb 2021
There's no automated process for reading the Job files and recreating the job. It'd be much easier to recreate the steps you've already run. For that reason, I'd suggest keeping all of this in a script and simply rerunning the script.
More Answers (0)
See Also
Categories
Find more on Cluster Configuration in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!