Failed reproducibility in parallel cluster validation
13 views (last 30 days)
Show older comments
Hello,
I'm writing a series of tests to validate our computing environment, and I'm following some instructions from here to write my cluster validation. I'm running into a very weird error when running the following code:
cluster = parcluster('Processes');
job1 = createCommunicatingJob(cluster, 'Type', 'spmd') ;
createTask(job1, @spmdIndex, 1, {});
submit(job1);
wait(job1);
out = fetchOutputs(job1)
When I run the script (by hitting F5), when it gets to this point it raises the following error:
Error using parallel.Job/fetchOutputs (line 1431)
Task with ID 8 returned 0 outputs but 1 were expected.
Error in N03_parallel_computing (line 74)
out = fetchOutputs(job1)
^^^^^^^^^^^^^^^^^^
However, if right after that, I select the code and run it with F9, then it runs without any error.
Additional information, before that I run this code:
job = createJob (cluster);
createTask(job, @sum, 1, {[1 1]});
submit(job);
wait(job);
out = fetchOutputs(job)
which also runs with no error.
I reproduced the same error in a different instance of Matlab installed in a different workstation. This is for R2024b.
Thank you,
Paulo
0 Comments
Answers (0)
See Also
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!