Failed reproducibility in parallel cluster validation

13 views (last 30 days)
Hello,
I'm writing a series of tests to validate our computing environment, and I'm following some instructions from here to write my cluster validation. I'm running into a very weird error when running the following code:
cluster = parcluster('Processes');
job1 = createCommunicatingJob(cluster, 'Type', 'spmd') ;
createTask(job1, @spmdIndex, 1, {});
submit(job1);
wait(job1);
out = fetchOutputs(job1)
When I run the script (by hitting F5), when it gets to this point it raises the following error:
Error using parallel.Job/fetchOutputs (line 1431)
Task with ID 8 returned 0 outputs but 1 were expected.
Error in N03_parallel_computing (line 74)
out = fetchOutputs(job1)
^^^^^^^^^^^^^^^^^^
However, if right after that, I select the code and run it with F9, then it runs without any error.
Additional information, before that I run this code:
job = createJob (cluster);
createTask(job, @sum, 1, {[1 1]});
submit(job);
wait(job);
out = fetchOutputs(job)
which also runs with no error.
I reproduced the same error in a different instance of Matlab installed in a different workstation. This is for R2024b.
Thank you,
Paulo

Answers (0)

Products


Release

R2024b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!