Answered
I would like to train a smallish network using cpu cores in parallel rather than gpu as they are slower.
setenv CUDA_VISIBLE_DEVICES -1 when you first start MATLAB, assuming you are running everything locally. However, as a general...

10 months ago | 0

Answered
Faster three dimensional higher order interpolation?
Thanks for the request, it will help us prioritise future work. In the meantime, it is possible to write your own interpolation ...

10 months ago | 0

Answered
How to solve "Unexpected error calling cuDNN: CUDNN_STATUS_EXECUTION_FAILED." error?
The Ada GPU architecture is not supported in R2018b. You need to upgrade MATLAB. You should have received a warning about this w...

11 months ago | 0

| accepted

Answered
Will self-written exe application run on GPU on other PC?
Yes, if your other PC has a supported GPU your application will run on it. Isn't MATLAB great?!

11 months ago | 0

| accepted

Answered
Error at linking stage using mexcuda
Relocatable device code needs to be linked by nvcc using -dlink before it can be linked to host code using the C linker, so you'...

11 months ago | 0

| accepted

Answered
OCR returns slightly different results on different machines
This is expected for any highly optimized code like this. Even for two Intel machines, the core count will affect how operations...

12 months ago | 1

Answered
Will the MATLAB Answers community diminish/obsolete with the rise of AI-based chatbots?
You look like you are asking a question about how AI-assisted automation will change MATLAB Answers in the coming years and pote...

12 months ago | 3

Answered
The matlab mexw64 file generated by mexcuda cannot be executed in the standalone app generated by matlab('parallel.gpu.GPUDeviceManager.selected' cannot be detected))
MATLAB Compiler's dependency analyzer cannot detect your dependency on PCT. Either add the product manually or call something ex...

12 months ago | 0

Answered
Unable to perform assignment because the indices on the left side are not compatible with the size of the right side.
If you click on the line number in the editor next to where you create your function layer, you can put a breakpoint at the entr...

1 year ago | 0

| accepted

Answered
Using a "CUDAKernel" type object within a parfor loop
A CUDAKernel object cannot be serialized, as you've found, so you will need to construct it separately on each worker. However, ...

1 year ago | 2

| accepted

Answered
Performance drop on mobile RTX4080
The 4080 is a good 10x slower than the V100 in double precision so this doesn't surprise me - it is designed for workstation gra...

1 year ago | 0

Answered
Using experiment manager on single GPU
It does depend on the balance of CPU and GPU work in your experiment, but as a general rule parallel execution will gain you not...

1 year ago | 0

Answered
Memory issue with texture in mexCUDA compiled code
It looks like the syntax for your function |mxArrayToTexture_3D_float4| is incorrect. You are passing the pointer |cuArray| by v...

1 year ago | 0

Answered
host compiler failed but others all passed with 'coder.checkGpuInstall('full')'
MSVC 2022 was not supported by the NVIDIA CUDA compiler in R2022b. You either need to install MSVC 2019 (or 2017) or upgrade MAT...

1 year ago | 0

Answered
cannot set gpu option in MBPO for catpole example
I do not see any |UseDevice| property in the <https://uk.mathworks.com/help/reinforcement-learning/ref/rl.option.rloptimizeropti...

1 year ago | 1

Answered
How to perform Eigenvalue Decomposition e.g, eig() on multiple GPUs´╝č
eigendecomposition is a highly serial algorithm so that's why simple multi-process solutions aren't easy to find and why the GPU...

1 year ago | 0

Answered
Problem: Image segmentation of forest area using CNN and MATLAB's BLOCKPROC function.
In the documentation for semanticseg it says that the output is a categorical array. Converting to uint8 would normally work, b...

1 year ago | 0

Answered
Summing array elements seems to be slow on GPU
These are my results that I got on my (somewhat old) GeForce GTX 1080 Ti: CPU time: 16.1288 GPU time: 0.96266 If I change the...

1 year ago | 0

| accepted

Answered
Summing array elements seems to be slow on GPU
Why are you recomputing H and HU inside the loop? They do not change. If you remove the sum, because the results are never used ...

1 year ago | 1

Answered
Examples of GPU do not work
This isn't a demo it's a blog from 11 years ago, and unfortunately it's using syntax that was removed from MATLAB 9 years ago. I...

1 year ago | 1

Answered
Half precision using GPU
As pointed out, gpuArray does not support half. The main reason is that half is an emulated type only meaningful for deployment ...

1 year ago | 1

| accepted

Answered
How to uncompress data faster than using for loop
No, |arrayfun| and |cellfun| are just convenient ways of writing loops, they don't have any special magic and often cause certai...

1 year ago | 1

Answered
how can i use fitrgp() function for gpuarray?
|fitrgp| does not support gpuArray inputs, sorry.

1 year ago | 0

Answered
Trouble using one of my GPUs
Hi Rohan. You may have to consult the documentation for your toolkit. The Titan RTX is a Turing card and may post-date the most ...

1 year ago | 0

Answered
Error using gpuArray/norm Failed to initialize GPU cuSOLVER library.
|cuSolver| is an NVIDIA library that is included with your MATLAB installation. It makes no difference what versions of the CUDA...

1 year ago | 0

Answered
mexcuda how to declare a matrix with size N given in input
Hi Charlotte. If you just want to pass an ordinary scalar value to a mex function then use the ordinary Mex API rather than the ...

1 year ago | 0

Answered
How can we use Nadam optimizer in place of sgdm in training deep learning networks
You cannot do this using trainNetwork. You need to use a dlnetwork with a <https://uk.mathworks.com/help/deeplearning/ug/train-n...

1 year ago | 0

Answered
brain tumor code errors
Looks like your version of MATLAB predates the |OutputNetwork| training option, which was introduced in R2021b.

1 year ago | 0

| accepted

Answered
Deep learning with partitionable datastores on a cluster
This error message is incorrect. It should say that your datastore is not PartionableByIndex. This was fixed in R2022a. As long...

1 year ago | 0

| accepted

Answered
i need to utilize fully of my GPUs during network training!
It's hard to be sure from the info you provide but it looks like the filesystem is your bottleneck. If you cannot load the next ...

1 year ago | 0

Load more