photo

Joss Knight

MathWorks

Last seen: 1 day ago Active since 2013

Followers: 0   Following: 0

Although I cannot be contacted directly, if you would like to ask me a question all you have to do is mention "GPU" somewhere in your MATLAB Answers question.

Statistics

  • Guiding Light
  • 36 Month Streak
  • Knowledgeable Level 5
  • Revival Level 2
  • First Answer

View badges

Feeds

View by

Answered
GPU memory usage for Hadamard product
I can't reproduce this. You say you have 48Gb of GPU memory available...have you checked this? Try running gpuDevice and lookin...

4 days ago | 0

Answered
Initializing LSTM which is imported using ONNX
This code is suspect % initialize input arrays obs = dlarray(rand(obs_size),"BS"); state = dlarray(rand(state_size),"SBS"); ...

4 days ago | 0

Answered
trainnet gives training loss is NaN
Do your network weights contain NaNs? Try this nansInMyNetwork = ~(all(cellfun(@allfinite, net.Learnables.Value)) && all(cellfu...

11 days ago | 0

Answered
how to use printf inside a CUDA kernel?
Just use it, and launch MATLAB from a terminal. On Linux, the output will appear in the terminal window. On Windows you will nee...

16 days ago | 0

| accepted

Answered
mexcuda compiler error: "__global__" does not apply here
You cannot call a function declared __global__ from another function declared __global__. Declare the second function as __devic...

29 days ago | 0

| accepted

Answered
CUSOLVER_STATUS_INTERNAL_ERROR with RTX 4080 Super in MATLAB 2022a C MEX Function
This is a known issue with CUDA 11.2, Ada cards, and cusolver. The solution is to upgrade MATLAB to R2023a or above, because it ...

29 days ago | 1

| accepted

Answered
CUSOLVER_STATUS_INTERNAL_ERROR with RTX 4080 Super in MATLAB 2022a C MEX Function
It's probably necessary for you to provide your code, since any number of things could have gone wrong prior to intializing cuso...

29 days ago | 0

Answered
why do I got such a low score when i use bench function in matlab2024a
Do check your computer's power settings, to make sure they are set to "best performance".

29 days ago | 0

Answered
Matab is very slow in processing figures: might it be caused by NVidia Optimus?
Check your laptop's power settings and make sure they are in 'best performance' mode.

1 month ago | 0

| accepted

Answered
Which is the difference between 'multi-gpu' and 'parallel-gpu' in 'trainingOption' function of the DeepLearning Toolbox?
The purpose of 'multi-gpu' is effectively to try to ensure you are using a local pool with numGpus workers, rather than needing ...

1 month ago | 1

Answered
MATLAB does not detect NVIDIA T400 GPU despite updated drivers
The 516 driver is too old for R2024a, which is on CUDA 12.2. Download the recommended 552 driver for your T400 from https://www....

1 month ago | 0

Answered
MATLAB GPU UBUNTU 20.04 compatibility CUDA driver
Try the 555 and 550 drivers recommended for your device available from https://www.nvidia.com/en-gb/drivers/, rather than relyin...

1 month ago | 0

Answered
Memory leak occurring when repeatedly performing matrix left division on GPU
Thank you very much for reporting this. This appears to be a bug in MATLAB's gpuArray support for overdetermined solves. For now...

2 months ago | 0

| accepted

Answered
How do I use multiple inputs in mexcuda?
You have passed in two inputs but you have a check if (nrhs!=1), so you are erroring because you don't have one input. You need ...

3 months ago | 1

Answered
Why training stops showing error (out of memory) during implementing transfer learning with pre-trained network despite having a laptop of well configuration?
You have a very small GPU I'm afraid. You need to lower your |MiniBatchSize| until it works. Also, set your |ExecutionEnvironmen...

3 months ago | 1

| accepted

Answered
Parallel Processing is better than buying another MATLAB licence?
There may be a misunderstanding here. Your typical license is to a user or a computer. That user, or that computer, can open as ...

3 months ago | 0

Answered
I can`t solve out this problem, there is always Output argument "varargout{2}" (and possibly others) not assigned a value in the execution with "dlarray/dlgradient" function.
Your mistake is |dlfeval(@dlgradient,...)|. You need to put your code that computes the loss and the loss gradients into a funct...

3 months ago | 0

| accepted

Answered
AlexNet (filter groups) cannot be well resolved inside dlfeval (dimension mismatch error)
Sorry about this. This is because you have an old version of alexnet (although yes, it shouldn't error). For now, please get you...

3 months ago | 0

| accepted

Answered
arrayfun vs loops again
I wish it were a cleverer answer, but I'm afraid that it's simply that MATLAB has been heavily optimized for |for| loops over th...

3 months ago | 3

Answered
Extreme memory usage when using gather
As others have worked out, it looks like the issue is the indexed assignment into u: u(TERRAIN:end,m) = abs(gather(head(ffield,...

4 months ago | 0

| accepted

Answered
How can i fix the error mismatch format when convert multiplication layer from LayerGraph to dlnetwork in prune and quantize network example
For one solution, replace the fully connected layers with convolution layers with filters the size of the input and num filters ...

4 months ago | 0

Answered
How can i fix the error mismatch format when convert multiplication layer from LayerGraph to dlnetwork in prune and quantize network example
That's an annoying limitation which is hopefully fixed in current releases. Is there any particular reason why you can't use |mu...

4 months ago | 0

Answered
Arrayfun/gpuArray CUDA kernel need to be able to remember previous steps
This is a bit too vague to answer. Without indexing, how can each subproblem retrieve its subset of the data? If you just mean i...

4 months ago | 0

Answered
Loop through list of vectors (ie, the rows of a matrix), applying same "simple" function to each one, on GPU? Should I use arrayfun somehow?
If your calculation is truly sequential then by definition you cannot parallelize along the sequence. But it sounds like you def...

4 months ago | 0

Answered
Parallel Computing in C when using Matlab Coder (parpool and gpuArray)
GPU Coder will generate CUDA code for you. It can even automatically vectorize |for| loops. For a multithreaded parallel for loo...

6 months ago | 1

Answered
Can parfor run a series of GPU programs simultaneously?
It looks like you just have a bug in your CUDAKernel implementation, probably accessing unallocated memory. This is putting the ...

7 months ago | 0

Answered
error of GPU, net = trainNetwork(datastore, lgraph, options);
You are using your display GPU for computation and it does not have enough capacity. Try disabling all hardware acceleration for...

7 months ago | 1

| accepted

Answered
How to train a sequence to classification network on GPU
This performance discrepancy is normal. Small sequence networks often cannot benefit from GPU parallelism, especially if they us...

7 months ago | 0

Answered
gpuArray large sparse arrays. Error codes: "CUSPARSE_INTERNAL_ERROR" / "UNKNOWN_ERROR"
Hi Joseph. It's hard to be definitive. There were some problems with cusparse and also Windows drivers when supporting the newes...

7 months ago | 0

Answered
NVIDIA A2 performance in Matlab R2023b is lousy
The A2's spec says its double precision performance is 140 GFLOPS vs the V100's 7 TFLOPS, so this is pretty much expected. Even ...

9 months ago | 3

| accepted

Load more