How to represent gray scale images as affine subspaces?

4 views (last 30 days)

Show older comments

M on 23 Oct 2023

0
Link

Direct link to this question

https://ch.mathworks.com/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces

Edited: M on 11 Dec 2023

How to represent gray scale images as affine subspaces?

4 Comments
Show 2 older commentsHide 2 older comments

M on 24 Oct 2023

Hi @Walter Roberson do you have any idea please?

Walter Roberson on 27 Oct 2023

This is not a topic I know anything about.

Answers (4)

Image Analyst on 23 Oct 2023

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1339101

I don't know what you mean. What's the context? What do you mean by "model"? What do you mean by "affine subspaces"? Do you just want to warp or spatially transform the image?

imwarp

Spatial transformations Defining and applying custom transforms Steve on Image Processing

If you have any more questions, then attach your data and code to read it in with the paperclip icon after you read this:

TUTORIAL: How to ask a question (on Answers) and get a fast answer

1 Comment
Show -1 older commentsHide -1 older comments

M on 23 Oct 2023

Edited: M on 23 Oct 2023

@Image Analyst @Matt J In the attached paper they represented the images in Affine subspaces, I am asking generally if there is a popular method/code of representing the image in affine space.

My data is huge attached is a sample.

Matt J on 23 Oct 2023

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1339211

Edited: Matt J on 23 Oct 2023

Open in MATLAB Online

One way, I suppose would be to train an affine neural network with the Deep Learning Toolbox, e.g.,

layers=[imageinputLayer(120,160,1);
        convolution2dLayer([120,160],N) )
        regressionLayer];
XTrain=images;
YTrain=zeros(1,1,N,size(XTrain,4));
        
net=trainNetwork(XTrain, YTrain,  layers,.... )

but you would need a truly huge number of images and good regularization for the training to be well-posed. You should probably look at augmentedImageDatastore.

34 Comments
Show 32 older commentsHide 32 older comments

M on 24 Oct 2023

Edited: M on 24 Oct 2023

Open in MATLAB Online

@Matt J The weights of the CNN will provide equations for the best fitting N-dimensional affine subspace to your image data set.

How did you reach to this? what is the mathematics behind it?

net=trainNetwork(XTrain, YTrain, layers,.... )

What is the options that fit this NN?

@Matt J I tried to do the following, but I think what I have done is wrong

But I keep getting the following error: Number of observations in X and Y disagree.

Also, I still didnt get what is the relation between this and affine and grassmann space!!

layers=[    imageInputLayer([120 160 1], 'Normalization', 'rescale-zero-one')
convolution2dLayer([120,160],7200)
regressionLayer];
options = trainingOptions('adam', ...
    'MiniBatchSize',200, ...
    'MaxEpochs',10, ...
    'InitialLearnRate',1e-3, ...
    'LearnRateSchedule','piecewise', ...
    'LearnRateDropFactor',0.1, ...
    'LearnRateDropPeriod',20, ...
    'ValidationData', {images_Test, labels_Test}, ...
    'ValidationFrequency',200, ...
    'Shuffle','every-epoch', ...
    'Plots','training-progress');
Net = trainNetwork(XTrain, YTrain,  layers, options);

M on 24 Oct 2023

Edited: Matt J on 24 Oct 2023

@Matt J, Why did you replace the convolution2dLayer([120,160],N) by fullyConnectedLayer(N)

They do the same thing. You can use analyzeNetwork to verify that the activations are 1x1xN in both cases.

N is the number of image right?

N is the number of rows in the matrix equations A*x=b. So, if you are considering a Grassmanian manifold Gr(n,k) of k-dimensional affine subspaces of

then N=n-k.

Keep in mind that the total number of layer weights will be 120*160*N+N, so you mustn't be too liberal with your choice of N.

Do I have to train 3 neural nets, one for each class and pass the test image through these NN?

Yes. You must train 3 neural nets and each of these neural nets must be trained only with images in the corresponding class.

Also how can I test this neural net? Pass an image?

Yes. If the image is a member of the class that the NN is supposed to detect, the outputs should all be zero, or close to zero, depending on how well the test image agrees with the equations for the affine subspace, A*x(:)+b=0.

Matt J on 25 Oct 2023

Edited: Matt J on 25 Oct 2023

Open in MATLAB Online

Here's a working example of the plane fitting, but with a neural network. I had to turn of L2-regularization to get it to work. Not sure what that will mean for your real use case.

images=randn(3,2)*randn(2,1000) + randn(3,1); %3x1 images

images=reshape(images,3,1,1,[]);

N=1;

layers=[imageInputLayer([3,1,1],'Normalization','none')

fullyConnectedLayer(N,'WeightL2Factor',0,'BiasL2Factor',0 )

regressionLayer];

XTrain=images;

YTrain=zeros(1,1,N,size(XTrain,4));

options = trainingOptions('adam', ...

'MiniBatchSize',100, ...

'MaxEpochs',50, ...

'InitialLearnRate',1, ...

'LearnRateSchedule','piecewise', ...

'LearnRateDropFactor',0.1, ...

'LearnRateDropPeriod',20, ...

'ValidationFrequency',200, ...

'Shuffle','every-epoch');

close all force

net=trainNetwork(XTrain, YTrain, layers,options);

Training on single CPU. |========================================================================================| | Epoch | Iteration | Time Elapsed | Mini-batch | Mini-batch | Base Learning | | | | (hh:mm:ss) | RMSE | Loss | Rate | |========================================================================================| | 1 | 1 | 00:00:00 | 1.78 | 1.6 | 1.0000 | | 5 | 50 | 00:00:00 | 0.06 | 2.0e-03 | 1.0000 | | 10 | 100 | 00:00:00 | 8.13e-03 | 3.3e-05 | 1.0000 | | 15 | 150 | 00:00:00 | 1.83e-03 | 1.7e-06 | 1.0000 | | 20 | 200 | 00:00:00 | 9.71e-05 | 4.7e-09 | 1.0000 | | 25 | 250 | 00:00:01 | 1.20e-05 | 7.2e-11 | 0.1000 | | 30 | 300 | 00:00:01 | 1.44e-06 | 1.0e-12 | 0.1000 | | 35 | 350 | 00:00:01 | 3.81e-08 | 7.3e-16 | 0.1000 | | 40 | 400 | 00:00:01 | 6.86e-09 | 2.4e-17 | 0.1000 | | 45 | 450 | 00:00:01 | 6.34e-09 | 2.0e-17 | 0.0100 | | 50 | 500 | 00:00:01 | 5.71e-09 | 1.6e-17 | 0.0100 | |========================================================================================| Training finished: Max epochs completed.

A=net.Layers(2).Weights; s=norm(A);

A=A/s;

b=net.Layers(2).Bias/s;

norm(A*images(:,:)+b)

ans = single 2.7123e-06

p=planarFit.groundtruth(images(:,:), A,-b);

plot(p)

M on 26 Oct 2023

Edited: M on 26 Oct 2023

Open in MATLAB Online

@Matt J

because the norm of my real data is 1.3900e+05

And

| 200 | 2600 | 00:01:20 | 9.26 | 42.9 | 1.0000e-09 |

My real image which belong to the same class 2608 images are attached in the link without augmentation https://we.tl/t-hDRA25aaJ1

N=15;
 layers=[imageInputLayer([120 120 1],'Normalization', 'rescale-zero-one')
        fullyConnectedLayer(N,'WeightL2Factor',0,'BiasL2Factor',0 ) 
        regressionLayer];
YTrain=zeros(1,1,N,size(XTrain,4));
options = trainingOptions('adam', ...
    'MiniBatchSize',200, ...
    'MaxEpochs',200, ...
    'InitialLearnRate',1, ...
    'LearnRateSchedule','piecewise', ...
    'LearnRateDropFactor',0.1, ...
    'LearnRateDropPeriod',20, ...
    'ValidationFrequency',200, ...
    'Shuffle','every-epoch', ...
    'Plots','training-progress'); 
net=trainNetwork(XTrain, YTrain,  layers,options);
A=net.Layers(2).Weights; 
%s=norm(A);
%A=A/s;
b=net.Layers(2).Bias;

M on 23 Nov 2023

Hi @Matt J, I have a question please, Why did you decide her to use regressionLayer in your Network? And what is the advantage of using this layer? thanks

Matt J on 23 Nov 2023

As opposed to what? What else might we have used?

Matt J on 27 Oct 2023

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1342291

Edited: Matt J on 27 Oct 2023

Open in MATLAB Online

Well, in general, we can write the estimation of A,b as the norm minimization problem,

If X can be fit in RAM, you could just use svd() to solve it

N=14;
X=images(:,:)';
vn=vecnorm(X,inf,1);
[~,~,V]=svd([X./vn,  ones(height(X),1)] , 0);
Abt=V(:,end+1-N:end)./vn';
A=Abt(1:end-1,:)';
b=Abt(end,:)';
s=vecnorm(A,2,2);
[A,b]=deal(A./s, b./s);

43 Comments
Show 41 older commentsHide 41 older comments

Torsten on 5 Dec 2023

Edited: Torsten on 5 Dec 2023

So you say you have 3 classes for your images that are known right from the beginning.

Say you determine the affine subspace for each 49000 images that best represents your image and you compute the mutual distance by SVD between these 49000 affine subspaces (which would give a 49000x49000 matrix). Now say you cluster your images according to this distance matrix into 3 (similar) clusters derived from the distance matrix.

The question you should ask yourself is: would these three clusters resemble the 3 classes that you think the images belong to right at the beginning ?

If the answer is no and if you consider the 3 classes from the beginning as fixed, then the distance measure via SVD is not adequate for your application.

M on 6 Dec 2023

Edited: M on 6 Dec 2023

@Matt J unfortunately the pre-normalization, and increasing the number of N didnt improve the performance.

I sure that the problem in the indicitor (norm), other indicators may provide better results as Grassman Kernel and distance. because that's proved in the literature. but still I am looking how can I apply them

Matt J on 5 Dec 2023

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1366084

Edited: Matt J on 5 Dec 2023

Open in MATLAB Online

So how Can I get the direction and origins so I can compute the grassman distance?

Here is another variant that gives a Basis/Origin description of the subspace.

N=100; %estimated upper bound on subspace dimension
X=reshape(XTrain,[], size(XTrain,4)); %<----no tranpose
mu=mean(X,2); 
X=X-mu; 
[Q,~,~]=qr(X , 0);
Basis=Q(:,1:N); %Basis direction vectors
Origin=mu-Basis*(Basis.'*mu);  %Orthogonalize

18 Comments
Show 16 older commentsHide 16 older comments

Matt J on 6 Dec 2023

Edited: Matt J on 6 Dec 2023

Open in MATLAB Online

why did you decide to take into consideration only the first N columns of Q

The final columns of a QRP decomposition contribute the least to the decomposition, and might be attributable to noise. For example, here is a wide matrix X of rank N=2 as well as a noise-corrupted version, Xnoisy:

X=repmat(  rand(4,2)   ,1,4)
X = 4×8
    0.8430    0.7861    0.8430    0.7861    0.8430    0.7861    0.8430    0.7861
    0.7153    0.5335    0.7153    0.5335    0.7153    0.5335    0.7153    0.5335
    0.4414    0.4160    0.4414    0.4160    0.4414    0.4160    0.4414    0.4160
    0.0416    0.8772    0.0416    0.8772    0.0416    0.8772    0.0416    0.8772
Xnoisy=X + 1e-4*randn(size(X))
Xnoisy = 4×8
    0.8429    0.7862    0.8430    0.7861    0.8431    0.7862    0.8430    0.7861
    0.7155    0.5336    0.7153    0.5334    0.7151    0.5336    0.7152    0.5335
    0.4414    0.4159    0.4413    0.4160    0.4414    0.4160    0.4414    0.4159
    0.0418    0.8770    0.0415    0.8772    0.0417    0.8771    0.0416    0.8771

When we take the QRP decomposition of X, we see that the last two rows of R are essentially 0, which means that only linear combinations of the first 2 columns of Q are used in the decomposition to regenerate the columns of X. The final two columns of Q can be discarded. This makes sense, because we know that X is rank-2.

[Q,R,~]=qr(X,"econ"); R
R = 4×8
   -1.3583   -0.9308   -1.3583   -0.9308   -0.9308   -1.3583   -0.9308   -1.3583
         0   -0.7432   -0.0000   -0.7432   -0.7432   -0.0000   -0.7432   -0.0000
         0         0    0.0000    0.0000    0.0000    0.0000    0.0000    0.0000
         0         0         0   -0.0000   -0.0000    0.0000   -0.0000    0.0000

When we do this with Xnoisy, the final two rows are still quite small, but the effect of the noise is that they are not as close to zero as they should be. Therefore, the final two columns of Q need to be discarded based on some other criterion besides R(i,:)=0.

[Q,R]=qr(Xnoisy,  "econ"); R
R = 4×8
   -1.1912   -1.0617   -1.1911   -1.0615   -1.1911   -1.0617   -1.1911   -1.0616
         0    0.8472   -0.0003    0.8474   -0.0001    0.8473   -0.0002    0.8473
         0         0    0.0003    0.0000    0.0004   -0.0000    0.0003   -0.0000
         0         0         0   -0.0002   -0.0002   -0.0001   -0.0002   -0.0000

You don't necessarily have to choose N manually, though. You could have used some threshold on the diagonal values of R, e.g.,

N=find(abs(diag(abs(R))> 0.01*max(abs(R(:)))) ,1,'last')
N = 2

Matt J on 7 Dec 2023

Edited: Matt J on 7 Dec 2023

is there here a criteria for selcting the N final V columns of its SVD as you suggested for N initial columns of Q?

The dimension of the subspace that you fit to X will be NumPixels-N, where NumPixels is the total number of pixels in one image.

Also, Regarding Basis/Origin description, can you give me an idea how do we usually use this information for classification?

No, pursuing it was your idea. You said it would help you compute the Grassman distance, whatever that is.

M on 11 Dec 2023

Edited: M on 11 Dec 2023

Dear @Matt J thank you for your suggestions and clarifications.

I reached to a conclusion that representing the images as it is as vectors in a subspace is not a good idea!

Especially if the test images are not typical for the training set.(stacking the images as vectors causes problems!)

I think I have to do some feature extractions of region of interest first then representing the features as subspaces.(whether as matrices or vectors) , Still I am thinking how to do that.

Products

Image Processing Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

How to represent gray scale images as affine subspaces?

4 Comments
Show 2 older commentsHide 2 older comments

Answers (4)

1 Comment
Show -1 older commentsHide -1 older comments

34 Comments
Show 32 older commentsHide 32 older comments

43 Comments
Show 41 older commentsHide 41 older comments

18 Comments
Show 16 older commentsHide 16 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

How to represent gray scale images as affine subspaces?

4 Comments Show 2 older commentsHide 2 older comments

Answers (4)

1 Comment Show -1 older commentsHide -1 older comments

34 Comments Show 32 older commentsHide 32 older comments

43 Comments Show 41 older commentsHide 41 older comments

4 Comments
Show 2 older commentsHide 2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

34 Comments
Show 32 older commentsHide 32 older comments

43 Comments
Show 41 older commentsHide 41 older comments