How can I make my neural network support any size of image input?

Question

zzm oliver on 2 Apr 2020

0
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/514798-how-can-i-make-my-neural-network-support-any-size-of-image-input

Commented: Jacques Boutet de Monvel on 31 May 2022

There are three levels of code writing to do a vision-related deep learning task.

Highest Level: complete layerGraph and train with trainNetwork function.

Middle level: build a layerGraph without loss. Instead, calculate loss and gradient in an eval function. One can also specify customed learning rate schedule. This level allows some customization, and still exploits the easy-to-use highest level features.

Lowest level: this level has no concept of layer. Coders have to take care of the parameters themself. It's really messy and time-consuming to build and train a network in this way.

My question is: Highest level and middle level all requires a certain size of input, i.e, an imageInputLayer. But imageInputLayer only supports for fixed image size. I do not want to trouble myself with lowest level coding. So how could I make my NN take inputs of any size?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Ryan Comeau on 10 May 2020

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/514798-how-can-i-make-my-neural-network-support-any-size-of-image-input#answer_431576

Open in MATLAB Online

Hello,

I wish it was possible to just dump images of multiple sizes as well. Unfortunately though each image would yield a differen size of convultion maps and a different number. How then would it make sense to pass these into a full connection layer and curve fit these convolutions maps? It would be like sorting oranges by size, but half of your input oranges are apples, it would be a strange task.

There is however a solution to this problem. Your input images need to be scaled to the size of your network input size. This is one of the preprocessing steps that is important. Here is some code that could resize all of your images:

image=imread('path/to/image');
number_rows=200; %depending on the input size of your network
number_cols=300; %depending on input size. 
rescaled_image=imresize(image,[number_rows number_cols])

It may seem unintuitive, but computers don't see the same way we do and the scale of things doesn't always matter.

Hope this helps,

RC

1 Comment
Show -1 older commentsHide -1 older comments

Jacques Boutet de Monvel on 31 May 2022

If it is true that there is no way to feed an image of unprescribed size to a fully convolutional network, this is too bad! It misses one of the most attractive and elegant features of FCNs: the ability to process an image of any size in a seamless - translation invariant way at prediction time, while the network has been trained on much smaller image patches. This is very useful, and even crucial for segmentation applications.

Why not implement this feature at least to give the choice to users? This is one thing that could (still) make matconvnet more attractive than the matlab DL toolbox, despite all its impressive features.

Sign in to comment.

How can I make my neural network support any size of image input?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment
Show -1 older commentsHide -1 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

How can I make my neural network support any size of image input?

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment Show -1 older commentsHide -1 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments