Video to Image Regression

Question

0 votes

Hello!

I have 32x32x256 (HeightxWidthXFrames) greyscale video data that I need to regress to a 32x32 image.

What is the ideal format for me to save my input data so it can be read into a NN? Is there an appropriate image format? (I have not been succesfull using .mat files in an image datastore)
Should I use a 2d or 3d ImageInputLayer? I intend to use a Unet architecture.

Thank you!

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Shashank Gupta on 6 Jul 2020

1 vote

Hi Michael,

Since your input to the model is a video data, it is appropriate to use 3D image datastore. Also Unet archtecture you intent to design will be a 3D architecture and in that case going for 3d imageInputLayer is prompt.

When we deal with high dimension data, It is always good choice to go with ".mat" data storage. In particularly your case, you can write a custom function in @ReadFcn property of datastore to read the ".mat" file.

I hope this helps you,

1 Comment
Show -1 older comments Hide -1 older comments

Michael Keeling on 9 Jul 2020

Thank you!

Sign in to comment.

Video to Image Regression

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

1 Comment
Show -1 older comments Hide -1 older comments

More Answers (0)

Categories

Products

Release

Tags

Community Treasure Hunt

Video to Image Regression

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

1 Comment Show -1 older comments Hide -1 older comments

More Answers (0)

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

1 Comment
Show -1 older comments Hide -1 older comments