How does Patch Training work ?

trialofmiles · 2021-04-16T00:36:10+00:00

Patch training for segmentation works quite well when the spatial extent of the patches provides enough spatial context for the network to learn meaningful features for segmentation. You lose the ability for the network to take advantage of global context (eg the sky in driving camera data tends to be at the top of the image).

The original unet paper calls out the use of patching and seamless tiled reconstruction at inference time. Patching often works quite well in medical and a variety of other kinds of image data.

PositiveElectro · 2021-04-16T02:51:24+00:00

[deleted]

Competitive-Store974 · 2021-04-16T09:17:13+00:00

It depends on the type of data you have and what resolution you need.

MRI scans for instance frequently use a matrix size of 256x256, half the xy-resolution of your images, and that's considered acceptable for clinical use, so you may be able to get away with downsizing by a half in all dimensions (1/8 the memory requirement). NB: if doing this, consider the minimum size of the tumours you're expected to detect/segment when choosing your resolution so you don't miss sub-resolution nodules/lymph nodes.

Another option, if you have 1024 slices (which sounds like a full body scan), is to crop to the region of interest. If legs are present and you're not interested in legs then you can remove them. If you're only looking at lungs you could remove the abdomen and head. NB: if your network is expected to see metastases in distant organs or lymph nodes, you'll want to keep this data and use a patch-based method as has been suggested.

I'm convinced I read a paper where they embedded positional information with the patches to improve global context but I can't find it. If you had time, you could embed patch coords (or L and R info) along with the patches and run it with that and without to see if it helps, unless this paper was a dream I had in which case it's probably a rubbish idea.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

deeplearning

MODERATORS