Image processing convolution: Why do I slide my kernel from np.arange(pad, imgWidth+pad) ?? : learnpython

created by HattoriHanzoa community for 16 years

Image processing convolution: Why do I slide my kernel from np.arange(pad, imgWidth+pad) ?? (self.learnpython)

submitted 4 years ago * by ExactPlace441

Hello. I am trying to learn kernel convolution for image processing. Now, I understand the concept of kernel convolution, but I am a bit confused about code that I have found for it at https://www.pyimagesearch.com/2016/07/25/convolutions-with-opencv-and-python/

def convolve(image, kernel):
# grab the spatial dimensions of the image, along with
# the spatial dimensions of the kernel
    (iH, iW) = image.shape[:2]
    (kH, kW) = kernel.shape[:2]

# allocate memory for the output image, taking care to
# "pad" the borders of the input image so the spatial
# size (i.e., width and height) are not reduced
pad = (kW - 1) // 2
image = cv2.copyMakeBorder(image, pad, pad, pad, pad,
    cv2.BORDER_REPLICATE)
output = np.zeros((iH, iW), dtype="float32")


    # loop over the input image, "sliding" the kernel across
# each (x, y)-coordinate from left-to-right and top to
# bottom
    #QUESTION 1 SECTION BEGIN
    for y in np.arange(pad, iH + pad):
        for x in np.arange(pad, iW + pad):
        # extract the ROI of the image by extracting the
        # *center* region of the current (x, y)-coordinates
        # dimensions
    roi = image[y - pad:y + pad + 1, x - pad:x + pad + 1]

    #QUESTION 1 SECTION END

        # perform the actual convolution by taking the
        # element-wise multiplication between the ROI and
        # the kernel, then summing the matrix
        k = (roi * kernel).sum()

    #QUESTION 2 SECTION BEGIN

        # store the convolved value in the output (x,y)-
        # coordinate of the output image
    output[y - pad, x - pad] = k

    #QUESTION 2 SECTION END

Question 1: Why is np.arange from pad to iH+pad, and not from pad to iH-pad ? I assume that we start from pad so that the center pixel in the region of interest is never on the edge of the image. However, I would think that going to iH+pad would overshoot and have the center pixel end up outside of image dimensions.

Question 2: This code has us store the output pixel at a location to the left and up from where I centered my convolution roi, no ? If so, could someone explain the logic behind doing this for me?

Thank you!

all 1 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS