Seite - 138 - in Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics

Bild der Seite - 138 -

Text der Seite - 138 -

error terms for the equations above, Sg(v) := n−1 ∑ i=0 m−1 ∑ j=0 ( gi,j−αβv2i,2j−αβ¯v2i,2j+1 −α¯βv2i+1,2j−α¯β¯v2i+1,2j+1 )2 , (14) Su(v) := n−1 ∑ i=0 m−1 ∑ j=0 ( ui,j−α¯β¯v2i+1,2j+1−α¯βv2i+1,2j+2 −αβ¯v2i+2,2j+1−αβv2i+2,2j+2 )2 . (15) Up to constant factors, Sg and Su are just the MSE(g,vg) and MSE(u,vu) from the alignment-MSE definition. Let us therefore now discuss possible constraints for this minimisation problem. These constraints will constitute the class X of images to minimise over that appeared in the definition of the alignment-MSE. Note first that in the equations (11), (12) for subsequent indices i or j the two input images u and g alternate. This suggests that for images u and g that do not perfectly match, solutions of (11), (12) are likely to develop oscillating patterns like stripes of alternating intensity or checkerboard structures, so the discrepancy between u and g can be translated to the image boundary where the first and last row and column of v are linked only to one of the input images and therefore provide degrees of freedom that can absorb the discrepancy. In extreme, this could mean that even for completely mismatching u and g highly oscillatory images v might exist that fulfil (11), (12) without any error. Such solutions should be rejected by a suitable class X. In order to prevent v from developing strong high- frequency structures, a natural requirement could be that v should be essentially interpolating; thus each pixel intensity vi,j should be in the interval bounded by the intensities gbi/2c,bj/2c, ub(i−1)/2c,b(j−1)/2c of the two input pixels it is linked to by (11), (12). Whilst conceptually elegant and free of additional parameters, this constraint turns the minimi- sation of (13) into a quadratic minimisation problem on a highly nonconvex domain. We aim therefore at relaxing this constraint to a convex regularisation that warrants a unique solution as well as a practical minimisation procedure. We extend therefore the energy function (13) to E(v)=Sg(v)+Su(v)+γT(∇v) (16) where T is a regulariser that depends on the derivatives∇v= (vx,vy) of v approximated by finite differences, and γ>0 is a regularisation weight. With regard to the quadratic nature of the mean square error to be measured, a Whittaker-Tikhonov regularisation T(∇v) :=∑ i,j |∇v|2 (17) lends itself as a natural candidate, which yields a convex quadratic minimisation problem, also removing completely the non-uniqueness of the original equations. Minimisers can efficiently be computed using standard iterative solution methods for the linear system of minimality conditions. a b Fig. 3. (a) Superresolution image created in aligning the images from Fig. 2(a) and (d) with Whittaker-Tikhonov regularisation, γ = 0.003. Alignment-MSE measurement with this superresolution image yields a PSNR of 46.05dB. – (b) Same with TV regularisation, γ=0.03, yielding a PSNR of 29.92dB. A further candidate is total variation T(∇v) :=∑ i,j |∇v| . (18) To find minimisers with this regularisation, one can use, e.g., a gradient descent approach where the regularisation is realised via a locally analytic scheme related to single- scale Haar wavelet shrinkage; we use here a variant of the scheme from [19] adapted to the unequal pixel sizes of v. As a general rule, in order to just remove the underdeter- minedness of the equation system (11), (12), it is desirable to keep the regularisation weight γ rather small. V. EXPERIMENTS Weevaluate the regularisedsuperresolutionalignmentpro- cedure from the preceding two sections by the test case from Section II. Starting with Whittaker-Tikhonov regularisation, we observe that for large regularisation weight such as γ=0.3 fairly precise estimates for the displacement can be obtained. However, the superresolution images in this case are severely blurred, leading to overestimation of alignment- MSE and low PSNR. For example, the resulting PSNR for the images from Fig. 2(a) and (d) is 28.61dB. On the other hand, reducing the regularisation parameter to γ= 0.003 yields extremely high PSNR estimates, e.g. 46.05dB for the same two images. The reason is that the superresolution images are far away from interpolating between u and g, showing unnatural oscillations, see Fig. 3(a). In contrast, TV regularisation yields plausible results over a wide range of regularisation parameters, see the exemplary superresolution image in Fig. 3(b). For a more detailed evaluation we focus therefore on TV regularisation. We measure first reconstruction errors for the known exact displacements, see column (a) of Table III. Next we estimate the displacements using the TV-regularised error measure itself, see column (b). Once more the minimisation is done by a grid search with x and y displacements varying from−1 to+1 in 0.01 steps. The TV regularisation weight γ is set to 0.03. As the application of the superresolution alignment in this brute-force minimisation is computationally expensive, we add a third scenario, column (c), in which a faster 138

zurück zum Buch Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics"

Proceedings of the OAGM&ARW Joint Workshop Vision, Automation and Robotics

Titel: Proceedings of the OAGM&ARW Joint Workshop
Untertitel: Vision, Automation and Robotics
Autoren: Peter M. Roth; Markus Vincze; Wilfried Kubinger; Andreas Müller; Bernhard Blaschitz; Svorad Stolc
Verlag: Verlag der Technischen Universität Graz
Ort: Wien
Datum: 2017
Sprache: englisch
Lizenz: CC BY 4.0
ISBN: 978-3-85125-524-9
Abmessungen: 21.0 x 29.7 cm
Seiten: 188
Schlagwörter: Tagungsband
Kategorien: International; Tagungsbände

Seite - 138 - in Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics

Bild der Seite - 138 -

Text der Seite - 138 -

Inhaltsverzeichnis