Seite - 137 - in Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics

Bild der Seite - 137 -

Text der Seite - 137 -

displacement values, minimisation of the MSE measure is an obvious way to estimate also unknown displacements. For the following, let us consider two images u and g, which are sampled representations of continuous-scale images. To specify the sampling process more precise, we assume that each pixel of g is the integral of the underlying continuous-scale image G over a rectangled region such that all pixels together tesselate (a rectangle of) the image plane: gi,j= ∫ i+1 i ∫ j+1 j G(x,y)dydx , (6) and similarly for u whose grid is of equal resolution but shifted by d=(α,β)∈R2, ui,j= ∫ i+1+α i+α ∫ j+1+β j+β U(x,y)dydx . (7) Without loss of generality, we assume 0≤α,β<1. Whereas in the special case of band-limited images sam- pled with at least their double limiting frequency, Shannon’s sampling theorem guarantees that u and g contain full informationon their continuouscounterparts, this canusually not be expected to hold true for natural images; thus the continuous images U and G are in fact unknown. A good measure for the discrepancy between u and g should essentially measure the discrepancy between their continuous versions U and G. In other words, we do not want to punish reconstructions for badly aligned grids, and formulate therefore an “innocence assumption” (in dubio pro reo – in case of doubt for the defendant): Whatever discrepancy between two images can plausibly be attributed to different sampling, shall not enter the discrepancy mea- sure. In particular, if a sufficiently plausible continuous- scale image V≡U≡G exists from which both u and g can be obtained by sampling, their discrepancy should be measured as zero. Notice that the exact meaning of the word “plausible” remains to be specified later. Our considerations can be boiled down to a discrete image v of size (2n+1)×(2m+1) whose pixels are the intersections of pixels of u and g: vi,j= ∫ ξi+1 ξi ∫ ηj+1 ηj V(x,y)dydx , (8) i=0,...,2n, j=0,...,2m, where ξi = i/2 for even i and ξi= i/2+α for odd i,ηj= j/2 for even j andηj= j/2+β for odd j. Note that pixel (i, j) of g covers the four pixels (2i,2j), (2i,2j+1), (2i+1,2j) and (2i+1,2j+1) of v whereaspixel (i, j)ofucovers the fourpixels (2i+1,2j+1), (2i+1,2j+2), (2i+2,2j+1) and (2i+2,2j+2) of v. The image v is therefore a superresolution image [15] of g and u, albeit with pixels of different sizes. In x direction grid cells of size α alternate with such of size 1−α, whereas in y direction the same is true with β and 1−β. In the general situation when U and G cannot be chosen as equal, we want to retain this idea and construct a super- resolution image v that tries to reconciliate the information of u and g as good as possible. The discrepancy of u and g will then be measured by combining discrepancies between u and v, and between v and g. In the perfectly aligned case,α=β=0, the MSE (2) of images g and u can be combined from the MSEs between each of g and u and their average v := 12(g+u) via MSE(u,g)=2(MSE(u,v)+MSE(v,g)) . (9) Moreover, using the parallelogram identity (or by an easy combination of Cauchy-Schwarz’ inequality with the arithmetic-geometric mean inequality) we see that for any other image v the right-hand side of (9) will be greater than MSE(u,g). This motivates the following definition. Definition. Let images u and g of size n×m sampled as in (6), (7) be given. Let a class X of (2n+1)×(2m+1)- images v sampled as in (8) be given. For each image v∈X, define vu, vg as the downsamplings of v to the grids of u and g, respectively. The alignment-MSE MSEX between u and g with respect to X is defined as MSEX(u,g)=min v∈X 2(MSE(u,vu)+MSE(vg,g)) . (10) Application of this definition requires, first, the specifi- cation of the class X for given images u, g. The class X essentiallydefineswhat areplausible superresolution images. Second, a minimisation method will be needed to find the minimiser. We will turn to these issues in the next section. IV. SPECIFYING CONSTRAINTS Given u and g, a superresolution image v as specified in the previous section must satisfy the equations αβv2i,2j+αβ¯v2i,2j+1 +α¯βv2i+1,2j+α¯β¯v2i+1,2j+1=gi,j , (11) α¯β¯v2i+1,2j+1+α¯βv2i+1,2j+2 +αβ¯v2i+2,2j+1+αβv2i+2,2j+2=ui,j (12) for i = 1,...,n, j = 1,...,m, where we have used the abbreviations α¯ :=1−α, β¯ :=1−β. On one hand, these are just 2nm equations for 4nm+ 2n+2m+1 pixels of v (from which two corner pixels could be eliminated as they are neither covered by g nor by u); additional conditions will therefore be necessary to remove this underdetermination. On the other hand, for images u and g that do not match perfectly, we expect that the equations should be satisfied only approximately. which favours smoothness. Thus, we are led to reformulate our equation system into the minimisation of an energy function E(v)=Sg(v)+Su(v) (13) under suitable constraints, where Sg and Su are quadratic 137

zurück zum Buch Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics"

Proceedings of the OAGM&ARW Joint Workshop Vision, Automation and Robotics

Titel: Proceedings of the OAGM&ARW Joint Workshop
Untertitel: Vision, Automation and Robotics
Autoren: Peter M. Roth; Markus Vincze; Wilfried Kubinger; Andreas Müller; Bernhard Blaschitz; Svorad Stolc
Verlag: Verlag der Technischen Universität Graz
Ort: Wien
Datum: 2017
Sprache: englisch
Lizenz: CC BY 4.0
ISBN: 978-3-85125-524-9
Abmessungen: 21.0 x 29.7 cm
Seiten: 188
Schlagwörter: Tagungsband
Kategorien: International; Tagungsbände

Seite - 137 - in Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics

Bild der Seite - 137 -

Text der Seite - 137 -

Inhaltsverzeichnis