Page - 117 - in Proceedings of the OAGM&ARW Joint Workshop - Vision, Automation and Robotics
Image of the Page - 117 -
Text of the Page - 117 -
Eight models are assigned to each parameter-typeα∈S.
A model is reconstructed with the full range of parameter
values, which are 8 values for ColourCorrection, PoseEs-
timation, MaxIterations, and 5 values for SmoothnessTerm.
The parameter space is discrete and the values form the
individual parameter value setsPα.
In a questionnaire 32 people (16 male, 16 female) between
19 and 55 years old, were asked to choose the most aesthetic
reconstruction for each model. Since every reconstruction
is mapped to a parameter value, they implicitly chose the
parameter value which led to the best reconstruction quality.
The best parameter choices according to the human votes
havebeencounter-checked toproduce reasonable reconstruc-
tions. During this result preparation, one model which was
assigned to SmoothnessTerm, had to be omitted because of
inconsistencies in the data. In detail the parameter value
with the most human votes for this model leads to a failed
reconstruction similar to the bottom right picture of Figure 4.
Therefore the final dataset consists of 31 human judged
model reconstructions.1
B. Evaluation Criteria
Figure 2(a) and Figure 2(b) show example distributions
of human decisions for specific models. One can see the
variances in the votes. To cover these variances we define
the following correctness criterion:
Definition 1. Aparameter value estimation is correct if it is
inside µ±σ of the human decisions.
To test this criterion, human judgements have been simu-
lated with random values. In detail for each decision distri-
bution (e.g. Subfigure 2(b)) a uniformly distributed random
value in the same discrete parameter range was generated.
If the random value fulfilled the correctness criterion for the
decision distribution, it was counted as correct, otherwise as
incorrect. With 1000 trials this lead to a mean accuracy of
0.5095 and σ=0.0841 which can be seen as baseline for
the following tests.
C. Results
In an evaluation which is run on each decision distribution
in the dataset, the best parameter value for the reconstruction
of a model is estimated using Equation 5 with PSNR as
similarity measure. After that the parameter value is checked
against the decision distribution with Definition 1. Therefore
if theparametervalue is insideµ±σ of thehumandecisions,
the parameter value estimation is counted as correct and false
otherwise. This procedure lead to an estimation accuracy of
1 on the dataset of 31 judged reconstructions.
The evaluation has also been run with two other (dis-)
similarity measures: Root-Mean-Squared Error (RMSE) and
Structural Similarity index (SSIM), see Table I.
The first is a standard measure for deviations. Applying
it the accuracy drops to 0.9032. This is further interesting
since the RMSE is also the denominator in Equation 2. One
1The full dataset can be requested by emailing the main author. 0 1 2 3 4 5 6 7
ColorCorrection Parameter Value
0
5
10
15
20
(a) ParametertypeColourCorrection on Model 0001.
0 1 2 3 4
SmoothnessTerm Parameter Value
0
5
10
15
20
(b) Parametertype SmoothnessTerm on Model 0093.
Fig. 2. Examples of human decision distributions for parameter-types
ColourCorrection on Model 0001 in Subfigure (a) and SmoothnessTerm on
Model 0093 in Subfigure (b). One can see the variance in the data.
(Dis-)similarity Accuracy
PSNR 1
RMSE 0.9032
SSIM 0.9032
TABLE I
ACCURACY ON THE DATASET EVALUATED WITH DIFFERENT IMAGE
(DIS-)SIMILARITIES FOR FORMULA 4. EVALUATED MEASURES ARE
PEAK SIGNAL TO NOISE RATIO (PSNR), STRUCTURAL SIMILARITY
INDEX (SSIM) AND ROOT-MEAN-SQUARED ERROR (RMSE). PSNR
PERFORMS BEST.
can see that the logarithm in the equation is important in this
context.
When applying SSIM, which should reflect human per-
ception, the accuracy drops to 0.9032. A detailed look on
the results reveals that RMSE as well as SSIM fail on the
models assigned toColourCorrection.
A similar comparison has been performed with different
colour spaces, see Table II. Beside the HSV colour space
Red, Green, Blue (RGB), YCbCr, Grayscale and CIE-Lab
colour spaces have been evaluated. RGB is a standard in
image representation. When using it the accuracy drops
to 0.7742. Recent publications indicate that YCbCr colour
space shows advantages in skin detection, see Shaik et
117
Proceedings of the OAGM&ARW Joint Workshop
Vision, Automation and Robotics
- Title
- Proceedings of the OAGM&ARW Joint Workshop
- Subtitle
- Vision, Automation and Robotics
- Authors
- Peter M. Roth
- Markus Vincze
- Wilfried Kubinger
- Andreas Müller
- Bernhard Blaschitz
- Svorad Stolc
- Publisher
- Verlag der Technischen Universität Graz
- Location
- Wien
- Date
- 2017
- Language
- English
- License
- CC BY 4.0
- ISBN
- 978-3-85125-524-9
- Size
- 21.0 x 29.7 cm
- Pages
- 188
- Keywords
- Tagungsband
- Categories
- International
- Tagungsbände