Seite - 372 - in Intelligent Environments 2019 - Workshop Proceedings of the 15th International Conference on Intelligent Environments
Bild der Seite - 372 -
Text der Seite - 372 -
Wehave also developed a driver to import the datasets acquired bymeans of the
UniMiBAAL suite [13]. UniMiBAAL includes twoAndorid apps that ease both the
acquisition of signals from sensors in a controlled environment and the labeling tasks
requiredwhenbuildingadataset.
All thedrivershavebeenimplementedinthePythonlanguage,whilethewebservice
has been developed relying on Laravel2 and exposing the services through RESTFul
APIs.
4. DataManagement
Once the signals have been standardized in terms of structure, they can be included in
theUniïŹedDataset tobedistributed.For this reason, theresponsibilityof theDataMan-
agement component is twofold: it integrates the new labelled signals into theUniïŹed
Dataset (Data Storage) andmakes available sets of labelled signals to thosewho need
them(DataAccess).
Before being inserted into theUniïŹedDataset, signals require another elaboration
tomake themhomogeneousboth in termsof representationand label.
Sensors record data at a given sampling frequency,with a given range of intensity
values, and soon.Eachmanufacturer design their own sensorwith operating speciïŹca-
tions that may be different from the typical ones. For instance, wemay deal with ac-
celerometer sensors thatworkat verydifferent sampling frequency ranging fromfew to
hundredsofHertz.Machine learningmethods require inputdata in agiven format (e.g.,
number of samples per second and intensity range) that is consistent over time [4]. For
this reason, raw data acquired by sensors need to be pre-processed before being pro-
cessedbymachine learningmethods. Inevitably thisproduces anoverheadofdata tobe
handled and stored. Storage spacemanagement is carried out using cloud storage tech-
niques that reducespaceconsumptionbyusingcapacityoptimization,datadeduplication
anddatacompression tools [14].
Signals fromdifferent datasetsmay have assigned different (but semantical equal)
labels for the sameADL(e.g., âwalkâ vs âwalkingâ), same label for differentADL(e.g.,
âsittingâmayrefer to thestateofbeingseated inachairor the transitionfromstanding to
sitting), anddifferent labels for the sameactivity (e.g., ârunningâvs âjoggingâ).
Thus, the aimof theDataManagement component is to harmonize signals and to
make themavailable for exploitation.Theorganizationof the component is sketched in
Figure3.
The Data Aligner module is in charge of pre-processing the data from the Data
Collection component in order tomake themusable by anymachine learningmethod.
For example, an activity that is in charge of theDataAlignermodule is the conversion
toa samemeasurementunit.
TheLabelConsolidatormodule is in chargeofuniforming the labels of thedataset
to include to a commonuniïŹed set. For example, if a dataset uses the label âsittingâ to
label signals related to the transition (fromstanding to sitting down) and in theUniïŹed
Dataset isusedâsitdownâtolabelsignalsrelatedtothetransition(fromstandingtositting
down), then the labelwill bechanged tobeconsistentwith theUniïŹedDataset. Inview
2https://laravel.com/ A.Ferrari etal. /AFramework
forLong-TermDataCollection372
Intelligent Environments 2019
Workshop Proceedings of the 15th International Conference on Intelligent Environments
- Titel
- Intelligent Environments 2019
- Untertitel
- Workshop Proceedings of the 15th International Conference on Intelligent Environments
- Autoren
- Andrés Muñoz
- Sofia Ouhbi
- Wolfgang Minker
- Loubna Echabbi
- Miguel Navarro-CĂa
- Verlag
- IOS Press BV
- Datum
- 2019
- Sprache
- deutsch
- Lizenz
- CC BY-NC 4.0
- ISBN
- 978-1-61499-983-6
- Abmessungen
- 16.0 x 24.0 cm
- Seiten
- 416
- Kategorie
- TagungsbÀnde