1. CAMPsyN, SUPAA, Hôpital de Cery, Route de Cery, Lausanne University Hospital, CH-1008 Prilly, Lausanne, Switzerland
2. SUPAA, Hôpital de Cery, Route de Cery, Lausanne University Hospital, CH-1008 Prilly, Lausanne, Switzerland
petr.kloucek@chuv.ch
armin-von-gunten@chuv.ch
Show less
History+
Received
Accepted
Published
2016-02-16
2016-07-13
2016-12-01
Issue Date
Revised Date
2016-10-19
PDF
(997KB)
Abstract
Background: Identification of human subjects using a geometric approach to complexity analysis of behavioural data is designed to provide a basis for a more precise diagnosis leading towards personalised medicine.
Methods: The approach is based on capturing behavioural time-series that can be characterized by a fractional dimension using non-invasive longer-time acquisitions of heart rate, perfusion, blood oxygenation, skin temperature, relative movement and steps frequency. The geometry based approach consists in the analysis of the area and centroid of convex hulls encapsulating the behavioural data represented in Euclidian index spaces based on the scaling properties of the self-similar normally distributed behavioural time-series of the above mentioned quantities.
Results: An example demonstrating the presented approach of behavioural fingerprinting is provided using sensory data of eight healthy human subjects based on approximately fifteen hours of data acquisition. Our results show that healthy subjects can be factorized to different similarity groups based on a particular choice of a convex hull in the corresponding Euclidian space. One of the results indicates that healthy subjects share only a small part of the convex hull pertaining to a highly trained individual from the geometric comparison point of view. Similarly, the presented pair-wise individual geometric similarity measure indicates large differences among the subjects suggesting the possibility of neuro-fingerprinting.
Conclusions: Recently introduced multi-channel body-attached sensors provide a possibility to acquire behavioural time-series that can be mathematically analysed to obtain various objective measures of behavioural patterns yielding behavioural diagnoses favouring personalised treatments of, e.g., neuropathologies or aging.
Petr Kloucek, Armin von Gunten.
On the possibility of identifying human subjects using behavioural complexity analyses.
Quant. Biol., 2016, 4(4): 261-269 DOI:10.1007/s40484-016-0088-8
We address the possibility to use complexity indices of behavioural variables to identify human subjects. The identification is made by projection of self-similar and normally distributed behavioural data onto Euclidian product spaces based on non-disruptive sensing. This results in a platform that combines data acquisitions provided by body attached sensing devices, mathematical analysis of complexity of the acquired signals, and subsequent projections onto Euclidian spaces representing behavioural states.
Externally applied wearable sensors allow for collecting self-similar time-series of behavioural or physiological parameters such as heart rate, blood oxygenation, skin temperature, steps frequency, and others. The measurements of the behavioural parameters represent surrogate data that characterize patterns through segmentation and a subsequent complexity analysis using data pertaining to expended time spans.
Behavioural surrogate data are segmented and analysed to extract meaningful patterns hidden in the measured time series of behavioural data. The word segmentation is used to indicate a connection between behavioural temporal segmentation and image spatial segmentation [ 1, 2]. This approach is similar to the concepts of the Gestalt school [ 1] and psycho-physiologists analysing, e.g., perceptions of images in the retina in the first instants of their arrival [ 3, 4]. Segmentation of captured images is aimed at and allows for recognition of well-defined objects from discrete patterns. Similarly, we strive to use complexity indexing to recognize well-defined behaviours that can differentiate various subjects using time-discrete behavioural time-series.
We build geometrical images of behavioural states evolving over time that can be measured and analysed using mathematical tools based on complexity indexing, i.e., scaling properties of self-similar sensory data expressed through the Hurst exponent [ 5– 9].
Our approach shares some fundamental underpinnings with image segmentations, in particular the use of a measure generating dimension [ 10] and a principle similar to Mumford-Shah functional to verify self-similarity. The Mumford-Shah functional is routinely used in image segmentation as it simultaneously accounts for both high and low dimensions of structural measurements [ 2].
We first address behavioural indexing, the notion of behavioural indexing and behavioural similarity measures (cf. Section of the Methods).We present application of the proposed approach pertaining to identification of human subjects based on surrogate data using complexity data segmentation including the use of centroids and convex hulls (cf. Section of the Results).
RESULTS
Human behavioural convex hulls
The proposed approach to “neuro-fingerprinting” is illustrated using behavioural data of eight healthy subjects selected of a larger pool of healthy humans during their routine daily activities covering roughly up to fifteen hours of sensory data. This time span is divided into roughly two-hours time spans from which the complexity indices are computed and projected onto the behavioural two dimensional product spaces generated by the quantities that are recorded using the VSM sensor. Different colours shown Figure 1 through Figure 6 represent each of the individual’s convex hulls encapsulating the surrogate behavioural data. The following numbers are assigned to different colours, i.e., subjects, to make the referencing easier: red/1, green/2, blue/3, violet/4, black/5, grey/6, azure/7, yellow/8.
The results shown at the following visualizations indicate that there is different clustering of subjects with respect to different behavioural product spaces.
The presented results show that each of the subjects posses a different convex hull and centroid as well as different volatility characteristics.
Centroids and volatility indexing
The four tables included in this section present compounded analysis of human subjects’ behavioural similarities or dissimilarities in terms of centroids and volatility of the data to refine the analysis shown at Section of Coarse-grained behavioural patterns in Appendix and to provide a compounded algebraic overview of the subjects’ behavioural complexity. The centroid rows correspond to the complexity indices stipulated in the Tables 1–4 heads.
Consider subjects 4, 5 and 3 to highlight a possible use of the geometric behavioural characterization using the skin temperature/movement data in Table 2. The first subject is a highly trained, the second is a healthy subject and the third is a trained subject. The data for the first subject include running in addition to a daily routine. The data pertaining to the second subject are acquired exclusively during sleep. The data representing the third subject correspond to a non-interrupted high physical activity.
The interpretation of the first subject’s data is that it has almost constant complexity of the skin temperature with negative correlation and unbiased complexity of movement. The large volatility (radius) implicates disparity among office work and running behavioural complexity. These numbers may be typical for trained athletes. The second subject’s data indicate behavioural conformity with small radius, typical for a rest. The third subject’s data show virtually no complexity of the skin temperature and high complexity of movements with almost no volatility. This observation would indicate non-interrupted higher mono-physical activity during a longer period of time. Similar conclusions can be made using the Tables 1-4.
Higher dimensional indexing
We present an example of a three-dimensional representation of human surrogate data indexing using heart rate; blood oxygenation and steps index space. Figure 7 visualizes an example of such indexing of different human subjects.
Technically, the representation of the surrogate data in higher dimensional spaces is identical to the presentation of the data shown at Section of Coarse-grained behavioural patterns in Appendix.
There is an additional interpretation of the analysis beyond “behavioural fingerprinting”. The additional description is associated with the position and volume represented by the convex hulls encapsulating the surrogate data.
The Brownian point gives the centrality in the three-dimensional phase space. Any indices below are pronunciations of negative correlation as mentioned above. This is supposed to indicate more agile responses to stimuli. The indices above the Brownian threshold indicate opposite behavioural patterns. Consequently, we may conclude that it is favourable to observe symmetric and rather low volume convex hulls as a characterization of “healthy” human states.
The example shown in Figure 7 seems to support this conclusion. The green convex hull is that of a highly trained individual while the blue and red ones represent healthy, but less trained, subjects. The subjects were observed during fifteen hours period during regular working day and sleep.
CONCLUSIONS
The presented results indicate that some of the indices of heart, steps frequency, blood oxygenation, and/or skin perfusion, seem to be unique for each individual. This means that a collection of complexity indices can provide aggregated characterization of factorized groups of patients. This leads to “behavioural fingerprinting”. Consequently, the correlation between factorized groups and indexing of their responses to treatment can provide a basis for personalized medicine.
METHODS
Subjects
The proposed approach is illustrated using behavioural data collected from eight subjects selected from a larger pool of healthy humans. The data were collected at a hospital ward from volunteering medical practitioners.
Behavioural complexity indices
Behavioural data are obtained using external sensors attached to a subject’s body. We use two connected technologies, i.e., (i) micro-electronics based wearable sensors and (ii) mathematics based data processing providing data complexity characterizations. The first item refers to the wearable non-disruptive Vital Signs Monitor (VSM) capable of motion-tolerant monitoring of human behavioural signs without interfering with a person’s activities. The second item refers to detection of self-similarity and the underlying scaling properties of some behavioural data using our Cassiopee computational environment.
The index spaces can be viewed as spaces endowed with metric of an ambient Euclidian space. The introduction of the index spaces helps to illustrate the possible deviation of a complexity index of behavioural quantity points from “normal” to a changed functionality of a particular subsystem [ 11]. Hence, the Euclidian distance from whatever we considered “normal” may indicate presence, severity, or progression of behavioural deviation resulting in behavioural diagnostics of symptoms and signs both in the context of normality and disease.
Self-similar normally distributed surrogate data are analysed by constructing a map from the space of behaviours onto the space of complexity indices that correspond to an approximate non-integer Hausdorff-Besicovitch dimensions of finite time series of a subject’s behavioural patterns.
The mathematical foundations of the proposed indexing framework rely on two fundamental assumptions.
The first assumption is that stochastic normally distributed self-similar processes characterise at least some behavioural patterns. Thus, each time-series must be analysed to verify its self-similarity and normal distribution. Consequently, if these properties are confirmed we apply various ways to find the non-integer Hausdorff-Besicovitch dimensions of these patterns and the corresponding scaling laws.
From the point of view of clinical implementations, the unavoidable limitation of the finiteness of the sensory readings must be taken into account.
The second assumption is that of ergodicity. It is required to compare surrogate data obtained from many different individuals both with respect to individual time averaging procedures as well as subjects sampling to obtain the power scaling laws.
Surrogate data segmentation
It is important to identify time and/or space points at which self-similarity is changing, i.e., when it is lost or its scaling properties change. These behavioural discontinuities indicate behavioural pattern changes that may reflect changes of feedback control of the central nervous system in response to internal and/or to external stimuli. The result presented in Figure 8 is based on the heart rate that appears to have required self-similarity and normal distribution.
Behavioural index spaces
An example of an index is temperature. Increasing temperature indicates corrective physiological actions. Increasing value of an index indicates transition from anomalous to super diffusion. Lower values of an index indicate just the opposite that we relate to altered behavioural patterns. This is how we can infer from macroscopic observables the nature of the microscopic structure of stochastic processes associated with certain physiological states.
Consider the following example shown in Figure 9 that is based on sensory data of a healthy subject during typical professional activities during a working day. We partition the heart rate indices/blood oxygenation indexes space that is contained in into four quadrants.
The quadrant partitioning is cantered at what we call the Brownian point. The arguments behind this approach are that the Brownian point corresponds to standard Brownian functions given by Wiener processes. The lower left quadrant corresponds to on both axes that represent negatively correlated surrogate data, i.e., data with lots of irregularity. We interpret this quadrant clinically as anxiety/high activity region. The upper right quadrant represents Hurst indices yielding positive correlations, i.e., more data regularity. We interpret this quadrant as the relaxed region. The remaining two quadrants contain indices representing mixture of positively and negatively correlated processes. Since the central nervous system can balance such inputs we denote these regions as neutral regions.
The indexing shown in Figure 9 holds the key to the presented concept. We construct convex hull of the computed complexity indices of the time-segmented data including its centroid and radius to obtain dissimilarity measures providing behavioural identification of human subjects.
Behavioural identification
We construct behavioural complexity convex hulls as the main tool to identify human subjects using behavioural time series. The behavioural complexity convex hulls combined with dissimilarity indicatrices seem to be strong enough to distinguish among human subjects.
The application of complexity measures to indexing behavioural patterns on which the presented approach to neural-fingerprinting is based follows upon [ 12] and in some of the citations therein.
Consider human subjects. Assume we compute complexity indices either by surrogate data segmentation or by repeated measurements. We denote by a computed set of complexity indices generated by surrogate data corresponding to the j-th subject.
Thus, let
and, let
The convex hull is the smallest convex manifold encapsulating a given set of computed indices.
)
Behavioural similarity indicatrix
Let us consider a pairwise similarity indicatrix given by
It follows from (3) that if then If then Let us consider the opposite case when
Then
The non-trivial intersection, i.e., when
yields
Behavioural geometric similarity
In addition to the similarity indicatrices we also compute the centroid and radius of the super-scribed circles to a given convex hull of behavioural complexity indices.
We assign a centroid to each by computing
Simultaneously, we can determine the radius of the smallest disk covering with the centre at the centroid. While the centroid represents position of the convex hull in a selected Euclidian space representing behavioural indices, the radius represents the volatility of the data.
The centrality and the data spread, i.e., volatility, are meant to provide a more detailed characterization of a nontrivial subsection of convex hulls.
Assume Equation (4) holds. We propose that two subjects can be “likely” behaviourally similar if
and
The argument n corresponds to a number of data points.
The requirement Equation (7) is related to the consistency of the temporal behavioural partial overlap of two subjects. The consistency requires that data contained in a non-trivial intersection given by Equation (4) must be stable in the sense
as , increase with the number of the behavioural data points with respect to either increased time span of measurements of repeated sensing. Condition Equation (8) requires that a certain amount of data be shared among different subjects.
The combination of the values of behavioural indicatrices, centroids, and areas of behavioural convex hulls, i.e., data volatility, yielding similarity predictions is summarised by Table 5.
Koenderink, J. J. (1984) The structure of images. Biol. Cybern., 50, 363–370.
[2]
Morel, J. M. and Solimini, S. (1995) Variational Methods in Image Segmentation. USA: Birkhauser Boston Inc
[3]
Julesz, B. (1980) Spatial nonlinearities in the instantaneous perception of textures with identical power spactra. Philos. Trans. R. Soc. Lond., 290, 83–94.
[4]
Julesz, B. (1981) Textons, the elements of texture perception, and their interactions. Nature, 290, 91–97.
[5]
Peitgen, H.-O., Jrgens, H. and Saupe, D. (1992) Chaos and Fractals. New York: Springer-Verlag.
[6]
Ness, M. V. (1968) Fractional Brownian motions, fractional noise and application. SIAM Rev., 10, 422–437.DOI:10.1137/1010093
[7]
Mandelbrot, B. B. (1997) Fractals, Form, Chance and Dimension. San Francisco: W. H. Freeman and Company
[8]
Bassingthwaighte, J. B., Liebovitch, L. S. and West, B. J. (1994) Fractal Physiology. New York: Oxford University Press
[9]
West, B. J. (2010) Fractal physiology and the fractional calculus: a perspective. Front. Physiol., 1, 12
[10]
Preiss, D. (1987) Geometry of measures in Rn: distribution, rectifiliability and densities. Ann. Math., 125, 537–643.
[11]
Hassabis, D.and E.A.Maguire, (2009) The construction system of the brain. Phil. Trans. R. Soc. B. Biol. Sci., 364, 1263–1271.doi: 10.1098/rstb.2008.0296
[12]
Kloucek, P., P.Zakharov, and A.von Gunten, , Indexing of Behavioural Complexity Using Self-similar Surrogate Data. Preprint, 2016.
[13]
Morters, P. and Peres, Y. (2010) Brownian motion. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge: Cambridge University Press
RIGHTS & PERMISSIONS
Higher Education Press and Springer-Verlag Berlin Heidelberg
AI Summary 中Eng×
Note: Please be aware that the following content is generated by artificial intelligence. This website is not responsible for any consequences arising from the use of this content.