Robot visual guide with Fourier-Mellin based visual tracking

Chao PENG; Danhua CAO; Yubin WU; Qun YANG

doi:10.1007/s12200-019-0862-0

PDF(3230 KB)

Front. Optoelectron. ›› 2019, Vol. 12 ›› Issue (4) : 413-421. DOI: 10.1007/s12200-019-0862-0

RESEARCH ARTICLE

Robot visual guide with Fourier-Mellin based visual tracking

Author information +

History +

Abstract

Robot vision guide is an important research area in industrial automation, and image-based target pose estimation is one of the most challenging problems. We focus on target pose estimation and present a solution based on the binocular stereo vision in this paper. To improve the robustness and speed of pose estimation, we propose a novel visual tracking algorithm based on Fourier-Mellin transform to extract the target region. We evaluate the proposed tracking algorithm on online tracking benchmark-50 (OTB-50) and the results show that it outperforms other lightweight trackers, especially when the target is rotated or scaled. The final experiment proves that the improved pose estimation approach can achieve a position accuracy of 1.84 mm and a speed of 7 FPS (frames per second). Besides, this approach is robust to the variances of illumination and can work well in the range of 250-700 lux.

Keywords

robot visual guide / target pose estimation / stereo vision / visual tracking / Fourier-Mellin transform (FMT)

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Chao PENG, Danhua CAO, Yubin WU, Qun YANG. Robot visual guide with Fourier-Mellin based visual tracking. Front. Optoelectron., 2019, 12(4): 413‒421 https://doi.org/10.1007/s12200-019-0862-0

1 1 Introduction

Laser speckle contrast imaging (LSCI) is a powerful and easy-to-use method for blood flow mapping, especially in semi-transparent tissues. For this reason, this technique was first used to visualize retinal blood flow [1]. Although LSCI can be used for tissues possessing high scattering properties such as skin, the most impressive results are achieved with tissues having weak scattering properties. To date, LSCI has been actively used in cerebral blood flow (CBF) mapping [2–4] and intraoperative control in neurosurgery [5]. Laser Doppler flowmetry (LDF) is physically similar to LSCI [6] and essentially used for spatially-integrated blood flow measurements. Although modifications of scanning LDF are known, their temporal and spatial resolutions are limited by the scanning system capability [7]. Compared with LDF, LSCI can provide non-scanning full-field imaging of a relatively wide area of an object with a passable resolution.

Monitoring of skin perfusion is important for diagnosis and therapy of some diseases. For instance, Huang et al. [8] proposed LSCI as a tool to estimate efficiency of laser treatment for nevus flammeus. Moreover, LSCI is also applied to visualize retinal blood flow [1,9]. In these applications, Tamaki et al. [9] developed the algorithm they called “normalized blur”, in this method, ratio of the mean speckle intensity to difference between the mean speckle intensity and the instant speckle intensity is calculated. Another application of LSCI in the retina is for the study of retinal blood flow changes under functional activation using pharmacological agents [10].

LSCI was introduced initially as noninvasive method for CBF imaging; however, this technique requires optical clear [11] of a skin tissue and a part of the skull is removed to attain a direct access to the cortex [2,12]. LSCI is often combined with other methods such as multispectral reflectance imaging [13], fluorescence imaging of nicotinamide adenine dinucleotide in its reduced form (NADH) and flavoproteins [7], or used independently [14].

Another possible implementation of LSCI is to study depolarization wave propagation in the cortex [15], which is the main mechanism of migraine headache. In these studies, LDF was adopted for quantitative measurements of blood flow in a specific location of the cortex and LSCI was used for spatial mapping of the changes of CBF induced by the propagation of depolarization wave [7].

LDF has been the most common tool for CBF measurements with essentially point-wise access. Although magnetic resonance imaging and positron emission tomography can be also used to perform such monitoring, disadvantages of these techniques include low spatial resolution, contrast agents usage, ionizing radiation, and high cost of service [16]. By comparison, LSCI is a low-cost and high-performance imaging technique [17]. However, quantitative measurements using LSCI is very complicated and still need improvements, particularly with regard to multiple scattering [4,18,19], sensitivity to light polarization [7,20], and velocity distribution of the particles [21,22]. To solve these problems partially, calibration on tissue phantom with known parameters [23] or multi-exposure LSCI [24] can be used. In spite of these efforts, LSCI is most widely used for relative measurements [2–4,7], rather than quantitative studies.

The estimated optical resolution of our LSCI system is approximately 120 µm, and typical size of cerebral blood vessels varies in the range from 5 to 500 µm. Obviously, optical resolution of this imaging system is not high enough to distinguish most of the vessels in microcirculatory bed. Although a single capillary, arteriole, or venule cannot be resolved in a speckle image, extracting information on microcirculatory bed is possible through analysis of contrast distribution within the portion of the speckle image that corresponds to unresolved images of these small vessels [4,25].

The common method for retrieving information from a speckle contrast image is based on region of interest (ROI) analysis. A few ROIs are usually selected in a speckle contrast image. Typically, one ROI is set to represent the vein of macrocirculation and another ROI represents the microcirculatory network [4,25]. Further processing is performed usually to calculate the average contrast value over the entire ROI. Much of the information on blood flow within the analyzed region is unused because of such averaging. Instead of adopting the ROI method, we proposed a simple technique to retain this additional information using histogram analysis of speckle contrast distribution. We attempted to enhance the capabilities of LSCI to perform qualitative measurements by describing a simple algorithm based on histogram analysis of an image obtain by LSCI to provide rapid differentiation of macro- and microcirculation.

2 2 Speckle image analysis

Typical speckle contrast image of a rat cortex is shown in Fig. 1. This image is obtained with the following setup: He–Ne laser working at 632.8 nm was used to illuminate the rat cortex; complementary metal-oxide-semiconductor (CMOS)-camera Basler acA2500-14 gm with 2.2 µm × 2.2 µm pixel size was adopted for image acquisition; imaging lens Computar M1614-MP2 is at f/6; speckle/pixel size ratio is approximately 9; spatial speckle contrast distribution was calculated using 7×7 pixels sliding window; 50 consecutive frames of raw speckle image were averaged into the single speckle contrast image; and frame rate of initial raw data is equal to 40 frames per second.

Fig.1 Typical speckle contrast image

Full size|PPT slide

In this work, spatial speckle contrast distribution was calculated using the standard equation:

(1)

K = \frac{σ}{〈 I 〉},

where

〈 I 〉

is the mean value and

σ

is the standard deviation of intensity in the analyzed region of the raw speckle image. To reduce processing time, parallel computing and digital image processing were implemented [26]. Using these approaches, spatial speckle contrast analysis is described. Spatial distribution of the mean intensity

〈 I 〉

is given as convolution of the raw speckle image

I R a w

with the

N \times N

kernel matrix

K

of ones [27].

(2)

〈 I 〉 = \frac{1}{N^{2}} (I R a w * K) .

Array of the standard deviation

σ

can also be expressed as [27]

(3)

σ = \frac{\sqrt{\frac{1}{N^{2} - 1} [I R a w^{2} * K - \frac{{(I R a w * K)}^{2}}{N^{2}}]}}{\frac{I R a w * K}{N^{2}}} .

In that case, the resulting speckle contrast map can be effectively calculated by vectorization of Eq. (1).

Further processing is devoted to differentiation of blood flow between macro- and microlevels. In this work, we employed histogram analysis of the speckle contrast image [26] to achieve rapid integral multiscale analysis of blood flow. We selected three ROIs in the speckle contrast image, as depicted in Fig. 2. Each ROI represents a vascular bed with different scale. The first ROI (ROI1) contains both large and small vessels. As shown in the speckle contrast map, the distributions of blood flow in these both vessels are different. Results from the histogram analysis of laser speckle contrast image of this ROI showed two patterns of blood flow distribution, which were attributed to large (left side mode) and small (right side mode) vessels, respectively (see the purple curve in Fig. 2(b)). To validate this assumption, we constructed two additional small-sized ROIs: the second one (ROI2) corresponds to small optically unresolved vessels and the third ROI (ROI3) was a region with large vessels (as shown in Fig. 2(a)). The respective histograms of ROI2 and ROI3 coincide well with that of the ROI1 (Fig. 2(b)). A combination of histograms of the small-sized ROIs gives an expected bimodal distribution. In addition, the histogram of ROI1 contains some additional side modes corresponding to relatively large resolvable vessels on each side of the main trunk.

Thus, histogram analysis can provide integral rating of the speckle contrast image as a whole. However, information on some vessels can be lost and implicated because of unreasonably large ROIs.

Fig.2 Histogram analysis of speckle image. (a) Multiple ROIs in speckle contrast image; (b) normalized histograms of these ROIs

Full size|PPT slide

During the experiment, changes of these modes with time can be tracked, including their relative and absolute positions, shape, and magnitude. According to the proposed concept, relative changes of these modes provide information on rearrangement of blood flow between macro- and microcirculatory network.

The main steps of the algorithm are as follows:

1) Constructing ROI that covers large vessels and adjacent microvascular bed;

2) Calculating histogram of this ROI;

3) Retrieving the modes of distribution and their parameters.

In the case of time lapse measurements of laser speckle contrast image, some artifacts can be caused by the motion of the sample. As a result of this sample motion, the pre-selected ROI (green block in Fig. 3) is not always in a desirable position. To resolve this issue, we improved the algorithm using correlation-based motion tracker. We selected a ROI to track and calculate cross-correlation of each speckle contrast image in time series with this region, as depicted in Fig. 4. Then, a peak value in each spatial cross-correlation function was found, and the position of this peak defines new coordinates of the ROI according to current image. Finally, a new ROI within each frame of the time series was constructed with respect to their own new coordinates (purple block in Fig. 3). Moreover, the selected region must have a well-recognizable structure (large vessels or network of small vessels) for cross-correlation procedure.

Fig.3 Two consecutive frames of the time lapse measurements. Green block denotes statically overlaid ROI, whereas purple block is defined by correlation-based motion tracker

Full size|PPT slide

Fig.4 Cross-correlations between selection and the next frame in series. Purple block shows new ROI coordinates with respect to maximum of cross-correlation function; green block is a statically overlaid ROI

Full size|PPT slide

Results of time lapse measurements performed with statically overlaid ROI and with correlation-based motion tracker are presented in Fig. 5. The motion tracker can compensate the most part of artificial movements, and by using this tracker, the problem of exiting the vessels out of ROI can be avoided.

Fig.5 Results of time lapse measurements performed with statically overlaid ROI (a) and with correlation based motion tracker (b)

Full size|PPT slide

3 3 Experiment with animals

We mapped changes of CBF in hypertensive rats for differentiation of macro- and microcirculation. Hypertension is a key factor that leads to ischemic stroke development [28,29]. In 1959, Lassen proposed a concept for the retention of CBF even if arterial blood pressure dramatically changes from 60 to 150 mmHg [30]. However, changes in arterial pressure by even 10 mmHg are accompanied by the changes in CBF of 2%–7% [31]. Until now, research on the effects of changes of arterial pressure on CBF is insufficient. Hence, we performed a series of experiments with normotensive rats (n =10) and hypertensive (n =10) rats by phenylephrine. Phenylephrine is known to be unavailable over the blood-brain barrier [32], and thus we can measure the changes in CBF induced by the changes in blood pressure only. Arterial hypertension was provoked using the method described in Ref. [33]. According to this method, rats were kept in isolation or overpopulation for four months. Then, all rats were anesthetized using isoflurane, and craniotomy was performed using dental drill (Mikroton, Aesculap) with constant saline irrigation to prevent tissue overheating. CBF in rat cortex was measured at 30 min after surgery. Blood pressure monitoring was applied using pressure catheters (PE-50 with a PE-10 tip, Scientific Commodities Inc., Lake Havasu City, Arizona, USA) introduced into the femoral artery and the PowerLab data acquisition device.

Phenylephrine was injected at three time points to raise the peripheral blood pressure (PBP). Results of this study showed that phenylephrine injection leads to dose-dependent increase of blood pressure in normotensive rats from (105±5) mmHg to (171±7) mmHg (from (142±9) mmHg to (195±11) mmHg in hypertensive rats). The rate of change of blood pressure in normotensive rats (64%) was greater than that (37%) in hypertensive rats (Fig. 6(a)). This result can be explained by higher basal blood pressure level in hypertensive individuals.

Changes in CBF under rapidly increasing PBP were detected in hypertensive rats only. The typical dynamics of CBF in normotensive and hypertensive rats are shown in Figs. 6(b) and 6(c).

Fig.6 Typical example of changes of mean arterial pressure (a) and CBF after bolus injection of phenylephrine in three doses (0.25, 0.5, and 1 µg/kg) in normotensive (b) and hypertensive (c) rats

Full size|PPT slide

As shown in Fig. 6, an increase in CBF at microcirculatory level in response to sharp rise in peripheral arterial pressure is typical for hypertensive rats, rather than normotensive rats. CBF remains constant in normotensive rats, independent of significant hypertensive response to phenylephrine. In contrast to normotensive rats, in hypertensive individuals, peripheral pressure response to the third phenylephrine injection is accompanied by weak increase in microcirculatory and appreciable decrease in CBF in sagittal sinus. The possible mechanism underlying these processes is the decrease in venous blood flow, i.e., reallocation of CBF to the area where the blood–tissue oxygen exchange occurs, apparently, to prevent hypoxia. The method proposed for differentiation of blood flow at macro- and microlevels allows multi-scaled analysis of CBF at spasmodic changes in arterial pressure on the basis of chronic hypertension. Such kind of analysis is important particularly for further studies on hemorrhagic and ischemic stroke development provoked by hypertension.

4 4 Discussion and conclusion

The practical application of the proposed algorithm to study CBF in living animals shows that histogram analysis of an image obtained by LSCI may potentially improve the qualitative measurements of conventional LSCI. However, the advantages of this method over the other techniques should be verified to improve method efficiency. A possible approach to implement is a detailed analysis of the contrast distribution involving calculations of skewness, kurtosis, and higher moments, which characterize the shape of the contrast distribution. The contrast distribution can be considered as a rough approximation to velocity distribution in the ROI. Such approximation seems to be inappropriate for quantitative estimation, but is still acceptable for comparative qualitative analysis. As such, kurtosis and dispersion properties of the velocity distribution can be used as a diagnostic variable as well as the most probable value used in this study. Asymmetry of the distribution (skewness) requires more complicated analysis. This property can be referred to sensitivity of experimental setup with specified settings (e.g., exposure time, f-number, and uniformity of illumination). In this case, averaging the contrast value over the entire ROI can provide inaccurate result that is biased to the stretched, asymmetric side tail of the contrast distribution. The possible resolution to this problem is analyzing the skewness of the contrast distribution for a sample with known parameters and considering the results of analysis in the subsequent measurements. We believe that the analysis of these properties may provide some additional information on micro- and macrocirculations and their temporal dynamics.

Animal study demonstrated that acute hypertensive responses induced by bolus injection of phenylephrine in three different doses have no effect on CBF in normotensive individuals. In hypertensive rats, the third injection of phenylephrine led to an increase in macrocirculation and a decrease in microcirculation of CBF as response to dramatically increased PBP. Thus, the differentiation between macro- and microcirculations under functional activation allows for physiological analysis of the hemodynamic mechanisms underlying pathological changes in CBF, which in turn can help in developing new methods for pharmacological correction of macro- and microcirculation in the brain under different physiological conditions.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Agrawal A, Sun Y, Barnwell J, Raskar R. Vision-guided robot system for picking objects by casting shadows. International Journal of Robotics Research, 2010, 29(2-3): 155–173

[2]	Taryudi,Wang M S. 3D object pose estimation using stereo vision for object manipulation system. In: Proceedings of International Conference on Applied System Innovation. Sapporo: IEEE, 2017, 1532–1535

[3]	Dinham M, Fang G, Zou J J. Experiments on automatic seam detection for a MIG welding robot. In: Proceedings of International Conference on Artificial Intelligence and Computational Intelligence. Berlin: Springer, 2011, 390–397

[4]	Ryberg A, Ericsson M, Christiansson A K, Eriksson K, Nilsson J, Larsson M. Stereo vision for path correction in off-line programmed robot welding. In: Proceedings of IEEE International Conference on Industrial Technology. Vina del Mar: IEEE, 2010, 1700–1705

[5]	Dinham M, Fang G. Autonomous weld seam identification and localisation using eye-in-hand stereo vision for robotic arc welding. Robotics and Computer-integrated Manufacturing, 2013, 29(5): 288–301

[6]	Yun W H, Lee J, Lee J H, Kim J. Object recognition and pose estimation for modular manipulation system: overview and initial results. In: Proceedings of International Conference on Ubiquitous Robots and Ambient Intelligence. Jeju: IEEE, 2017, 198–201

[7]	Dong L, Yu X, Li L, Hoe J K E. HOG based multi-stage object detection and pose recognition for service robot. In: Proceedings of 11th International Conference on Control Automation Robotics & Vision, Singapore: IEEE, 2010, 2495–2500

[8]	Nam H, Han B. Learning multi-domain convolutional neural networks for visual tracking. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016, 4293–4302

[9]	Held D, Thrun S, Savarese S. Learning to track at 100 FPS with deep regression networks. In: Proceedings of European Conference on Computer Vision. Berlin: Springer, 2016, 749–765

[10]	Henriques J F, Rui C, Martins P, Batista J. High-speed tracking with kernelized correlation filters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3): 583–596

[11]	Stricker D. Tracking with reference images:a real-time and markerless tracking solution for out-door augmented reality applications. In: Proceedings of Conference on Virtual Reality, Archeology, and Cultural Heritage. Glyfada: DBLP, 2001, 77–82

[12]	Wu Y, Lim J, Yang M H. Object tracking benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1834–1848

[13]	Yang B, Yan J, Lei Z, Li S. Aggregate channel features for multi-view face detection. In: Proceedings of IEEE International Joint Conference on Biometrics. Clearwater: IEEE, 2014, 1–8

[14]	Hirschmuller H. Accurate and efficient stereo processing by semi-global matching and mutual information. In: Proceedings of IEEE Computer Society Conference on Computer Vision & Pattern Recognition. San Diego: IEEE, 2005, 807–814

[15]	Zhang Z. A flexible new technique for camera calibration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(11): 1330–1334

[16]	Li J, Zhao H, Jiang T, Zhou X. Development of a 3D high-precise positioning system based on a planar target and two CCD cameras. In: Proceedings of International Conference on Intelligent Robotics and Applications. Berlin: Springer, 2008, 475–484

[17]	Magnusson M, Andreasson H, Nüchter A, Lilienthal A J. Appearance-based loop detection from 3D laser data using the normal distributions transform. Journal of Field Robotics, 2010, 26(11–12):892–914

[18]	Li Y, Liu G. Learning a scale-and-rotation correlation filter for robust visual tracking. In: Proceedings of IEEE International Conference on Image Processing. Phoenix: IEEE, 2016, 454–458

[19]	Zhang K, Zhang L, Yang M H. Fast Compressive Tracking. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2014, 36(10):2002-2015.

[20]	Lu H, Jia X, Yang M H. Visual tracking via adaptive structural local sparse appearance model. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Providence: IEEE, 2012, 1822–1829

[21]	Dinh T B, Vo N, Medioni G. Context tracker: exploring supporters and distracters in unconstrained environments. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Colorado Springs: IEEE, 2011, 1177–1184

RIGHTS & PERMISSIONS

2019 Higher Education Press and Springer-Verlag GmbH Germany, part of Springer Nature

AI Summary AI Mindmap

PDF(3230 KB)

Accesses

Citations

Detail

Sections

Recommended

Abstract
Keywords
Cite this article
1 1 Introduction
2 2 Speckle image analysis
Fig.1 Typical speckle contrast image
Fig.2 Histogram analysis of speckle image. (a) Multiple ROIs in speckle contrast image; (b) normalized histograms of these ROIs
Fig.3 Two consecutive frames of the time lapse measurements. Green block denotes statically overlaid ROI, whereas purple block is defined by correlation-based motion tracker
Fig.4 Cross-correlations between selection and the next frame in series. Purple block shows new ROI coordinates with respect to maximum of cross-correlation function; green block is a statically overlaid ROI
Fig.5 Results of time lapse measurements performed with statically overlaid ROI (a) and with correlation based motion tracker (b)
3 3 Experiment with animals
Fig.6 Typical example of changes of mean arterial pressure (a) and CBF after bolus injection of phenylephrine in three doses (0.25, 0.5, and 1 µg/kg) in normotensive (b) and hypertensive (c) rats
4 4 Discussion and conclusion
References
RIGHTS & PERMISSIONS

Received	Accepted	Published
20 Jun 2018	29 Oct 2018	15 Dec 2019
Online First Date	Issue Date
11 Jun 2019	30 Dec 2019

About the journal

Browse

Authors & reviewers

Abstract

Keywords

Cite this article

1 1 Introduction

2 2 Speckle image analysis

Fig.1 Typical speckle contrast image

Fig.2 Histogram analysis of speckle image. (a) Multiple ROIs in speckle contrast image; (b) normalized histograms of these ROIs

Fig.3 Two consecutive frames of the time lapse measurements. Green block denotes statically overlaid ROI, whereas purple block is defined by correlation-based motion tracker

Fig.4 Cross-correlations between selection and the next frame in series. Purple block shows new ROI coordinates with respect to maximum of cross-correlation function; green block is a statically overlaid ROI

Fig.5 Results of time lapse measurements performed with statically overlaid ROI (a) and with correlation based motion tracker (b)

3 3 Experiment with animals

Fig.6 Typical example of changes of mean arterial pressure (a) and CBF after bolus injection of phenylephrine in three doses (0.25, 0.5, and 1 µg/kg) in normotensive (b) and hypertensive (c) rats

4 4 Discussion and conclusion

{{custom_sec.title}}

{{custom_sec.title}}

References

RIGHTS & PERMISSIONS