Introduction
Accurate diagnosis of different bronchopulmonary diseases is important in clinical practice, particularly when conventional examination methods are inapplicable for certain patients, such as children, pregnant women, and extremely severe patients [
1].
Many studies explored the relationship between the composition of exhaled air and various diseases. The exhaled air of patients with chronic obstructive pulmonary disease (COPD) differs from that of healthy individuals in terms of the composition of volatile organic compounds (VOCs) [
2,
3]. Van Berkel et al. [
3] selected six types of VOCs that exhibit 92% sensitivity toward COPD diagnostics. Phillips et al. [
4] created a diagnostic model based on VOC profile analysis to distinguish COPD patients from healthy individuals and achieved 64% diagnostic accuracy. The diagnostic accuracy of COPD patients improved up to 74% after smokers were excluded from the patients with COPD. Moreover, the amounts of hydrogen cyanide and isoprene in exhaled air were analyzed to identify patients with community-acquired pneumonia [
5]. Patients with pulmonary tuberculosis were also determined on the basis of the absorption spectral profiles of exhaled air [
6].
VOCs are not strictly specific for any disease, thereby complicating the practical application of VOC measurement for diagnostics. Bronchial asthma significantly increases the amount of exhaled NO and moderately augments the amount of CO, whereas COPD slightly increases NO and markedly increases CO [
7]. Thus, using the set of VOCs or the direct absorption spectral profile of breath samples as a fingerprint of bronchopulmonary diseases is convenient. In the latter case, various distant metrics should be used to estimate the “difference” between spectra. Distance metrics with good quality can identify important features and discriminate relevant and irrelevant features. Specifying which pairs of data points are similar or dissimilar is important in biomedical analysis. The Mahalanobis distance is a measure between two data points in the space defined by relevant features by assigning corresponding weights to the features of data points.
This study aims to compare the absorption spectral profiles of breath air from patients with various bronchopulmonary diseases and from healthy volunteers on the basis of Mahalanobis distance.
The absorption spectra of breath air from patients with various bronchopulmonary diseases and from healthy volunteers are visually similar. We used the Mahalanobis distance to quantify the similarity or difference in the absorption spectra of breath air from patients with various bronchopulmonary diseases and from healthy volunteers. Previous similar studies [
6] used this method to separate patients with pulmonary tuberculosis from healthy volunteers and patients with other diseases. In the present study, the same equipment was used, but breath air samples were obtained from a different group of patients.
Methods
The study involved 20 healthy volunteers and 77 patients with bronchopulmonary diseases, including COPD, bronchial asthma, pulmonary tuberculosis, and community-acquired pneumonia. Details regarding participants are presented in Table 1.
Samples of exhaled air were collected in the morning (08:00 am to 09:00 am) on an empty stomach before taking inhaled drugs and after three to five times of mouth rinsing with boiled water. The smokers refrained from smoking for a minimum of 6 h before sampling. Exhaled air was collected into a preliminary sterilized glass tube with dense cotton-gauze tubes. Each participant exhaled in a relaxed manner for one to two times into the tube with lips tightly clasped. Then, the tube was tightly closed with sterile cotton-gauze plugs.
The absorption spectrum of exhaled air samples was recorded on an intra-cavity photo-acoustic gas analyzer (ILPA-1, Special Technologies, Ltd., Russia) with a photo-acoustic detector and CO
2 laser with a tuning range from 9.2 to 10.8 µm [
8]. Three samples of exhaled air were obtained from every participant. The absorption spectrum of each sample was recorded for five times to reduce random error.
The measured spectra for participants between the test group (patients with bronchopulmonary diseases) and the reference group (healthy volunteers) were compared on the basis of the Mahalanobis distance. The feature vectors of the participants from groups and were and , correspondingly. and are the total quantities of feature vectors that correspond to all participants in the group. Thus, the specific average square of Mahalanobis distance can be defined as
where
is the Mahalanobis distance,
is the covariance matrix of the features of participants from group
[
9], and
is the dimension of the feature space.
The sets of absorption coefficients of exhaled air from patients with bronchopulmonary diseases and from healthy volunteers were used as feature vectors and , correspondently. We marked the specific average square of the Mahalanobis distances of exhaled air absorption spectra in the 10P and 10R spectral bands of CO2 laser generation for participants as , correspondingly. We selected the 10P and 10R spectral bands because the measurement error was small in these bands.
Results
We used healthy volunteers as the reference group. The values of were calculated for every patient with bronchopulmonary disease. On the basis of the calculation results, the values of were not subjected to the law of normal distribution. Thus, we used the median and quartile values (25% and 75%, respectively) for analysis (Table 2). To compare the values of for various groups, pairwise statistical analysis in terms of Mann–Whitney coefficients was carried out. Statistical significance was considered at p <0.05. Considering that pneumonia and tuberculosis are urgently arising lung diseases, we combined the values of for these diseases in the joint group of “urgent lung diseases” (ULD).
The analysis of exhaled air absorption spectrum in the 10P spectral band revealed a significant difference (p≤0.01) in between the patients with pulmonary diseases and the healthy volunteers. A significant difference (p = 0.097) in was also observed between the COPD patients and the healthy volunteers in the 10R spectral band (Table 2).
As shown in Table 2, significantly differed between the patients with bronchial asthma and COPD (p<0.001), as well as between the patients with ULD and COPD (p = 0.006). Similarly, significantly differed between the healthy and asthmatic patients, healthy participants and ULD patients, ULD and COPD patients, and asthmatic and COPD patients (all at p<0.001). The values of did not allow distinguishing patients with bronchial asthma from those with ULD.
Furthermore, we calculated the threshold values of and estimated the sensitivity and specificity of the differential diagnostics within the group of bronchopulmonary diseases using receiver operating characteristic analysis (Table 3).
The results shown in Table 3 can be presented in the form of diagnostic rules:
healthy 1.28 COPD 2.29 bronchial asthma
Se= 70%, Sp= 70% Se= 75%, Sp= 74%
Therefore, COPD can be expected in 70% of cases when is between 1.28 and 2.29, whereas bronchial asthma can be expected in 75% of cases when value exceeds 2.29. ULD was not included in the current scheme because no results were obtained for ULDs in comparison with bronchial asthma in the 10P and 10R bands.
Conclusions
Analysis of the Mahalanobis distance-based absorption spectral profiles of breath air from bronchopulmonary patients and healthy volunteers allows the formulation of a preliminary diagnosis.
Higher Education Press and Springer-Verlag Berlin Heidelberg