Introduction
Coronary heart disease with angina pectoris became a significant factor affecting health and quality of life, and its mortality rate increases annually. Thus, a better plan must be urgently developed to improve efficacy of treatment and quality of life. Traditional Chinese medicine (TCM) possesses significant advantages in treatment of coronary heart disease with angina pectoris [
1–
12]. In TCM clinical practice, syndrome differentiation serves as the core of treatment for coronary heart disease with angina pectoris. Correct syndrome recognition will improve efficacy of treatment for this disease. Therefore, determining syndrome features of coronary heart disease with angina pectoris from effective clinical cases bears significance. Related studies focused on syndrome classification based on data mining methods. Various classification techniques, e.g., naïve Bayes, support vector machine, k nearest neighbor, and latent structure models, were applied in quantitative syndrome analysis [
13–
20]. Better results were obtained for several important diseases, e.g., hepatitis, viral hepatitis, and diabetes. By introducing these data mining models, researchers can mine from some effective cases objective syndrome feature distribution for diseases. Such feature distribution can be used to characterize the underlying relationship between a syndrome and its specific symptoms. However, few studies explored the underlying syndrome features for coronary heart disease with angina pectoris. Li
et al. focused on inquiry diagnosis model to determine the effective inquiry style for coronary heart disease [
4]. Chen
et al. focused on phenotypic networks to explore biological basis for coronary heart disease with angina pectoris [
7]. Zhou
et al. analyzed syndrome features of coronary heart disease with angina pectoris based on the cluster method [
1–
3]. However, their research data originated from collection of clinical records. That is, pre-designed clinical manifestations were used to collect symptom information. Compared with real-world clinical data for coronary heart disease with angina pectoris, collection of clinical records requires additional cost for researchers and hence further burdens clinical doctors. Therefore, efficiency of research can be improved by finding consistent evidence between real-world and theoretical data for coronary heart disease with angina pectoris. In this study, we selected 860 cases of coronary heart disease with angina pectoris from TCM Clinical Research Information Sharing System (hereinafter “Sharing System”) for TCM clinics and research [
21]. Sharing System was structured for clinical practice and research and is extensively applied in domestic hospitals because of its high efficiency. In this study, objective syndrome classifications included those first automatically extracted from 860 cases of coronary heart disease with angina pectoris using the cluster method. Cluster algorithm is the most robust method and is available for many data styles in data mining methods. Second, clinical experts on coronary heart disease with angina pectoris discussed and determined the final syndrome classification and optimized syndrome features. Final syndrome results thoroughly reflect syndrome differentiation experience of patients with coronary heart disease with angina pectoris. Simultaneously, better syndrome features provide objective diagnostic evidence for clinical doctors for improvement of accuracy of syndrome diagnosis and efficacy of clinical treatment for coronary heart disease with angina pectoris.
This paper consists of the following sections. Materials and Methods introduce case selection, data pre-processing, and analysis method. Results discuss evaluation of performance of syndrome classification on selected cases. Conclusions and recommendations for future work are also discussed.
Materials and methods
Case selection
A total of 860 selected patients of coronary heart disease with angina pectoris were scheduled to receive treatment in the Guanganmen Hospital of Traditional Chinese Medicine from January 2012 to December 2013. These patients were effectively treated to reduce frequency of chest pain. Clinical information of these patients were collected by the Sharing System for TCM clinics and research. Different from solely observed symptoms and treatment plan in randomized controlled trials (RCT), treatment of these patients is based on syndrome differentiation of clinical doctors. Symptoms were collected and recorded based on clinical manifestation of patients. A total of 860 patients provided their clinical information, e.g., population, symptoms, syndromes, and treatment. Each patient possesses records for at least two treatments.
Data pre-processing
Extraction, transformation, and loading (ETL) software was used to pre-process clinical manifestations, e.g., population, symptom, tongue, pulse and treatment, of 860 patients with coronary heart disease with angina pectoris. Fig. 1 shows the interface of ETL.
ETL software includes some functional interfaces, e.g., database restore, operation database, detailed database, data transformation, data cleaning, data load, and semantic mergence. In this study, data pre-processing steps for coronary heart disease with angina pectoris are as follows: combining similar clinical information; deleting errors or completed clinical information; unifying different types of clinical information; and quantifying clinical information by 0 and 1. Noisy records were removed by filtering.
General materials
Among the 860 selected patients, 389 were males, and 471 were females, with an average age of 60.86±8.16 years; all were diagnosed with coronary heart disease with angina pectoris with an average disease course of 16.44±5.46 months. Finally, 115 clinical manifestations, i.e., 86 symptoms, 18 tongues, and 11 pulses, were extracted and generated to explore underlying syndromes of coronary heart disease with angina pectoris after data pre-processing.
Analysis method
All data were analyzed using SAS (Version 9.3, SAS Institute, Chicago, IL, USA) software. PROC VARCLUS in SAS was used to obtain clusters of symptoms, tongues, and pulses. Cluster results were used to analyze and explain syndrome features of coronary heart disease with angina pectoris. Variables of clinical manifestations were selected and contained in one cluster at P<0.05, which will be considered significant for all analyses.
Results
Symptom indicators
Table 1 presents the top 20 clinical symptoms of coronary heart disease with angina pectoris. “Lack of strength” was the most dominant symptom in 860 included patients.
Clustering results
Symptoms were clustered to six groups to characterize syndrome features. Table 2 shows relevance of six clusters, and Table 3 shows clustering results on 115 symptoms.
Syndrome distribution
Table 4 shows clustering results on symptoms and accuracy of syndrome differentiation.
Discussion
Cluster analysis can be used to determine underlying syndrome differentiation without prior knowledge of syndrome classification. Results objectively present the underlying relationship among TCM clinical symptoms. In this study, clustering results were obtained from 860 patients of coronary heart disease with angina pectoris; these patients were treated effectively by syndrome differentiation.
First, we observed that each cluster features high inter-heterogeneity with each other, as indicated in Table 2. The best heterogeneity was observed in cluster 5 (liver depression and spleen deficiency; and Qi stagnation and blood stasis). Average relevance coefficient of clusters 5 and 2 reached 0.0483 and 0.0944, respectively, which were smaller than those of other clusters. However, the highest relevance coefficient was noted in cluster 6, i.e., Yin deficiency and depleted fluid, blood stasis, and obstruction collaterals. Inter-relevance coefficients of all clusters were small, that is, all clusters featured good inter-heterogeneity.
Second, the best intra-consistency was detected in clusters 4 (Qi and blood deficiency, blood stasis, and obstruction collaterals) and 5 (liver depression and spleen deficiency; and Qi stagnation and blood stasis) (Tables 2 and 3). The numbers of symptoms in 246 and 227 patients of coronary heart disease with angina pectoris reached 17 and 10, respectively. For clusters 4 and 5, patient-to-symptom ratios totaled 14.47% and 22.70%, respectively. All clusters showed high patient-to-symptom ratios, that is, all clusters presented good intra-consistency.
According to Table 4, Qi and blood deficiency, blood stasis, and obstruction collaterals are the main syndrome types for patients of coronary heart disease with angina pectoris. This result is consistent with diagnosis information of “chest pain” or “pectoral pain with stuffiness” category in TCM.
In summary, syndrome features were similar to results in references [
1–
3], which were based on collection of clinical records. Among 860 included patients, Qi deficiency, blood deficiency, blood stasis, and Qi stagnation are the main syndrome elements for coronary heart disease with angina pectoris. Calculated results agree with those in actual clinical practice. Angina pectoris belongs to “chest pain” or “pectoral pain with stuffiness” categories in TCM. Syndromes also include stagnation of the heart blood, phlegm, and Yin-cold and deficiencies of the heart-Yin and kidney-Yin, Yang and Yin, and Yang-Qi in pectoral pain with stuffiness. Some of the most important pathogeneses include deficiency and stagnation of Qi and blood. Results presented the main therapy rule of activating blood and promoting Qi. This finding conforms to the rule of “relieving the secondary in an urgent case and removing the primary in a chronic case.”
Conclusions
We conclude that similar syndrome features can be captured from real-world clinical data of coronary heart disease with angina pectoris. Preciseness of results were verified according to TCM theory, and they provide objective evidence for TCM clinical practice and studies.
Higher Education Press and Springer-Verlag GmbH Germany