Decision Trees For Differential Diagnosis Pdf To Word
BackgroundNew technologies like echocardiography, color Doppler, CT, and MRI provide more direct and accurate evidence of heart disease than heart auscultation. However, these modalities are costly, large in size and operationally complex and therefore are not suitable for use in rural areas, in homecare and generally in primary healthcare set-ups. Furthermore the majority of internal medicine and cardiology training programs underestimate the value of cardiac auscultation and junior clinicians are not adequately trained in this field. Therefore efficient decision support systems would be very useful for supporting clinicians to make better heart sound diagnosis. In this study a rule-based method, based on decision trees, has been developed for differential diagnosis between 'clear' Aortic Stenosis (AS) and 'clear' Mitral Regurgitation (MR) using heart sounds. MethodsFor the purposes of our experiment we used a collection of 84 heart sound signals including 41 heart sound signals with 'clear' AS systolic murmur and 43 with 'clear' MR systolic murmur. Signals were initially preprocessed to detect 1st and 2nd heart sounds.
Next a total of 100 features were determined for every heart sound signal and relevance to the differentiation between AS and MR was estimated. The performance of fully expanded decision tree classifiers and Pruned decision tree classifiers were studied based on various training and test datasets. Similarly, pruned decision tree classifiers were used to examine their differentiation capabilities. In order to build a generalized decision support system for heart sound diagnosis, we have divided the problem into sub problems, dealing with either one morphological characteristic of the heart-sound waveform or with difficult to distinguish cases. ResultsRelevance analysis on the different heart sound features demonstrated that the most relevant features are the frequency features and the morphological features that describe S1, S2 and the systolic murmur. The results are compatible with the physical understanding of the problem since AS and MR systolic murmurs have different frequency contents and different waveform shapes.
On the contrary, in the diastolic phase there is no murmur in both diseases which results in the fact that the diastolic phase signals cannot contribute to the differentiation between AS and MR.We used a fully expanded decision tree classifier with a training set of 34 records and a test set of 50 records which resulted in a classification accuracy (total corrects/total tested) of 90% (45 correct/50 total records). Furthermore, the method proved to correctly classify both AS and MR cases since the partial AS and MR accuracies were 91.6% and 88.5% respectively.
Similar accuracy was achieved using decision trees with a fraction of the 100 features (the most relevant). Pruned Differentiation decision trees did not significantly change the classification accuracy of the decision trees both in terms of partial classification and overall classification as well. DiscussionPresent work has indicated that decision tree algorithms decision tree algorithms can be successfully used as a basis for a decision support system to assist young and inexperienced clinicians to make better heart sound diagnosis.
Differential Diagnosis Website
Furthermore, Relevance Analysis can be used to determine a small critical subset, from the initial set of features, which contains most of the information required for the differentiation. Decision tree structures, if properly trained can increase their classification accuracy in new test data sets. The classification accuracy and the generalization capabilities of the Fully Expanded decision tree structures and the Pruned decision tree structures have not significant difference for this examined sub-problem.
However, the generalization capabilities of the decision tree based methods were found to be satisfactory. Decision tree structures were tested on various training and test data set and the classification accuracy was found to be consistently high. BackgroundNew technologies like Echocardiography, Color Doppler, CT, and MRI provide more direct and accurate evidence of heart disease than heart auscultation. However, these modalities are costly, large in size and operationally complex. Therefore these technologies are not suitable for use in rural areas, in homecare and generally in primary healthcare set-ups.
Although heart sounds can provide low cost screening for pathologic conditions, internal medicine and cardiology training programs underestimate the value of cardiac auscultation and junior clinicians are not adequately trained in this field. The pool of skilled clinicians trained in the era before echocardiography continues to age, and the skills for cardiac auscultation is in danger to disappear. Therefore efficient decision support systems would be very useful for supporting clinicians to make better heart sound diagnosis, especially in rural areas, in homecare and in primary healthcare. Recent advances in Information Technology systems, in digital electronic stethoscopes, in acoustic signal processing and in pattern recognition methods have inspired the design of systems based on electronic stethoscopes and computers ,. In the last decade, many research activities were conducted concerning automated and semi-automated heart sound diagnosis, regarding it as a challenging and promising subject. Many researchers have conducted research on the segmentation of the heart sound into heart cycles -, the discrimination of the first from the second heart sound , the analysis of the first, the second heart sound and the heart murmurs -, and also on features extraction and classification of heart sounds and murmurs ,. These activities mainly focused on the morphological characteristics of the heart sound waveforms.
On the contrary very few activities focused on the exploitation of heart sound patterns for the direct diagnosis of cardiac diseases. The research of Dr. Akay and co-workers in Coronary Artery Disease is regarded as very important in this area. Another important research activity is by Hedben and Torry in the area of identification of Aortic Stenosis and Mitral Regurgitation.
The reason for this focus on the morphological characteristics is the following: Cardiac auscultation and diagnosis is quite complicated depending not only on the heart sound, but also on the phase of the respiration cycle, the patient's position, the physical examination variables (such as sex, age, body weight, smoking condition, diastolic and systolic pressure), the patient's history, medication etc. The heart sound information alone is not adequate in most cases for heart disease diagnosis, so researchers generally focused on the identification and extraction of the morphological characteristics of the heart sound.The algorithms which have been utilized for this purpose were based on: i) Auto Regressive and Auto Regressive Moving Average Spectral Methods , ii) Power Spectral Density , iii) Trimmed Mean Spectrograms , iv) Sort Time Fourier Transform , v) Wavelet Transform , vi) Wigner-Ville distribution, and generally the ambiguous function. The classification algorithms were mainly based on: i) Discriminant analysis , ii) Nearest Neighbour , iii) Bayesian networks , iv) Neural Networks , (backpropagation, radial basis function, multiplayer perceptron, self organizing map, probabilistic neural networks etc) and v) rule-based methods.In this paper a rule-based method, based on decision trees, has been developed for differential diagnosis between the Aortic Stenosis (AS) and the Mitral Regurgitation (MR) using heart sounds.
This is a very significant problem in cardiology since there is very often confusion between these two diseases. The correct discrimination between them is of critical importance for the determination of the appropriate treatment to be recommended. Previous research activities concerning these diseases were mainly focused on their clinical aspects, the assessment of their severity by spectral analysis of cardiac murmurs and the time frequency representation of the systolic murmur they produce ,. The problem of differentiation between AS and MR has been investigated in and. The method proposed in was based on the different statistic values in the spectrogram of the systolic murmurs that these two diseases produce. The method proposed in was based on frequency spectrum analysis and a filter bank envelope analysis of the first and the second heart sound. Our work aims to investigate whether decision tree-based classifier algorithms can be a trustworthy alternative for such heart sound diagnosis problems.
For this purpose a number of different decision tree structures were implemented for the classification and differentiation of heart sound patterns produced by patients either with AS or with MR. We chose the decision trees as classification algorithm because the knowledge representation model that they produce is very similar to the differential diagnosis that the clinicians use. In other words this method does not work as a black box for the clinicians (i.e. In medical terms). On the contrary neural networks, genetic algorithms or generally algorithms that need a lot of iteration in order to converge to a solution are working as a black box for the clinicians. Using decision trees clinicians can trace back the model and either accept or reject the proposed suggestion. This capability increases the clinician confidence about the final diagnosis.In particular, the first goal of this work was to evaluate the suitability of various decision tree structures for this important diagnostic problem.
The evaluated structures included both Fully Expanded decision trees Structures and Pruned decision trees Structures. The second goal was to evaluate the diagnostic abilities of the investigated heart sound features for decision tree – based diagnosis. In both the above evaluations the generalization capabilities of the implemented decision tree structures were also examined. Generalization was a very important issue due to the difficulty and the tedious work of having adequate data for all possible data acquisition methods within the training data set. The third goal of this work was to suggest a way of selecting the most appropriate decision tree structures and heart sound features in order to provide the basis of an effective semi-automated diagnostic system. Problem definition: differentiation between AS and MR murmursA typical normal heart sound signal that corresponds to a heart cycle consists of four structural components:– The first heart sound (S1, corresponding to the closure of the mitral and the tricuspid valve).– The systolic phase.– The second heart sound (S2, corresponding to the closure of the aortic and pulmonary valve).– The diastolic phase.Heart sound signals with various additional sounds are observed in patients with heart diseases. The tone of these sounds can be either like murmur or click-like.
The murmurs are generated from the turbulent blood flow and are named after the phase of the heart cycle where they are best heard, e.g. Systolic murmur (SM), diastolic murmur (DM), pro-systolic murmur (PSM) etc. The heart sound diagnosis problem consists in the diagnosis from heart sound signals of a) whether the heart is healthy, or not and b) if it is not healthy, which is the exact heart disease. In AS the aortic valve is thickened and narrowed.
As a result, it does not fully open during cardiac contraction in systolic phase, leading to abnormally high pressure in the left ventricle and producing a systolic murmur that has relatively uniform frequency and rhomboid shape in magnitude (Figure ). In MR the mitral valve does not close completely during systole (due to tissue lesion) and there is blood leakage back from the left ventricle to the left atrium. MR is producing a systolic murmur that has relatively uniform frequency and magnitude slope (Figure ).
The spectral content of the MR systolic murmur has faintly higher frequencies than the spectral content of the AS systolic murmur. The closure of aortic valve affects the second heart sound and the closure of the mitral valve affects the first heart sound. Two heart cycles of AS and MR heart sounds.It can be concluded by the visual insplection of Figure that the AS systolic murmur has very similar characteristics with the MR systolic murmur and therefore the differentiation between these two diseases is a difficult problem in heart sound diagnosis, especially for young inexperienced clinicians. All these facts define the problem of differentiation between the AS and the MR murmurs. In the following sections we propose a method based on time-frequency features and decision tree classifiers for solving this problem. Preprocessing of heart sound dataCardiac auscultation and diagnosis, as already mentioned, are quite complicated, depending not only on the heart sound but also on other factors.
There is also a great variability at the quality of the heart sound affected by factors related to the acquisition method. Some important factors are: the type of stethoscope used, the sensor that the stethoscope has (i.e. Microphone, piezoelectric film), the mode that the stethoscope was used (i.e.
Bell, diaphragm, extended), the way the stethoscope was pressed on the patients skin (firmly or loosely), the medication the patient used during the auscultation (i.e. Vasodilators), the patient's position (i.e. Supine position, standing, squatting), the auscultation areas (i.e. Apex, lower left sternal border, pulmonic area, aortic area), the phase of patients' respiration cycle (inspiration, expiration), the filters used while acquiring the heart sound (i.e. Anti-tremor filter, respiratory sound reduction, noise reduction).
The variation of all these parameters leads to a large number of different heart sound acquisition methods. A heart sound diagnosis algorithm should take into account the variability of the acquisition method, and also be tested in heart sound signals from different sources and recorded with different acquisition methods. For this purpose we collected heart sound signals from different heart sound sources - and created a 'global' heart sound database. The heart sound signals were collected from educational audiocassettes, audio CDs and CD ROMs and all cases were already diagnosed and related to a specific heart disease.
For the purposes of the present experiment 41 heart sound signals with 'clear' AS systolic murmur and 43 with ' 'clear' MR systolic murmur were used.This total set of 41 + 43 = 84 heart sound signals were initially pre-processed in order to detect the cardiac cycles in every signal, i.e. Detect S1 and S2, using a method based on the following steps:a) Wavelet decomposition as described in (with the only difference being that the 4th and 5th level detail was kept, i.e. Relevance analysisBefore constructing and using the decision tree Classifiers a Relevance Analysis of the features was performed.
Relevance Analysis aims to improve the classification efficiency by eliminating the less useful (for the classification) features and reducing the amount of input data to the classification stage. For example from the previous description of the database table attributes it is obvious that the pattern identification code (primary key) is irrelevant to the classification, therefore it should not be used by the classification algorithm.In this work we used the value of the Uncertainty Coefficient - of each of the above 100 features to rank these features according to their relevance to the classifying (dependent variable) attribute which in our case is the hdisease attribute. In order to compute the Uncertainty Coefficients, the 100 numeric attributes were transformed into corresponding categorical ones.
The algorithm that has been used for optimizing this transformation for the specific classification decision is described in. Then for each of these 100 categorical attributes, its Uncertainty Coefficient was calculated, as described in the following paragraphs.Initially we have a set P of p data records (84 data records in our case).
The classifying attribute (dependent variable) for this differential diagnosis problem has 2 possible discrete values: AS or MR, which define 2 corresponding subsets discrete classes Pi (i = 1, 2) of the above set P. If P contains pi records for each Pi (i = 1, 2), (p = p1 + p2) then the Information Entropy (IE) of the initial set P, which is a measure of its homogeneity concerning the classifying attribute-dependent variable (higher homogeneity corresponding to lower values of IE) is given by:Any categorical attribute CA (from the 100 ones created via the above transformation) with possible values ca 1, ca 2. Ca k can be used to partition P into subsets Ca 1, Ca 2. Ca k, where subset Ca j contains all those records from P that have CA = ca j. Let Ca j contain p1 j records of class P1 and p2 j from class P2.
The Information Entropy of the set P, after this partition, is equal to the weighted average of the information Entropies of the subsets Ca 1, Ca 2. Construction of decision tree classifier structuresA decision tree is a class discrimination tree structure consisting of non-leaf nodes (internal nodes) and leaf nodes (final-without child nodes), ,. For constructing a decision tree we use a training data set, which is a set of records, for which we know all feature-attributes (independent variables) and the classifying attribute (dependent variable). Starting from the root node, we determine the best test (= attribute + condition) for splitting the training data set, which created the most homogeneous subsets concerning the classifying attribute and therefore gives the highest classification accuracy. Each of these subsets can be further split in the same way etc.
Each non-leaf node of the tree constitutes a split point, based on a test on one of the attributes, which determines how the data is partitioned (split). Such a test at a decision tree non-leaf node usually has exactly two possible outcomes (binary decision tree), which lead to two corresponding child nodes. The left child node inherits from the parent node the data that satisfies the splitting test of the parent node, while the right child node inherits the data that does not satisfy the splitting test. As the depth of the tree increases, the size of the data in the nodes is decreasing and the probability that this data belongs to only one class increases (leading to nodes of lower impurity concerning the classifying attribute-dependent variable). If the initial data that is inherited to the root node are governed by strong classification rules, then the decision tree is going to be built in a few steps, and the depth of the tree will be small.
On the other hand, if the classification rules in the initial data of the root node are weak, then the number of steps to build the classifier will be significant and the depth of the tree will be higher.During the construction of the tree, the goal at each node is to determine the best splitting test (= attribute + condition), which best divides the training records belonging to that leaf into most homogeneous subsets concerning the classifying attribute. The value of a splitting test depends upon how well it separates the classes. For the evaluation of alternative splitting tests in a node various splitting indexes can be used. In this work the splitting index that has been used is the Entropy Index. Assuming that we have a data set P which contains p records from the 2 classes and where p j/ p is the relative frequency of class j in data set P then the Entropy Index Ent( P) is given by:If a split divides P into two subsets P 1 and P 2 where P 1 contains p 1 examples, P 2 contains p 2 examples and p 1 + p 2 = p, then the values of Entropy Index of the divided data is given by:In order to find the best splitting test (= attribute + condition) for a node we examine for each attribute data set all possible splitting tests based on these attributes.
We finally select the splitting test with the lowest value for the above Ent split to split the node.The expansion of a decision tree can continue, dividing the training set in to subsets (corresponding to new child nodes), until we have subsets-nodes with 'homogeneous' records having all the same value of the classification attribute. Although theoretically this is a possible scenario in practical situations usually leads to a decision tree structure that is over-fitted to the training data set and to the noise this data set contains. Consequently such a decision tree structure has not good generalization capabilities (= high classification accuracy for other test data sets). Stopping rules have been therefore adopted in order to prevent such an over-fitting; this approach is usually referred to as decision tree Pruning. A simple stopping rule that has been examined in this work was a restriction concerning the minimum node size. The minimum node size sets a minimum number of records required per node; in our work for the pruned decision tree structure described in the Results section a node was not split if it had fewer records than a percentage of the initial training data set records (percentages 5%, 10%, 15%, and 20% were tried).Finally the constructed Decision tree structure is used as a classifier in order to classify new data sets. In our work we used this classifier to classify both the training data set and the test data set.
Selection of training and test pattern setsIn practical situations, where the acquisition of heart sound signals for all probable cases is time consuming and almost impossible, we are very much interested in the capability of the constructed-trained decision tree structure to generalize successfully. In order to examine the generalization capabilities of the constructed decision tree structures, the complete pattern set was divided in two subsets. The first subset included 34 patterns, 17 with AS systolic murmur and 17 with MR systolic murmur (40% of each category of the heart sound patterns set) randomly selected out of each pattern category (AS and MR) of the patterns set. This subset was used as the training set. The other subset, that included the remaining 50 patterns (24 belonging to the AS class and 26 belonging to the MR class), was used as the test set. In this way the first training-test sets scheme was developed. The division of the pattern set was repeated, keeping the same proportions (40% training set – 60% test set), but using different patterns giving a second scheme.
In the same way we created some more schemes with different proportions in order to examine the impact of the training and test data set size on the performance of the decision tree classifier. These schemes are presented in Table. Relevance analysisIn order to investigate the relevance and the contribution to the differentiation between AS, and MR for each of the above mentioned 100 heart sound features, the Uncertainty Coefficients were calculated for each one of them considering the hdisease field as the classifying attribute. The calculation was made separately for the training data set of each of the 10 schemes outlined in Table and for each heart sound feature. Then the average value and the standard deviation of the Uncertainty Coefficient were calculated taking into account the 10 values that were calculated from these 10 schemes. The average values and the standard deviations of the Uncertainty Coefficient for all the 100 features are presented in Figure. Note that the most relevant features are the frequency features (i.e.
Ediashf = High Frequency Energy in diastolic phase, Ediasmf = Medium Frequency Energy in diastolic phase, Esyshf = High Frequency Energy in systolic phase, Ediasmh = Medium High Frequency Energy in diastolic phase, etc) and the morphological features that describe the S1 (i.e. S11.s18), the S2 (i.e. S21.s28) and the systolic murmur (sys1. These results are compatible with our physical understanding of the problem; the AS and MR systolic murmurs have different frequency content and different envelope shape.
On the contrary, in the diastolic phase there is no murmur in both diseases, therefore the diastolic phase of heart sound signals cannot contribute to the differentiation between AS and MR. Additionally the closure of Mitral valve affects the S1 and the closure of Aortic valve affects the S2. Fully expanded decision treeAccording to the methodology described in the Methods section, we initially construct the decision tree structure with no restriction to the nodes (without pruning). The training data-set used was from the scheme 40%a (Table ) and had 34 records (heart sound patterns). The rest 50 patterns were used as a test set.
The Decision tree was constructed based on the training data set and afterwards using this decision tree the patterns of the test data set were classified in order to investigate the generalization capabilities of the decision tree. Then the percentage of the correctly classified patterns of the test data set (according to the hdisease attribute) was calculated. The classification performance for the 40%a scheme was: 45 patterns were classified correctly (22 AS and 23 MR) while for 5 patterns the classification was wrong (2 AS and 3 MR); therefore the percentage of classification accuracy, is defined asis 45/50.100 = 90%.In order to exploit the results of the Relevance Analysis described earlier, we constructed five more decision trees using only the heart sound features that have Uncertainty Coefficient above a) 50%, b) 60%, c) 70%, d) 80% and e) 90%. For example in the case e) of 90%, the features that used were the following sixteen: 1) Ediaslf, 2) Ediasmf, 3) Ediasmhf, 4) Esyshf, 5) Esyslf, 6) Esysmf, 7) Esysmhf, 8) s12, 9) s14, 10) s15, 11) s16, 12) s22, 13) s24, 14) s25, 15) sys23, 16) sys24.
These five decision trees were used for classifying the patterns of the test data set, with results identical to the ones described above. Therefore using only 16 out of the 100 features we can get identical levels of classification accuracy with much less computational effort.
For each of the data schemes of Table, we repeated the above calculation and finally we calculated the average classification accuracy and its standard deviation for the above five cases a) to e) (= using different number of features, according to their Uncertainty Coefficients). The results are shown in Figure.
We notice from Figure that the classification accuracy results for cases a) to e) are identical; therefore the standard deviation is very low (zero in most of the cases) confirming the above-mentioned relevant conclusions. Also we can see that for the schemes with the largest training data sets (= 80%a and 80%b schemes, with the training data set of 67 record-patterns) we have a consistent (= the same for both schemes) level of classification accuracy about 88.3%. For other schemes of the classification accuracy e.g. For the 50%a scheme the classification accuracy is 75% while for the 50%b scheme it is 95%. This is probably due to the small size of the corresponding training sets.
Therefore it is concluded that decision trees if properly trained can give a high and consistent level of classification accuracy concerning the differentiation between AS and MR, using auscultation findings. Also it is worth mentioning that the Classification Accuracy for the training data set was 100% for all the examined cases. Average classification total accuracy, ASaccuracy, and MRaccuracy for the fully expanded decision tree classifier.The standard deviation of all these average values was very low, proving once more that the 16 most relevant features (out of the initial 100 features) contain all the information we need for the differentiation between AS and MR. The conclusions derived from Figure concerning the partial Classification Accuracies are similar to the ones derived above concerning the total Classification Accuracy; with the exception of schemes 80%a and 80%b, in which the test data set for each disease is very small (8–9 patterns), therefore the corresponding partial Classification Accuracies are less reliable. Also we can see that for 6 out of 10 schemes (40%a, 50%a, 50%b, 70%a, 70%b, 80%b) there are no significant difference between AS Classification Accuracy and MR Classification Accuracy, while for the remaining 4 schemes (40%b, 60%a, 60%b, 80%a) the MR Classification Accuracy is significantly higher than the AS Classification Accuracy; therefore in general the classification performance of the decision trees for the MR heart sounds is higher than it is for AS heart sounds. In order to confirm statistically this first conclusion, a t-test was performed with the null hypothesis statement H0:Mean (SchemeAverageASAccuracy) = Mean (SchemeAverageMRAccuracy),and the alternative hypothesis statement H1:Mean (SchemeAverageASAccuracy). Pruned decision treeIn this part of our work we tried to investigate if we could improve the generalization capabilities of the decision trees by placing a restriction during the training phase concerning the creation of new nodes of the decision tree and not allowing it to be fully expanded as in 5.2.
The restriction we placed was that the number of records-patterns in a node as a percentage of the total number of records of the initial training data set should be at least a predefined percentage, usually referred to as the support of the node. The predefined percentages (minimum support levels) we examined were 5%, 10%, 15% and 20%. In particular, in the construction of a decision tree structure during the training phase for each parent node, the size (number of records-patterns) of its child nodes was examined. If the size of one of these child nodes was less than, for example, 5% of the total number of records-patterns of the initial training set i.e. The node had support less than 5%, then the parent node was not further split and was converted to a leaf node and the decision tree was not further expanded from that path.
With this exception the same calculations described in paragraph 5.2 were repeated (namely with the only difference that the leaf nodes of the decision tree had at least the predefined support). Figure shows an example of a resulting decision tree following the procedure described. Example of a resulting pruned decision tree.Figure shows the Total Classification Accuracies achieved with Pruned decision trees having minimum leaf node support at least 5%, 10%, 15% and 20%, for all the data schemes described in Table, compared with the previous results from the Fully expanded decision tree (0% minimum support at leaf nodes). From Figure we can see that in 5 out of the 10 schemes (40%a, 40%b, 50%a, 70%a, 80%a) Pruned decision trees give exactly the same Total Classification Accuracy results with the corresponding Fully-Expanded decision trees, in 2 schemes (60%b, 80%b) pruning slightly increases the Total Classification Accuracy, while in the remaining 3 schemes (50%b, 60%a, 70%b) pruning slightly decreases the Total Classification Accuracy.
Therefore it is concluded that for our specific differentiation problem pruning does not significantly change the Classification Accuracy of the decision trees. We performed four individual t-tests, to confirm statistically this conclusion. The null hypothesis statement H0X% was defined as. Total classification accuracy results for all data schemes for pruned decision trees.Mean (SchemeAverageAccuracysupport0%) = Mean (SchemeAverageAccuracysupportX%),and the alternative hypothesis statement H1X% was defined as:Mean (SchemeAverageAccuracysupport0%) Mean (SchemeAverageAccuracysupportX%).The X% stands for the minimum support levels (i.e. 5% or 10% or 15% or 20%) used individually in each one of the four t-tests.
Decision Tree Pdf
Assuming that the error probability of rejecting H0X% while H0X% is truth is a = 1%, we found that there was a 99% confidence that the null hypothesis should be accepted. This statistical result confirms once more the above conclusion for the decision tree pruning.This conclusion is also confirmed by the corresponding partial classification Accuracies for AS and MR shown in Figure and Figure respectively. Finally it is worth mentioning that using Pruned decision trees the Classification Accuracy for the training data sets was not 100% for all schemes / cases (as it was when we used Fully Expanded decision trees); for many schemes / cases it was much lower reaching the level of 88% for some of them. ConclusionsWe have investigated the applicability and the suitability of a number of decision tree structures for the AS MR differentiation problem.
Criteria for this evaluation ware considered the classification Accuracy (both the total one and the partial ones) for the training set and the test set. The main conclusions from this work are the following:– The decision trees can be used with high levels of success for the differentiation between AS and MR. The decision tree model is simple and clinicians are familiar with it, because they use a similar way when they make differential diagnosis. Therefore the decision trees can be used as a basis for decision support systems, giving to the clinicians dealing with the heart sound diagnosis, especially to the ones in rural areas, in homecare, the primary healthcare etc, almost immediately an advice that helps them to make better heart sound diagnosis. This capability can reduce the costs and improve the quality of the healthcare for the cardiological problems.– There is a small subset of the initial features that contain most of the information required for the differentiation. Using Relevance Analysis we can determine this critical subset of features, which then can be used for constructing the decision tree.
Also the diagnosis for any new data set will be based only on this subset of features. In this way the required computational effort for training and using these decision trees significantly decrease.– In the specific discrimination problem the Fully Expanded decision tree structures have similar levels of Generalization and Classification Accuracy for new data in comparison with the Pruned decision tree structures.– Increasing the size of the training data sets (more patterns) improves the Classification Accuracy and the general reliability of the system. This is reasonable because the decision tree is trained with more samples that cover more cases.– The generalization capabilities of the decision tree based methods for problems similar to the one examined in this paper were found to be satisfactory. This is very important, due to the difficulty and the high cost of having enough training data for every possible case.
The decision tree structures were tested on various training and test data set and the Classification Accuracy was found to be consistently high. DeGroff CG, Bhatikar S, Hertzberg J, Shandas R, Valdes-Cruz L, Mahajan RL. Artificial neural network-based method of screening heart murmurs in children. 2001; 103:2711–2716. Criley SR, Criley DG, Criley JM, Beyond Heart Sound An Interactive Teaching and Skills Testing Program for Cardiac Examination. Blaufuss Medical Multimedia, San Francisco, CA, USA, Computers in Cardiology. 2000; 27:591–594.
Myint WW, Dillard B. An Electronic Stethoscope with Diagnosis Capability. In Proceedings of the 33rd IEEE Southeastern Symposium on System Theory. 18–20 March 2001. Lukkarinen S, Noponen A-L, Sikio K, Angerla A.
A New Phonocardiographic Recording System. Computers in Cardiology.
1997; 24:117–120. Haghighi-Mood A, Torry JN. A Sub-Band Energy Tracking Algorithm for Heart Sound Segmentation. Computers in Cardiology. 10–13 September 1995. Liang H, Lukkarinen S, Hartimo I. Heart Sound Segmentation Algorithm Based on Heart Sound Envelogram.
Computers in Cardiology. 7–10 Sept 1997. Liang H, Lukkarinen S, Hartimo I. A heart sound segmentation algorithm using wavelet decomposition and reconstruction. Proceedings of the 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 1997; 4:1630–1633. Hebden JE, Torry JN.
Neural network and conventional classifiers to distinguish between first and second heart sounds. IEE Colloquium (Digest) 1996. 3/1–3/6. Tovar-Corona B, Torry JN.
Time-frequency representation of systolic murmurs using wavelets. Computers in Cardiology. 13–16 Sep 1998. White PR, Collis WB, Salmon AP.
Analysing heart murmurs using time-frequency methods, Time-Frequency and Time-Scale Analysis. Proceedings of the IEEE-SP International Symposium. 18–21 June 1996. Wu Yanjun, Xu Jingping, Zhao Yan, Wang Jing, Wang Bo, Cheng Jingzhi. Time-frequency analysis of the second heart sound signals. In Proceedings of IEEE 17th Annual Conference Engineering in Medicine and Biology Society.
1995; 1:131–132. Wang W, Guo Z, Yang J, Zhang Y, Durand LG, Loew M.
Analysis of the first heart sound using the matching pursuit method. Med Biol Eng Comput. 2001; 39:644–648.
Nygaard H, Thuesen L, Terp K, Hasenkam JM, Paulsens PK. Assessing the severity of aortic valve stenosis by spectral analysis of cardiac murmurs (spectral vibrocardiography). Part II: Clinical aspects. The Journal of Heart Valve Disease. 1993; 2:468–475.
Liang H, Hartimo I. A heart sound feature extraction algorithm based on wavelet decomposition and reconstruction. In Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 1998; 3:1539–1542. Doi: 10.1109/IEMBS.1998.747181. Sharif Z, Zainal MS, Sha'ameri AZ, Salleh SHS.
Analysis and classification of heart sounds and murmurs based on the instantaneous energy and frequency estimations. TENCON 2000 Proceedings IEEE. 130–134. Akay Y, Akay M, Welkowitz W, Kostis J. Noninvasive detection of coronary artery disease.
IEEE Engineering in Medicine and Biology. 1994; 13:761–764. Doi: 10.1139. Hebden JE, Torry JN. Identification of Aortic Stenosis and Mitral Regurgitation by Heart Sound Analysis. Computers in Cardiology.
1997; 24:109–112. Leung T, White P, Collis W, Brown E, Salmon A. Classification of heart sounds using time-frequency method and artificial neural networks. In Proceedings of the 22nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.
23–28 July 2000. Leung T, White P, Collis W, Brown E, Salmon A. Analysing paediatric heart murmurs with discriminant analysis.
In Proceedings of the 19th Annual conference of the IEEE Engineering in Medicine and Biology Society, Hong Kong. 1628–1631. Durand L, Guo Z, Sabbah H, Stein P. Comparison of spectral techniques for computer-assisted classification of spectra of heart sounds in patients with porcine bioprosthetic valves. Med Biol Eng Comput.
1993; 31:229–36. Wu CH. On the analysis and classification of heart sounds based on segmental Bayesian networks and time analysis. Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E. 1997; 4:343–350. Haghighi-Mood A, Torry JN.
Time-frequency analysis of systolic murmurs. IEE Colloquium (Digest) 1997. 7/1–7/2. Koundourakis G, En Visioner.
Doctoral Thesis. University of Manchester Institute of Science and Technology; 2001. A Data Mining Framework Based On Decision Trees. Kamber M, Winstone L, Gong W, Cheng S, Han J. In Proc of 1997 Int'l Workshop on Research Issues on Data Engineering (RIDE'97) Birmingham, England; 1997. Generalisation and Decision Tree Induction: Efficient Classification in Data Mining; pp.
Stasis A. Athens, National Technical University of Athens; 2003. Decision Support System for Heart Sound Diagnosis, using digital signal processing algorithms and data mining techniques. Liu B, Xia Y, Yu P. In Proceedings of the ACM International Conference on Information and Knowledge Management. Washington, DC; 2000. Clustering through decision tree construction.
Weiss SM, Indurkhya N. Small sample decision tree pruning.
Proceedings of the 11th International Conference on Machine Learning, Morgan Kaufmann. 335–342. Han J, Kamber M. Data Mining: Concepts and Techniques. Morgan Kaufman Publisher; 2001.
Stasis A, Loukis E, Pavlopoulos S, Koutsouris D. Using decision tree algorithms as a basis for a heart sound diagnosis decision support system. Information Technology Application in Biomedicine Conference Birmingham UK, IEEE-EMBS. 2003.
Stasis A, Skarpalezos D, Pavlopoulos S, Koutsouris D. Differentiation of opening snap, second heart sound split and third heart sound, using a multiple Decision Tree Architecture. Computational Management Science Conference Crete, Journal of the CMS. 2003. Karatzas N, Papadogianni D, Spanidou E, Klouva F, Sirigou A, Papavasiliou S. Twelve Recorded Lessons with heart sound simulator, Merk Sharp & Dohme Hellas.
Medical Publications Litsas, Athens; 1974. Criley JM, Criley DG, Zalace C. The physiological Origins of Heart Sounds and Murmurs. Harbor UCLA Medical Center, Blaufuss Medical Multimedia; 1995. Littman, 20 Examples of Cardiac & Pulmonary Auscultation. Cable C. The Auscultation Assistant.
1997. Glass L, Pennycook B. Virtual Stethoscope. McGill University, Molson Medical Informatics Project; 1997. VMRCVM Class of 2002, Virginia Maryland regional College of veterinary medicine. Kocabasoglu YE, Henning RH. Human Heart Sounds.
Frontiers in Bioscience, Normal and Abnormal EKGs and Heart Sounds. Student Internal Medicine Society, Class 2000 Cardiology Heart Sounds Wave Files, American College of Physicians, Baylor College of Medicine.
The introduction of polythetic diagnostic classification (DSM-III and ICD-10) in psychiatry was anticipated to improve the reliability of psychiatric diagnoses, facilitate research, and eventually allow the then future DSM-IV to become anchored in objective, etiological criteria. However, the preparations and release of DSM-IV and DSM-5 highlighted the fact that the etiological promise has not materialized. Psychiatric classifications will continue in the foreseeable future to be based mainly on clinical descriptions.This has stimulated a broad range of reflections and critiques of psychiatric nosology (e.g., (,). Yet, the criticism is typically confined to concrete, technical issues, e.g. Discussing a necessity for novel categories, modification of existing criteria, correcting definitions and misunderstandings, etc. The polythetic-operational foundation of current classification remains largely unchallenged.
Thus, despite a nearly universal consensus about the etiological stalemate, psychiatrists continue to believe that the polythetic system is epistemologically adequate and that it has indeed broadly improved clinical diagnostic reliability.I wish to question the alleged improvement of reliability and to challenge the epistemological adequacy of the polythetic approach. The issue of differential diagnosis will serve as a concrete clinical embodiment of this critique.
Needless to say, a full discussion of the theoretical and clinical ramifications of these topics (e.g., the issue of “comorbidity”) is beyond the scope of this article. IMPROVED RELIABILITYDiagnostic reliability is typically reported as interrater agreement for selected disorders in the so-called “field trials”, accompanying the construction of diagnostic criteria, or in research studies.
The data behind such reports stem from somewhat artificially constructed measurement contexts that formalize what actually happens in the ordinary, everyday clinical practice. Such reports also tend to embellish or even inflate the presented reliability levels.We have no data on the general quality (reliability and concurrent validity) of contemporary clinical diagnostic practices, or data comparing the general utility of successive diagnostic systems. We lack anthropologically oriented research, emphasizing ecological aspects of reliability, i.e. Examining the actual reliability of working psychiatrists as they assign a diagnosis, situated in their daily environment. Instead, we may use other sources of information that, at least indirectly, may shed some light upon these issues.
Several, disparate signs jointly indicate a still existing and serious problem with diagnostic reliability.First, we witness the epidemic-like explosion of certain diagnoses (e.g., autistic spectrum, attention-deficit/hyperactivity disorder). Such dramatic increases may be due to an actual and true rise in the incidence, changes in treatment seeking behavior, or availability of novel and more efficacious treatments. THE POLYTHETIC-OPERATIONAL DIAGNOSTIC SYSTEMA polythetic diagnostic category of current DSM/ICD is based on a list of symptoms and signs believed to be characteristic for the diagnosis in question. Typically, a certain number of diagnostically equivalent symptoms or signs from a given list is sufficient to arrive at a diagnosis. These “diagnostic criteria” are, contrary to a widespread belief, not “operational” in any epistemological or scientific sense. They are just briefly described in an ordinary non-technical lay language at “the lowest order of inference”.Two issues deserve attention here.
First, the symptoms/signs shared by two or several disorders tend to be omitted from the diagnostic lists in order to strengthen the clinical distinctiveness of the categories (e.g., depressed mood and anxiety are exclusively listed in the context of mood and anxiety disorders). Second, the simplification of the psychopathological descriptions to brief, lay language statements converts the symptoms and signs into phenomenological primitives or homogeneous elementals.
There is only one kind of delusion (i.e., it is assumed that all delusions share the same phenomenological structure), one kind of anxiety, one kind of auditory verbal hallucination, etc. Consequently the syndromes, solely constituted by aggregates of such elementals, lose their characteristic salience, and their boundaries become blurred. A recent study using a network model of DSM-IV symptoms demonstrated that half of the symptoms are connected with short paths. The individual disorders are therefore also mutually proximate, accounting for the high levels of empirically observed comorbidity.The narrative, conceptual and phenomenological descriptions of pre-DSM-III psychopathology were eliminated from the contemporary diagnostic manuals. Those descriptions contained a discussion of the characteristic prototypes of mental disorders, their phenomenological structures and the interdependency of their constituent features (e.g., in the manic syndrome, the potential relations between the global “volatility” of the manic gestalt, increased mood, vitality, psychomotor speed, and grandiosity). They also contained a consideration of the phenomenological structure of the individual symptoms and signs, their relations of implication or entailment, and their context dependence. Such information no longer exists in the diagnostic manuals and is largely gone into oblivion.For example, a reader of DSM-IV or DSM-5 is told that schizophrenia is a mixture of positive and negative symptoms that happens to satisfy certain inclusion and exclusion criteria.
This definition says more about what schizophrenia is not (e.g., non-organic, non-affective) than what it is , i.e. What kind of validity is behind this category , what is its characteristic gestalt that constitutes its difference from other potentially similar mixtures of positive and negative symptoms , what justifies schizophrenia’s dominating diagnostic rank in the taxonomic hierarchy, or why it is risky to expose a patient with schizophrenia to an orthodox psychoanalysis. MAKING DIAGNOSIS AND DEFINING CONCEPTS: PROTOTYPES AND GESTALTSThe process of differential diagnosis in the pre-DSM-III era was framed by prototypical considerations. Although such considerations still take place or, more exactly, cannot avoid to take place in any diagnostic situation (including somatic medicine), in psychiatry they only operate on an implicit, un-reflected level, because they are un-anchored and incompatible with the philosophy underlying the polythetic-operational classification.A prototype is a central example of a given category (a sparrow is more typical of the category “bird” than is a penguin or an ostrich), with a graded dilution of typicality towards its borders, where it eventually overlaps with neighboring prototypes.
Thus, the prototypical categories exhibit an intrinsic dimensionality. However, a prototype is not just an example (exemplar), but contains condensed information on its internal configuration of properties and its relations to neighboring prototypes. The concept of prototype/gestalt is fit for description of single symptoms and signs as well as larger entities such as diagnostic categories. One can use the concept of prototype-gestalt in a narrow or a wide sense, neither one limited to perception but also involving complex cognitive-affective operations.In a narrow sense, a gestalt is a unity or organization of phenomenal aspects, that emerges from the interactions among its component features (part-whole relations). The whole is irreducible to a mere aggregate, because it is more than a sum of its parts. In a diagnostic process there are reciprocal dependencies between the whole and its single features.
The clinical whole confers on its constitutive features their characteristic diagnostic significance. Conversely, the single clinical features, by instantiating the gestalt, imbue it with clinical concreteness and rootedness.In a wider sense, the notion of gestalt entails an interplay of factors that extend beyond the subject to include not only a mental state, but also the patient’s engagements with the environment and others. For instance, detecting a delusion involves taking into account not only the patient’s verbal contents but also his experiences, way of arguing, relational style and relevant historical information. To use the concept of delusion competently, a psychiatrist must master plenty of other prototypes and concepts (e.g., psychosis, rationality, reality, hallucination, etc.).The argument for a prototype-based diagnosis is fundamentally linked to the fact that perception is always apperceptively (conceptually) informed: perceiving something is to perceive it as a something, as a token of a certain type. A perceptual or cognitive object is always given as a certain gestalt.
The unfamiliar is perceived in terms of the familiar, i.e. In terms of the general type or gestalt that is “activated in the particular perception”. This process is called typification and is intrinsic to all human cognition and hence to the diagnostic process as well.The natural unfolding of a comprehensive semi-structured prototype-based diagnostic assessment involves reflective and critical questioning of typifications, which become supported, weakened or discarded by explicitly elicited diagnostic information on symptoms, their evolution, social history etc., progressively limiting the number of diagnostic options. Typification as such can never be eliminated because it is an automatic aspect of cognition. A recent review of mechanisms involved in concept formation, use and understanding suggests that concepts (e.g., psychiatric categories) are not constituted by a list of criteria, but are organized around prototypes/gestalts : “Theory of concepts must be primarily prototype-based, within a broader knowledge representation scheme in which the concept is positioned both within a hierarchy and within a theoretical framework(s) appropriate to that domain” , p.
It follows that the more knowledgeable and experienced is the psychiatrist, the more refined is the diagnostic repertoire.Finally, it needs to be emphasized that prototypically defined and described nosological categories may be enriched and supplemented by lists of criteria. This was, in fact, the original but, unfortunately, unrealized intention behind the DSM-III. CLINICAL REALITY OF DIFFERENTIAL DIAGNOSIS IN A POLYTHETIC SYSTEMAll diagnosis is an instance of differential diagnosis: the task is to pick up, from a larger catalogue of potential options, the one that most adequately fits the patient.Let us then imagine a young clinician in an open outpatient facility, trained with the DSM/ICD manuals as her exclusive source of psychopathological knowledge. She encounters a self-referred male in his early 20-ies, sitting on the floor of the waiting room in a lotus position, mumbling, and occasionally laughing to himself in a silly manner.
How should she proceed after the initial greetings?Since her diagnostic-cognitive field lacks a prototypical-conceptual grid, she is exposed to what in cognitive science is known as a “frame problem”, i.e. The issue of how to decide what is relevant, indeed what is even the relevant overall context within which to approach a given problem. Theoretically, she would therefore need to explore the inclusion and exclusion criteria for nearly all disorders (the number will vary with the degree of diagnostic hierarchy).
Alva noto ryuichi sakamoto rarlab. Since 2002 the two musicians have fused their disciplines near-faultlessly, with Sakamoto’s evocative piano-based compositions melting effortlessly into Nicolai’s skillful digital rhythms and cautious manipulations. Each album has brought the duo slightly closer together, and ‘summvs’ (the title an amalgamation of the words summa and versus) sees their work reach critical mass.
That is, obviously, not feasible in practice. Instead, she may imitate a digital computer and use a structured interview. Such interview is essentially constructed as a binary decision tree with mandatory probing questions and suggested cut-off points. The epistemological problems and the quite meager pragmatic utility of structured interviews have been amply addressed (,).
Here, it is important to note that the very nature of structured questioning confers a limited diagnostic utility on the interview, because of the low sensitivity and specificity of the responses. Responding with a “no” or “yes” to the question of “feeling down” neither excludes nor strongly supports any specific diagnosis.Most likely, our clinician will conduct a so-called “clinical interview”, a conversation starting with the patient’s complaints and reasons for seeking help, and assisted by various symptom checklists locally in use. In this process, the patient may be diagnosed with major depression if he answers affirmatively to five or six criteria of this diagnosis.
In other words, for a psychiatrist untrained to impose a conceptual-psychopathological grid on the diagnostic information, the patient’s initial behavior (suggestive of schizophrenia) may easily fail to display a relevant clinical salience and hence fail to enter into the diagnostic considerations.Thus, a young psychiatrist, unfamiliar with the prototypical structure of psychopathology, finds herself exposed to a myriad of chaotic, unconnected data, where each individual feature is equally worthy of attention and may therefore become a pivot of a potential diagnostic class. With growing experience, this clinician will invariably acquire her own private prototypes, shaped by the local ideologies and habits and by personal inclinations, i.e., in an implicit way that is not exposed to an academic, rigorous, and peer-shared reflection. “Private” prototypes become easily activated by single, popping up clinical features that happen to evoke a single aspect of a contingent diagnostic category. Here, a decisive role is often played by the very first verbalizations of the complaint. If a patient mentions a habit of cutting herself, a “borderline” diagnosis will be likely considered. CONCLUSIONSThe distinctions and concepts in the realm of experience and behavior play now, and will continue to play, a decisive role in psychiatric classifications. These distinctions do not function with the simplicity of causal referents, as it is often the case with signs and symptoms of somatic medicine (e.g.
Jaundice → bilirubine cycle). Rather, they exhibit a phenomenological-empirical and theoretical complexity, which cannot be adequately represented through the simplifying, reductive approach of the operational-polythetic system.The differential diagnostic process is not (only) a matter of a digitalized decision tree, but involves context dependencies and complex pattern recognitions. These empirical, phenomenological and theoretical issues constitute the domain of the science of psychopathology.
In recent decades, research, study and training in psychopathology have been seen as largely redundant, because the polythetic manuals seemed to offer all that was needed for research and practice. These assumptions have proven to be false.