Our intuition is that coupled classifiers of highly correlated pairs of diseases should have better performance than independent classifiers for the two diseases. To test this, we focused on textual judgments and only on the classes 'Y' and 'U' ('Q' and 'N' were converted to 'U'). We coded 'Y' as 1 and 'U' as 0, and we computed the correlation between every pair of diseases. Figure 3 show different pairwise distances between the diseases.
![]() ![]() ![]() |
We also computed the matrix of conditional probabilities for
the 'Y' class of couples of diseases. A detail analysis of this matrix showed us interesting cases,
for example, Obesity is very likely to happen if the patient has OSA
(
), and pretty unlikely to happen if the patient has PVD (
).
With this example in mind, we thought that a good test case could be to include in the text of the records the new tokens HAS_DISEASEX for every DISEASEX that the patient has, except for Obesity, and see the results of training the classifier for the Obesity class on this new record set. So, for example, if a patient has OSA, CHF and Obesity, the tokens: HAS_OSA and HAS_CHF are added to the text of the record. And with these enriched records we estimate whether the patient has Obesity or not.
However, the outcome of the classifier for Obesity does not show any improvement (exactly the same accuracy) when using the enriched records. Features like HAS_OSA, which we thought should clearly improve the classification for the 'Y' class for Obesity are not pretty much taken into account by the classifier.
We were concerned about these results. And to test whether the whole
idea makes sense, we did another run also including the token
HAS_OBESITY in the records of the patients who actually have obesity
(i.e. we included the correct answer as part of the training data of
the classifier). Not surprisingly, in this case the classifier
achieves a accuracy (not reaching the
because there are probably
errors for the classes 'Q' and 'N'), and the feature HAS_OBESITY
obviously appears as the most informative for the classifier.
After looking at the numbers with more detail we decided to include also the answers (Y, N, Q, U) as new tokens: Y_DISEASE, U_DISEASE, Q_DISEASE, N_DISEASE for all the records and for all the diseases except for one, and see the results of the classifier for the excluded disease.
After carrying out that experiments we concluded that adding the answers for other diseases doesn't help the classification of a cercain disease. For example, in the textual case, it only helps for CAD and Hypercholesterolemia, but the improvement is very low: under 0.02 in averaged accuracy. Similar results are obtained for intuitive judgments. All the results are shown in tables 3 and 4.
|
Carlos 2008-10-16