I have an algorithmic problem which can be reduced to this task:
Suppose we have a list of n diseases and m symptoms.
For each disease d and symptom s, we have one of three options:
- the symptom is positively correlated with the disease:
s => d - the symptom is negatively correlated with the disease:
s => ~d - the symptom is uncorrelated with the disease
The goal of the algorithm is to create a list of yes/no questions regarding symptoms (or even better - a binary tree of questions), which can deduce the exact disease according to the symptoms.
Any references to specific algorithms, relevant software tools and even domain-specific jargon would be very appreciated.