I have an algorithmic problem which can be reduced to this task:
Suppose we have a list of n
diseases and m
symptoms.
For each disease d
and symptom s
, we have one of three options:
- the symptom is positively correlated with the disease:
s => d
- the symptom is negatively correlated with the disease:
s => ~d
- the symptom is uncorrelated with the disease
The goal of the algorithm is to create a list of yes/no questions regarding symptoms (or even better - a binary tree of questions), which can deduce the exact disease according to the symptoms.
Any references to specific algorithms, relevant software tools and even domain-specific jargon would be very appreciated.