SLEEP-EVAL© RESEARCH

Sleep Epidemiology Research & Sleep-EVALTM Diagnosis Expert System


Stanford Sleep Epidemiology Journal

Stanford Sleep Epidemiology Research Center (SSERC)

Psy-EVAL Research

 

"Not everything that can be counted counts,
and not everything that counts can be counted."
Albert
Einstein

 

Fuzzy Logic In the Sleep-Eval Diagnosis System

First created | 01/12/2006

Last edited   | 05/11/2012

Written by Maurice M. Ohayon, MD, DSc, PhD

Reference to cite: Ohayon MM. Improving decisionmaking processes with the fuzzy logic approach in the epidemiology of sleep disorders. J Psychosom Res 1999 Oct;47(4):297-311

 

Uncertainty is inherent in fields such as sleep medicine and psychiatry and becomes evident in clinical practice at the stages of data collection and diagnostic formulation, when the clinician must determine which symptoms are present and which diagnosis is the good one

 

The process involves a considerable degree of subjectivity on the part of the patient in trying to describe his or her symptoms, and of the clinician whose final diagnosis will depend on his or her clinical experience and interpretation of what is normal and what is pathological.

 

Epidemiological studies can provide information not only on specific diagnostic entities but also on their underlying symptomatic constellations.

 

For this purpose, Sleep-EVAL was developed for the assessment of sleep and psychiatric disorders, endowed with the fuzzy logic capabilities necessary to determine the degree to which a given symptom corresponds to a specific diagnosis.

 

Inferential models of the probabilistic or fuzzy-logic type take into account such uncertainty.

 

Therefore, two incidences of uncertainty are very important:

  • one in data collection and

  • the other in diagnostic decision-making.

The manner in which these problems are tackled has a direct influence on what can be recognized and on how data analyses can be accomplished.

 

Inference models such as probabilistic and fuzzy systems can be used to integrate uncertainty in both symptomatic assessment and diagnostic attribution.

It therefore becomes possible to extend boundaries and attribute a degree of certainty to a diagnosis.

A probabilistic model can be easily computed from an existent binary data set.

A fuzzy model can also be calculated from an existent data set, but the model gains in precision if the data are expressed in categorical terms.

It should be noted that this model is more difficult to compute than the probabilistic model and requires the creation of a computer algorithm to calculate the degree of membership of each symptom involved in a diagnosis.

Nevertheless, this model is of greater interest than the probabilistic and binary models because it allows for a complete integration of the element of uncertainty in the process.

 

The inclusion of uncertainty in data should permit an improvement in classificatory systems such as DSM-IV and ICSD-90.

Indeed, weight and strength of the relationship between criteria within the same diagnosis can be improved, and at least the place of criteria within a certain category can be verified and eventually discarded.

For example, in a syndrome Y defined by the presence of criteria A, B, C, and D, a fuzzy model could show that a low weight on criterion A, accompanied by a strong weight on criteria B and C, and a moderate weight on D are enough to give a certainty of about 80% in the diagnosis.

This provides clues to the clinician that B and C are more relevant in the diagnosis and their presence should make suspect whether the diagnosis is present even if the other criteria have an uncertain presence.

 

This kind of result could validate classification and impart more legitimacy in their use in clinical practice and in pharmaceutical trials.

 

 

Differences between binary, probabilistic and fuzzy models

To reduce the uncertainty inherent in symptomatology assessment, a binary form is usually used to determine the presence or absence of a symptom.

However, this introduces another form of uncertainty especially in borderline cases where we must decide somehow whether a situation is normal or pathological.

 

An alternative is to use probabilistic modeling in order to change these boundaries.

This allows the creation of a natural framework that retains the characteristics of the diagnostic classification and can still be interpreted in the usual way.

Another possibility is to use fuzzy logic, which will afford a degree of certainty to the outcome.

 

The results of an epidemiological survey conducted in Italy will be used to illustrate the application of these methods.

This survey involved 3,970 subjects drawn from the non-institutionalized general population aged 15 to 99 years.

 

An extensive description of the epidemiological methodology can be found elsewhere (10).

 

First example: Insomnia complaint

According to the DSM-IV criteria for insomnia used for the purpose of illustration, an insomnia complaint is defined as difficulty initiating or maintaining sleep or non-restorative sleep, lasting at least one month and accompanied by significant daytime repercussions in important areas of functioning. Description: C:\Copy of myWeb12(Sept2011)\fuzzy_logic_files\image002.gif

 

 

 

In the probabilistic model, the diagnostic outcome (i.e., presence or absence of insomnia), is a function of symptom probabilities, as per Bayes theorem, the sum of which represents the prevalence estimate.

In the fuzzy model, the diagnostic outcome is expressed as a fuzzy set  distributed over seven categories ranging from completely certain of presence to completely certain of absence.

 

 

 

 

The Figure shows the results for binary, probabilistic and fuzzy outcomes.

 

The overall prevalence of insomnia with the binary system was 10.8%, compared with 10.1% with the probabilistic model.

 

With the fuzzy model, 7.8% of the subjects were 100% certain of having insomnia and 1.8% were about an 80% certain, resulting in a prevalence of about 10.6%.

 

It can be seen that the probabilistic model had a leveling effect across age groups.

This could be the result of an unwanted reduction in the variation estimates, most notable when cells are based on few cases. The same occurred under the fuzzy model.

 

Second Example: Obstructive Sleep Apnea Syndrome

The second illustration involves Obstructive Sleep Apnea Syndrome (ICSD-90 classification).

Sleep-EVAL concluded with a complete certainty the presence of Obstructive Sleep Apnea Syndrome in 1.1% of the Italian sample and an almost certain presence in 1.9% (see figure 4 for the distribution by age groups).

 

Description: C:\Copy of myWeb12(Sept2011)\fuzzy_logic_files\image004.gif

Figure: Prevalence of Obstructive Sleep Apnea Syndrome by age groups using a fuzzy reasoning

 

In a study involving 105 patients from two sleep disorders centers (unpublished data), the same decisional tree yielded an almost perfect agreement (96.7% of agreement; kappa of 0.94) between Sleep-EVAL diagnosis (including the case when the system is not totally certain) and the diagnosis of sleep specialists confirmed with polysomnography.

 

Consequently, fuzzy logic reasoning can also offer the possibility of exploring prevalence of disorders in the general population using several levels of confidence.

 

Probabilistic models also offer different levels of confidence.

The main limitation relies on the calculation required to determine the different levels.

This first requires the determination of several levels of probability to have the disease X according different predetermined sets of criteria.

Prevalence estimates are then derived from these different probabilities.

Therefore, one can see that probabilistic models can only be done a posteriori, once all the data is collected.

The application of a fuzzy model does not require knowledge of probabilities.

Therefore, it can be applied a priori.

However, it requires great attention in the creation and application of fuzzy sets since these determine the correspondence between the data and the underlying concept.

 

 

Conclusions

The aim of a classification is to ensure a common language between the clinicians that use it, so that an  entity such as “Psychophysiological Insomnia” refers to a symptomatology understood by any clinician familiar with the classification.

 

Matters, however, are complicated by the existence of multiple classifications, and clinicians can hardly be expected to be familiar with all of them.

The use of structured diagnostic tools is therefore necessary.

The clinician’s clinical experience and theoretical background are major factors impacting on the final diagnosis.

 

Several studies have shown that clinicians do not make optimal use of classifications and often fail to properly document the underlying symptomatology (11,12).

This is further illustrated by the Buysse et al. study in five sleep disorder clinics (13) where no structured interview was used.

As a consequence, kappa coefficients were quite low between the sleep specialists (.30 for all listed diagnoses; .42 for cases of Insomnia Related to Another Mental Disorder) and there was a significant variability of kappa coefficients across the five experimentation sites.

 

The Sleep-EVAL system is designed to assess a variety of sleep disorders in the general population on the basis of two classificatory systems (i.e., DSM-IV and ICSD-90) and performs fuzzy reasoning.

Expert systems such as Sleep-Eval can be used to test classifications by assessing the symptomatic constellation underlying a diagnosis.

 

The use of Sleep-EVAL ensures that the full spectrum of the classification is covered, including rare diagnoses which do not necessarily receive the physician’s immediate attention.

Sleep-EVAL also ensures that at least the minimal criteria for a diagnosis are present and makes it possible to explore the symptomatic constellations of specific diagnoses.

One of the main advantages of such modeling consist in the ability to verify how suitable existing classifications are for general populations.

 

Inference models such as probabilistic and fuzzy systems can be used to integrate uncertainty in both symptomatic assessment and diagnostic attribution.

It therefore becomes possible to extend boundaries and attribute a degree of certainty to a diagnosis.

A probabilistic model can be easily computed from an existent binary data set.

 

A fuzzy model can also be calculated from an existent data set, but the model gains in precision if the data are expressed in categorical terms.

It should be noted that this model is more difficult to compute than the probabilistic model and requires the creation of a computer algorithm to calculate the degree of membership of each symptom involved in a diagnosis.

Nevertheless, this model is of greater interest than the probabilistic and binary models because it allows for a complete integration of the element of uncertainty in the process.

 

The inclusion of uncertainty in data should permit an improvement in classificatory systems such as DSM-IV and ICSD-90.

Indeed, weight and strength of the relationship between criteria within the same diagnosis can be improved, and at least the place of criteria within a certain category can be verified and eventually discarded.

This kind of result could validate classification and impart more legitimacy in their use in clinical practice and in pharmaceutical trials.

 

Content of this page is extracted from:

Ohayon MM. Improving decisionmaking processes with the fuzzy logic approach in the epidemiology of sleep disorders. J Psychosom Res 1999 Oct;47(4):297-311

 

More Information   

 

Sleep-EVAL Aims

When the first epidemiological survey was launched in 1992, we were looking for an assessment tool that could be used by interviewers with little knowledge about sleep disorders.

 

Knowledge Base
The inference engine uses its knowledge base to pose questions, to infer hypotheses and to deduce diagnostic conclusion

 

Inference Engine
Sleep-EVAL is a non-monotonic, level-2 expert system endowed with the ability to make logical connections based on patient information (causal reasoning mode).

 

Fuzzy Logic
Inference models such as probabilistic and fuzzy systems can be used to integrate uncertainty in both symptomatic assessment and diagnostic attribution. It therefore becomes possible to extend boundaries and attribute a degree of certainty to a diagnosis.
A probabilistic model can be easily computed from an existent binary data set.
A fuzzy model can also be calculated from an existent data set, but the model is obviously much more precise when the data are expressed in categorical terms.