Assessing the quality of annotations in asthma gene expression experiments
- Author(s): Lacson, Ronilda;
- Mbagwu, Michael;
- Yousif, Hisham;
- Ohno-Machado, Lucila
- et al.
Published Web Locationhttp://dx.doi.org/10.1186/1471-2105-11-S9-S8
Abstract Background The amount of data deposited in the Gene Expression Omnibus (GEO) has expanded significantly. It is important to ensure that these data are properly annotated with clinical data and descriptions of experimental conditions so that they can be useful for future analysis. This study assesses the adequacy of documented asthma markers in GEO. Three objective measures (coverage, consistency and association) were used for evaluation of annotations contained in 17 asthma studies. Results There were 918 asthma samples with 20,640 annotated markers. Of these markers, only 10,419 had documented values (50% coverage). In one study carefully examined for consistency, there were discrepancies in drug name usage, with brand name and generic name used in different sections to refer to the same drug. Annotated markers showed adequate association with other relevant variables (i.e. the use of medication only when its corresponding disease state was present). Conclusions There is inadequate variable coverage within GEO and usage of terms lacks consistency. Association between relevant variables, however, was adequate.