Enormous amounts of data are generated during clinical interactions across multiple-healthcare settings in the form of structured and unstructured EHRs. The data contains rich, longitudinal information on diagnoses, symptoms, medications and tests which can be used for research. However, EHR data is not primarily generated for research purposes; is stored in disparate sources often using different formats and requires a significant amount of pre-processing.
Our Phenotype Library
The UKhas established a,ɳone of thein the world. It is the only national whollyopen-access library of reproducible phenotyping algorithms for defining human disease, lifestylerisk factors and biomarkersusing diverse electronic health records.For each phenotype, the library curatesitsmetadata, implementation details, programmaticcodeand validation information. TheLibraryenables reproducible and transparent research using such complex data by the wider research and clinical community.
Researchers hoping to unlock the valuable data contained with EHRs need to spendconsiderable time creating the coding neededto work with data that often containsinconsistencies and is of varyingqualityanddetail.
The 51 ʳԴdzٲLibrary hasbeen created to assist researchers working with EHRs, by creating an openaccess national library of validated phenotyping algorithms,definitionsandmethods. Routine use of the libraryby researcherswill cut down on the duplication of effort by allowing re-use of algorithms, tools and methodsandwillensure reproducibility of research by creating a national standard for creating,evaluatingand representing phenotypes.
Are you a researcher that has developed a phenotyping algorithmthat:
- defines a disease,risk factor or biomarker,
- derivesinformation from one or more EHR sources,
- is associated with one or more peer-reviewed output and
- isalreadyvalidated?
Youcan contribute to the improvement ofhealth by depositing your algorithms in the Phenotype Libraryenabling their dissemination, re-use, evaluation, and citationto the benefit of the emergingphonemicsresearchcommunity.
The phenome national priority is developing toolswhichwillsupportthedefinitionand creation of computable phenotypes, which can be used to interrogate EHR datato enable healthresearchfor patients benefit.