Evaluation of Relational and NoSQL Approaches for Cohort Identification from Heterogeneous Data Sources in the National Sleep Research Resource

Ningzhou Zeng; Guo-Qiang Zhang; Xiaojin Li; Licong Cui

doi:10.4172/2157-7420.1000295

Evaluation of Relational and NoSQL Approaches for Cohort Identification from Heterogeneous Data Sources in the National Sleep Research Resource

Abstract

Ningzhou Zeng, Guo-Qiang Zhang, Xiaojin Li and Licong Cui

Patient cohort identification across heterogeneous data sources is a challenging task, which may involve a complicated process of data loading, harmonization and querying. Most existing cohort identification tools use a relational database model implemented in SQL for storing patient data. However, SQL databases have restrictions on the maximum number of columns in a table, which necessitates the breaking down of high dimensional data into multiple tables and as a consequence affects query performance. In this paper, we developed two NoSQL-based patient cohort query systems based on an existing SQL-based system for the cross-cohort query in the National Sleep Resource Research (NSRR). We used eight NSRR datasets in our experiment to evaluate the performance of the NoSQLbased and SQL-based systems in data loading, harmonization and query. Our experiment showed that NoSQL-based approaches outperformed the SQL-based and are rather promising for developing patient cohort query systems across heterogeneous data sources.

PDF

Share this article

Awards & Nominations

50+ Million Readerbase

Journal Highlights

Google Scholar citation report

Citations: 2700

Journal of Health & Medical Informatics received 2700 citations as per Google Scholar report

Journal of Health & Medical Informatics peer review process verified at publons

Indexed In

Index Copernicus
Google Scholar
Sherpa Romeo
Open J Gate
Genamics JournalSeek
Academic Keys
JournalTOCs
ResearchBible
Access to Global Online Research in Agriculture (AGORA)
Electronic Journals Library
RefSeek
Hamdard University
EBSCO A-Z
OCLC- WorldCat
Proquest Summons
Scholarsteer
SWB online catalog
Virtual Library of Biology (vifabio)
Publons
Geneva Foundation for Medical Education and Research
Euro Pub

Journal of Health & Medical Informatics

Evaluation of Relational and NoSQL Approaches for Cohort Identification from Heterogeneous Data Sources in the National Sleep Research Resource

Abstract

Awards & Nominations

50+ Million Readerbase

Journal Highlights

Google Scholar citation report

Citations: 2700

Journal of Health & Medical Informatics peer review process verified at publons

Indexed In

Related Links

Open Access Journals