平民
数据共享
数据科学
云计算
软件
数据发现
协调
数据管理
计算机科学
数据库
万维网
元数据
生物
生态学
程序设计语言
替代医学
病理
物理
操作系统
医学
声学
标识
DOI:10.1016/j.tig.2018.12.006
摘要
Data commons collate data with cloud computing infrastructure and commonly used software services, tools, and applications to create biomedical resources for the large-scale management, analysis, harmonization, and sharing of biomedical data. Over the past few years, data commons have been used to analyze, harmonize, and share large-scale genomics datasets. Data ecosystems can be built by interoperating multiple data commons. It can be quite labor intensive to curate, import, and analyze the data in a data commons. Data lakes provide an alternative to data commons and simply provide access to data, with the data curation and analysis deferred until later and delegated to those that access the data. We review software platforms for managing, analyzing, and sharing genomic data, with an emphasis on data commons, but also cover data ecosystems and data lakes.
科研通智能强力驱动
Strongly Powered by AbleSci AI