计算机科学
元数据
背景(考古学)
数据科学
计算模型
生物学数据
大数据
标识符
过程(计算)
软件
编码(集合论)
任务(项目管理)
万维网
数据挖掘
生物信息学
人工智能
程序设计语言
经济
生物
古生物学
管理
集合(抽象数据类型)
作者
Yo Yehudi,Lukas Hughes-Noehrer,Carole Goble,Caroline Jay
标识
DOI:10.1038/s41597-023-02627-9
摘要
Biological science produces "big data" in varied formats, which necessitates using computational tools to process, integrate, and analyse data. Researchers using computational biology tools range from those using computers for communication, to those writing analysis code. We examine differences in how researchers conceptualise the same data, which we call "subjective data models". We interviewed 22 people with biological experience and varied levels of computational experience, and found that many had fluid subjective data models that changed depending on circumstance. Surprisingly, results did not cluster around participants' computational experience levels. People did not consistently map entities from abstract data models to the real-world entities in files, and certain data identifier formats were easier to infer meaning from than others. Real-world implications: 1) software engineers should design interfaces for task performance, emulating popular user interfaces, rather than targeting professional backgrounds; 2) when insufficient context is provided, people may guess what data means, whether or not they are correct, emphasising the importance of contextual metadata to remove the need for erroneous guesswork.
科研通智能强力驱动
Strongly Powered by AbleSci AI