能见度
计算机科学
万维网
鉴定(生物学)
软件
资源(消歧)
搜索引擎索引
互联网
数据科学
钥匙(锁)
选择(遗传算法)
网络资源
情报检索
人工智能
生物
光学
物理
植物
计算机安全
程序设计语言
计算机网络
作者
Gael Pérez‐Rodríguez,Martín Pérez‐Pérez,Florentino Fdez‐Riverola,Anália Lourenço
标识
DOI:10.1016/j.ipm.2018.11.011
摘要
Internet, in general, and the WWW, in particular, have become an immediate, practical means of introducing software tools and resources, and most importantly, a key vehicle to attract the attention of the potential users. In this scenario, content organization as well as different development practices may affect the online visibility of the target resource. Therefore, the careful selection, organization and presentation of contents are critical to guarantee that the main features of the target tool can be easily discovered by potential visitors, while ensuring a proper indexation by automatic online systems and resource recognizers. Understanding how software is depicted in scientific manuscripts and comparing these texts with the corresponding online descriptions can help to improve the visibility of the target website. It is particularly relevant to be able to align online descriptions and those found in literature, and use the resulting knowledge to improve software indexing and grouping. Therefore, this paper presents a novel method for formally defining and mining software-related websites and related literature with the ultimate aim of improving the global online visibility of the software. As a proof of concept, the method was used to evaluate the online visibility of biomedical text mining tools. These tools have evolved considerably in the last decades, and are gathering together a heterogeneous development community as well as various user groups. For the most part, these tools are not easily discovered via general search engines. Hence, the proposed method enabled the identification of specific issues regarding the visibility of these online contents and the discussion of some possible improvements.
科研通智能强力驱动
Strongly Powered by AbleSci AI