术语
自然语言处理
计算机科学
中医药
人工智能
情报检索
语言学
医学
替代医学
哲学
病理
作者
Liang Hao Liang Hao,Wu Jiaze Wu Jiaze,Peng Qinghua Peng Qinghua,Duan Lunhui Duan Lunhui
标识
DOI:10.11922/sciencedb.j00001.00213
摘要
The dataset is based on an integration of the English terminology of Chinese medicine (internal draft) developed by the people's Health Publishing House (PMPH), the WHO International Standard Terminologies on Traditional Medicine in the Western Pacific Region formulated by the World Health Organization (WHO) and the International Standard Chinese-English Basic Nomenclature of Chinese Medicine formulated by the World Federation of Chinese Medicine Associations (WFCMS), which aims to promote the standardization of Chinese Medicine terms and international communication of TCM. Through Python pandas package and OCR technology, the dataset is cleaned, sorted and merged. Finally, it is divided into 56 categories. A total of 16189 records are sorted out and merged to 8975 terms. The dataset promotes the standardization of TCM terms, facilitates academic communications and inheritance and development of TCM, and is convenient for informatization construction of TCM.
科研通智能强力驱动
Strongly Powered by AbleSci AI