计算机科学
遮罩(插图)
命名实体识别
人工智能
学习迁移
脱敏(药物)
领域(数学)
标记数据
稳健性(进化)
模式识别(心理学)
机器学习
语音识别
数据挖掘
工程类
艺术
视觉艺术
受体
化学
系统工程
纯数学
基因
生物化学
数学
任务(项目管理)
作者
Yumeng Yang,Miaoqiong Wang,Yu Rong,Jingyang Fang,Guo Mei-ying,Kairu Lei,Lü Li
标识
DOI:10.1109/iucc-cit-dsci-smartcns55181.2021.00083
摘要
To address the deficiencies of traditional data masking techniques, this paper focuses on the identification of unstructured sensitive data, and constructs an adaptive data masking-named entity recognition model (Adm-NER). Based on the Bi-LSTM-CRF model, Adm-NER applies adversarial transfer learning to the field of data desensitization and introduces self-attention mechanism, which can effectively identify sensitive data in the lack of sample fields. The results of five comparative experiments show that Adm-NER has significantly improved the accuracy of identifying sensitive data. In addition, the transfer learning experiment proves that Adm-NER can adaptively learn common features by using large-scale labeled samples to achieve accurate positioning and recognition of sensitive data in the lack of sample field, which is conducive to subsequent data desensitization. Adm-NER provides a new idea for the intelligent design of big data masking systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI