计算机科学
管道(软件)
可用性
数据清理
XML
软件
背景(考古学)
数据挖掘
过程(计算)
数据科学
任务(项目管理)
数据库
软件工程
万维网
情报检索
数据质量
人机交互
程序设计语言
工程类
古生物学
公制(单位)
运营管理
系统工程
生物
作者
Andreas Pointner,Martin Harrer
标识
DOI:10.1109/iceccme57830.2023.10253136
摘要
Managing the member data of social clubs can be a tedious task. However, there are software solutions available that can help streamline this process. This although means, that existing member data, that is often in the form of text-based data formats like CSV, or semi-structured formats like XML, or Excel needs to be imported in those tools. Unfortunately, the data in these formats may contain errors, inconsistencies, and missing values, which can compromise the usability of this data. In this work, a rule-based data cleansing pipeline designed to clean, enrich, and transform social club member data into a suitable format for import into software solutions is presented. The approach is evaluated on a small data sample and shows promising results for such an application scenario.
科研通智能强力驱动
Strongly Powered by AbleSci AI