语义压缩
计算机科学
语义相似性
无损压缩
语义计算
语义整合
语义网格
熵编码
情报检索
自然语言处理
显式语义分析
语义对等
人工智能
理论计算机科学
语义技术
数据压缩
语义网
作者
Prithwish Basu,Jie Bao,Mike Dean,James A. Hendler
标识
DOI:10.1109/percomw.2012.6197583
摘要
We show how semantic relationships that exist within an information-rich source can be exploited for achieving parsimonious communication between a pair of semantically-aware nodes that preserves quality of information. We extend the source coding theorem of classical information theory to encompass semantics in the source and show that by utilizing semantic relations between source symbols, higher rate of lossless compression may be achieved compared to traditional syntactic compression methods. We define the capacity of a semantic source as the mutual information between its models and syntactic messages, and show that it equals the average semantic entropy of its messages. We further show the duality of semantic redundancy and semantic ambiguity in compressing semantic data, and establish the semantic capacity of a source as the lower bound on semantic compression. Finally, we give a practical semantic compression algorithm that exploits the graph structure of a shared knowledge base to facilitate semantic communication between a pair of nodes.
科研通智能强力驱动
Strongly Powered by AbleSci AI