文章检索
首页» 过刊浏览» 2018» Vol. 3» Issue (4) 446-451     DOI : 10.3969/j.issn.2096-1693.2018.04.040
最新目录| | 过刊浏览| 高级检索
语义相似度计算在内检测数据参数匹配中的应用
张河苇,金剑,董绍华,张来斌,李宁
1 中国石油大学( 北京) 机械与储运工程学院,北京 102249 2 中石油管道有限责任公司西部分公司,乌鲁木齐 830000
Application of semantic similarity calculation in parameter matching of detection data
ZHANG Hewei, JIN Jian, DONG Shaohua, ZHANG Laibin, LI Ning
1 School of Mechanical and Transportation Engineering, China University of Petroleum -Beijing, Beijing 102249, China 2 China Petroleum Pipeline Co., Ltd. West Branch, Urumqi 830000, China

全文:   HTML (1 KB) 
文章导读  
摘要  内检测数据对齐有助于提高内检测数据的利用率,目前国内外学者已初步建立内检测对齐流程。然而针对管道大数据背景下需匹配字段繁杂、中文字段描述多样等问题仍缺乏解决方案。本文采用中文语义相似度计算方法,计算各类字段与模板字段的相似度,确定其匹配度,可以从大量字段中选取匹配字段,实现不同来源内检测数据的对齐。本文在原有的基于同义词词林计算方法的基础上进行改进,并使用内检测报告中的实际字段进行计算,通过比对发现,本文改进的方法能够区分内检测报告中的不同字段,对多来源内检测数据对齐有较好的适用性。
服务
把本文推荐给朋友
加入我的书架
加入引用管理器
关键词 : 语义相似度;内检测;数据对齐;同义词词林;长输管道
Abstract

The alignment of inline inspection datasets can help to improve the utilization rate of the data. At present, domestic and foreign scholars have preliminarily established the alignment method. However, there is still a lack of solutions to the complexity and the diversity of Chinese characters, which are used in the inline inspection reports. Here the method of Chinese semantic similarity calculation was used to determine the matching degree between fields, select the matched fields from a large number of fields and achieve the data alignment between different testing companies. This method is improved based on Synonym Forest, and the actual fields from the inline inspection test reports are used. The improved method can distinguish the different fields and has good applicability to the multiple inspection data alignment.

Key words: semantic similarity; inline inspection; data alignment; Synonym Forest; long distance pipeline
收稿日期: 2018-04-20     
PACS:    
基金资助:国家重点基础研究发展计划(2017YFC0805800)和中石油管道有限责任公司西部分公司科研项目“管道完整性大数据架构模型及辅助决策分析模型研究与应用”(XG-JCGL-CX-KJXX-01-JL-03/201604)联合资助
通讯作者: 张河苇 shdong@cup.edu.cn
引用本文:   
张河苇, 金剑, 董绍华, 张来斌, 李宁. 语义相似度计算在内检测数据参数匹配中的应用. 石油科学通报, 2018, 04: 446-451
链接本文:  
ZHANG Hewei, JIN Jian, DONG Shaohua, ZHANG Laibin, Li Ning. Application of semantic similarity calculation in parameter matching of detection data. Petroleum Science Bulletin, 2018, 04: 446-451. doi: 10.3969/j.issn.2096-1693.2018.04.040
版权所有 2016 《石油科学通报》杂志社