基于查询日志的语义关系发现算法
首发时间:2009-07-13
摘要:提出了基于查询日志的语义关系发现算法,通过对搜索引擎查询日志分析,构建查询词与URL的双向图,使用图距离结合同义相似度计算查询词的语义相似度,同时采用基于路径的高效图挖掘算法实现查询词的语义关系发现。采用sogou近2000万条记录用户查询点击记录的日志实验表明,该算法较子串扩展算法与日志挖掘算法有更好的准确率与查全率。
关键词: 查询日志 词语相似度 基于路径的图挖掘 语义关系发现
For information in English, please click here
Discovering Semantic Relations from Query Logs
Abstract:A discovering semantic relations algorithm from Query Logs is presented. First construct a weighted Query-URL bipartite graph from query log data. Then compute the semantic similarity of queries by distance of queries nodes and synonymy similarity and use a effective mining algorithm to discovering semantic related queries based on graph path. Experiments show that the algorithm is more effective than substring extending algorithm and log mining algorithm in recall and precision.
Keywords: query logs semantic similarity mining algorithm based on graph path semantic relations discovering.
论文图表:
引用
No.3379548457612474****
同行评议
共计0人参与
勘误表
基于查询日志的语义关系发现算法
评论
全部评论0/1000