论文部分内容阅读
Contextual advertising is a major revenue source for todays companies.Keyword extraction is a key step in this kind of advertising,through which appropriate advertising keywords are extracted from Web pages so that corresponding ads can be triggered.This paper describes a system that learns how to extract keywords from web pages for advertisement targeting.Firstly a text network for a single webpage is build,then PageRank is applied in the network to decide on the importance of a word,finally top-ranked words are selected as keywords of the webpage.The algorithm is tested on the corpus of blog pages,and the experimental results prove practical and effective.