A platform for research: civil engineering, architecture and urbanism
Improved algorithm for keywords extraction from documents without corpus
In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.
Improved algorithm for keywords extraction from documents without corpus
In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.
Improved algorithm for keywords extraction from documents without corpus
Jing Chen, (author) / Jianfeng Wu, (author)
2009-11-01
63286 byte
Conference paper
Electronic Resource
English
British Library Online Contents | 2004
British Library Online Contents | 2002
Wiley | 2023
|British Library Online Contents | 2002