Eine Plattform für die Wissenschaft: Bauingenieurwesen, Architektur und Urbanistik
Improved algorithm for keywords extraction from documents without corpus
In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.
Improved algorithm for keywords extraction from documents without corpus
In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.
Improved algorithm for keywords extraction from documents without corpus
Jing Chen, (Autor:in) / Jianfeng Wu, (Autor:in)
01.11.2009
63286 byte
Aufsatz (Konferenz)
Elektronische Ressource
Englisch
British Library Online Contents | 2004
British Library Online Contents | 2002
Wiley | 2023
|British Library Online Contents | 2002