Improved algorithm for keywords extraction from documents without corpus: Fid-Bau Portal

Fachinformationsdienst BAUdigital

A platform for research: civil engineering, architecture and urbanism

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, / Jianfeng Wu,

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.

Access

Check availability in my library

Order at Subito €

Page navigation

Document information

Export, share and cite

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, / Jianfeng Wu,

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, / Jianfeng Wu,

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.

Access

Check availability in my library

Order at Subito €

Page navigation

Document information

Export, share and cite

Document information

Title:

Improved algorithm for keywords extraction from documents without corpus

Contributors:

Jing Chen, (author) / Jianfeng Wu, (author)

Published in:

2009 IEEE 10th International Conference on Computer-Aided Industrial Design & Conceptual Design ; 2339-2341

Publication date:

2009-11-01

Size:

63286 byte

ISBN:

978-1-4244-5268-2 , 978-1-4244-5266-8

DOI:

https://doi.org/10.1109/CAIDCD.2009.5375325

Type of media:

Conference paper

Type of material:

Electronic Resource

Language:

English

Similar titles

An index of IRG documents, 1969 - 1990 / R. Cockcroft ; Pt. 2: Keywords

Cockcroft, R. | TIBKAT | 1990

British Library Online Contents | 2004

British Library Online Contents | 2002

Weller, Bernhard ;Tasche, Silke | Wiley | 2023

British Library Online Contents | 2002