Improved algorithm for keywords extraction from documents without corpus: Fid-Bau Portal

Fachinformationsdienst BAUdigital

Eine Plattform für die Wissenschaft: Bauingenieurwesen, Architektur und Urbanistik

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, / Jianfeng Wu,

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.

Zugriff

Zugriff prüfen

Verfügbarkeit in meiner Bibliothek prüfen

Bestellung bei Subito €

Seitennavigation

Dokumentinformationen

Ähnliche Titel

Exportieren, teilen und zitieren

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, / Jianfeng Wu,

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.

Improved algorithm for keywords extraction from documents without corpus

Jing Chen, / Jianfeng Wu,

In this paper, an algorithm for extracting keywords without corpus is described. We use the co-occurrence information of the words and the biases of distribution to extract the more important words based on the most frequently appearing words so called reference words. Firstly, the most frequently terms are chosen from the document. Then due to keywords have a non-linear relationship with the set of frequently terms, the bias between words in documents and reference terms is measured. At last we prove that the algorithm is effective.

Zugriff

Zugriff prüfen

Verfügbarkeit in meiner Bibliothek prüfen

Bestellung bei Subito €

Seitennavigation

Dokumentinformationen

Ähnliche Titel

Exportieren, teilen und zitieren

Dokumentinformationen

Titel:

Improved algorithm for keywords extraction from documents without corpus

Beteiligte:

Jing Chen, (Autor:in) / Jianfeng Wu, (Autor:in)

Erschienen in:

2009 IEEE 10th International Conference on Computer-Aided Industrial Design & Conceptual Design ; 2339-2341

Erscheinungsdatum:

01.11.2009

Format / Umfang:

63286 byte

ISBN:

978-1-4244-5268-2 , 978-1-4244-5266-8

DOI:

https://doi.org/10.1109/CAIDCD.2009.5375325

Medientyp:

Aufsatz (Konferenz)

Format:

Elektronische Ressource

Sprache:

Englisch

Ähnliche Titel

An index of IRG documents, 1969 - 1990 / R. Cockcroft ; Pt. 2: Keywords

Cockcroft, R. | TIBKAT | 1990

British Library Online Contents | 2004

British Library Online Contents | 2002

Weller, Bernhard ;Tasche, Silke | Wiley | 2023

British Library Online Contents | 2002