A platform for research: civil engineering, architecture and urbanism
Semantic Indexing of 19th-Century Greek Literature Using 21st-Century Linguistic Resources
Manual classification of works of literature with genre/form concepts is a time-consuming task requiring domain expertise. Building automated systems based on language understanding can help humans to achieve this work faster and more consistently. Towards this direction, we present a case study on automatic classification of Greek literature books of the 19th century. The main challenges in this problem are the limited number of literature books and resources of that age and the quality of the source text. We propose an automated classification system based on the Bidirectional Encoder Representations from Transformers (BERT) model trained on books from the 20th and 21st century. We also dealt with BERT’s constraint on the maximum sequence length of the input, leveraging the TextRank algorithm to construct representative sentences or phrases from each book. The results show that BERT trained on recent literature books correctly classifies most of the books of the 19th century despite the disparity between the two collections. Additionally, the TextRank algorithm improves the performance of BERT.
Semantic Indexing of 19th-Century Greek Literature Using 21st-Century Linguistic Resources
Manual classification of works of literature with genre/form concepts is a time-consuming task requiring domain expertise. Building automated systems based on language understanding can help humans to achieve this work faster and more consistently. Towards this direction, we present a case study on automatic classification of Greek literature books of the 19th century. The main challenges in this problem are the limited number of literature books and resources of that age and the quality of the source text. We propose an automated classification system based on the Bidirectional Encoder Representations from Transformers (BERT) model trained on books from the 20th and 21st century. We also dealt with BERT’s constraint on the maximum sequence length of the input, leveraging the TextRank algorithm to construct representative sentences or phrases from each book. The results show that BERT trained on recent literature books correctly classifies most of the books of the 19th century despite the disparity between the two collections. Additionally, the TextRank algorithm improves the performance of BERT.
Semantic Indexing of 19th-Century Greek Literature Using 21st-Century Linguistic Resources
Dimitris Dimitriadis (author) / Sofia Zapounidou (author) / Grigorios Tsoumakas (author)
2021
Article (Journal)
Electronic Resource
Unknown
Metadata by DOAJ is licensed under CC BY-SA 1.0
THE MIDDLE 19TH CENTURY – EARLY 21ST CENTURY FUNCTIONING OF LVIV’S HORONYMS
DOAJ | 2017
|Readying a 19th Century structure for the 21st
Online Contents | 1994
Readying a 19th Century structure for the 21st
IuD Bahn | 1994
|Cool flames in 19th century English literature
SAGE Publications | 2018
|