Multimodal Technologies and the Dynamics of Unsupervised Keyphrase Extraction

Alessandra Morandi and Giulia Bianchi

Vol. 13 No. 5 (2024), Original Research Articles

Vol. 13 No. 5 (2024)

Multimodal Technologies and the Dynamics of Unsupervised Keyphrase Extraction

Original Research Articles

Published 2024-05-20

Alessandra Morandi and Giulia Bianchi⁺⁻

Alessandra Morandi and Giulia Bianchi

Department of Information Engineering and Computer Science, University of Trento, Via Sommarive 14, 38123 Povo (TN), Italy

PDF

Keywords

unsupervised automatic keyword extraction
clustering algorithms
word embedding models
Italian datasets
information retrieval
evaluation
word2vec
GloVe

How to Cite

[1]

A. M. and G. Bianchi, “Multimodal Technologies and the Dynamics of Unsupervised Keyphrase Extraction”, J. Comput. Eng., vol. 13, no. 5, May 2024, Accessed: Apr. 13, 2026. [Online]. Available: https://journalofcomputerengineering.com/index.php/jce/article/view/1716

Abstract

Increasingly, the web produces massive volumes of texts, alone or associated with images, videos, photographs, together with some metadata, indispensable for their finding and retrieval. Keywords/keyphrases that characterize the semantic content of documents should be, automatically or manually, extracted, and/or associated with them. The paper presents a novel method to address the problem of the automatic unsupervised extraction of keywords/phrases from texts, expressed both in English and in Italian. The main feature of this approach is the integration of two methods that have given interesting results: word embedding models, such as Word2Vec or GloVe able to capture the semantics of words and their context, and clustering algorithms, able to identify the essence of the terms and choose the more significant one(s), to represent the contents of a text. In the paper, the datasets used are presented, together with the method implemented and the results obtained. These results will be discussed, commented, and compared with those obtained in previous experimentations, using TextRank, Rapid Automatic Keyword Extraction (RAKE), and TF-IDF

PDF

This work is licensed under a Creative Commons Attribution 4.0 International License.

Multimodal Technologies and the Dynamics of Unsupervised Keyphrase Extraction

Keywords

Categories

How to Cite

Abstract

Similar Articles

Multimodal Technologies and the Dynamics of Unsupervised Keyphrase Extraction

Keywords

Categories

How to Cite

Download Citation

Abstract

Similar Articles