Figure 1 - uploaded by Renato Fernandes Correa
Content may be subject to copyright.
Steps executed by a Hybrid Document Organization System.

Steps executed by a Hybrid Document Organization System.

Source publication
Conference Paper
Full-text available
This paper presents and evaluates a new version of the Semantic Mapping method applied to the construction of a hybrid document organization system based on Self-Organizing Maps. The hybrid system uses reduced document vectors generated by Semantic Mapping to training the SOM map, thus reducing the training time without compromising the quality of...

Context in source publication

Context 1
... this work, the hybrid SOM-based document organization systems perform the following four steps (see Figure 1): document indexing, dimensionality reduction, construction of document map and construction of user interface. The document indexing step comprises preprocess the text documents and represent them statistically. ...

Citations

... Finally, from 2006 becomes the consolidation of the different research lines defined in the previous stages by methods' comparison aiming to obtain systems that generate document maps of good quality at a lower computational cost, as well as re-investigate how to construct an IRS based on document maps to specific domains. For instance, in the first research line the works [ [17] discuss and compare SOM-based hybrid systems to text document organization. Beside these works, others like [18] and [19] propose different documents' content representation by codifying content and structure at levels as document, pages, paragraphs and sentences. ...
Article
Full-text available
This paper proposes the application of Self-Organizing Maps (SOM) in the construction of an Information Retrieval System (IRS) to the Digital Library of Theses and Dissertations at Federal University of Pernambuco (BDTD-UFPE). The hypothesis is that the trained SOM and its graphical representation can help the user to obtain a general view of topics discussed in the document collection and also to perceive the similarity among documents, topics and graduate programs' knowledge areas and subjects. We present the system architecture and implementation's issues and evaluate the hypothesis through experimentation. The unsupervised organization performance of the proposed system was measured in terms of text categorization effectiveness and visual inspection of categories' distribution over graphics about trained SOM. The experimental results show that the proposed system generates relevant document map and confirms the hypothesis.