Information management in global environments: swarm intelligence in multilingual economic document repositories
Keywords:
clustering, ant-based algorithms, multilingual documents, text miningAbstract
The information is a strategic resource of first order for organizations, so it is essential to have methodologies and tools that allow them to properly manage information and extract knowledge from it. Organizations also need knowledge generation strategies using unstructured textual information from different sources and in different languages. This paper presents two bio-inspired approaches to clustering multilingual document collections in a particular field (economics and business). This problem is quite significant and necessary to organize the huge volume of information managed within organisations in a global context characterised by the intensive use of Information and Communication Technologies. The proposed clustering algorithms take inspiration from the behaviour of real ant colonies and can be applied to identify groups of related multilingual documents in the field of economics and business. In order to obtain a language independent vector representation, several linguistic resources and tools are used. The performance of the algorithms is analysed using a corpus of 250 documents in Spanish and English from different functional areas of the enterprise, and experimental results are presented. The results demonstrate the usefulness and effectiveness of the algorithms as clustering technique.Downloads
Download data is not yet available.
Downloads
Published
2013-05-10
How to Cite
Cobo, A., Rocha, R., & Vanti, A. A. (2013). Information management in global environments: swarm intelligence in multilingual economic document repositories. Informação &Amp; Sociedade, 23(1). Retrieved from https://periodicos.ufpb.br/ojs2/index.php/ies/article/view/15128
Issue
Section
Artigos de Revisão
License
Os originais aceitos e publicados tornam-se propriedade de INFORMAÇÃO & SOCIEDADE, sendo vedada sua reprodução total ou parcial, sem a devida autorização da Comissão Editorial, exceto para uso de estudo e pesquisa.