Using Wikipedia to develop language resources: WordNet 3.0 in Catalan and Spanish

Antoni Oliver, Salvador Climent


We describe the state of the art in the use of Wikipedia for natural language processing tasks and also describe three applications of our own that enrich a powerful language resource: WordNet version 3.0 in Catalan and Spanish. Researchers have for many years sought applications that would take account of world knowledge in a more or less structured way, as this kind of knowledge has proven to be crucial to satisfactorily solving certain language processing tasks. Wikipedia may be the answer to the provision of this kind of information, as it is constantly updated and access is free.  


Wikipedia; WordNet; Natural Language Processing; linguistic resources



  • There are currently no refbacks.



Digithum is an e-journal promoted by the Arts and Humanities Department of the UOC

Creative Commons License

The texts published in this journal are – unless indicated otherwise – covered by the Creative Commons Spain Attribution 3.0 licence. You may copy, distribute, transmit and adapt the work, provided you attribute it (authorship, journal name, publisher) in the manner specified by the author(s) or licensor(s). The full text of the licence can be consulted here: