Using Wikipedia to develop language resources: WordNet 3.0 in Catalan and Spanish

Antoni Oliver, Salvador Climent

Abstract


We describe the state of the art in the use of Wikipedia for natural language processing tasks and also describe three applications of our own that enrich a powerful language resource: WordNet version 3.0 in Catalan and Spanish. Researchers have for many years sought applications that would take account of world knowledge in a more or less structured way, as this kind of knowledge has proven to be crucial to satisfactorily solving certain language processing tasks. Wikipedia may be the answer to the provision of this kind of information, as it is constantly updated and access is free.  


Keywords


Wikipedia; WordNet; Natural Language Processing; linguistic resources



DOI: http://dx.doi.org/10.7238/d.v0i14.1474

Refbacks

  • There are currently no refbacks.


Universitat Oberta de Catalunya

 

 

 

 

Universidad de Antioquia







Digithum is an e-journal coedited by the Arts and Humanities Department of the UOC and the Faculty of Humanities and Social Sciences of the University of Antioquia, Colombia.

Creative Commons License

The texts published in this journal are – unless indicated otherwise – covered by the Creative Commons Spain Attribution 3.0 licence. You may copy, distribute, transmit and adapt the work, provided you attribute it (authorship, journal name, publisher) in the manner specified by the author(s) or licensor(s). The full text of the licence can be consulted here: http://creativecommons.org/licenses/by/3.0/es/deed.en.