WordNet is an online lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets.
Following the success of the English WordNet, similar networks have been developed for a variety of languages. In particular, researchers at ITC-IRST in Trento, Italy have developed a methodology for parallel development of multilingual WordNets. The system, called MultiWordNet, contains information on several aspects of multilingual dictionaries, including lexical relationships between words, semantic relations over lexical concepts, several mappings of lexical concepts in different languages, etc.
MultiWordNet has a variety of applications, including:
- Information retrieval: lexical relations can significantly improve the performance of query answering systems, for example; multilingual relationships facilitate multilingual information extraction and retrieval.
- Semantic annotation: since words in the network are tagged by the semantic concepts to which to relate, a multilingual WordNet can be used for semantic annotation and classification of texts.
- Disambiguation: semantic relationships can assist in determining the semantic distance between words and concepts, thereby assisting in lexical disambiguation.
- Terminology: the system can be used for developing structured terminologies for specific applications.
Hebrew WordNet uses the MultiWordnet methodology and is aligned with the one developed at IRST (and, therefore, aligned with English, Italian and Spanish).
The system is distributed in two versions: a set of XML files and a set of SQL dumps.
- Noam Ordan and Shuly Wintner. "Hebrew WordNet: A Test Case of Aligning Lexical Databases Across Languages." International Journal of Translation 19(1):39-58, 2007. [BibTeX]
- Noam Ordan and Shuly Wintner. "Representing Natural Gender in Multilingual Lexical Databases." International Journal of Lexicography 18(3):357-370, September 2005.
- Researchers: Noam Ordan, Iris Eyal, Shuly Wintner, and Margalit Zabludowski.
- This project was joint with a team at the TCC Division ITC-IRST, Trento, Italy.
For non-commercial research purposes, this resource is licensed under the GNU General Public License (GPL). Any publications resulting from the use of this resource should refer to it as "The MILA Hebrew WordNet" and cite:
Noam Ordan and Shuly Wintner. "Hebrew WordNet: A Test Case of Aligning Lexical Databases Across Languages." International Journal of Translation 19(1):39-58, 2007. [BibTeX]