five

Hindi WordNet

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2008L02
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3> <p>Hindi WordNet, Linguistic Data Consortium (LDC) catalog number LDC2008L02 and isbn 1-58563-470-0, was developed by researchers at the Center for Indian Language Technology, Computer Science and Engineering Department, IIT Bombay. </p> <p>Hindi, a member of the Indo-Iranian language family, is the primary national language of India and is spoken by approximately 500 million people making it the fifth largest language in the world. Inspired by the well-known English language <a href="http://wordnet.princeton.edu/" rel="nofollow">Wordnet</a>, Hindi Wordnet is the first wordnet for an Indian language. Wordnets are systems for analyzing the different lexical and semantic relations between words. Specifically, a wordnet is a word sense network in which words are grouped into sematically equivalent units called synsets. Each synset represents a lexical concept, and synsets are linked to each other by semantic relations (between synsets) and lexical relations (between words). Similar in design to the Princeton Wordnet for English, Hindi Wordnet incorporates additional features to capture the complexities of Hindi. This release of Hindi Wordnet consists of 56,928 unique words and 26,208 synsets.</p> <p>Additional information about the development of Hindi Wordnet is available at the <a href="http://www.cfilt.iitb.ac.in/wordnet/webhwn/" rel="nofollow">Hindi WordNet </a> web site. </p> <h3>Data</h3> <p>Hindi WordNet contains nouns, verbs, adjectives and adverbs. Each entry consists of the following elements:</p> <ol><li> <p><b>Synset:</b> a set of synonymous words. For example, ?विद्यालय, पाठशाला, स्कूल? (vidyaalay, paaThshaalaa, skuul) represents the concept of school as <i>an educational institution</i>. The words in the synset are arranged according to the frequency of usage.</p> </li></ol><ol start="2"><li> <p><b>Gloss:</b> the concept. It consists of two parts:</p> </li></ol><p> <b><i>Text definition</i>:</b> It explains the concept denoted by the synset. For example, ?वह स्थान जहाँ प्राथमिक या माध्यमिक स्तर की औपचारिक शिक्षा दी जाती है? (vah sthaan jahaaM praathamik yaa maadhyamik star kii aupacaarik sikshaa dii jaatii hai) explains the concept of school as <i>an educational institution.</i></p> <p> <b><i>Example sentence</i>:</b> It gives the usage of the words in the sentence. Generally, the words in a synset are replaceable in the sentence. For example,<i>"</i>इस विद्यालय में पहली से पाँचवीं तक की शिक्षा दी जाती है? (is vidyaalay me pahalii se pancvii tak kii shikshaa dii jaatii hai) gives the usage for the words in the synset representing schoolas<i> an educational institution.</i></p> <ol start="3"><li> <p><b>Position in Ontology</b>: An ontology is a hierarchical organization of concepts, or more specifically, a categorization of entities and actions. A separate ontological hierarchy exists for each syntactic category (noun, verb, adjective adverb). Each synset is mapped into some place in the ontology.. </p> </li></ol><p>This release of Hindi WordNet is made available as a complete Java application along with an API to facilitate further development. </p> </br> Portions © 2007 IIT Bombay, © 2008 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作