POS Category and SWN
-
Upload
arkadiy-popov -
Category
Documents
-
view
36 -
download
2
description
Transcript of POS Category and SWN
Sanskrit Wordnet TeamIIT Bombay
POS Category and SWN
Roadmap Work Report
Interface
Publications
POS category and WN
POS of derived words
Form vs Function
Conclusion
04/20/232
Completed Synsets 12870Verb 970Noun 8857Adjective 2820 Adverb 223
04/20/233
Work Report (till 31st December 2011)
Interface of SWN
04/20/234
Slogan on the Interface
Let us collect words together
Publications and Presented Papers
04/20/235
Published Papers-2009 Kulkarni, M., and Bhattacharyya, P., Verbal roots in the
Sanskrit wordnet, In SanskritComputational Linguistics, Lecture Notes in Computer Science, edi. G. Huet, Amba Kulkarni, and Peter Scharf, Springer-Verlag, Berlin/Heidelberg, pp. 328-338
2010 Kulkarni, M., Kulkarni I., Dangarikar, C., Bhattacharyya P., and Nanda, A., Introducing Sanskrit Wordnet, 5th Global Wordnet Conference.
2010 Kulkarni, M., Kulkarni I., Dangarikar, C., Bhattacharyya P., Gloss in Sanskrit Wordnet, Sanskrit Computational Linguistic Symposium, Lecture Notes in Computer Science, 6465, (edi.) Jha, G. Berlin: Springer-Verlag/Heidelberg, pp. 190-198
2010 KulkarniM. et. al. Adverbs in Sanskrit wordnet, ICON 2010 Forthcoming paper
2012 Ajotikar, T., Kulkarni M. and Bhattachryya P., Verbs in Sanskrit Wordent, 6th Global Wordnet Conference, Japan
Verbs-Gloss
Unique task Following traditional grammar and Nyāya schoolTakes care of semantics as well as syntax
04/20/236
POS Category and WN
गमन = एकस्था�नवियो�गपू��क� पू�दविक्षे�पूयो�क्तः� अन्योस्था�नसं�यो�ग�न�क� लः� व्या�पू�रः�।
अपू�द�न कर्ता�� कम�
04/20/237
रः�म खा�न� पका� रहा� हा�
रः�म न� पूत्र लि�ख डा��� ।
पूक�न� Semantic Aspect
Progressive Aspect + Gender+ Number
Tense + Number
लिलःखान�Semantic Aspect
Number+ Gender+ Aspect
Tense, Aspect, Mood etc. expressed through a lexicalized units in Hindi
Structural Difference between Hindi and Sanskrit
Nature of verb in Sanskrit
04/20/238
रः�म� ग्रा�म� गच्छविर्ता।
गम$ विर्ता
क�रःकम$ क�लः� पू�रुषः� चनम$गमनम$
Tense, Number, Person and क�रःक Information fused in SUFFIX
Sanskrit does not use
lexical item to denote
tense, mood, aspect.
Issues related to link synsets of Verbs
04/20/239
Unable to link synsets expressing meaning such as
च�कन� (synset id 10809)
विकसं( क�यो� क� बा�क़ी+ न रःहन�
"क्यो� आपू खा�न� खा� च�क� “
To eat
Completion
सं�भा�न� ह�न�, संकन� (synset id 10149)
क�ई क�यो� ह� संकन�
आज बा�रिरःश ह�न� क+ सं�भा�न� ह6।
Expressed through a suffix.किंक8 भान्र्ता� अभा�क्षेर्ता।
Expressed through a suffix.अद्य षः�� भा�र्ता$।
सं�भा�न�ह�न�
स्रष्टा�- a creatorTwo synsets in HWN in different categories
04/20/2310
POS of derived words
7789 सं>जनकर्ता��, संज�क, सं>जक, रःच�र्ता�, स्रष्टा�, लिसंरःजनह�रः - रःचन� यो� बान�न��लः� "योह र्ता� सं>ष्टिष्टा क� सं>जनकर्ता�� ब्रह्मा�ज( भा( नहB ज�नर्ता� विक उन्हDन� सं>ष्टिष्टा क+ रःचन� क्योD क+" ADJECTIVE
6259 विनम��र्ता�, संज�क, सं>जक, सं>जनकर्ता��, स्रष्टा�, सं>जनह�रः, सं>ष्टिष्टाकर्ता��, लिसंरःजनह�रः, खा�लिलःक, ख़ा�लिलःक़ी, सं�स्था�पूक - विनम��ण करःन� यो� बान�न� �लः� "इसं विIल्म क� विनम��र्ता� आष्टिमरः ख़ा�न ह6“ NOUN
विनम��र्ता�, धा�र्ता�, स्रष्टा� - यो� विनम��विर्ता। "विहन्दुधाम��न�सं�रः�ण सं>ष्टा�� विनम��र्ता� ब्रह्माद�� अस्तिOर्ता।"
स्रष्टा� nominative singular of स्रष्टा>
स्रष्टा>
सं>ज$ र्ता>च$ (Ashtadhyayi 3.1.133) to create कर्ता>� of action
Derived Word Can be said as an adjective Lexicalized as a noun
04/20/2311
Creator
क्तः�
04/20/2312
Found in Vedic Literatu
re
Occurrence in all genders
as a noun
in classic
al Sanskr
it
04/20/2313
(synset id 29330)
वक्ता�, अ��प�, अमू�का, व�दका, आख्या�पका, आख्या�याका -
बा�लःन� �लः�
"कभा(- “कभा( क्तः� व्यालिक्तः क� झे�लःन� बाड़ा� म�स्तिRकलः ह� ज�र्ता� ह6
Adjective
• Cannot be done in SWN
Form vs Function
04/20/2314
Conflict lexical information is adjective syntactic feature is like a noun
When a word is lexically an adjective but behaves like a noun on syntactic level then it is called as nominal. The ability to take grammatical case marking.
Common Feature of Indian LanguagesAdjectives are declined according to gender and number of
a noun to which they modify in Sanskrit. श्वे�र्ता� धा�विर्ता।
Nominals should not be a part of WN which is semantic net.
Nouns behave as an adjective on syntactic level is an adjectival.
Synsets of adjectivals difficult to cover in Sanskrit
Literal translation ‘मह्यं� विहन्दUलिचत्रपूटाः�� रः�चन्र्ता�।’ किंह8दU विIल्मX – A compound in Sanskrit.It is dissolved in Sanskrit as विहन्दU(भा�षः�यो�म$) लिचत्रपूटाः�.It is संप्र्ताम( र्तात्पू�रुषः.It is not an adjective in Sanskrit Compound.
04/20/2315
12530 विहन्दU,किंह8दU किंह8दU भा�षः� क� यो� किंह8दU सं� सं�बा�ष्टिधार्ता
म^ किंह8दU विIल्मX द�खान� पूसं�द करःर्ता� हूँ`।
Adjective
04/20/2316
When a word is an adjective or a noun lexically but functions as an adverb syntactically then it is called as adverbial.ज�नज�र्ता$�ग�न
All these are nouns lexically.When appear in instrumental or ablative
case acts as an adverbShould they form a part of a synset of an
adverb?
Conclusion
Impact of structural differences of Hindi and
Sanskrit on development of SWN
Lexicalization of derived words needs to be
studied
Historical development of a word should be
considered while developing SWN
Nominals, Adjectivals and Adverbials will not
be considered in SWN
04/20/2317