CURRENT STATE OF THE ART PARTS OF SPEECH TAGGING FOR INDIAN LANGUAGES –A STUDY
Keywords:
Ambiguity, Tagset, Natural Language Processing, Part of Speech Tagging, Rule Based Approach, Statistical Approach, Hybrid ApproachAbstract
parts-of speech tagging is a pipeline module for almost all application areas of natural language processing
(NLP). POS Tagging is a very important preprocessing task for language processing activities. This paper reports about
the parts of speech systems proposed for 15 Indian languages( Hindi, Panjabi, Marathi, Gujrati, Kannad, Tamil, Telgu,
Malyalam, Manipuri, Konkani, Bengali, Assames, Odia, Sambalpuri, Sindhi). In this paper, all the approaches have been
also briefly discussed which is used in POS tagging.