Please use this identifier to cite or link to this item: http://ir.inflibnet.ac.in/handle/1944/504
Full metadata record
DC FieldValueLanguage
dc.contributor.authorFatima, S Sameenen_US
dc.contributor.authorKrishnan, Ren_US
dc.date.accessioned2005-05-10T11:52:19Zen_US
dc.date.accessioned2010-04-08T08:47:56Z-
dc.date.available2005-05-10T11:52:19Zen_US
dc.date.available2010-04-08T08:47:56Z-
dc.date.issued2005-02-02en_US
dc.identifier.isbn81-902079-0-3en_US
dc.identifier.urihttp://hdl.handle.net/1944/504en_US
dc.description.abstractAn error in classification can occur due to an error of omission, statistically known as a false negative or an error of commission, statistically known as a false positive. In order to build a perfect classifier, the false negatives and false positives have to be zero. With this in mind, we propose a two-tier model for the classifier. The first tier will reduce false negatives to zero and pass the results to the second tier. The second tier will reduce false positives to zero. We demonstrate the working of this model for the task of classifying sentences in Hindi as passive formations. The first tier will consist of a simple pattern matching system for filtering out sentences with likely passive formations without committing errors of omission. This will reduce the size of the corpus considerably. The second tier will work on the reduced corpus and make a complete grammatical analysis of these filtered sentences in order to reduce the false positives to a zero. The Anusaraka System [Bharati 1995] is a very good example of such a system. This paper concentrates on building the first tier. A hill climbing algorithm is proposed, where the start state is a list of patterns commonly found in passive formations. Each step up the hill will update the list of patterns such that the next state will bring down the number of false negatives, thereby reducing errors of omission. The hill climbing algorithm terminates when the false negatives are zero.en_US
dc.format.extent223841 bytesen_US
dc.format.mimetypeapplication/pdfen_US
dc.language.isoenen_US
dc.publisherINFLIBNET Centreen_US
dc.subjectNatural Language Processingen_US
dc.subjectAutomated Language Processingen_US
dc.titleTwo-Tier Performance Based Classification Model for Low Level NLP tasksen_US
dc.typeArticleen_US
Appears in Collections:CALIBER 2005:Kochi

Files in This Item:
File Description SizeFormat 
05cali_13.pdf218.59 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.