Research Article Open Access

An Entropy Based Method for Removing Web Query Ambiguity in Hindi Language

S. K. Dwivedi and P. Rastogi


Problem statement: WSD is core problem of many Natural Language Processing (NLP) tasks; information retrieval is one of them. Information Retrieval in Hindi language also faces the similar problem of WSD. Hindi language is spoken by the major population in India. Natives from the rural area come across the setback of Hindi language information retrieval. WSD is one of them. End users do not understand that how the information retrieval system will remove the ambiguity in the queries. An automatic disambiguation system is required to rectify this problem. Various researchers have worked on it and given solutions. But none of them tried to detect the ambiguity in the query before its disambiguation. Approach: We followed entropy based selective query disambiguation approach for Hindi language information retrieval. The approach will identify the ambiguity in the query which will be further disambiguated. The approach is also stimulated by the feature of Google "Did you mean…" for English queries. This study summarizes the ambiguity detection approach as the prior ambiguity detection leads to conserve computation power. Results: We applied the selective query approach on the set of fifty queries. In our query set 35% queries were unambiguous. The survey of results concludes that several times even if the query consists of polysemous word, it is detected as unambiguous. Conclusions/recommendation: The study concludes that the detection of ambiguity is quiet important as it leads to saving computational time. Followed by ambiguity detection, final disambiguation can be done through human intervention based on google feature. 

Journal of Computer Science
Volume 4 No. 9, 2008, 762-767


Submitted On: 17 November 2008 Published On: 30 September 2008

How to Cite: Dwivedi, S. K. & Rastogi, P. (2008). An Entropy Based Method for Removing Web Query Ambiguity in Hindi Language. Journal of Computer Science, 4(9), 762-767.

  • 9 Citations



  • Word sense disambiguation
  • information retrieval
  • sense ambiguity
  • polysemous
  • hindi language
  • natural language processing