Stochastic Approach of Parsing Bengali Sentences

##plugins.themes.bootstrap3.article.main##

Ayesha Khatun
Khadiza-Tul-Kobra
Babe Sultana
Md. Jahidul Islam
Sumaiya Kabir

Abstract

The parsing technique based on associate grammar rules as well as probability is called stochastic parsing. This paper suggested a probabilistic method to eliminate the uncertainty from the sentences of Bangla. The technique of Binarization is applied to increase the precision of the parsing. CYK algorithm is used in this paper. The work mainly focused on intonation-based sentences, for these reasons PCFGs (Probabilistic Context-Free Grammars) is based on proposed. About 30324 words are used to test the proposed system; average 93% accuracy is achieved.

##plugins.themes.bootstrap3.article.details##

Area :
Articles

References

Schutze H, Manning C, Foundations of statistical NLP. MIT press; 1999 May 28.

Martin, J.H and Jurasky, D., 2000. Speech and Language Processing: Computational Linguistics and Speech Recognition, An introduction to natural language Processing. Prentice Hall, New Jersey.

M. Collins, “There are Three generative, lexicalized models for statistical parsing”, In Proceedings of International Conference of 8th Conference on European Chapter of the Association for Computational Linguistics, 7th July 1997.

M. M. Hoque, M. M. Ali, 2003, December. A parsing methodology for Bangla natural language sentences. In Proc. on Computer and Information Technology (ICCIT), Dhaka, Bangladesh (pp. 277-282).

Rabbi RZ, Shuvo MI, Hasan KA. Bangla grammar pattern recognition using shift reduce parser. In Proc. On 2016 5th International Conference on Informatics, Electronics and Vision Conference 2016 May 13 (pp. 229-234).

Chakraborty S, Sinha A, Nath S. a Bengali Sylheti rule-based dialect translation system: Proposal and preliminary system. In

Proceedings of the International Conference on Computing and Communication Systems 2018 (pp. 451-460). Springer, Singapore.

Dasgupta, S., Wasif, A. and Azam, S., 2004. An optimal way of machine translation from English to Bengali. In Proc. 7th International Conference on Computer and Information (ICCIT) (pp. 648-653).

Karim MA, editor. Technical challenges and design issues in bangla language processing. IGI Global; 2013 Apr 30.

Hoque, M.M., Faruk, M.O., Hasan, M.M., Hassan, M.K. and Karim, M.M.U., 2006. An empirical framework for statistical parsing of Bangla sentences. Computer Science & Engineering Research Journal, 4, pp.29-38.

Khatun, A. and Hoque, M.M., 2017, February. Statistical parsing of Bangla sentences by CYK algorithm. In 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE) (pp. 655-661). IEEE.

Purohit, P.P., Hoque, M.M. and Hassan, M.K., 2014, October. An empirical framework for semantic analysis of Bangla sentences. In 2014 9th International Forum on Strategic Technology (IFOST) (pp. 34-39). IEEE.

M. N. Hoque, M. H. Siddiqui , 2015, December. rule based analyzer and Bangla Parts-of-Speech tagging using Bangla stemmer. In 2015 18th International Conference on Computer and Information Technology (ICCIT) (pp. 440-444). IEEE.

Johnson M, Griffiths TL, Goldwater S. Bayesian inference for pcfgs via markov chain monte carlo. In Human Language

Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference 2007 Apr (pp. 139-146).

M.S. Arefin, M. M. Hoque, M. O. Rahman, and Arefin, M.S., 2015, May. interrogative and imperative sentences into English and the machine translation framework for translating Bangla assertive. In Proc. On 2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT) (pp. 1-6).

Alam, L., Arefin, M.S., Hoque, M.M and Sharmin, S., 2015, November. For parsing Bangla assertive, interrogative and imperative sentences an empirical framework is designed.

In 2015 International Conference on Computer and Information Engineering (ICCIE) (pp. 122-125). IEEE.

Song, X., Ding, S. and Lin, C.Y., 2008, October. Better binarization for the CKY parsing. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (pp. 167-176).

Huang, L., Zhang, H., Gildea, D. and Knight, K., 2009. Binarization of synchronous context-free grammars. Computational Linguistics, 35(4), pp.559-595.