This article gives an overview of the state of art of tools and resources for syntactic analysis of Estonian. A morphosyntactic disambiguator, surface-syntactic analyzer and dependency parser are all based on the Constraint Grammar formalism. As for language resources, a 400,000-word manually annotated dependency treebank has been created, its annotation scheme is compatible with the output of the Constraint Grammar dependency parser. Part of the treebank has been converted to the Universal Dependencies annotation scheme. Our tools have also been tested by large-scale corpus annotation.
Antonsen, Lene, Saara Huhmarniemi and Trond Trosterud. 2009a. Constraint grammar in dialogue systems. In Proceedings of Workshop on Constraint Grammar and robust parsing at NODALIDA 2009 (NEALT Proceedings 8). 13–21.
Antonsen, Lene, Saara Huhmarniemi and Trond Trosterud. 2009b. Interactive pedagogical programs based on constraint grammar. In Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA (NEALT Proceedings 4). 10–17.
Bick, Eckhardt and Tino Didriksen. 2015. CG3 Beyond classical constraint grammar. In Proceedings of the 20th Nordic Conference of Computational Linguistics NODALIDA 2015. Linköping: Linköping University Electronic Press. 31–40.
Ballesteros, Miguel and Joakim Nivre. 2014. MaltOptimizer: Fast and effective parser optimization. Natural Language Engineering 22. 187–213.
Erelt, Mati. 2003. Estonian language. Tallinn: Estonian Academy Publishers.
Erelt, Mati, Reet Kasik, Helle Metslang, Henno Rajandi, Kristiina Ross, Henn Saari, Kaja Tael and Silvi Vare. 1993. Eesti keele grammatika II. Süntaks [Estonian grammar II. Syntax]. Tallinn: Eesti TA Keele ja Kirjanduse instituut.
Forcada, Mikel L., Mirela Ginestí-Rosell, Jacob Nordfalk, Jim O’Regan, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Gema Ramírez-Sánchez and Francis M. Tyers. 2011. Apertium: A free/open-source platform for rule-based machine translation. Machine Translation 25. 127–144.
Hasselblatt, Cornelius. 1990. Das Estnische Partikelverb Als Lehnübersetzung Aus Dem Deutschen (Veroffentlichungen Der Societas Uralo-Altaica). Otto Harrassowitz: Wiesbaden.
Kaalep, Heiki-Jaan. 1997. An Estonian morphological analyser and the impact of a corpus on its development. Computers and the Humanities 31. 115–133.
Kaalep, Heiki-Jaan and Tarmo Vaino. 2001. Complete morphological analysis in the linguist’s toolbox. In Proceedings of Congressus Nonus Internationalis Fenno-Ugristarum, Pars V. 9–16.
Karlsson, Fred, Arto Anttila, Juha Heikkilä and Atro Voutilainen. 1995. Constraint grammar: A language-independent system for parsing unrestricted text. Berlin & New York: Mouton de Gruyter.
Lindström, Liina and Kaili Müürisep. 2009. Parsing corpus of Estonian dialects. In Proceedings of Workshop on Constraint Grammar and robust parsing at NODALIDA 2009 (NEALT Proceedings 8).
McDonald, Ryan, Joakim Nivre, Yvonne Quirmbach-Brundage, Yoav Goldberg, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Täckström, Claudia Bedini, Núria Bertomeu Castelló and Jungmee Lee. 2013. Universal dependency annotation for multilingual parsing. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 92–97.
Muischnek, Kadri, Kaili Müürisep and Tiina Puolakainen. 2013. Estonian particle verbs and their syntactic analysis. In Z. Vetulani and H. Uszkoreit (eds.) Human language technologies as a challenge for computer science and linguistics: 6th Language & Technology Conference proceedings. Poznań: Adam Mickiewicz University. 338–342.
Muischnek, Kadri, Kaili Müürisep and Tiina Puolakainen. 2014a. Dependency parsing of Estonian: Statistical and rule-based approaches. In Human language technologies — The Baltic perspective: The Sixth International Conference “Human Language Technologies—The Baltic Perspective”. Amsterdam: IOS Press. 111–118.
Muischnek, Kadri, Kaili Müürisep and Tiina Puolakainen. 2016. Estonian dependency treebank: From constraint grammar tagset to universal dependencies. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).
Muischnek, Kadri, Kaili Müürisep, Tiina Puolakainen, Eleri Aedmaa, Riin Kirt and Dage Särg. 2014b. Estonian dependency treebank and its annotation scheme. In Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT13). 285–291.
Müürisep, Kaili. 2000. Eesti keele arvutigrammatika: süntaks. [Computational grammar of Estonian: Syntax]. Doctoral dissertation. University of Tartu.
Müürisep, Kaili and Helen Nigol. 2008. Where do parsing errors come from: The case of spoken Estonian. In P. Sojka, A. Horák, I. Kopečekp and K. Pala (eds.) Proceedings of Text, Speech and Dialogue. Berlin: Springer. 161–168.
Müürisep, Kaili and Helen Nigol. 2009. Shallow parsing of transcribed speech of Estonian and disfluency detection. In Z. Vetulani and H. Uszkoreit (eds.) Human language technology: Challenges of information society. Berlin: Springer. 165–177.
Nivre, Joakim, Johan Hall, Jens Nilsson, Atanas Chanev, Gülsen Eryigit, Sandra Kübler, Svetoslav Marinov and Erwin Marsi. 2007. MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering 13. 95–135.
Orasmaa, Siim, Timo Petmanson, Alexander Tkachenko, Sven Laur and Heiki-Jaan Kaalep. 2016. EstNLTK — NLP Toolkit for Estonian. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).
Puolakainen, Tiina. 2001. Eesti keele arvutigrammatika: morfoloogiline ühestamine [Computational grammar of Estonian: Morphological disambiguation]. Doctoral dissertation. University of Tartu.
Rosa, Rudolf, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman and Zdeněk Žabokrtský. 2014. HamleDT 2.0: Thirty dependency treebanks Stanfordized. In Proceedings of LREC 2014.
Särg, Dage. 2015. Adapting constraint grammar for parsing Estonian chatroom texts. In M. Dickinson, E. Hinrichs, A. Patejuk and A. Przepiórkowski (eds.) Proceedings of the 14th International Workshop on Treebanks and Linguistic Theories (TLT14). Warsaw: Polish Academy of Sciences. 300–307.
Uibo, Heli, Jaak Pruulmann-Vengerfeldt, Jack Rueter and Sulev Iva. 2015. Oahpa! õpi! opiq! Developing free online programs for learning estonian and võro. In Proceedings of the 4th Workshop on NLP for Computer Assisted Language Learning at NODALIDA 2015 (NEALT Proceedings 26). 51–64.
Villavicencio, Aline and Ann Copestake. 2002. Verb-particle constructions in a computational grammar of English. In Proceedings of the Ninth International Conference on Head-Driven Phrase Structure Grammar. Stanford: CSLI Publications.