Other Research Projects             Homepage Remko Scha      


Remko Scha: Research on Data-Oriented Parsing


D. Ayuso, Y. Chow, A. Haas, R. Ingria, S. Roucos, R. Scha and D. Stallard: Integration of Speech and Natural Language. BBN Report No. 6813. April 1988.

S. Boisen, Y. Chow, A. Haas, R. Ingria, S. Roucos, R. Scha, D. Stallard, and M. Vilain: Integration of Speech and Natural Language: Final Report. Report No. 6991. Cambridge, Mass.: BBN Systems and Technologies Corporation. 1989.

Remko Scha: "Taaltheorie en taaltechnologie; competence en performance." In: R. de Kort and G.L.J. Leerdam (eds.): Computertoepassingen in de Neerlandistiek. Almere: LVVN, 1990, pp. 7-22. [Translated into English as: "Language Theory and Language Technology; Competence and Performance."]

Remko Scha: "Virtuele Grammatica's en Creatieve Algoritmes." Gramma/TTT 1, 1 (1992), pp. 57-77. [Translated into English as: "Virtual Grammars and Creative Algorithms."]

Jan Scholtes: Neural Networks in Natural Language Processing and Information Retrieval. January 21, 1993. Ph. D. Thesis, University of Amsterdam. (Promotor: Remko Scha)

Khalil Sima'an, Rens Bod, Steven Krauwer and Remko Scha: "Efficient Disambiguation by means of Stochastic Tree Substitution Grammars." In: D. Jones (ed.): Proceedings of the International Conference on New Methods in Language Processing. University of Manchester, 1994, pp. 50-58.

Martin van den Berg, Rens Bod and Remko Scha: "A Corpus-Based Approach to Semantic Interpretation." In: P. Dekker and M. Stokhof (eds.): Proceedings of the Ninth Amsterdam Colloquium. ILLC, University of Amsterdam, 1994, pp. 141-160.

[Reprinted with minor additions as Chapter 8 ("Further Extensions of DOP: Semantics, Discourse, Recency") in: Rens Bod: Enriching Linguistics with Statistics: Performance Models of Natural Language (ILLC Dissertation Series 1995-14, University of Amsterdam, 1995), and as Chapter 8 ("An experience-based model for compositional semantic representations") in: Rens Bod: Beyond Grammar: an Experience-Based Theory of Language (CSLI Publications, Stanford, CA, 1998).]

Rens Bod and Remko Scha: "Prediction and Disambiguation by means of Data-Oriented Parsing." In: L. Boves and A. Nijholt (eds.): Speech and Language Processing. Universiteit Twente, Enschede, 1994, pp. 157-160.

Rens Bod: Enriching Linguistics with Statistics: Performance Models of Natural Language. 1995. (Promotor: Remko Scha.) ILLC Dissertation Series 1995-14.

Remko Bonnema, Remko Scha and Rens Bod: "A Data-oriented Approach to Semantic Interpretation." Proceedings Workshop on Corpus-Oriented Semantic Analysis, ECAI-96, Budapest, Hungary, 1996. cmp-lg/9606024.

K. Sima'an, R. Scha, R. Bonnema and R. Bod: Disambiguation and Interpretation of Wordgraphs using Data Oriented Parsing. Report #31, Probabilistic Natural Language Processing, NWO Priority Programme for Language and Speech Technology, Amsterdam, November 1996.

Rens Bod and Remko Scha: "Data-Oriented Language Processing. An Overview." Technical Report LP-96-13, Institute for Logic, Language and Computation, University of Amsterdam, 1997. cmp-lg/9611003.

R. Bod, R.L. Bonnema, R. Koeling, G.J. van Noord and R.J.H. Scha: Cooperation Data-oriented Grammar-based Natural Language Processing, NWO Priority Programme on Language and Speech Technology, Technical Report nr. 47, Amsterdam/Groningen, 1997.

Remko Bonnema, Rens Bod and Remko Scha: "A DOP Model for Semantic Interpretation." Proceedings 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics (July 7-12, 1997, Madrid, Spain), pp. 159-167.

R. Bod and R. Scha: "Data-Oriented Language Processing", In S. Young and G. Bloothooft (eds.): Corpus-Based Methods in Language and Speech Processing, Kluwer Academic Publishers, Boston, 1997. pp. 137-173.

Reprinted in: Diana McCarthy and Geoffrey Sampson (eds.): Readings in Corpus Linguistics. London & New York: Continuum International, 2003.

R. Bod, R. Bonnema and R. Scha:. "Data-Oriented Semantic Interpretation", Proceedings Second International Workshop on Computational Semantics  (IWCS-II, Tilburg, The Netherlands, 1997), pp. 15-25.

Remko Scha: "Exacte Geesteswetenschappen." Athenæum Illustre, 9 (March 1997), pp. 39-43.

Remko Bonnema, Paul Buying and Remko Scha: "A New Probability Model for Data Oriented Parsing." In: Paul Dekker (ed.): Proceedings of the Twelfth Amsterdam Colloquium, December 18-21, 1999, pp. 85-90.

Remko Scha, Rens Bod and Khalil Sima'an: "A Memory-Based Model of Syntactic Analysis: Data-Oriented Parsing." Journal of Experimental and Theoretical Artificial Intelligence, 11, 3 (July 1999), pp. 409-440. (Special Issue on Memory-Based Language Processing, edited by Walter Daelemans.)

Rens Bod and Remko Scha: "What are the Productive Units of Language Processing? Some Evidence from Data-Oriented Parsing." Proceedings International Conference on Cognitive Science 1999, Tokyo, Japan.

L. Boves, J. Terken, J. Landsbergen, R. Scha and G. van Noord: NWO Priority Programme Language & Speech Technology. Progress Report 1997-1998. Netherlands Organisation for Scientific Research, The Hague, July 1999.

Khalil Sima'an: Learning Efficient Disambiguation. March 31, 1999. Utrecht University, Utrecht Institute of Linguistics OTS. ILLC Dissertation Series 1999-02. (Promotors: Jan Landsbergen and Remko Scha.) FoLLI 2000 Dissertation Award.

Remko Bonnema, Paul Buying, Remko Scha and Khalil Sima'an: "Data-Oriented Natural Language Understanding". In: Remko Scha (ed.): NWO Priority Programme Language and Speech Technology. Final Report. The Hague: Dutch National Science Foundation NWO. May 2000, pp. 149-178.

Remko Scha (ed.): NWO Priority Programme Language and Speech Technology. Final Report. The Hague: Dutch National Science Foundation NWO. May 2000. Contributions by Remko Bonnema, Loe Boves, Paul Buying, Danny Kersten, Esther Klabbers, Gertjan van Noort, Remko Scha, Khalil Sima'an, Jacques Terken, M. Theune, and Gert Veldhuijzen van Zanten.

Remko Bonnema, Paul Buying and Remko Scha: "Parse Tree Probability in Data Oriented Parsing." In: Alexander Gelbukh (ed.): CICLing-2000 International Conference Proceedings, IPN, Mexico City, January 2000, pp. 219-232.

Lars Hoogweg: Extending DOP1 with the insertion operation. M.A. Thesis, Department of Computational Linguistics, University of Amsterdam, 2000.

Rens Bod, Remko Scha and Khalil Sima'an (eds.): Data-Oriented Parsing. Stanford: CSLI Publications, 2003. 410 pp. Contributions by Rens Bod, Remko Bonnema, John Carroll, Jean-Cédric Chappelier, David Chiang, Ido Dagan, Guy De Pauw, Joshua Goodman, Lars Hoogweg, Aravind Joshi, Ronald Kaplan, Yuval Krymolowski, Günter Neumann, Arjen Poutsma, Martin Rajman, Anoop Sarkar, Remko Scha, Khalil Sima'an, Srinivas Bangalore, Andy Way, David Weir and Menno van Zaanen.

Rens Bod, Remko Scha and Khalil Sima'an: "Introduction." In: Rens Bod, Remko Scha and Khalil Sima'an (eds.): Data-Oriented Parsing. Stanford: CSLI Publications, 2003, pp. 1-9.

Rens Bod and Remko Scha: "A DOP Model for Phrase Structure Trees." In: Rens Bod, Remko Scha and Khalil Sima'an (eds.): Data-Oriented Parsing. Stanford: CSLI Publications, 2003, pp. 13-23.

Remko Bonnema and Remko Scha: "Reconsidering the Probability Model for DOP." In: Rens Bod, Remko Scha and Khalil Sima'an (eds.): Data-Oriented Parsing. Stanford: CSLI Publications, 2003, pp. 25-41.

Isaac Sijaranamual: Regular Treebank Generalisation. M.Sc. Thesis Artificial Intelligence, University of Amsterdam, August 19, 2007.