Mitchell P. Marcus




RCA Professor of Artificial Intelligence

Office Address:
Dept. of Computer & Information Science
University of Pennsylvania
3330 Walnut Street
503 Levine Hall
Philadelphia, PA 19104-6389
Email: mitch@cis.upenn.edu
Tel: (215) 898-2538
FAX: (215) 898-0587


RESEARCH

Research interests include: statistical and symbolic methods for automatic acquisition of linguistic structure; methodologies for annotation of linguistic structure in large text corpora; natural language processing; and cognitive science.


BACKGROUND

I Studied Linguistics and Applied Mathematics at Harvard University as an undergraduate (Magna cum Laude), and received a Ph.D. in Electrical Engineering and Computer Science in 1978 from MIT. From 1979-1987, I was a Member of Technical Staff at AT&T Bell Laboratories. At Penn, I hold the position of RCA Professor of Artificial Intelligence and was Chair of the Department of Computer and Information Science from 1994 to 2001. I also hold an appointment in Linguistics. I am currently (2002-2003) Chair of the Penn Faculty Senate. I was the principal investigator for the Penn Treebank Project through the mid-1990s, was President of the Association for Computational Linguistics in 1997, and have been a Fellow of the American Association of Artificial Intelligence since 1992. In 2002, I was the Chair of the 2nd Human Language Technologies Conference.


SELECTED PUBLICATIONS

``Text Chunking using Transformation-Based Learning'', Association for Computational Linguistics Workshop on Very Large Corpora, Boston, MA, June 1995 (with Lance Ramshaw). Reprinted in Natural Language Processing Using Very Large Corpora, Armstrong et al. (eds.), Kluwer, 1998.

``New Trends in Natural Language Processing: Statistical Natural Language Processing'', Proceedings of the National Academy of Science, Vol 92, pp. 10052-10059, 1995. Reprinted in Voice Communication Between Humans and Machines, Rowe and Wilpon (eds.), National Academy Press, 1994.

``Building a large annotated corpus of English: the Penn Treebank.'' Computational Linguistics, Vol.19, 1993. Reprinted in Using Large Corpora, S. Armstrong (ed.), MIT Press, 1994. (with Beatrice Santorini and Mary Ann Marcinkiewicz)

``The Penn Treebank: A Revised Corpus Design for Extracting Predicate Argument Structure,'' 1994 ARPA Human Language Technology Workshop, Princeton, NJ, March 1994, Morgan-Kaufman (with 6 others).

A Theory of Syntactic Recognition for Natural Language, MIT Press, 1980.


FORMER PhD STUDENTS

Jason Eisner 2001 (Asst Prof, Johns Hopkins)
Michael Collins 1999 (Asst Prof, MIT)
Dan Melamed 1998 (Asst Prof, NYU)
Adwait Ratnaparkhi 1998 (Microsoft Research)
Jeff Reynar 1998 (Lead Program Manager, Microsoft)
David Yarowsky 1996 (Assoc Prof, Johns Hopkins)
James Henderson 1994 (Maître-Assistant,Université de Genève)
Eric Brill 1993 (Senior Researcher, Microsoft Research)
Robert Rubinoff 1992
Michel Degraff 1992 (Assoc Prof, Linguistics, MIT)