Talk on active learning/statistical parsing--Rebecca Hwa-REMINDER

If you would like to meet with Rebecca please send me a message with a copy to
David Chiang (dchiang@linc).

Rebecca will come in the eve. of Tue. Jan 23. I will be away that eve. David is
arrangine a small dinner party, so please get intouch with him if you wouldlike
to join him.

Other opportunities are Th. Jan 24--lunch and in the afternoon.
Th eve. (Jan 24) we will go out for dinner. Let me know if you would like to

Announcement for the talk enclosed below.


Rebecca Hwa from the Univ. of Maryland will be speaking at next week's
XTAG meeting (24 Jan, 10:30 AM, IRCS fishbowl).

Sample selection for parser induction

Many corpus-based natural language processing systems rely on using
large quantities of annotated text as their training examples. Building
this kind of resource is an expensive and labor-intensive project.
Sample selection is a machine learning technique that attempts to
minimize the number of training examples by asking people to annotate
only those examples with the greatest potential to improve the system.
In this talk, I will address the challenges in applying sample selection
to training parsers: what is an effective metric for
selecting informative examples to train parsers; does the metric work
for different kinds of parsers; and are the selected examples good for
training other parsers?  I will present empirical results showing that
selection using the {\it tree-entropy} metric can significantly decrease
the number of training examples needed for both a history-based and an
EM-based parser.

------- End of Forwarded Message