[Ltg] LTG Seminar [Alexandre Rafalovitch, 2008-06-23, E6A 202, 11am]

Marc Tilbrook marct at ics.mq.edu.au
Thu Jun 19 14:57:16 EST 2008


----
  LTG Seminar
   - see: http://www.clt.mq.edu.au/Events/Seminars.html

   Monday, 23th June, 2008, 11am
   Macquarie University, E6A, Room 202
----


Title: UN General Assembly resolution corpus - challenges and discoveries
Speaker: Alexandre Rafalovitch

In the talk, I will discuss UN GA resolution corpus and properties that make
it NLP-interesting. I will talk about building corpus from MSWord documents
and challenges of tokenization, Multi-Word Named Entities, verb phrase
clustering and (hints of) information extraction. If there is time, I may
also showcase some of the other corpus aspects that I do not concentrate on,
but other people may find interesting.





More information about the LTG mailing list