Hizlaria: Thiago Pardo
Irakaslea eta ikertzailea da Instituto de Ciências Matemáticas e de Computação (ICMC) Brasileko Universidade de São Paulon (USP)
Bere ikerkuntza-lerroak laburpen automatikoa, analisi diskurtsibo automatikoa, sinplifikazio automatikoa eta itzulpen automatikoa dira eta nabarmentzekoak dira bere ikerkuntzen ondorioz komunitate zientifikoari eskaini dizkion corpus aberastuak eta diskurtso-egitura aztertzeko tresnak.
Eguna: Otsailaren 27an, 2014, osteguna
Ordua: Arratsaldeko 3:30etan
Non: Informatika Fakutatea, 3.1 gela
“Rhetorical Structure Theory: relational discourse structure annotation”
/ Egitura Erretorikoaren Teoria: diskurtsoaren egitura erlazionalaren anotazioa
Thiago A.S. Pardok dikurtsoa analizatzen duten hainbat tresna eraikitzen parte hartu du. Esperientzia horren berri emango digu. Besteak beste, hauek dira tresna horietako batzuk:
- CSTNews interface – access to 50 clusters of news texts and their multidocument summaries, with texts annotated according to the Cross-document Structure Theory
- CSTTool – a semi-automatic edition tool for annotating texts according to the Cross-document Structure Theory
- DiZer 2.0 – an on-line version of DiZer, which is easily adaptable and portable to different text types/genres and languages
- RSTeval – tool for discourse parsing evaluation, following Marcu (2000) evaluation method – the tool is able to compare RST trees (automatically or manually produced), producing precision and recall numbers
- CorpusTCC – corpus of 100 Brazilian Portuguese scientific texts (from Computer Science domain – introduction sections of theses), marked by Marcu’s RSTTool (using this relation set), used for developing DiZer
- RhetDB – Rhetorical Database – an edition environment for handling the rhetorical analyses produced by Daniel Marcu’s RSTTool; it offers several computational facilities for both computational and linguistic purposes. (this is an old version of the software; for better and more advanced features, use RST Toolkit above)