目 录
导读…1
Preface …17
Introduction …19
Anne Abeill??
1 BUILDING TREEBANKS …21
2 USING TREEBANKS …25
Part I BUILDING TREEBANKS
ENGLISH TREEBANKS
Chapter 1
THE PENN TREEBANK:AN OVERVIEW …5
Ann Taylor, Mitchell Marcus, Beatrice Santorini
INTRODUTION…5
1 THE ANNOTATION SCHEMES …6
2 METHODOLOGY …16
3 CONCLUSIONS …20
Chapter 2
THOUGHTS ON TWO DECADES OF DRAWING TREES …23
Geoffrey Sampson
1 HISTORICAL BACKGROUND …23
2 BUILDING TREEBANKS …26
3 EXPLOITING THE SUSANNE TREEBANK …29
4 SMALL IS BEAUTIFUL …33
5 ANNOTATING A SPOKEN CORPUS …35
6 USING THE CHRISTINE CORPUS …38
7 CONCLUSION …40
Chapter 3
BANK OF ENGLISH AND BEYOND …43
Timo J?rvinen
1 INTRODUCTION …43
2 ANNOTATING 200 MILLION WORDS …44
3 ENGCG SYNTAX …52
4 FDG PARSER …54
5 CONCLUSION …56
Chapter 4
COMPLETING PARSED CORPORA …61
Sean Wallis
1 INTRODUCTION …61
2 CONVENTIONAL POST-CORRECTION …63
3 A PARADIGM SHIFT: TRANSVERSE CORRECTION …65
4 CRITIQUE …68
GERMAN TREEBANKS
Chapter 5
SYNTACTIC ANNOTATION OF A GERMAN NEWSPAPER CORPUS …73
Thorsten Brants, Wojciech Skut, Hans Uszkoreit
1 INTRODUCTION …73
2 TREEBANK DEVELOPMENT …74
3 CORPUS ANNOTATION …77
4 APPLICATIONS …83
5 CONCLUSIONS …83
Chapter 6
ANNOTATION OF ERROR TYPES FOR A GERMAN
NEWSGROUP CORPUS…89
Markus Becker, Andrew Bredenkamp, Berthold Crysmann, Judith Klein
1 INTRODUCTION …89
2 CORPUS DESCRIPTION …90
3 ANNOTATION STRATEGY …91
4 ANNOTATION TOOLS …93
5 EVALUATION …96
6 FIRST RESULTS …98
7 CONCLUSION …99
SLAVIC TREEBANKS
Chapter 7
THE PRAGUE DEPENDENCY TREEBANK… 103
Alena B?hmov??, Jan Hajicˇ, Eva Hajicˇov??, Barbora Hladk??
1 THE PRAGUE DEPENDENCY TREEBANK …103
2 MORPHOLOGICAL LEVEL …104
3 ANALYTICAL LEVEL …106
4 MERGING THE MORPHOLOGICAL AND THE
ANALYTICAL SYNTACTIC LEVEL …114
5 TECTOGRAMMATICAL LEVEL …114
6 PDT VERSIONS 1.0 AND 2.0 …121
7 CONCLUSION …122
Chapter 8
AN HPSG-ANNOTATED TEST SUITE FOR POLISH …129
Malgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, Anna Kup
1 AIMS AND DESIGN CONSTRAINTS …129
2 CORRECTNESS AND COMPLEXITY MARKERS …130
3 LINGUISTIC PHENOMENA …131
4 ANNOTATION SCHEMA …136
5 IMPLEMENTATION ISSUES …137
6 CONCLUSION …143
TREEBANKS FOR ROMANCE LANGUAGES
Chapter 9
DEVELOPING A SYNTACTIC ANNOTATION SCHEME AND TOOLS
FOR A SPANISH TREEBANK …149
Antonio Moreno, Susana López, Fernando S??nchez, Ralph Grishman
1 INTRODUCTION …149
2 DATA SELECTION …150
3 ANNOTATION SCHEME …151
4 TOOLS …157
5 DEBUGGING AND ERROR STATISTICS …158
6 CURRENT STATE AND FUTURE DEVELOPMENT …159
Chapter 10
BUILDING A TREEBANK FOR FRENCH …165
Anne Abeill??, Lionel Cl??ment, Fran?ois Toussenel
INTRODUTION
1 THE TAGGING PHASE …166
2 THE PARSING PHASE …173
3 CURRENT STATE AND FUTURE WORK …180
4 CONCLUSION …181
Chapter 11
BUILDING THE ITALIAN SYNTACTIC-SEMANTIC TREEBANK …189
Simonetta Montemagni, Francesco Barsotti, Marco Battista, Nicoletta Calzolari, Ornella Corazzari, Alessandro Lenci. Antonio Zampolli, Francesca Fanciulli, Maria Massetani, Remo Raffaelli, Roberto Basili, Maria Teresa Pazienza, Dario Saracino, Fabio Zanzotto,Nadia Mana, Fabio Pianesi, Rodolfo Delmonte
1 INTRODUCTION …190
2 ISST ARCHITECTURE …190
3 ISST CORPUS …191
4 ISST MORPHO-SYNTACTIC ANNOTATION …191
5 ISST SYNTACTIC ANNOTATION …192
6 ISST LEXICO-SEMANTIC ANNOTATION …196
7 THE MULTI-LEVEL LINGUISTIC ANNOTATION TOOL …200
8 ISST EVALUATION …204
9 CONCLUSION …206
Chapter 12
AUTOMATED CREATION OF A MEDIEVAL PORTUGUESE
PARTIAL TREEBANK …211
Vitor Rocio. M??rio Amado Alves, J. Gabriel Lopes, Maria Francisca Xavier, Gra?a Vicente
1 INTRODUCTION …211
2 THE PARSED CORPUS OF MEDIEVAL
PORTUGUESE TEXTS …212
3 TOOLS AND COMPUTATIONAL RESOURCES …215
4 EVALUATION …222
5 CONCLUSION …224
TREEBANKS FOR OTHER LANGUAGES
Chapter 13
SINICA TREEBANK …231
Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang, Zhao-Ming Gao
1 INTRODUCTION …231
2 DESIGN CRITERIA …232
3 REPRESENTATION OF LEXICO-GRAMMATICAL
INFORMATION: ICG …233
4 ANNOTATION GUIDELINE …235
5 IMPLEMENTATION …239
6 REPRESENTATIONAL ISSUES: PROBLEMATIC CASES
AND HOW THEY ARE SOLVED …241
7 CURRENT STATUS OF THE SINICA TREEBANK AND
FUTURE WORK …243
Chapter 14
BUILDING A JAPANESE PARSED CORPUS …249
Sadao Kurohashi, Makoto Nagao
1 INTRODUCTION …249
2 OVERVIEW OF THE PROJECT …250
3 MORPHOLOGICAL ANALYZER JUMAN …253
4 DEPENDENCY STRUCTURE ANALYZER KNP …255
5 CONCLUSION …259
Chapter 15
BUILDING A TURKISH TREEBANK …261
Kemal Oflazer, Bilge Say, Dilek Zeynep Hakkani-Tür, G?khan Tür
1 TURKISH: MORPHOLOGY AND SYNTAX …262
2 WHAT INFORMATION NEEDS TO BE REPRESENTED? …263
3 THE ANNOTATION TOOL …270
4 SOME DIFFICULT ISSUES …272
5 CONCLUSIONS AND FUTURE WORK …273
Part II USING TREEBANKS
Chapter 16
ENCODING SYNTACTIC ANNOTATION …281
Nancy Ide, Laurent Romary
1 INTRODUCTION …281
2 XCES …283
3 SYNTACTIC ANNOTATION: CURRENT PRACTICE …284
4 A MODEL FOR SYNTACTIC ANNOTATION …286
5 USING THE XCES SCHEME …291
6 CONCLUSION …293
EVALUATION WITH TREEBANKS
Chapter 17
PARSER EVALUATION …299
John Carroll, Guido Minnen, Ted Briscoe
1 INTRODUCTION …299
2 GRAMMATICAL RELATION ANNOTATION …302
3 CORPUS ANNOTATION …308
4 PARSER EVALUATION …309
5 DISCUSSION …312
6 SUMMARY …313
Chapter 18
DEPENDENCY-BASED EVALUATION OF MINIPAR …317
Dekang Lin
1 INTRODUCTION …317
2 DEPENDENCY-BASED PARSER EVALUATION …318
3 EVALUATION OF MINIPAR WITH SUSANNE CORPUS …320
4 SELECTIVE EVALUATION …323
5 RELATED WORK …326
6 CONCLUSIONS …328
GRAMMAR INDUCTION WITH TREEBANKS
Chapter 19
EXTRACTING STOCHASTIC GRAMMARS FROM TREEBANKS …333
Rens Bod
1 INTRODUCTION …333
2 SUMMARY OF DATA-ORIENTED PARSING …335
3 SIMULATING STOCHASTIC GRAMMARS BY
CONSTRAINING THE SUBTREE SET …337
4 DISCUSSION AND CONCLUSION …344
Chapter 20
A UNIFORM METHOD FOR AUTOMATICALLY EXTRACTING
STOCHASTIC LEXICALIZED TREE GRAMMARS FROM
TREEBANKS AND HPSG …351
Günter Neumann
1 INTRODUCTION …351
2 RELATED WORK …352
3 GRAMMAR EXTRACTION …353
4 SLTG FROM TREEBANKS …355
5 SLTG FROM HPSG …359
6 FUTURE STEPS: TOWARDS MERGING SLTGS …362
Chapter 21
FROM TREEBANK RESOURCES TO LFG F-STRUCTURES …367
Anette Frank, Louisa Sadler, Josef van Genabith, Andy Way
1 INTRODUCTION …368
2 METHODS FOR AUTOMATIC F-STRUCTURE
ANNOTATION …370
3 TWO EXPERIMENTS …380
4 DISCUSSION AND CURRENT RESEARCH …383
5 SUMMARY …385
Contributing Authors …391
Index …398