Texts for Testing Texts Analysis Tools
This is the preliminary list of texts for use in testing. Please see the bottom of the page for a copy of the tool testing results to date.
Short Text
Selection from Oscar Wilde's
Prose Poems: "The Doer of Good."
Text version
HTML version
XML version
Metadata for this text:
| Element | Number |
| * Words | 486 (483) |
| Paragraphs | 23 |
| ** Sentences | 31 |
| Returns | 43 |
| Words Joined by Dashes | 3 |
| Commas | 20 |
| Single-Quotes | 16 |
| Parentheses | 2 |
| Colons | 1 |
| Question Marks | 7 |
| Periods | 21 |
| Paragraphs Not Ending in Punctuation | 7 |
| HTML Tag Pairs | 34 |
| Empty HTML Tags | 4 |
| Total HTML Tags | 72 |
| XML Tag Pairs | 183 |
| Empty XML Tags | 3 |
| Total XML Tags | 366 |
*
The larger number is the maximum number of distinct words a text analysis tool might count. The smaller number is the maximum count should words joined by dashes, such as 'sea-purple,' be considered one word.
**
Some sentences are counted twice due to page breaks in the text. This count also includes non-poem information such as page numbers and the source project's name.
Mid-Length Text
Oscar Wilde's poem
Ravenna
Text version
HTML version
XML version
Long Text
Jonathan Swift's
Gulliver's Travels.
Text version
HTML version
XML version
Findings from Testing
HelpImprovementRecommendationsforTAPoRTools.docx:
TAPoR Help Improvement Recommendations (Last uploaded November 18, 2011)
TAPoRTestingNotes.docx:
TAPoR Testing Notes (Last uploaded November 18, 2011)
--
AmyDyrbye - 18 Nov 2011