Create Textual Infrastructure
using Text Analysis Tools
This recipe uses text analysis tools to extract key words to create an index and table of contents from a body of text.
Ingredients
Steps
- Prepare your electronic text for processing
;
- Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
- Identify keywords for indexing;
- Explore keywords using TAPoR Find Words - Concordance Tool to clusters of related words;
- Group terms of associated relevance as they should appear logically in the index
;
- Identify collocated words using TAPoR Find Collocates Tool to determine usage patterns;
- Return to your word processing tool such as Microsoft Word and use the generated lists to search for and tag the words for automated creation of index and contents.
Discussion
Text Preparation
You can use tools such
TAPoR Extract Text to remove added material.
Grouping Words for Index Inclusion
- How are terms related?
- Does a concordance help to associate words that should be logically associated - clustering?
Glossary
A Complete Glossary
Next Steps/Further Information
--
ShawnDay – 14 April 2007