TAPoRware Online Documentation
TAPoRware is a collection tools that enable user to perform text analysis on XML, HTML and plain text files over the Web. Go to
http://taporware.mcmaster.ca to try them.
Description
TAPoRware tools are open tools written in Ruby (and in some cases Java) that can be used on the Web or downloaded for local use. Some of the features of TAPoRware are:
- Each of the tools can be used off our server (TAPoRware) on texts anywhere on the web by entering the URL. TAPoRware tools will retrieve the text, process it and return results to you. TAPoRware tools can also process an uploaded text.
- TAPoRware tools come in four flavours, those tools that will work on XML texts (including TEI encoded texts), those that work with HTML texts, those that work with plain texts, and those that can work on a combination (Other) of texts. Wherever possible we have similar tools for each type of text.
- TAPoRware tools typically return results in four forms.
- HTML that your browser renders for a readable results display.
- An XML results file that could be saved and used for further processing.
- The XML results converted so that it can be rendered as HTML in a browser.
- A Java client-side interactive visualization.
- You can download and install the TAPoRware suite. They will work on Windows, Mac OS X, and Linux. See instructions on the TAPoRware site.
History
- TAPoRware tools began as a project by Geoffrey Rockwell while on sabbatical at the University of Virginia Institute for Advanced Technology in the Humanities. They have have been redeveloped by Lian Yan to work within the context of the TAPoR project. Others have begun to contribute tools, notably the Fixed phrase tools contributed by NYU.
- In May of 2005 Matt Patey began work on the project. He developed a new interface, the help, and visualization tools like the Distribution, Word Rain, and Word Brush tools.
General Notes
The
TAPoRware tools generally work only on texts less than a Megabyte in size. As the text is not preindexed certain tools can take a long time, especially tools that use stemming or part-of-speech tagging.
We have a collection of recipes for doing research tasks with text analyis tools. Many of them use the
TAPoR versions of the
TAPoRware tools. The recipes are for things like identifying a theme in a text.
We encourage people to add comments to these pages on the
TAPoRware tools or to edit the pages themselves.
Individual Tools