| <<O>> Difference Topic WhatTA (r1.9 - 16 Mar 2011 - GeoffreyRockwell) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||||||
| Line: 14 to 14 | ||||||||
|---|---|---|---|---|---|---|---|---|
| ||||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
| |||||||
| ||||||||
| <<O>> Difference Topic WhatTA (r1.8 - 11 Mar 2007 - GeoffreyRockwell) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||||||
| Line: 49 to 49 | ||||||||
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Added: | ||||||||
| > > |
||||||||
What are electronic texts and how can we analyze them? A longer answer.Based on Electronic Texts and Text Analysis by Geoffrey Rockwell and Ian Lancashire | ||||||||
| <<O>> Difference Topic WhatTA (r1.7 - 30 Jul 2006 - GeoffreyRockwell) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||||||
| Line: 7 to 7 | ||||||||
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Added: | ||||||||
| > > |
What is text analysis? A very short answer.by Geoffrey Rockwell Word processors have searching tools that allow you to find a word or phrase. Such finding tools can be used as a simple text analysis environment. Your word processor is not, however, suited to searching large texts interactively, nor does it show you the results of a search in a way that can help you understand a text. Computer assisted text analysis environments do three types of things beyond what the "Find" tool of a word processor might do:
| |||||||
Why bother with computer-assisted text analysis? A short answer. | ||||||||
| Changed: | ||||||||
| < < |
by Geoffrey Rockwell | |||||||
| > > |
by Geoffrey Rockwell | |||||||
Text analyis tools aide the interpreter asking questions of electronic texts | ||||||||
| Line: 36 to 49 | ||||||||
|
| ||||||||
| Deleted: | ||||||||
| < < |
What is text analysis? A short answer.by Geoffrey Rockwell Word processors have searching tools that allow you to find a word or phrase. Such finding tools can be used as a simple text analysis environment. Your word processor is not, however, suited to searching large texts interactively, nor does it show you the results of a search in a way that can help you understand a text. Computer assisted text analysis environments do three types of things beyond what the "Find" tool of a word processor might do:
| |||||||
What are electronic texts and how can we analyze them? A longer answer. | ||||||||
| Changed: | ||||||||
| < < |
Based on Electronic Texts and Text Analysis by Geoffrey Rockwell and Ian Lancashire | |||||||
| > > |
Based on Electronic Texts and Text Analysis by Geoffrey Rockwell and Ian Lancashire | |||||||
|
| ||||||||
| <<O>> Difference Topic WhatTA (r1.6 - 20 Apr 2006 - GeoffreyRockwell) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||
| <<O>> Difference Topic WhatTA (r1.5 - 30 Mar 2006 - GeoffreyRockwell) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||
| <<O>> Difference Topic WhatTA (r1.4 - 23 Feb 2006 - GeoffreyRockwell) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||||||
| Changed: | ||||||||
| < < |
TOC: No TOC in "Main.WhatTA" | |||||||
| > > |
Why bother with computer-assisted text analysis? A short answer.by Geoffrey RockwellText analyis tools aide the interpreter asking questions of electronic textsMuch of the evidence used by humanists is in textual form. One way of interpreting texts involves bringing questions to these texts and using various reading practices to reflect on the possible answers. Simple text analysis tools can help with this process of asking questions of a text and retrieving passages that help one think through the questions. The computer does not replace human interepretation, it enhances it. The concordance is an example of a research aide that predates computing. The concordance, like the index, allows the interpreter to find passages that share a common word about which you are asking. Unlike the index, the concordance is a presentation of the passages that "concord" together for reflection. Thus a Key Word In Context display shows one line of text with the key word searched for in the middle so that the reader can see if there are patterns in the way the word is used. Most text analysis tools build on the concordance. They break down the text (analysis) and then represent passages in new arrangements (synthesis) that aide the questioner list graphs of the distribution of a word.
Text analysis practices encourage reflection on the questions asked and formalization of queriesOne of the side-effects of computer-assisted text is that it forces the interpreter to think about their interpretative practices in order to use the tool. When you use an index or concordance you have ask yourself what word or concept you want to follow through the text. In order for a computer to aide in interpretation you need to describe the questions you bring to bear and then formalize them into queries that the computer can handle. To look for an abstract concept in a text with a concordance you have to ask what words or patterns would be indicative of the discussion of that concept. In the formalization the interpreter can learn about what they are asking and develop new questions. With interactive systems you start thinking you are asking about one question but in the formalization and refinement discover anomalies or other questions you want to ask. Interactive with a text through text analysis tools then becomes an interative practice of discovery with its own serindipidous paths comparable to, but not identical to, the serindipidous discovery that happen in rereading a text. Text analysis is a way of targeted rereading that tests intuitionsFor some, text analysis is a way of "thinking through" or targeted rereading. When interpreting a text you develop intuitions about what may be an interesting rereading of the evidence. Most intutions have implied quantitative or comparative elements of the sort "Rousseau is more interested in political issues while Diderot is interested in domestic philosophical issues." Such intuitions can be checked with text analysis. What language use would we expect in a philosopher interested in state issues rather than personal or domestic issues? Is there a higher frequency of certain words in Rosseau? In this formal thinking through of intuitions one doesn't necessarily prove the intuition, instead one uses interpretative aides in practices of reflection. Interestingly, these practices are often not shared as formal methods in the humanities. The resulting papers we write often don't mention the text analysis just as they don't mention the library practices. The practices are undertheorized in the humanities, which is one of the reasons for this Text Analysis Developers Alliance wiki. It was set up to provide a place for those interested in thinking about computer-assisted interpretative practices, associated tools, and how they can be developed. You are already doing text analysisAs an increasing amount of the evidence that we use is accessed electronically online, we are forced to use analytical techniques in everyday research. When you Google for documents and then follow trails through the web you are using sophisticated tools that rank results of queries for words or combinations of words. Few understand how Google operates, but we learn to use it. Likewise when we search bibliographic databases for articles and then read them online we are using computer research tools. To use large full-text databases we learn to search for keywords and we learn to interpret the resulting views. This is simple text analysis, aren't you bothered by not be able to ask more refined questions of an electronic text? Doesn't it bother you to not know how the search works when you don't get what you expected? Text analysis as a practice is reflective. Text analysis researchers and developers are asking about the tools and the way they constrain or make possible practices of reflection. Starting pointsOne resource TAPoR is adding to this wiki is a collection of Recipes which outline research practices that can be enabled by text analysis tools and give concrete examples. | |||||||
|
| ||||||||
| Line: 49 to 78 | ||||||||
|---|---|---|---|---|---|---|---|---|
| Electronic texts digitally represent oral or written language in a form suitable for analysis with a computer. Typically an electronic text is either | ||||||||
| Changed: | ||||||||
| < < |
a electronic version of a written work, an electronic | |||||||
| > > |
an electronic version of a written work, an electronic | |||||||
| version of a transcript of an oral event, or a document composed on the computer. In any case the information in an electronic text is meant | ||||||||
| Line: 139 to 168 | ||||||||
|
| ||||||||
| Changed: | ||||||||
| < < |
sciences use electronic text collections to | |||||||
| > > |
sciences use electronic text collections to find | |||||||
| passages where issues are discussed and to retrieve | ||||||||
| Changed: | ||||||||
| < < |
documents to their questions. | |||||||
| > > |
documents that are relevant to their questions. | |||||||
|
| ||||||||
| Line: 202 to 231 | ||||||||
|
| ||||||||
| Changed: | ||||||||
| < < |
which to teach languages. They work linguists | |||||||
| > > |
which to teach languages. They work with linguists | |||||||
|
to develop text collections with which to train
translation systems. | ||||||||
| Line: 298 to 327 | ||||||||
| towards interactive models that assumed that the scholar would have access to the tools and a collection of e-texts for personal study. It is the access | ||||||||
| Changed: | ||||||||
| < < |
to research electronic texts that we need to ensure through the preservation of research text data. | |||||||
| > > |
to research-quality electronic texts that we need to ensure. | |||||||
|
One of the best known examples of this shift from batch to interactive concording is TACT which was developed at the University of Toronto and | ||||||||
| Line: 374 to 402 | ||||||||
| for researchers. The DOE has a full-text database of all the significant works written in Old English. There is no organization at present | ||||||||
| Changed: | ||||||||
| < < |
in Canada that can provide access, let alone preserve, this important resource. The project | |||||||
| > > |
in Canada that can provide access to, let alone preserve over the long term, this important research resource. The project | |||||||
| had to look abroad for electronic text delivery support. Further, the dictionary itself, while it will be published on paper, will also exist | ||||||||
| Line: 393 to 421 | ||||||||
|
| ||||||||
| Changed: | ||||||||
| < < |
text-analysis tools TACT was developed at the | |||||||
| > > |
text-analysis tools, TACT, was developed at the | |||||||
| University of Toronto. Text and document tools | ||||||||
| Changed: | ||||||||
| < < |
are not only be created at Canadian universities; | |||||||
| > > |
are not only created at Canadian universities; | |||||||
| an important commercial document management environment, Livelink by OpenText?, benefited from research into text retrieval at the University | ||||||||
| Line: 472 to 500 | ||||||||
|
| ||||||||
| Changed: | ||||||||
| < < |
United Kingdom which has set up a Arts and Humanities | |||||||
| > > |
United Kingdom which has set up an Arts and Humanities | |||||||
| Data Service. What is at stake is the preservation of our textual heritage. The electronic texts of today will provide rich resources for future | ||||||||
| Changed: | ||||||||
| < < |
study including a works of interests to those | |||||||
| > > |
study including works of interest to those | |||||||
| who want to study our age and this transition from analogue to digital scholarship. | ||||||||
| <<O>> Difference Topic WhatTA (r1.3 - 30 Jun 2005 - VikaZafrin) |
What is Text Analysis?Short and longer discussions of what text analysis is. | ||||||||
| Line: 545 to 545 | ||||||||
|---|---|---|---|---|---|---|---|---|
|
Project Gutenberg: http://promo.net/pg/ Text Encoding Initiative: http://www.uic.edu/orgs/tei/ | ||||||||
| Changed: | ||||||||
| < < |
Humbol Humanities Hub: http://www.humbul.ac.uk/ | |||||||
| > > |
Humbul Humanities Hub: http://www.humbul.ac.uk/ | |||||||
|
| ||||||||
| <<O>> Difference Topic WhatTA (r1.2 - 17 May 2005 - GeoffreyRockwell) |
| ||||||||
| Changed: | ||||||||
| < < |
What is Text Analysis? | |||||||
| > > |
What is Text Analysis?Short and longer discussions of what text analysis is.What is text analysis? A short answer.by Geoffrey Rockwell Word processors have searching tools that allow you to find a word or phrase. Such finding tools can be used as a simple text analysis environment. Your word processor is not, however, suited to searching large texts interactively, nor does it show you the results of a search in a way that can help you understand a text. Computer assisted text analysis environments do three types of things beyond what the "Find" tool of a word processor might do:
What are electronic texts and how can we analyze them? A longer answer. | |||||||
| Based on Electronic Texts and Text Analysis by Geoffrey Rockwell and Ian Lancashire | ||||||||
| <<O>> Difference Topic WhatTA (r1.1 - 03 May 2005 - GeoffreyRockwell) |
| Line: 1 to 1 | ||||||||
|---|---|---|---|---|---|---|---|---|
| Added: | ||||||||
| > > |
What is Text Analysis?Based on Electronic Texts and Text Analysis by Geoffrey Rockwell and Ian Lancashire
| |||||||
|
Revision r1.1 - 03 May 2005 - 14:22 - GeoffreyRockwell Revision r1.9 - 16 Mar 2011 - 13:05 - GeoffreyRockwell |