Main.RecipeTwentyTwo (r1.1 vs. r1.13)
Diffs

 <<O>>  Difference Topic RecipeTwentyTwo (r1.13 - 22 Oct 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Add a French language Text to TAPoR

Line: 18 to 18

Ingredients

  • An electronic text in the French language
Changed:
<
<
>
>

  • A Text Editor capable of converting between character encodings

Line: 31 to 31

  1. Confirm that the French language web page that you wish to use is encoded properly ;
  2. Log in to TAPoR;
  3. Add your encoded French language text file to MyTexts ;
Changed:
<
<
  1. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
  2. Explore an accented word individually using Find Words - Concordance Tool
>
>
  1. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
  2. Explore an accented word individually using TAPoR Find Words - Concordance Tool

Discussion

Line: 71 to 71

Changed:
<
<
-- ShawnDay 21 June 2006
>
>
-- ShawnDay 20 October 2006

META FILEATTACHMENT 22PrintableRecipeTwentyTwo.pdf attr="h" comment="" date="1148401580" path="22PrintableRecipeTwentyTwo.pdf" size="196344" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic RecipeTwentyTwo (r1.12 - 27 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Add a French language Text to TAPoR

Line: 38 to 38

Text Editors

Changed:
<
<
You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through the *NotePad* and under Macintosh OSX through *TextEdit*. On Unix-based systems, you will find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.
>
>
You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through NotePad and under Macintosh OSX through TextEdit. On Unix-based systems, you will find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.

  • Instructions for saving as UTF-8 or Latin-1 using NotePad
  • Instructions for saving as UTF-8 or Latin-1 using TextEdit
  • Instructions for saving as UTF-8 or Latin-1 using MicrosoftWord

 <<O>>  Difference Topic RecipeTwentyTwo (r1.11 - 26 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Recipe 22:
Add a French language Text to TAPoR

>
>

Add a French language Text to TAPoR



This recipe takes a French language text and adds it to the TAPoR workspace for textual analysis.

Line: 23 to 23

  • A Text Editor capable of converting between character encodings

Changed:
<
<
wrench.gif This recipe is applied to a sample text in Exercise 22 Add a French language Text to TAPoR
>
>
wrench.gif This recipe is applied to a sample text in Exercise to Add a French language Text to TAPoR

Steps

Line: 62 to 62

Next Steps/Further Information

Changed:
<
<
>
>

Line: 71 to 71

Changed:
<
<
-- ShawnDay 21 May 2006
>
>
-- ShawnDay 21 June 2006

META FILEATTACHMENT 22PrintableRecipeTwentyTwo.pdf attr="h" comment="" date="1148401580" path="22PrintableRecipeTwentyTwo.pdf" size="196344" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic RecipeTwentyTwo (r1.10 - 23 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Recipe 22:
Add a French language Text to TAPoR

Line: 12 to 12

TOC: No TOC in "Main.RecipeTwentyTwo"

Changed:
<
<
This recipe and exercise will soon be available as a PDF download.
>
>
This recipe is available as a PDF download.

Ingredients

Line: 38 to 38

Text Editors

Changed:
<
<
You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through the *NotePad* and under Macintosh OSX through *TextEdit*. On a Unix-based systems, you will find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.
>
>
You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through the *NotePad* and under Macintosh OSX through *TextEdit*. On Unix-based systems, you will find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.

  • Instructions for saving as UTF-8 or Latin-1 using NotePad
  • Instructions for saving as UTF-8 or Latin-1 using TextEdit
  • Instructions for saving as UTF-8 or Latin-1 using MicrosoftWord

Web Page Encoding

Changed:
<
<
To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply got to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding. If you view the page source for your web page, it may contain the HTML line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
or
>
>
To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply go to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding. If you view the page source for your web page, it may contain the HTML line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />or

"<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>" Which will indicate that it is encoded properly for text analysis.
Line: 71 to 71

Changed:
<
<
-- ShawnDay – 1 May 2006
>
>
-- ShawnDay 21 May 2006

Added:
>
>
META FILEATTACHMENT 22PrintableRecipeTwentyTwo.pdf attr="h" comment="" date="1148401580" path="22PrintableRecipeTwentyTwo.pdf" size="196344" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic RecipeTwentyTwo (r1.9 - 01 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Recipe 22 :
Add a French language Text to TAPoR

>
>

Recipe 22:
Add a French language Text to TAPoR



Changed:
<
<
This recipe takes a French language text and adds it to the TAPoR workspace for further textual analysis.

ALERT!

This recipe will ensure that the fundamental task of loading text into a text analysis enviroment is accomplished correctly. For proper analysis of french text to be possible, the text must be interpreted by the computer in the same way in which you do, including accented characters. Because different operating systems and applications deal with text entry and storage differently there are a variety of ways in which text can be Encoded. Therefore, it is necessary to take steps to ensure that the computer sees the text in the same way that you do. This recipe will ensure that your text has been entered and encoded properly for analsysis and that you can enter search terms and parameters from your browser to complete the analytical tasks. The accompanying exercise also identifies things which can go wrong with this process.
>
>
This recipe takes a French language text and adds it to the TAPoR workspace for textual analysis.

Added:
>
>

ALERT!

This recipe ensures that the fundamental task of loading text into a text analysis environment is accomplished correctly. For proper analysis, the text must be interpreted by the computer in the same way in which you enter it, including accented characters. There are a variety of ways in which text can be encoded by operating systems and applications during text entry and storage. This recipe will ensure that your text has been entered and encoded properly for analysis and that you can enter search terms and parameters from your browser to complete analytical tasks.


TOC: No TOC in "Main.RecipeTwentyTwo"

Line: 20 to 20

Changed:
<
<
  • A Text Editor Capable of Converting between encoded languages
>
>
  • A Text Editor capable of converting between character encodings

wrench.gif This recipe is applied to a sample text in Exercise 22 Add a French language Text to TAPoR

Steps

Changed:
<
<
  1. Prepare Text in an external editor to ensure that it is encoded correctly ;

    or
  2. Confirm that the French language web page that you wish to use is endoded properly ;
>
>
  1. Prepare Text in an external editor to ensure that it is encoded correctly ;
    or
  2. Confirm that the French language web page that you wish to use is encoded properly ;

  1. Log in to TAPoR;
Changed:
<
<
  1. Add your encoded French language text file to MyTexts ;
>
>
  1. Add your encoded French language text file to MyTexts ;

  1. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
Changed:
<
<
  1. Explore the words found individually using Find Words - Concordance Tool
>
>
  1. Explore an accented word individually using Find Words - Concordance Tool

Discussion

Text Editors

Changed:
<
<
You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through the NotePad and under Macintosh OSX through TextEdit. On a Unix-based systems, you should find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analsysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.
  • Instructions for saving as UTF8 or Latin1 using NotePad
  • Instructions for saving as UTF8 or Latin1 using TextEdit
  • Instructions for saving as UTF8 or Latin1 using MicrosoftWord
>
>
You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through the *NotePad* and under Macintosh OSX through *TextEdit*. On a Unix-based systems, you will find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.
  • Instructions for saving as UTF-8 or Latin-1 using NotePad
  • Instructions for saving as UTF-8 or Latin-1 using TextEdit
  • Instructions for saving as UTF-8 or Latin-1 using MicrosoftWord

Web Page Encoding

Changed:
<
<
To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply got to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding. If you view the page source for your web page, it may contain the html line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
or
>
>
To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply got to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding. If you view the page source for your web page, it may contain the HTML line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
or

"<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>"
Added:
>
>
Which will indicate that it is encoded properly for text analysis.

Glossary


 <<O>>  Difference Topic RecipeTwentyTwo (r1.8 - 01 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Recipe 22 :
Add a French language Text to TAPoR

Line: 6 to 6

This recipe takes a French language text and adds it to the TAPoR workspace for further textual analysis.

Changed:
<
<
ALERT! This recipe will ensure that the fundamental task of loading text into a text analysis enviroment is accomplished correctly. For proper analysis of french text to be possible, the text must be interpreted by the computer in the same way in which you do, including accented characters. Because different operating systems and applications deal with text entry and storage differently there are a variety of ways in which text can be Encoded. Therefore, it is necessary to take steps to ensure that the computer sees the text in the same way that you do. This recipe will ensure that your text has been entered and encoded properly for analsysis and that you can enter search terms and parameters from your browser to complete the analytical tasks. The accompanying exercise also identifies things which can go wrong with this process.
>
>
ALERT!

This recipe will ensure that the fundamental task of loading text into a text analysis enviroment is accomplished correctly. For proper analysis of french text to be possible, the text must be interpreted by the computer in the same way in which you do, including accented characters. Because different operating systems and applications deal with text entry and storage differently there are a variety of ways in which text can be Encoded. Therefore, it is necessary to take steps to ensure that the computer sees the text in the same way that you do. This recipe will ensure that your text has been entered and encoded properly for analsysis and that you can enter search terms and parameters from your browser to complete the analytical tasks. The accompanying exercise also identifies things which can go wrong with this process.

TOC: No TOC in "Main.RecipeTwentyTwo"

Line: 46 to 46

Web Page Encoding

To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply got to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding.
Changed:
<
<
If you view the page source for your web page, it should contain the html line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

or
>
>
If you view the page source for your web page, it may contain the html line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
or

"<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>"

Glossary


 <<O>>  Difference Topic RecipeTwentyTwo (r1.7 - 01 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Recipe 22 : Add a French language Text to TAPoR

>
>

Recipe 22 :
Add a French language Text to TAPoR



This recipe takes a French language text and adds it to the TAPoR workspace for further textual analysis.

Changed:
<
<
ALERT! This recipe will ensure that the fundamental task of loading text into a text analysis enviroment is accomplished correctly. For proper analysis of french text to be possible, the text that you wish to analyse must be interpreted by the computer in the same way in which you do, including accented characters. Because different operating systems and applications deal with text entry and storage differently there are a variety of ways in which text can be Encoded. Therefore, it is necessary to take steps to ensure that the computer sees the text in the same way that you do. This recipe will ensure that your text has been entered and encoded properly for analsysis and that you can enter search terms and parameters from your browser to complete the analytical tasks.
>
>
ALERT! This recipe will ensure that the fundamental task of loading text into a text analysis enviroment is accomplished correctly. For proper analysis of french text to be possible, the text must be interpreted by the computer in the same way in which you do, including accented characters. Because different operating systems and applications deal with text entry and storage differently there are a variety of ways in which text can be Encoded. Therefore, it is necessary to take steps to ensure that the computer sees the text in the same way that you do. This recipe will ensure that your text has been entered and encoded properly for analsysis and that you can enter search terms and parameters from your browser to complete the analytical tasks. The accompanying exercise also identifies things which can go wrong with this process.

TOC: No TOC in "Main.RecipeTwentyTwo"

Line: 46 to 46

Web Page Encoding

To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply got to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding.
Changed:
<
<
If you view the page source for your web page, it should contain the html line:
""
>
>
If you view the page source for your web page, it should contain the html line:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

or
"<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>"

Glossary


 <<O>>  Difference Topic RecipeTwentyTwo (r1.6 - 01 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Recipe 22 : Add a French language Text to TAPoR


Changed:
<
<
This recipe takes a Franch language text and adds it to the TAPoR workspace for further textual analysis.
>
>
This recipe takes a French language text and adds it to the TAPoR workspace for further textual analysis.

ALERT! This recipe will ensure that the fundamental task of loading text into a text analysis enviroment is accomplished correctly. For proper analysis of french text to be possible, the text that you wish to analyse must be interpreted by the computer in the same way in which you do, including accented characters. Because different operating systems and applications deal with text entry and storage differently there are a variety of ways in which text can be Encoded. Therefore, it is necessary to take steps to ensure that the computer sees the text in the same way that you do. This recipe will ensure that your text has been entered and encoded properly for analsysis and that you can enter search terms and parameters from your browser to complete the analytical tasks.

TOC: No TOC in "Main.RecipeTwentyTwo"

Line: 61 to 64

Added:
>
>

Changed:
<
<
-- ShawnDay – 25 April 2006
>
>
-- ShawnDay – 1 May 2006


 <<O>>  Difference Topic RecipeTwentyTwo (r1.5 - 25 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Recipe 22 : Add a French language Text to TAPoR


Changed:
<
<
This recipe takes an appropriately encoded text and adds it to the TAPoR workspace for further textual analysis.
>
>
This recipe takes a Franch language text and adds it to the TAPoR workspace for further textual analysis.

TOC: No TOC in "Main.RecipeTwentyTwo"

Line: 17 to 17

Changed:
<
<
  • A Text Editor Capable of Converting between encoded languages
>
>
  • A Text Editor Capable of Converting between encoded languages

wrench.gif This recipe is applied to a sample text in Exercise 22 Add a French language Text to TAPoR

Steps

Changed:
<
<
  1. Prepare Text in an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX ;
>
>
  1. Prepare Text in an external editor to ensure that it is encoded correctly ;

    or
  2. Confirm that the French language web page that you wish to use is endoded properly ;

  1. Log in to TAPoR;
Changed:
<
<
  1. Add your French language text file to MyTexts;
>
>
  1. Add your encoded French language text file to MyTexts ;

  1. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
Changed:
<
<
  1. Explore the words found individually using Find Words - Concordance Tool to determine their context;
>
>
  1. Explore the words found individually using Find Words - Concordance Tool

Discussion

Added:
>
>

Text Editors

You may require a text editor to encode your text into UTF-8 or Latin-1 to maintain the accents and special characters in the textual language. On a Windows system, this can be done through the NotePad and under Macintosh OSX through TextEdit. On a Unix-based systems, you should find a text editor installed as part of the standard system install. Word processors typically provide a much deeper tool set for formatting text and generally save documents in their native format which is not appropriate for importing into a text analsysis environment. However, they too can be used to save a plain text file with appropriate encoding by following the appropriate steps.
  • Instructions for saving as UTF8 or Latin1 using NotePad
  • Instructions for saving as UTF8 or Latin1 using TextEdit
  • Instructions for saving as UTF8 or Latin1 using MicrosoftWord

Web Page Encoding

To verify that the web page that you wish to import into TAPoR is encoded in either UTF-8 or Latin-1, you need to check the browser settings. In Internet Explorer, simply got to the View Menu and select the Encoding Option. This should read Unicode (UTF-8). On Firefox, the option is Character Encoding under the View menu. This should also read Unicode (UTF-8). If this is not the case, then you can manually select the encoding you wish to use from this menu. On other web browsers, the process should be similar. Please consult their help files for specific instructions on character encoding. If you view the page source for your web page, it should contain the html line:
""

Glossary

Text Encoding
One of the most important aspects of the text input process is the encoding of the text which you are working with. It must be encoded as either UTF8 or Latin-1, which provides proper mapping of accented and other extended characters. See the links below for more background information on encoding processes. For example, when propoerly encoded the charcter 'e' is differentiated from the character 'é' and 'é' is not seen as the character 'e' + some symbol.

UTF-8 (8-bit Unicode Character Encoding)
Unicode character encoding is an evolution of the ASCII set to permit support of a greater number of alphanumeric characters including those with diacritical marks such as accents. More information on UTF-8 is available at:Wikipedia
Added:
>
>

Latin-1 (ISO 8859-1)
Latin-1 character encoding is an evolution of the ASCII character set to permit support of a greater number of alphanumeric characters including those with diacritical marks such as accents. It is being supplanted by the UTF-8 Character Encoding More information on Latin-1 is available at: Wikipedia

MyTexts
This is an area of the TAPoR in which you collect your private texts for analysis. It is also a portal to access publicly available texts which have been added by other users. In this area you can view the catalogue of texts available to you or add, edit, tag, and view the contents of specific texts.

A Complete Glossary

Line: 49 to 65

Changed:
<
<
-- ShawnDay – 23 April 2006
>
>
-- ShawnDay – 25 April 2006


 <<O>>  Difference Topic RecipeTwentyTwo (r1.4 - 25 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Recipe 22 : Add a French language Text to TAPoR

Line: 24 to 24

Steps

Changed:
<
<
  1. Prepare Text in an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX ;
>
>
  1. Prepare Text in an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX ;

  1. Log in to TAPoR;
  2. Add your French language text file to MyTexts;
  3. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
Line: 32 to 32

Discussion

Deleted:
<
<

Text Encoding

The most important aspect of this input process is the encoding of the text which you are working with. It must be encoded as either UTF8 or Latin-1, which allows for a proper mapping of accented charcaters and an extended character set. See the links below for more background information on encoding processes.


Glossary

Changed:
<
<
>
>

Text Encoding
One of the most important aspects of the text input process is the encoding of the text which you are working with. It must be encoded as either UTF8 or Latin-1, which provides proper mapping of accented and other extended characters. See the links below for more background information on encoding processes. For example, when propoerly encoded the charcter 'e' is differentiated from the character 'é' and 'é' is not seen as the character 'e' + some symbol.

UTF-8 (8-bit Unicode Character Encoding)
Unicode character encoding is an evolution of the ASCII set to permit support of a greater number of alphanumeric characters including those with diacritical marks such as accents. More information on UTF-8 is available at:Wikipedia

A Complete Glossary


 <<O>>  Difference Topic RecipeTwentyTwo (r1.3 - 24 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Recipe 22 : Add a French language Text to TAPoR

>
>

Recipe 22 : Add a French language Text to TAPoR



This recipe takes an appropriately encoded text and adds it to the TAPoR workspace for further textual analysis.

Line: 49 to 49

Added:
>
>

Changed:
<
<
-- ShawnDay – 20 April 2006
>
>
-- ShawnDay – 23 April 2006


 <<O>>  Difference Topic RecipeTwentyTwo (r1.2 - 23 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Recipe 22 : Add a French language Text to TAPoR

>
>

Recipe 22 : Add a French language Text to TAPoR



This recipe takes an appropriately encoded text and adds it to the TAPoR workspace for further textual analysis.

Line: 24 to 24

Steps

Changed:
<
<
  1. Prepare Text in an external editor such as [[][for a Windows-based PC]] or for Mac OSX?;
>
>
  1. Prepare Text in an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX ;

  1. Log in to TAPoR;
  2. Add your French language text file to MyTexts;
  3. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
Line: 32 to 32

Discussion

Changed:
<
<
#Encoding
>
>

Added:
>
>

Text Encoding


The most important aspect of this input process is the encoding of the text which you are working with. It must be encoded as either UTF8 or Latin-1, which allows for a proper mapping of accented charcaters and an extended character set. See the links below for more background information on encoding processes.

Glossary

Line: 50 to 51

Changed:
<
<
-- ShawnDay – 18 April 2006
>
>
-- ShawnDay – 20 April 2006


 <<O>>  Difference Topic RecipeTwentyTwo (r1.1 - 18 Apr 2006 - ShawnDay)
Line: 1 to 1
Added:
>
>
META TOPICPARENT TaporRecipes


Recipe 22 : Add a French language Text to TAPoR


This recipe takes an appropriately encoded text and adds it to the TAPoR workspace for further textual analysis.

This recipe and exercise will soon be available as a PDF download.

Ingredients

wrench.gif This recipe is applied to a sample text in Exercise 22 Add a French language Text to TAPoR

Steps

  1. Prepare Text in an external editor such as [[][for a Windows-based PC]] or for Mac OSX?;
  2. Log in to TAPoR;
  3. Add your French language text file to MyTexts;
  4. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
  5. Explore the words found individually using Find Words - Concordance Tool to determine their context;

Discussion

#Encoding

The most important aspect of this input process is the encoding of the text which you are working with. It must be encoded as either UTF8 or Latin-1, which allows for a proper mapping of accented charcaters and an extended character set. See the links below for more background information on encoding processes.

Glossary

A Complete Glossary

Next Steps/Further Information

-- ShawnDay – 18 April 2006


Topic: RecipeTwentyTwo . { View | Diffs | r1.13 | > | r1.12 | > | r1.11 | More }

Revision r1.1 - 18 Apr 2006 - 13:44 - ShawnDay
Revision r1.13 - 22 Oct 2006 - 21:13 - ShawnDay