Main.ExerciseTwentyTwo (r1.1 vs. r1.18)
Diffs

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.18 - 21 Mar 2007 - ShawnDay)

META TOPICPARENT TaporRecipes


Add a French language Text to TAPoR

Line: 10 to 10

TOC: No TOC in "Main.ExerciseTwentyTwo"

Deleted:
<
<
This recipe and exercise will soon be available as a PDF download.

Exercise Steps

Line: 43 to 41

Changed:
<
<
-- ShawnDay - 21 October 2006
>
>
-- ShawnDay - 21 March 2007

META FILEATTACHMENT frenchText.txt attr="h" comment="" date="1146493264" path="frenchText.txt" size="4493" user="ShawnDay" version="1.1"
META FILEATTACHMENT talker.gif attr="" comment="" date="1150122874" path="talker.gif" size="2419" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.17 - 23 Oct 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Add a French language Text to TAPoR

Line: 24 to 24

  1. Your text should now be saved in UTF8 format.
  2. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file. Add an appropriate tag then click the Add Text button at the bottom of the Coplet.
  3. When you receive the message that the text has been added successfully, click the button indicated to refresh your text list.
Changed:
<
<
  1. To test whether the encoding has been completed correctly, generate a word list using the TAPoR List Words Tool. Use the default parameters.
>
>
  1. To test whether the encoding has been completed correctly, generate a word list using the TAPoR List Words Tool. Use the default parameters.

  1. If the document has been successfully encoded and imported, you should obtain a results such as:

    Summary: There are 378 unique words
    and there are 744 words in total. 304 words occurred
    once and 27 words occurred twice.
    WordsCounts
    La------40
    De------39
    Pouvoir------23
    Soif------20
    Et------19
    Les------17
    L------15
    Des------15
    Du------15
    Le------11
    D------10
    Il------9
    Que------8
    Est------8
    à------7
    Une------6
    Qui------6
    En------6
    Dans------6
    S------5
    ------5
    Sa------5
    Un------5


Changed:
<
<
  1. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
>
>
  1. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.

  1. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
  2. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables


  3. Congratulations, you are all ready to work with French language text within the TAPoR environment.
Line: 43 to 43

Changed:
<
<
-- ShawnDay - 26 June 2006
>
>
-- ShawnDay - 21 October 2006

META FILEATTACHMENT frenchText.txt attr="h" comment="" date="1146493264" path="frenchText.txt" size="4493" user="ShawnDay" version="1.1"
META FILEATTACHMENT talker.gif attr="" comment="" date="1150122874" path="talker.gif" size="2419" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.16 - 05 Jul 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Add a French language Text to TAPoR

Line: 28 to 28

  1. If the document has been successfully encoded and imported, you should obtain a results such as:

    Summary: There are 378 unique words
    and there are 744 words in total. 304 words occurred
    once and 27 words occurred twice.
    WordsCounts
    La------40
    De------39
    Pouvoir------23
    Soif------20
    Et------19
    Les------17
    L------15
    Des------15
    Du------15
    Le------11
    D------10
    Il------9
    Que------8
    Est------8
    à------7
    Une------6
    Qui------6
    En------6
    Dans------6
    S------5
    ------5
    Sa------5
    Un------5


  2. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
  3. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
Changed:
<
<
  1. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables


>
>
  1. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables



  1. Congratulations, you are all ready to work with French language text within the TAPoR environment.

Next Steps/Further Information


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.15 - 27 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Add a French language Text to TAPoR

Line: 15 to 15

Exercise Steps

Changed:
<
<

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.


>
>

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.



  1. Download this text from FrenchSample and save to your desktop.
  2. Log in to TAPoR;
  3. Choose the MyTexts tab in the portal.
Changed:
<
<
  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.

    When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type". This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.


>
>
  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.

    When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type". This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.



  1. To re-encode this text, open a text editor on your computer and follow the instructions at Recipe 22 for your particular operating system. Save on your desktop with the filename MyFrenchText.txt.
  2. Your text should now be saved in UTF8 format.
  3. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file. Add an appropriate tag then click the Add Text button at the bottom of the Coplet.
Line: 28 to 28

  1. If the document has been successfully encoded and imported, you should obtain a results such as:

    Summary: There are 378 unique words
    and there are 744 words in total. 304 words occurred
    once and 27 words occurred twice.
    WordsCounts
    La------40
    De------39
    Pouvoir------23
    Soif------20
    Et------19
    Les------17
    L------15
    Des------15
    Du------15
    Le------11
    D------10
    Il------9
    Que------8
    Est------8
    à------7
    Une------6
    Qui------6
    En------6
    Dans------6
    S------5
    ------5
    Sa------5
    Un------5


  2. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
  3. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
Changed:
<
<
  1. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables comme


>
>
  1. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables



  1. Congratulations, you are all ready to work with French language text within the TAPoR environment.

Next Steps/Further Information


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.14 - 26 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Exercise 22
Add a French language Text to TAPoR

>
>

Add a French language Text to TAPoR



Changed:
<
<
This exercise uses Recipe 22 to import a French language into the TAPoR text analysis environment.
>
>
This exercise uses this Recipe to import a French language into the TAPoR text analysis environment.

This exercise applies the recipe to a textual example which is freely available on the Internet so you can complete the steps yourself and see the results.

Line: 15 to 15

Exercise Steps

Changed:
<
<

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.


>
>

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.



  1. Download this text from FrenchSample and save to your desktop.
  2. Log in to TAPoR;
  3. Choose the MyTexts tab in the portal.
Changed:
<
<
  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.
  2. When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type". This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.
>
>
  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.

    When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type". This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.



  1. To re-encode this text, open a text editor on your computer and follow the instructions at Recipe 22 for your particular operating system. Save on your desktop with the filename MyFrenchText.txt.
  2. Your text should now be saved in UTF8 format.
  3. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file. Add an appropriate tag then click the Add Text button at the bottom of the Coplet.
Line: 30 to 29

  1. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
  2. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
  3. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables comme


Added:
>
>
  1. Congratulations, you are all ready to work with French language text within the TAPoR environment.

Next Steps/Further Information

Line: 42 to 42

Deleted:
<
<

Comments on this Recipe

Use this box to quickly add a comment to the page.

Changed:
<
<
-- ShawnDay - 12 June 2006
>
>
-- ShawnDay - 26 June 2006

META FILEATTACHMENT frenchText.txt attr="h" comment="" date="1146493264" path="frenchText.txt" size="4493" user="ShawnDay" version="1.1"
META FILEATTACHMENT talker.gif attr="" comment="" date="1150122874" path="talker.gif" size="2419" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.13 - 12 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.12 - 12 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR

Line: 15 to 15

Exercise Steps

Changed:
<
<

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.


>
>

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.



  1. Download this text from FrenchSample and save to your desktop.
  2. Log in to TAPoR;
  3. Choose the MyTexts tab in the portal.
Line: 26 to 26

  1. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file. Add an appropriate tag then click the Add Text button at the bottom of the Coplet.
  2. When you receive the message that the text has been added successfully, click the button indicated to refresh your text list.
  3. To test whether the encoding has been completed correctly, generate a word list using the TAPoR List Words Tool. Use the default parameters.
Changed:
<
<
  1. If the document has been successfully encoded and imported, you should obtain a results such as:

    Summary: There are 378 unique words
    and there are 744 words in total. 304 words occurred
    once and 27 words occurred twice.
    WordsCounts
    La------40
    De------39
    Pouvoir------23
    Soif------20
    Et------19
    Les------17
    L------15
    Des------15
    Du------15
    Le------11
    D------10
    Il------9
    Que------8
    Est------8
    à------7
    Une------6
    Qui------6
    En------6
    Dans------6
    S------5
    ------5
    Sa------5
    Un------5


>
>
  1. If the document has been successfully encoded and imported, you should obtain a results such as:

    Summary: There are 378 unique words
    and there are 744 words in total. 304 words occurred
    once and 27 words occurred twice.
    WordsCounts
    La------40
    De------39
    Pouvoir------23
    Soif------20
    Et------19
    Les------17
    L------15
    Des------15
    Du------15
    Le------11
    D------10
    Il------9
    Que------8
    Est------8
    à------7
    Une------6
    Qui------6
    En------6
    Dans------6
    S------5
    ------5
    Sa------5
    Un------5



  1. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
  2. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
Changed:
<
<
  1. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables comme


>
>
  1. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables comme



Next Steps/Further Information


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.11 - 12 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR

Line: 15 to 15

Exercise Steps

Changed:
<
<
  1. This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
  2. The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.
>
>

This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.



  1. Download this text from FrenchSample and save to your desktop.
  2. Log in to TAPoR;
Changed:
<
<
  1. Choose the !MyTexts tab in the portal.
>
>
  1. Choose the MyTexts tab in the portal.

  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.
Changed:
<
<
  1. When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type".
  2. This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.
>
>
  1. When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type". This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.

  1. To re-encode this text, open a text editor on your computer and follow the instructions at Recipe 22 for your particular operating system. Save on your desktop with the filename MyFrenchText.txt.
  2. Your text should now be saved in UTF8 format.
  3. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file. Add an appropriate tag then click the Add Text button at the bottom of the Coplet.
Line: 32 to 30

  1. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
  2. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
  3. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables comme


Deleted:
<
<
  1. Conclude

Next Steps/Further Information

Line: 49 to 46

Use this box to quickly add a comment to the page.
Changed:
<
<
-- ShawnDay - 1 May 2006
  • (comment from GeoffreyRockwell - 24 May 2006 18:49:45): I would like to contribute to this.
>
>
-- ShawnDay - 12 June 2006

META FILEATTACHMENT frenchText.txt attr="h" comment="" date="1146493264" path="frenchText.txt" size="4493" user="ShawnDay" version="1.1"
Added:
>
>
META FILEATTACHMENT talker.gif attr="" comment="" date="1150122874" path="talker.gif" size="2419" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.10 - 12 Jun 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR

Line: 45 to 45

Added:
>
>

Comments on this Recipe

Use this box to quickly add a comment to the page.

-- ShawnDay - 1 May 2006
  • (comment from GeoffreyRockwell - 24 May 2006 18:49:45): I would like to contribute to this.

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.9 - 24 May 2006 - GeoffreyRockwell)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR

Line: 46 to 46

-- ShawnDay - 1 May 2006

Added:
>
>
  • (comment from GeoffreyRockwell - 24 May 2006 18:49:45): I would like to contribute to this.

META FILEATTACHMENT frenchText.txt attr="h" comment="" date="1146493264" path="frenchText.txt" size="4493" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.8 - 24 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR

Line: 16 to 16

Exercise Steps

  1. This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
Changed:
<
<
  1. The text is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.
>
>
  1. The text in this sample is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.

  1. Download this text from FrenchSample and save to your desktop.
  2. Log in to TAPoR;
Changed:
<
<
  1. Choose the MyTexts tab in the portal.
>
>
  1. Choose the !MyTexts tab in the portal.

  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.
  2. When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type".
  3. This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.7 - 01 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22
Add a French language Text to TAPoR

Line: 19 to 19

  1. The text is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.
  2. Download this text from FrenchSample and save to your desktop.
  3. Log in to TAPoR;
Changed:
<
<
  1. Choose the __MyTexts__ tab in the portal.
>
>
  1. Choose the MyTexts tab in the portal.

  1. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.
  2. When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type".
  3. This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.6 - 01 May 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Exercise 22

>
>

Exercise 22
Add a French language Text to TAPoR



Changed:
<
<
This exercise uses Recipe 22 to import a French language into the TAPoR text analysis environment.
>
>
This exercise uses Recipe 22 to import a French language into the TAPoR text analysis environment.

Changed:
<
<
This exercise applies the recipe to a textual example which is freely available on the internet so you can complete the steps yourself and see the results.
>
>
This exercise applies the recipe to a textual example which is freely available on the Internet so you can complete the steps yourself and see the results.

TOC: No TOC in "Main.ExerciseTwentyTwo"

Changed:
<
<
This recipe and exercise will soon be available as a PDF download.
>
>
This recipe and exercise will soon be available as a PDF download.

Exercise Steps

Changed:
<
<
  1. This exercise obtains French langiuage text from the Office de langue francaise document on the state of the language in Quebec. This doucument is available in a PDF format, so we will use the TAPoR PDF Transformer Tool to obtain an HTML document of the text.
>
>
  1. This exercise assumes that you have a text which is not encoded in a way that will allow for analysis to take place.
  2. The text is in French with accented characters but is encoded using Windows ASCII. Although it may appear properly on the screen of your word processor, it will not be interpreted properly when you attempt to analyse it in TAPoR.
  3. Download this text from FrenchSample and save to your desktop.

  1. Log in to TAPoR;
Changed:
<
<
  1. Select the TAPoR PDF Transformer Tool in the Workbench.
  2. Provide the URL http://www.oqlf.gouv.qc.ca/ressources/bibliotheque/sociolinguistique/oqlf_faslin_01_f_20050519.pdf when prompted for the document source.
  3. Choose to transform from page 7 to page 8 of the document.
  4. Save the text returned from the tool as a file on your computer. By default it will be named temporaryResults.xml.
  5. Although the text returned is saved in XML/UTF-8 format, many texts that you may wantt o work with may not be in this format. To encode these documents, complete with accents, so tat yuo may work with them they need to be encoded into the UTF-8 format.
  6. Launch an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX.
  7. Open the document, temporaryResults.xml, that you saved in the previous step.
  8. Although this document, as you can read in the first line, has been saved as UTF-8, for practise, save the file and choose UTF-8 as the save format from your text editor.
  9. Save the document as MyFrenchText?.txt.
  10. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file.
  11. *
  12. To test whether the encoding format will allow you to recognize the French accents, Generate a word list (sorted by frequency) using the TAPoR List Words Tool.
  13. If the document has been successfuly encoded and imported, you should obtain a results such as:

    Summary: There are 487 unique words and there are
    1135 words in total. 347 words occurred once
    and 61 words occurred twice.
    WordsCounts
    De------82
    La------54
    Les------36
    Et------34
    Des------25
    Langue------21
    à------19
    Le------18
    Du------15
    En------14
    A------12
    Par------12
    Un------10
    Québec------10
    Au------10
    Canada------10
    Langues------9
    Statistique------8
    Français------8
    Données------8
    Linguistiques------8
    été------8
    Région------2


  14. Build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. This will verify that you can enter diacritical characters correctly. In this case, we search for the word Données in the text.
  15. If you are capable of inputing characters correctly, you should see a result similar to:

    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

>
>
  1. Choose the __MyTexts__ tab in the portal.
  2. At the bottom of your list of texts, click the Add Text button. A tutorial on adding texts to TAPoR is available if you need more information on adding texts to TAPoR.
  3. When you choose to upload this text file, you will get the error message "Please correct these errors: File to upload: upload.invalid-type".
  4. This is because TAPoR checks this text and determines that it is encoded as Windows ASCII and cannot be used.
  5. To re-encode this text, open a text editor on your computer and follow the instructions at Recipe 22 for your particular operating system. Save on your desktop with the filename MyFrenchText.txt.
  6. Your text should now be saved in UTF8 format.
  7. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file. Add an appropriate tag then click the Add Text button at the bottom of the Coplet.
  8. When you receive the message that the text has been added successfully, click the button indicated to refresh your text list.
  9. To test whether the encoding has been completed correctly, generate a word list using the TAPoR List Words Tool. Use the default parameters.
  10. If the document has been successfully encoded and imported, you should obtain a results such as:

    Summary: There are 378 unique words
    and there are 744 words in total. 304 words occurred
    once and 27 words occurred twice.
    WordsCounts
    La------40
    De------39
    Pouvoir------23
    Soif------20
    Et------19
    Les------17
    L------15
    Des------15
    Du------15
    Le------11
    D------10
    Il------9
    Que------8
    Est------8
    à------7
    Une------6
    Qui------6
    En------6
    Dans------6
    S------5
    ------5
    Sa------5
    Un------5


  11. To test whether you can enter an accented word and search within this text, build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. In this example search for the word in the text.
  12. Do not copy and paste this search term. Enter this word using your normal keyboard technique for entering an accented character. If you are unsure of how to do this, a tutorial is available here.
  13. If you are capable of inputting characters correctly, you should see a result similar to:

    5 entries found.
    avant J . au moment les Juifs rêvaient encore d'une
    les programmes scolaires et partout cela est encore possible des
    La vie familiale d'abord , Chirac asservit les siens à
    d'une démocratisation mal négociée , le détenteur du pouvoir veut
    ouvrir des boîtes de pandore d'où surgissent des démons incontrôlables comme


  14. Conclude

Next Steps/Further Information

Line: 41 to 42

Added:
>
>

Added:
>
>
-- ShawnDay - 1 May 2006

Changed:
<
<
-- ShawnDay - 24 April 2006
>
>
META FILEATTACHMENT frenchText.txt attr="h" comment="" date="1146493264" path="frenchText.txt" size="4493" user="ShawnDay" version="1.1"

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.5 - 25 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22

Line: 15 to 15

Exercise Steps

Changed:
<
<
  1. This exercise obtains French langiuage text from the Office de langue francaise document on the state of the language in Quebec. This doucuemnt is available in a PDF format, so we will use the TAPoR PDF Transformer Tool to obtain an HTML document of the text.
>
>
  1. This exercise obtains French langiuage text from the Office de langue francaise document on the state of the language in Quebec. This doucument is available in a PDF format, so we will use the TAPoR PDF Transformer Tool to obtain an HTML document of the text.

  1. Log in to TAPoR;
Changed:
<
<
  1. Select the TAPoR PDF Transformer Tool in MyWorkbench?.
>
>
  1. Select the TAPoR PDF Transformer Tool in the Workbench.

  1. Provide the URL http://www.oqlf.gouv.qc.ca/ressources/bibliotheque/sociolinguistique/oqlf_faslin_01_f_20050519.pdf when prompted for the document source.
  2. Choose to transform from page 7 to page 8 of the document.
  3. Save the text returned from the tool as a file on your computer. By default it will be named temporaryResults.xml.

 <<O>>  Difference Topic ExerciseTwentyTwo (r1.4 - 24 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22

Line: 29 to 29

  1. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file.
  2. *
  3. To test whether the encoding format will allow you to recognize the French accents, Generate a word list (sorted by frequency) using the TAPoR List Words Tool.
Changed:
<
<
  1. If the document has been successfuly encoded and imported, you should obtain a results such as:

    Summary: There are 487 unique words and there are
    1135 words in total. 347 words occurred once
    and 61 words occurred twice.
    WordsCounts
    De------82
    La------54
    Les------36
    Et------34
    Des------25
    Langue------21
    à------19
    Le------18
    Du------15
    En------14
    A------12
    Par------12
    Un------10
    Québec------10
    Au------10
    Canada------10
    Langues------9
    Statistique------8
    Français------8
    Données------8
    Linguistiques------8
    été------8
    Région------2
>
>
  1. If the document has been successfuly encoded and imported, you should obtain a results such as:

    Summary: There are 487 unique words and there are
    1135 words in total. 347 words occurred once
    and 61 words occurred twice.
    WordsCounts
    De------82
    La------54
    Les------36
    Et------34
    Des------25
    Langue------21
    à------19
    Le------18
    Du------15
    En------14
    A------12
    Par------12
    Un------10
    Québec------10
    Au------10
    Canada------10
    Langues------9
    Statistique------8
    Français------8
    Données------8
    Linguistiques------8
    été------8
    Région------2


  2. Build a concordance using TAPoR Find Words - Concordance Tool and input a word that you know occurs within the text and includes diacritical marks. This will verify that you can enter diacritical characters correctly. In this case, we search for the word Données in the text.
  3. If you are capable of inputing characters correctly, you should see a result similar to:

    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX


Next Steps/Further Information

Line: 38 to 40

Added:
>
>

Changed:
<
<
-- ShawnDay - 20 April 2006
>
>
-- ShawnDay - 24 April 2006


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.3 - 23 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Changed:
<
<

Exercise 22 Exercise

>
>

Exercise 22



This exercise uses Recipe 22 to import a French language into the TAPoR text analysis environment.


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.2 - 23 Apr 2006 - ShawnDay)

META TOPICPARENT TaporRecipes


Exercise 22 Exercise

Line: 15 to 15

Exercise Steps

Changed:
<
<
  1. Prepare Text in an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX ;
>
>
  1. This exercise obtains French langiuage text from the Office de langue francaise document on the state of the language in Quebec. This doucuemnt is available in a PDF format, so we will use the TAPoR PDF Transformer Tool to obtain an HTML document of the text.

  1. Log in to TAPoR;
Changed:
<
<
  1. Add your French language text file to MyTexts;
  2. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
  3. Explore the words found individually using Find Words - Concordance Tool to determine their context;
>
>
  1. Select the TAPoR PDF Transformer Tool in MyWorkbench?.
  2. Provide the URL http://www.oqlf.gouv.qc.ca/ressources/bibliotheque/sociolinguistique/oqlf_faslin_01_f_20050519.pdf when prompted for the document source.
  3. Choose to transform from page 7 to page 8 of the document.
  4. Save the text returned from the tool as a file on your computer. By default it will be named temporaryResults.xml.
  5. Although the text returned is saved in XML/UTF-8 format, many texts that you may wantt o work with may not be in this format. To encode these documents, complete with accents, so tat yuo may work with them they need to be encoded into the UTF-8 format.
  6. Launch an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX.
  7. Open the document, temporaryResults.xml, that you saved in the previous step.
  8. Although this document, as you can read in the first line, has been saved as UTF-8, for practise, save the file and choose UTF-8 as the save format from your text editor.
  9. Save the document as MyFrenchText?.txt.
  10. Add this text file to MyTexts, by choosing the Add Text button in MyTexts and browse to find the saved file.
  11. *
  12. To test whether the encoding format will allow you to recognize the French accents, Generate a word list (sorted by frequency) using the TAPoR List Words Tool.
  13. If the document has been successfuly encoded and imported, you should obtain a results such as:

    Summary: There are 487 unique words and there are
    1135 words in total. 347 words occurred once
    and 61 words occurred twice.
    WordsCounts
    De------82
    La------54
    Les------36
    Et------34
    Des------25
    Langue------21
    à------19
    Le------18
    Du------15
    En------14
    A------12
    Par------12
    Un------10
    Québec------10
    Au------10
    Canada------10
    Langues------9
    Statistique------8
    Français------8
    Données------8
    Linguistiques------8
    été------8
    Région------2

Next Steps/Further Information

Changed:
<
<
  • Recipe 22 Add a French language Text to TAPoR?
>
>

Added:
>
>

-- ShawnDay - 20 April 2006


 <<O>>  Difference Topic ExerciseTwentyTwo (r1.1 - 23 Apr 2006 - ShawnDay)
Line: 1 to 1
Added:
>
>
META TOPICPARENT TaporRecipes


Exercise 22 Exercise


This exercise uses Recipe 22 to import a French language into the TAPoR text analysis environment.

This exercise applies the recipe to a textual example which is freely available on the internet so you can complete the steps yourself and see the results.

This recipe and exercise will soon be available as a PDF download.

Exercise Steps

  1. Prepare Text in an external editor such as UltraEdit for a Windows-based PC or BBedit for Mac OSX ;
  2. Log in to TAPoR;
  3. Add your French language text file to MyTexts;
  4. Generate a word list (sorted by frequency) using the TAPoR List Words Tool;
  5. Explore the words found individually using Find Words - Concordance Tool to determine their context;

Next Steps/Further Information

-- ShawnDay - 20 April 2006


Topic: ExerciseTwentyTwo . { View | Diffs | r1.18 | > | r1.17 | > | r1.16 | More }

Revision r1.1 - 23 Apr 2006 - 19:19 - ShawnDay
Revision r1.18 - 21 Mar 2007 - 23:30 - ShawnDay