Main.TAPoRwareHTMLListWords (r1.1 vs. r1.29)
Diffs

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.29 - 06 Jun 2008 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 86 to 86

Changed:
<
<
Here is the code for this function
>
>
Here is the code for this button interface:

<form method="post" name="htmlForm" enctype="multipart/form-data" target="_blank" action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/hlistword.cgi"


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.28 - 30 May 2008 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 128 to 128

    • sorting -- values are same as parameter "listsort" in the CGI interface above
    • outFormat -- values are same as parameter "listdisp" in the CGI interface above
Added:
>
>

REST Service Interface

Taporware Rest service uses plain text HTTP protocol so that you can submit your request use either POST or GET method.

  • Service URI: http://tapor1-dev.mcmaster.ca/~restserv/html/listwords.
  • Parameters:
    • htmlInput -- any HTML text
    • htmlTag --- any valid HTML tag in your submitted HTML text
    • listOption -- a HTML single valued selection control (select or radio control). The values are same as parameter "range" in the CGI interface above.
    • optionSeletion -- values depend on the "list option"
    • sorting -- a select control. The values are same as parameter "sorting" in the CGI interface above
    • outFormat -- a select control for output format. The values are same as parameter "display" in the CGI interface above
  • See sample interface in http://tapor1-dev.mcmaster.ca/~restserv/html/listwordtool

Known Bugs

To Do


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.27 - 22 May 2008 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 78 to 78

Changed:
<
<
>
>

Tag:
Line: 102 to 102

<input type="hidden" name="listsort" value="2" />

Added:
>
>
<input type="hidden" name="sparkline" value="10" />

<input type="hidden" name="listdisp" value="2" />

<input type="hidden" name="taporface" value="same" />


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.26 - 15 May 2008 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 68 to 68

You can add a button in your web page to list all the words in that page by call TAPoRware cgi script.

Changed:
<
<
>
>

Changed:
<
<
>
>

Changed:
<
<
>
>

Changed:
<
<
>
>

Added:
>
>
Tag:

Added:
>
>

Here is the code for this function

Changed:
<
<
<form method="post" name="textForm" enctype="multipart/form-data" target="_blank" action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/tlistwordstem.cgi" onsubmit="document.textForm.texturl.value=document.location.href">
>
>
<form method="post" name="htmlForm" enctype="multipart/form-data" target="_blank" action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/hlistword.cgi" onsubmit="document.htmlForm.htmlurl.value=document.location.href">

<input type="hidden" name="source" value="url" />

Changed:
<
<
<input type="hidden" name="texturl" />
>
>
<input type="hidden" name="htmlurl" />

<input type="hidden" name="freetext" value="yes"/>

<input type="hidden" name="range" value="all" />

Changed:
<
<
<input type="hidden" name="sorting" value="2" />
>
>
<input type="hidden" name="listsort" value="2" />

Changed:
<
<
<input type="hidden" name="display" value="1" />
>
>
<input type="hidden" name="listdisp" value="2" />

<input type="hidden" name="taporface" value="same" />

Changed:
<
<
<input type="submit" name="doIt" value="List All Words of the Page" />
>
>
<input type="submit" name="doIt" value="List All Words in The Tag" />

</form>


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.25 - 15 May 2008 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 68 to 68

You can add a button in your web page to list all the words in that page by call TAPoRware cgi script.

Changed:
<
<
>
>

Changed:
<
<
>
>

Line: 86 to 86

Here is the code for this function

<form method="post" name="textForm" enctype="multipart/form-data" target="_blank"

Changed:
<
<
action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/tlistword.cgi"
>
>
action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/tlistwordstem.cgi"

onsubmit="document.textForm.texturl.value=document.location.href">

<input type="hidden" name="source" value="url" />


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.24 - 14 May 2008 - GeoffreyRockwell)

META TOPICPARENT TAPoRware

List Words

Line: 70 to 70

Changed:
<
<
>
>


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.23 - 31 May 2007 - LianYan)

META TOPICPARENT TAPoRware

List Words


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.22 - 19 Jul 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.21 - 19 Jul 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 53 to 52

Parameter Name Parameter Value Control Type Default Discription
source url/local radio button url Let user select input text (either a url or upload local html text)
Changed:
<
<
htmlurl   text   A Valid URL that the pointed document should be an html text
>
>
htmlurl   text   A valid URL that the pointed document should be an html text

localFile   file   The path to your local html text file
tagword   text body Valid html element (tag) name or multple html element name separated by comma
range all/patt/find/stop radio button all Options that let user select the word list he/she want to see
Changed:
<
<
wpat   text   A unix styled pattern. This field corresponding ti the value "patt" in the radio button group named "range"
findstop typedin/textfile/glasgow trdio button glasgow The option are connected with value "find" and "stop" in the radio button group named "range"
>
>
wpat   text   A unix styled pattern. This field corresponding to the value "patt" in the radio button group named "range"
findstop typedin/textfile/glasgow radio button glasgow The option are connected with value "find" and "stop" in the radio button group named "range"

typedinword   text   This text field is corresponding to the value "typedin" of radio button group named "findstop"
wordfile   file   This field is corresponding to the value "textfile" of radio button group named "findstop"
listsort 1/2/3/4 selection 2 Sorting criteria which are alphabetically/by frequency/by order of first appearance/by reversed alphabetic order in the order of parameter values

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.20 - 18 Jul 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 113 to 113

Taporware provides web services to any non-benefit organizations. here is the taporware web services infomation:

Changed:
<
<
>
>

  • Service Method: list_Words_HTML
  • parameters:
    • htmlInput -- any html string

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.19 - 13 Jul 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 45 to 45

CGI Interface

If you want to use this tool from your web site, here is the CGI Interface:

Changed:
<
<
(Note: If you want to upload local html text to the tool, you need to use attribute name/value pair: enctype="multipart/form-data" within the form tag)
>
>
(Note: You need to use attribute name/value pair: enctype="multipart/form-data" within the form tag because the tool was to designed to allow local file uploading even if you do not use this feature)


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.18 - 11 Jul 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 14 to 14

  • pattern: A sequence of characters used either with regular expression notation or for path name expansion, as a means of selecting various character strings or path names, respectively. Values are matched against patterns to see if they should be included/excluded. In patterns "*" matches any string, "?" matches any single character.
Changed:
<
<

Predefined parameter values in Tool Bar

>
>

Predefined Parameter Values in Tool Bar


  • Source: the page the user is currently in.
  • Element: body or set by site owner

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.17 - 05 Jul 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 14 to 14

  • pattern: A sequence of characters used either with regular expression notation or for path name expansion, as a means of selecting various character strings or path names, respectively. Values are matched against patterns to see if they should be included/excluded. In patterns "*" matches any string, "?" matches any single character.
Changed:
<
<

History

>
>

Predefined parameter values in Tool Bar

  • Source: the page the user is currently in.
  • Element: body or set by site owner
  • Words listed: all words except the glasgow stop-list
  • Stemmer: use inflectional stemmer to process all the words before listing
  • Sorting: by frequency
  • Display format: HTML

Pseudocode


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.16 - 21 Jun 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 8 to 8

Description

This tool can be used to list all of the words or user specified words found within a specified tag. For example, list all words matching a user entered pattern, list all words except user specified stop words. The query results can be displayed alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. If no tag is specified, the <body> tag is used.
Added:
>
>

Term Defination

  • stop words: Words ignored in a query because they are so commonly used that they can't contribute to relevancy. Includes conjunctions, prepositions, and articles such as and, to and a.

  • pattern: A sequence of characters used either with regular expression notation or for path name expansion, as a means of selecting various character strings or path names, respectively. Values are matched against patterns to see if they should be included/excluded. In patterns "*" matches any string, "?" matches any single character.

History

Pseudocode


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.15 - 19 Jun 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.14 - 28 Mar 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 56 to 56

You can add a button in your web page to list all the words in that page by call TAPoRware cgi script.

Changed:
<
<
>
>

Line: 73 to 73

Here is the code for this function

Changed:
<
<
<form method="post" name="textForm" enctype="multipart/form-data"
>
>
<form method="post" name="textForm" enctype="multipart/form-data" target="_blank"

action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/tlistword.cgi" onsubmit="document.textForm.texturl.value=document.location.href">

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.13 - 27 Mar 2006 - GeoffreyRockwell)

META TOPICPARENT TAPoRware

List Words

Line: 36 to 36

Deleted:
<
<
put form code

Here are the parameters:

Parameter Name Parameter Value Control Type Default Discription
Line: 60 to 58

Changed:
<
<
>
>

Changed:
<
<
>
>

Changed:
<
<
>
>

Here is the code for this function

Line: 79 to 77

action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/tlistword.cgi" onsubmit="document.textForm.texturl.value=document.location.href">
Changed:
<
<
<input type="hidden" name="source" value="url">
>
>
<input type="hidden" name="source" value="url" />

Changed:
<
<
<input type="hidden" name="texturl">
>
>
<input type="hidden" name="texturl" />

<input type="hidden" name="freetext" value="yes"/>

<input type="hidden" name="range" value="all" />

Changed:
<
<
<input type="hidden" name="sorting" value="2">
>
>
<input type="hidden" name="sorting" value="2" />

Changed:
<
<
<input type="hidden" name="display" value="1">
>
>
<input type="hidden" name="display" value="1" />

Changed:
<
<
<input type="hidden" name="taporface" value="new">
>
>
<input type="hidden" name="taporface" value="same" />

Changed:
<
<
<input type="submit" name="doIt" value="List All Words of the Page">
>
>
<input type="submit" name="doIt" value="List All Words of the Page" />

</form>


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.12 - 22 Mar 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 54 to 54

listdisp 1/2/3/4 selection 2 Display format which are XML tags in HTML/HTML/XML tree/tab Delimited Text in the order of parameter values
taporface   checkbox checked display result in a new window without graphics interface (default) or with taporware interface in the same window
Added:
>
>

Use List Words TAPoRware Tool in Your Web Page

You can add a button in your web page to list all the words in that page by call TAPoRware cgi script.

Here is the code for this function

<form method="post" name="textForm" enctype="multipart/form-data" action="http://taporware.mcmaster.ca/~taporware/cgi-bin/prototype/tlistword.cgi" onsubmit="document.textForm.texturl.value=document.location.href">

<input type="hidden" name="source" value="url">

<input type="hidden" name="texturl">

<input type="hidden" name="freetext" value="yes"/>

<input type="hidden" name="range" value="all" />

<input type="hidden" name="sorting" value="2">

<input type="hidden" name="display" value="1">

<input type="hidden" name="taporface" value="new">

<input type="submit" name="doIt" value="List All Words of the Page">

</form>


Web Service Interface

Taporware provides web services to any non-benefit organizations. here is the taporware web services infomation:


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.11 - 21 Mar 2006 - GeoffreyRockwell)

META TOPICPARENT TAPoRware

List Words

Line: 36 to 36

Added:
>
>
put form code

Here are the parameters:

Parameter Name Parameter Value Control Type Default Discription

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.10 - 06 Feb 2006 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 31 to 31

CGI Interface

Changed:
<
<
If you want to use this tool from you web site, here is the CGI Interface:
>
>
If you want to use this tool from your web site, here is the CGI Interface:

(Note: If you want to upload local html text to the tool, you need to use attribute name/value pair: enctype="multipart/form-data" within the form tag)

Line: 49 to 49

typedinword   text   This text field is corresponding to the value "typedin" of radio button group named "findstop"
wordfile   file   This field is corresponding to the value "textfile" of radio button group named "findstop"
listsort 1/2/3/4 selection 2 Sorting criteria which are alphabetically/by frequency/by order of first appearance/by reversed alphabetic order in the order of parameter values
Changed:
<
<
listdisp 1/2/3/4 selection 2 Display foemat which are XML tags in HTML/HTML/XML tree/tab Delimited Text in the order of parameter values
>
>
listdisp 1/2/3/4 selection 2 Display format which are XML tags in HTML/HTML/XML tree/tab Delimited Text in the order of parameter values

taporface   checkbox checked display result in a new window without graphics interface (default) or with taporware interface in the same window

Web Service Interface


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.9 - 20 Dec 2005 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 58 to 58

Changed:
<
<
  • Service Method: find_Collocation_Plain
>
>
  • Service Method: list_Words_HTML

  • parameters:
    • htmlInput -- any html string
    • htmlTag -- any html element (tag) name or multple html element name separated by comma

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.8 - 20 Dec 2005 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 12 to 12

Pseudocode

Added:
>
>
  • Obtain HTML string by URL or from user's local disk

  • Tokenize text into words using spaces and punctuation marks
  • Sort and count words with similar letters ignoring capitalization
  • Extract words based on user specified criteria if necessary

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.7 - 20 Dec 2005 - LianYan)

META TOPICPARENT TAPoRware

List Words


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.6 - 16 Dec 2005 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 14 to 14

  • Tokenize text into words using spaces and punctuation marks
  • Sort and count words with similar letters ignoring capitalization
Changed:
<
<
  • Extract words based on user specified criteria
>
>
  • Extract words based on user specified criteria if necessary

  • Generate output format based on user's selection

Ways of Using


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.5 - 16 Dec 2005 - LianYan)

META TOPICPARENT TAPoRware

List Words

Line: 6 to 6

Description

Changed:
<
<
This tool can be used to list all of the words found within a specified tag. The query results can be displayed alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. If no tag is specified, the <body> tag is used.
>
>
This tool can be used to list all of the words or user specified words found within a specified tag. For example, list all words matching a user entered pattern, list all words except user specified stop words. The query results can be displayed alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. If no tag is specified, the <body> tag is used.

History

Pseudocode

  • Tokenize text into words using spaces and punctuation marks
Changed:
<
<
  • Count words with similar letters ignoring capitalization
>
>
  • Sort and count words with similar letters ignoring capitalization
  • Extract words based on user specified criteria
  • Generate output format based on user's selection

Ways of Using

Added:
>
>
  • Enter a valid URL in the URL field or enter a local upload html text
  • Enter a valid html tag or tag list seperated by comma, default is "body"
  • Select which list you want to get and enter the corresponding text if necessary
  • Select sorting criterion
  • Select output format
  • If you want the results displayed in the same window with taporware interface, uncheck the check box - "Open results in new window"
  • Finally, click the "Submit" button


CGI Interface

Added:
>
>
If you want to use this tool from you web site, here is the CGI Interface: (Note: If you want to upload local html text to the tool, you need to use attribute name/value pair: enctype="multipart/form-data" within the form tag)

Here are the parameters:

Parameter Name Parameter Value Control Type Default Discription
source url/local radio button url Let user select input text (either a url or upload local html text)
htmlurl   text   A Valid URL that the pointed document should be an html text
localFile   file   The path to your local html text file
tagword   text body Valid html element (tag) name or multple html element name separated by comma
range all/patt/find/stop radio button all Options that let user select the word list he/she want to see
wpat   text   A unix styled pattern. This field corresponding ti the value "patt" in the radio button group named "range"
findstop typedin/textfile/glasgow trdio button glasgow The option are connected with value "find" and "stop" in the radio button group named "range"
typedinword   text   This text field is corresponding to the value "typedin" of radio button group named "findstop"
wordfile   file   This field is corresponding to the value "textfile" of radio button group named "findstop"
listsort 1/2/3/4 selection 2 Sorting criteria which are alphabetically/by frequency/by order of first appearance/by reversed alphabetic order in the order of parameter values
listdisp 1/2/3/4 selection 2 Display foemat which are XML tags in HTML/HTML/XML tree/tab Delimited Text in the order of parameter values
taporface   checkbox checked display result in a new window without graphics interface (default) or with taporware interface in the same window

Web Service Interface

Added:
>
>
Taporware provides web services to any non-benefit organizations. here is the taporware web services infomation:

  • Endpoint URL: http://strange.mcmaster.ca:9982
  • Service URI: http://strange.mcmaster.ca/~taporware/webservice
  • Service Method: find_Collocation_Plain
  • parameters:
    • htmlInput -- any html string
    • htmlTag -- any html element (tag) name or multple html element name separated by comma
    • listOption -- values are same as parameter "range" in the CGI interface above
    • optionSeletion -- values are corresponding to the "list option"
    • sorting -- values are same as parameter "listsort" in the CGI interface above
    • outFormat -- values are same as parameter "listdisp" in the CGI interface above

Known Bugs

To Do


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.4 - 24 Nov 2005 - GeoffreyRockwell)

META TOPICPARENT TAPoRware

List Words

Line: 12 to 12

Pseudocode

Added:
>
>
  • Tokenize text into words using spaces and punctuation marks
  • Count words with similar letters ignoring capitalization

Ways of Using

CGI Interface

Line: 22 to 25

To Do

Added:
>
>
  • There should be a setting to preserve capitalization so that "Canadian" and "canadian" would be two different words.
  • We need to check multilingual texts - how are accents being handled?

-- MattPatey - 13 Oct 2005

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.3 - 15 Oct 2005 - MattPatey)

META TOPICPARENT TAPoRware

List Words

Changed:
<
<
See http://taporware/~taporware/htmlTools/listword.shtml
>
>
See http://taporware.mcmaster.ca/~taporware/htmlTools/listword.shtml

Description


 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.2 - 15 Oct 2005 - MattPatey)

META TOPICPARENT TAPoRware
Added:
>
>

List Words

See http://taporware/~taporware/htmlTools/listword.shtml

Description


This tool can be used to list all of the words found within a specified tag. The query results can be displayed alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. If no tag is specified, the <body> tag is used.
Added:
>
>

History

Pseudocode

Ways of Using

CGI Interface

Web Service Interface

Known Bugs

To Do


-- MattPatey - 13 Oct 2005

 <<O>>  Difference Topic TAPoRwareHTMLListWords (r1.1 - 13 Oct 2005 - MattPatey)
Line: 1 to 1
Added:
>
>
META TOPICPARENT TAPoRware
This tool can be used to list all of the words found within a specified tag. The query results can be displayed alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. If no tag is specified, the <body> tag is used.

-- MattPatey - 13 Oct 2005


Topic: TAPoRwareHTMLListWords . { View | Diffs | r1.29 | > | r1.28 | > | r1.27 | More }

Revision r1.1 - 13 Oct 2005 - 20:38 - MattPatey
Revision r1.29 - 06 Jun 2008 - 20:21 - LianYan