Skip to content.
TADA Contents
About TADA
About Text Analysis
Resources for Devs
Projects
News & Events
Publications
Find topic
Search text
Web tools
Recent changes
Topic list
Verbose topic list
Access statistics
Notify me of changes
Web preferences
Help
Text formatting rules
TWiki documentation
Sandbox (test) web
Tools
Analysis Tool Bar +
List Words
?
Concordance
?
Pattern:
Collocation
?
Pattern:
Co-occurrence
?
Pattern:
Co-pattern:
Summarizer
?
input type="hidden" name="distrib" value="1"<-->
Tokenizer
?
Date Finder
?
Principal Components Analysis
?
Pattern Distribution
?
Pattern:
Visual Collocator
?
Pattern:
Weighted Centroid
?
T-Bar from
TAPoR
More...
Taporize
Attach a file
Edit this page
Main
>
GlossaryUTF8
UTF8
UTF-8 (8-bit Unicode Character Encoding)
Unicode character encoding is an evolution of the ASCII set to permit support of a greater number of alphanumeric characters including those with diacritical marks such as accents. More information on UTF-8 is available at:
Wikipedia
--
ShawnDay
- 1 May 2006
Use this box to quickly add a comment to the page.
more options...