Skip to content.

Find topic

Web tools

Help

Tools

       Analysis Tool Bar  +

<< Godfrey, Academic Media Analyst | Mandy, Commercial User >>

User 2: Linguistic Studies Researcher

Sidney
Modern Languages Grad Student, University of Waterloo, Age: 25

not sameer

Sidney is a graduate student in languages at the University of Waterloo.

He needs to identify and explore a series of colloquial terms unique to the Franco-Ontarian population.

He assembles a custom web spidering routine using JiTR's drag and drop spiderBuilder. He runs this spider to construct a collection of articles drawn from Franco-Ontarian sources. He then applies a recipe he found in the TAPoR Portal to identify colloquial word usage in bodies of text. The TAPoR tools are provided as a plug-in to the JiTR environment and this enables him to quickly isolate word groups meeting his needs. Sidney's results are added to his repository as a separate text. These isolated phrases then serve as a target for additional analysis to discover patterns in their usage with additional tools available from the JiTR dashboard.

Persona

Overview

Now in his mid-20's, Sidney has had the opportunity to grow up with computers and feels comfortable around them. As a student, he sees the potential of technology in understanding language. His colleagues are open to it, and as such he is often assigned the unofficial role of tracking computing and the ways it can assist in research.

Sidney's interests lie in French-Canadian relations and French diaspora beyond Quebec.

Scenarios

Scenario 1

Sydney needs to identify and explore a series of colloquial terms unique to the Franco-Ontarian population.

He assembles a custom web spider routine using JiTR's drag and drop spiderBuilder. He runs this spider to construct a collection of articles drawn from Franco-Ontarian sources. He then applies a recipe he found in the TAPoR Portal to identify colloquial word usage in bodies of text. The TAPoR tools are provided as a plug-in to the JiTR environment and this enables him to quickly isolate word groups meeting his needs. Sidney's results are added to his repository as a separate text. These isolated phrases then serve as a target for additional analysis to discover patterns in their usage with additional tools available from the JiTR dashboard.

  • Sydney is studying Franco-Ontarian colloquial terms
  • sets up a webspider using the drag and drop spiderBuilder
  • uses a TAPoR recipe to identify colloquial word usage
  • results are sent back to JiTR, into the repository

Scenario 2

Sydney is collecting French language shareholder information from Canadian companies, in hopes of comparing them to their English counterparts.

To collect the documents, Sydney uses that manual item-add. Since they are all online, however, Sidney does not need to upload them. Rather, he adds the documents by entering the target URLs.

Since the documents collected are in PDF, Sidney uses the PDF-to-Text coversion process to create more tangible items with them. Once he does that, he adds anchor targets to each header of the documents. With this done, he is able to create links between the French and English versions, so he makes each section header link to its counterpart (in the other language).

  • Sydney is scollecting French and English Canadian shareholder information
    • he adds the items by pasting in their URLs
  • Sydney converts his PDFs to text items
  • he adds anchors to section headers (with the items), and makes each header a link to its other-language counterpart

Summary (see JiTRCollectiveSummaries)

Summary of what's needed for Sidney
  • extensibility - ability to use external processes on his texts
    • ability to call data back into the repository from external processes
  • web spider - customizable and capable of pulling articles from multiple specified sites
  • process to extract text from PDF documents
  • simple linking within items
  • manual add by url

Wireframes

Wireframes

-- PeterOrganisciak - 19 Dec 2007


Use this box to quickly add a comment to the page.

more options...