miller

Uploaded from authorPOINTLite
Views:
 
Category: Entertainment
     
 

Presentation Description

No description available.

Comments

Presentation Transcript

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing Lafayette Ragsdale - Limber Up ! A Limited Edition Civil War Print

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Outline Introductions - Myself, Project, Thesauri, RDF The Challenge - To Facilitate Cross-European Comparative Data Analysis The Dream - A Computer Data-ing Service The Reality - The Limber Approach The Progress - Metadata, Thesaurus, Workshop The End - A Possible Sponsor and Questions ???

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Introductions Ken Miller - Head of Information Systems Development, The UK Data Archive, The University of Essex, England EU Human Language Technologies IST Project - Language Independent Metadata Browsing of European Resources. CLRC Rutherford Appleton Laboratory, Intrasoft, UK Data Archive , Norwegian Data Archive, user group:- other European national archives To Breakdown Linguistic and Discipline Barriers to interoperability between European Resources

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Hierarchical Thesauri List of specific synonyms Broader and narrower relationships Subject concept Concept tree - hierarchies Shift focus within concept Ensured relevance Browsing and KWIC listings Apply to different parts of metadata Ranking

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University NT EMPLOYMENT HISTORY, EMPLOYMENT PROGRAMMES, FULL-TIME EMPLOYMENT, HOME-BASED WORK, IRREGULAR ECONOMIC ACTIVITY, JOB CHANGING, JOB LOSSES, JOB SHARING, JOB VACANCIES, MILITARY SERVICE, OFF-SHORE EMPLOYMENT, PART-TIME EMPLOYMENT, SEASONAL EMPLOYMENT, SHELTERED EMPLOYMENT, SPOUSES' EMPLOYMENT, STUDENT EMPLOYMENT, SUBSIDIARY EMPLOYMENT, TEMPORARY EMPLOYMENT, UNEMPLOYMENT, WOMEN'S EMPLOYMENT, YOUTH EMPLOYMENT RT CONDITIONS OF EMPLOYMENT, EMPLOYMENT ABROAD, EMPLOYMENT OPPORTUNITIES, EMPLOYMENT POLICY, EMPLOYMENT SERVICES, EMPLOYMENT STATUS, LABOUR (WORK), LABOUR ECONOMICS, OCCUPATIONS, PERSONNEL, PERSONNEL MANAGEMENT, RIGHT TO WORK, WORKING CONDITIONS UF JOBS, WORK BT LABOUR AND EMPLOYMENT

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University RDF - Resource Description Framework World-Wide Web Consortium (W3C) A framework within which statements about Web resources can be recorded Uses XML as its physical syntax Based around directed labelled graphs Triples - Resource, Property, Value Can construct more complex models Statements about statements

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University <web:Description about="http://elsst/concepts/CID_6”> <web:type resource="http://rdf-dot/Thes/Thes.xrdf#Concept"/> <rdfs:isDefinedBy web:resource="http://elsst/concepts/"/> <thes:indicator web:resource="http://elsst/terms/TID_3"/> <thes:notation>620</thes:notation> <thes:scope web:resource="http://elsst/scopenotes/SN_12"/> <thes:relatedConcept web:resource="http://elsst/concepts/CID_15"/> </web:Description>

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University <web:Description about="http://elsst/terms/TID_3"> <web:type resource="http://rdf-dot/Thes/Thes.xrdf#Term"/> <thes:lang>en</thes:lang> <web:value>Friends</web:value> <thes:termType web:resource="http://rdf-dot/Thes/Thes.xrdf#preferred"/> </web:Description> <web:Description about="http://elsst/scopenotes/SN_12"> <web:type resource="http://rdf-dot/Thes/Thes.xrdf#ScopeNote"/> <thes:lang>en</thes:lang> <web:value>To be used only for platonic relationships</web:value> </web:Description>

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University The Challenge - To Facilitate Cross-European Comparative Data Analysis A metadata model and representation to allow integration within and across data archives. A multilingual thesaurus to index and access social science datasets in data archives. A multilingual query and retrieval tool to allow queries to datasets in archives to be made in several languages with keyword and phrase translation in the retrieved metadata. Tools to support the construction and maintenance of datasets in an archive using automatic indexing. An XML metadata server using the metadata model and representation.

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University “Effect of Diet on Children’s Teeth” Greenhouse effect, Diet, Diet and nutrition, Diet therapy, Children’s rights, False teeth, teeth TEETH nt DENTURES rt DENTAL DISEASES rt DENTAL HEALTH “National Diet, Nutrition and Dental Survey” Show me more

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Computer dating service for datasets Metadata elements to search for a perfect match Metadata to display to confirm a perfect match Analysing tools to ensure a perfect match Resource Independent Language Independent Thesaurus Independent Dream on

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University The Dream - A Computer Data-ing Service France Germany England Spain German German French French English English English Spanish Spanish

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University The Reality - The Limber Approach Discipline boundaries RDF Mappings between metadata standards Linguistic boundaries Multi-lingual Thesaurus Interface Relevance feedback

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Convert DDI codeBook to RDF Translate DDI Headings/Tag Library Reduce HASSET Add Methodology Terms Translate Hierarchies Check individual languages Convert to XML/RDF Multi-mapped 4 Language Thesaurus

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University The Progress - Metadata, Thesaurus, Workshop Reduction of Thesaurus Automation NOT possible Removal of Cultural specificity Removal of Institutional specificity Removal of hierarchies in existing thesauri Initial reduction of major 20 hierarchies A top down reduction to very broad base User group evaluation of reductions

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Metadata Additions to Thesaurus methodology kind of data universe spatial unit file structure, format, type access conditions age groups

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Translation of Thesaurus Reduced monolingual thesaurus Specialist teams with social science background Backwards and sideways translations Additions of new terms Allow for different structures Allow for non-equivalence Extensive use of synonyms and scope notes Other EU projects (PACO, CHINTEX etc)

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Thesaurus defined in XML and RDF - <thesTerm classCode=“L10.10” type=“pref”> <termName xml:lang=“en”>ADDICTION</termName> <termName xml:lang=“fr”>DÉPENDANCE</termName> <scopeNote xml:lang=“en” type=“ambig”>Use a more specific term if possible</scopeNote> <ufTerm fterm=“L10.10NP”/> <ntTerm fterm=“L10.10.10”/> <btTerm fterm=“L10”/> <ttTerm fterm=“L”/> <rtTerm fterm=“R70.40”/>

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Limber Workshop - April 17-18th 2000 - Alta-vista style searches Free text / controlled vocabulary option Logic of multi-word combination explained Interface and hit list in chosen language Hit list ranked by where word found Customised prioritising of ranking Search history to refine and combine searches Other EU projects (e.g. PACO, EPAG etc)

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Search via classification code Automatic use of thesaurus only for synonyms Complexity of thesaurus hidden unless requested Simple alphabetic listings except for browsing Searches performed first with suggestion list Failed searches employ stemming and truncation Keywords are useful for searching and ranking

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Keywords are NOT so useful for relevance Serious analysis requires translation Title and summary in English Controlled vocabularies on specific elements access conditions, methodology, kind of data, universe and spatial unit Languages of metadata displayed and selectable Translation of DDI element headings

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University Great interest in Automatic Indexing Tool Great scepticism as to if it would work To work from DDI metadata Use linguistic techniques to draw concepts out Assign keywords that express those concepts Learn from metadata with manual keywords Other EU projects (RENARDUS) + OCLC

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University - Some doubts over RDF Some doubts over DDI / NESSTAR Some doubts over imbalances Some doubts over multiple resources Some doubts over mapping - IASSIST conference 7-10th June 2000 New doubts ????

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing IASSIST 2000 - DATA IN THE DIGITAL LIBRARY: Charting the Future for Social, Spatial and Government Data June 7-10, Northwestern University The End - A Possible Sponsor and Questions ???

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing: 

Limber Up: For the Race to Provide Multi-lingual Access and Automatic Indexing Lafayette Ragsdale - Limber Up ! A Limited Edition Civil War Print