Need for Language Technology:
Need for Language Technology Over 30,000 language professionals in the U.S. government
Many more at state and local levels
Extensive material in foreign language, often in legacy formats/encodings
More than 50 percent of Library of Congress is non-English
Many kinds of applications
Assimilation and dissemination
Extensive collaboration
Need for large number of languages
Disaster relief (e.g., Haiti) unpredicatable
Extensive multinational efforts
Language list now being reviewed
Short list as follows:
U.S. Government Languages(List being updated):
U.S. Government Languages (List being updated) Afrikaans
Albanian
Amharic
Arabic
Armenian
Ayamara
Azerbaijani
Bangla
Basque
Belarusian
Bengali
Bosnian
Bulgarian
Burmese
Cantonese
Catalan
Chinese
Croatian
Czech
Danish Dari
Dutch
English
Estonian
Farsi
Finnish
French
Georgian
German
Greek
Guarani
Haitian Creole
Hausa
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Italian
Japanese
Languages (Continued):
Languages (Continued) Kazakh
Khmer/Cambodian
Kinyarwanda
Kirundi
Korean
Kurdish
Lao
Latvian
Lithuanian
Macedonian
Mongolian
Nepali
Norwegian
Pashto
Polish
Portuguese
Romanian
Russian Serbian
Sinhalese
Slovak
Slovenian
Spanish
Swahili
Swedish
Tagalog
Thai
Tibetan
Tigrigna
Turkish
Ukranian
Urdu
Uzbek
Vietnamese
Slide5:
Today Translators and limited Machine Translation
Slide6:
Integrated Collaborative Translation
Space with Shared Tools Fast Routing Translated
and/or
Tagged
Types of Technology Needed:
Types of Technology Needed Browsers
Text Processors
Web Page Tools
OCR
MT
Search Engines
Translation Managers
Language Learning
Dictionaries
Thesauri
Developers Kits
Info Extraction andamp; Summarization
Knowledge Management
Visualization
Other
Special Requirements:
Special Requirements Unicode
UTF 8
Other major code sets
Code set conversions (extensible)
Language and encoding ID
Mixed languages
Per page
Per database
English interfaces
English sys admin
Work with Microsoft and/or Sun (non-localized)
Work well with other COTS applications
Easy training
Good U.S. support
Comply with W3C guidelines for accessibility
Enable easy extensibility by government