logging in or signing up tei contents Dora Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 91 Category: Education License: All Rights Reserved Like it (0) Dislike it (0) Added: February 13, 2008 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript An overview of TEI tagging or, Anyone for pizza?: An overview of TEI tagging or, Anyone for pizza?Basic concepts: Basic concepts The TEI is a modular system, built like a Chicago pizza Each module defines specific elements and attributes Elements are classified structurally and semanticallyTEI core modules: TEI core modules Infrastructure defines all named element classes and macros Core the TEI header elements “common to all kinds of text” Structure “book-like” structures of prose, verse, dramaOptional modules: Optional modules Alternative structures eg transcribed speech, dictionaries ... Specialist applications linking and alignment; analysis; feature structures; certainty; physical transcription; textual criticism, names and dates; language corpora; manuscript description.... Caution! Under Construction! There is NO SUCH THING as “the TEI dtd”: There is NO SUCH THING as “the TEI dtd” TEI Lite (http://www.tei-c.org/Lite/) is our guess at what most people want, most of the time realistic for existing texts, and for new document production, e.g. TEI technical documentation At P5 the task of making your own TEI schema is much simplifiedBasic structure(s): Basic structure(s) Every TEI-conformant document comprises a header followed by (at least one) text the header contains: mandatory file description optional encoding, profile and revision descriptions the header is essential for: bibliographic control and identification resource documentation and processingStructure of a TEI text: Structure of a TEI text A text may be unitary or composite a unitary text contains front matter back matter a body in a composite text, the body is a group of texts (or nested groups)TEI basic structure: TEI basic structure sA text usually has divisions : A text usually has divisions generic, hierarchic subdivisions vanilla or numbered type attribute associated head and trailer elements from the divtop classfor example...: for example...TEI global attributes : TEI global attributes Defined in the core module id for unique identification (to become xml:id) n for (non-unique) name or number rend for rendition (appearance) lang for language (to become xml:lang) Defined in the linking module corresp, synch, ana for specific association types next, prev for aggregating fragmented elements Character Encoding Recommendations: Character Encoding Recommendations non-normative extend, using standard entity sets or transliteration document transliteration scheme with formal Writing System DeclarationText components (prose base): Text components (prose base) What are divisions composed of? prose is mostly paragraphs (<p>) verse is mostly lines (<l>), sometimes in hierarchic groups (<lg>) drama is mostly speeches (<sp>) containing <p> or <l> and interspersed with stage directions (<stage>) These may be mixed, and may also appear directly within undivided texts.Verse: an example: Verse: an exampleDrama: an example: Drama: an exampleTexts are not just words...: Texts are not just words... … but probably only people know that an encoding may claim to capture just visual salience, just its assumed causes both encoding makes explicit one (or more) sets of interpretationsFor example...: For example... And this Indenture further witnesseth that the said Walter Shandy, merchant, in consideration of the said intended marriage...Slide18: And this Indenture further witnesseth that the said Walter Shandy, merchant, in consideration of the said intended marriage...Who does the work?: Who does the work? TEI scheme allows for close reading -- and the reverse can tag very detailed features of discourse function can normalise or simplify (e.g. dates numbers, names) … or leave well aloneCore phrase level elements include...: Core phrase level elements include... phrases that are conventionally typographically distinct “data-like” (names, numbers, dates, times, addresses) editorial intervention (corrections, regularizations, additions, omissions ...) cross references and linksfor example...: for example... <head>Of writing lives in general,and particularly of <title>Pamela </title>, with a word by the bye of <name>Colley Cibber</name> and others.</head> <p>It is a trite but true observation, that <q>examples work more forcibly on the mind than precepts</q>.… <p><name>Mr. Joseph Andrews</name>, <rs>the hero of our ensuing history</rs>, was esteemed to be ...Direct speech: Direct speech Use the who attribute to show speakers Speeches can be nested in other speechesForeign language phrases : Foreign language phrases The xml:lang attribute may be attached to any element Use <foreign> if nothing else is available Use ISO 639-2 code to identify languageNames and other referring strings: Names and other referring strings The <rs> (referring string) element is used for any kind of name or referenceCorrection and Regularization: Correction and Regularization <corr> marks a correction <sic> marks a (deliberate) non-correction <reg> and <orig> for normalization (or the reverse) use singly, or within <choice> if you want bothA table of green feelds: A table of green feelds Omissions, Deletions, Additions: Omissions, Deletions, Additions <gap> omission by transcriber <del> and <add> cancellation or addition in source <combine> used to group addition and deletion together <supplied> insertion by editor <unclear> material uncertain because illegible <damage> physical damage to text carrierThe multiple hierarchy problem: The multiple hierarchy problem SGML allows only one hierarchy at a time Is a document chapter-paragraph-phrase gathering-page-leaf or both? discontinuous segments links and milestonesBoundary markers: Boundary markers page, column, and line breaks (<pb>, <cb>, <lb>) generic <milestone>Some chunks are also phrases: Some chunks are also phrases <list> lists of all kinds <note> notes (authorial or editorial) <figure> pictures or figures <formula> formulae <table> tables <bibl> bibliographic descriptionsLists: Lists use <list> for lists of any kind (use type attribute to distinguish) use <label> in two-column lists as alternative to n attribute may be nested as necessaryfor example...: for example...Figures and graphics: Figures and graphics The presence of a graphic is indicated by the <figure> element The title of the graphic is tagged as a <head> A description of the graphic may be supplied (as a <figDesc>) for use by software unable to render the graphic The graphic itself is specified by an external link (URL) for example... : for example... Tables: Tables a <table> element contains <row>s of <cell>s spanning is indicated by rows and cols attributes role attribute indicates whether row or column holds data or a label embedded tables are permittedfor example...: for example...Bibliography: Bibliography Use simple <bibl> with optional subcomponents: <respStmt> (for any kind of responsibility) or <author>, <editor>, etc. <title> with optional level attribute <imprint> groups publication details <biblScope> adds page references etc. Use <listBibl> for list of referencesfor example...: for example...Notes : Notes Use <note> for notes of any kind (editorial or authorial) if in-line, use place attribute to specify location if out of line, either use target attribute to specify attachment point or mark attachment point as a <ref>for example...: for example...The Spoken texts module: The Spoken texts module components : <u> <event> <kinesic> <vocal> <pause> <shift> contextual information in header <settingDesc> <particDesc> facilities for synchronization and timing Features of speech: Features of speechUtterances: Utterances Basic unit of discourse, corresponding to speaker turns Optionally grouped into higher-level divisions (<div>s), e.g. to mark discourse function Linked by who attribute to <person> description in headerVocals and events: Vocals and events Empty elements are used to mark paralinguistic phenomenaVoice quality and prosody: Voice quality and prosody The <shift> element is used to mark changes in voice quality Other prosodic features may be marked using specific kinds of <seg> or entity refsAnother example: Another exampleParticipant Description: Participant DescriptionSetting Description: Setting Description eg from P2Timing: Timing Pausing use <pause> element Duration use dur attribute Overlap use trans attribute Overlap: OverlapNot covered here...: Not covered here... specialised front and back matter dictionaries and terminology analytic tagging segmentation interpretations linking the header tags for documentation You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
tei contents Dora Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 91 Category: Education License: All Rights Reserved Like it (0) Dislike it (0) Added: February 13, 2008 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript An overview of TEI tagging or, Anyone for pizza?: An overview of TEI tagging or, Anyone for pizza?Basic concepts: Basic concepts The TEI is a modular system, built like a Chicago pizza Each module defines specific elements and attributes Elements are classified structurally and semanticallyTEI core modules: TEI core modules Infrastructure defines all named element classes and macros Core the TEI header elements “common to all kinds of text” Structure “book-like” structures of prose, verse, dramaOptional modules: Optional modules Alternative structures eg transcribed speech, dictionaries ... Specialist applications linking and alignment; analysis; feature structures; certainty; physical transcription; textual criticism, names and dates; language corpora; manuscript description.... Caution! Under Construction! There is NO SUCH THING as “the TEI dtd”: There is NO SUCH THING as “the TEI dtd” TEI Lite (http://www.tei-c.org/Lite/) is our guess at what most people want, most of the time realistic for existing texts, and for new document production, e.g. TEI technical documentation At P5 the task of making your own TEI schema is much simplifiedBasic structure(s): Basic structure(s) Every TEI-conformant document comprises a header followed by (at least one) text the header contains: mandatory file description optional encoding, profile and revision descriptions the header is essential for: bibliographic control and identification resource documentation and processingStructure of a TEI text: Structure of a TEI text A text may be unitary or composite a unitary text contains front matter back matter a body in a composite text, the body is a group of texts (or nested groups)TEI basic structure: TEI basic structure sA text usually has divisions : A text usually has divisions generic, hierarchic subdivisions vanilla or numbered type attribute associated head and trailer elements from the divtop classfor example...: for example...TEI global attributes : TEI global attributes Defined in the core module id for unique identification (to become xml:id) n for (non-unique) name or number rend for rendition (appearance) lang for language (to become xml:lang) Defined in the linking module corresp, synch, ana for specific association types next, prev for aggregating fragmented elements Character Encoding Recommendations: Character Encoding Recommendations non-normative extend, using standard entity sets or transliteration document transliteration scheme with formal Writing System DeclarationText components (prose base): Text components (prose base) What are divisions composed of? prose is mostly paragraphs (<p>) verse is mostly lines (<l>), sometimes in hierarchic groups (<lg>) drama is mostly speeches (<sp>) containing <p> or <l> and interspersed with stage directions (<stage>) These may be mixed, and may also appear directly within undivided texts.Verse: an example: Verse: an exampleDrama: an example: Drama: an exampleTexts are not just words...: Texts are not just words... … but probably only people know that an encoding may claim to capture just visual salience, just its assumed causes both encoding makes explicit one (or more) sets of interpretationsFor example...: For example... And this Indenture further witnesseth that the said Walter Shandy, merchant, in consideration of the said intended marriage...Slide18: And this Indenture further witnesseth that the said Walter Shandy, merchant, in consideration of the said intended marriage...Who does the work?: Who does the work? TEI scheme allows for close reading -- and the reverse can tag very detailed features of discourse function can normalise or simplify (e.g. dates numbers, names) … or leave well aloneCore phrase level elements include...: Core phrase level elements include... phrases that are conventionally typographically distinct “data-like” (names, numbers, dates, times, addresses) editorial intervention (corrections, regularizations, additions, omissions ...) cross references and linksfor example...: for example... <head>Of writing lives in general,and particularly of <title>Pamela </title>, with a word by the bye of <name>Colley Cibber</name> and others.</head> <p>It is a trite but true observation, that <q>examples work more forcibly on the mind than precepts</q>.… <p><name>Mr. Joseph Andrews</name>, <rs>the hero of our ensuing history</rs>, was esteemed to be ...Direct speech: Direct speech Use the who attribute to show speakers Speeches can be nested in other speechesForeign language phrases : Foreign language phrases The xml:lang attribute may be attached to any element Use <foreign> if nothing else is available Use ISO 639-2 code to identify languageNames and other referring strings: Names and other referring strings The <rs> (referring string) element is used for any kind of name or referenceCorrection and Regularization: Correction and Regularization <corr> marks a correction <sic> marks a (deliberate) non-correction <reg> and <orig> for normalization (or the reverse) use singly, or within <choice> if you want bothA table of green feelds: A table of green feelds Omissions, Deletions, Additions: Omissions, Deletions, Additions <gap> omission by transcriber <del> and <add> cancellation or addition in source <combine> used to group addition and deletion together <supplied> insertion by editor <unclear> material uncertain because illegible <damage> physical damage to text carrierThe multiple hierarchy problem: The multiple hierarchy problem SGML allows only one hierarchy at a time Is a document chapter-paragraph-phrase gathering-page-leaf or both? discontinuous segments links and milestonesBoundary markers: Boundary markers page, column, and line breaks (<pb>, <cb>, <lb>) generic <milestone>Some chunks are also phrases: Some chunks are also phrases <list> lists of all kinds <note> notes (authorial or editorial) <figure> pictures or figures <formula> formulae <table> tables <bibl> bibliographic descriptionsLists: Lists use <list> for lists of any kind (use type attribute to distinguish) use <label> in two-column lists as alternative to n attribute may be nested as necessaryfor example...: for example...Figures and graphics: Figures and graphics The presence of a graphic is indicated by the <figure> element The title of the graphic is tagged as a <head> A description of the graphic may be supplied (as a <figDesc>) for use by software unable to render the graphic The graphic itself is specified by an external link (URL) for example... : for example... Tables: Tables a <table> element contains <row>s of <cell>s spanning is indicated by rows and cols attributes role attribute indicates whether row or column holds data or a label embedded tables are permittedfor example...: for example...Bibliography: Bibliography Use simple <bibl> with optional subcomponents: <respStmt> (for any kind of responsibility) or <author>, <editor>, etc. <title> with optional level attribute <imprint> groups publication details <biblScope> adds page references etc. Use <listBibl> for list of referencesfor example...: for example...Notes : Notes Use <note> for notes of any kind (editorial or authorial) if in-line, use place attribute to specify location if out of line, either use target attribute to specify attachment point or mark attachment point as a <ref>for example...: for example...The Spoken texts module: The Spoken texts module components : <u> <event> <kinesic> <vocal> <pause> <shift> contextual information in header <settingDesc> <particDesc> facilities for synchronization and timing Features of speech: Features of speechUtterances: Utterances Basic unit of discourse, corresponding to speaker turns Optionally grouped into higher-level divisions (<div>s), e.g. to mark discourse function Linked by who attribute to <person> description in headerVocals and events: Vocals and events Empty elements are used to mark paralinguistic phenomenaVoice quality and prosody: Voice quality and prosody The <shift> element is used to mark changes in voice quality Other prosodic features may be marked using specific kinds of <seg> or entity refsAnother example: Another exampleParticipant Description: Participant DescriptionSetting Description: Setting Description eg from P2Timing: Timing Pausing use <pause> element Duration use dur attribute Overlap use trans attribute Overlap: OverlapNot covered here...: Not covered here... specialised front and back matter dictionaries and terminology analytic tagging segmentation interpretations linking the header tags for documentation