Standard PDF Tags

Aus BIT inklusiv Wiki und Test-Case-Datenbank
Version vom 19. Oktober 2016, 14:02 Uhr von Petra (Diskussion | Beiträge) (Schützte „Standard PDF Tags“ ([Bearbeiten=Nur Administratoren erlauben] (unbeschränkt) [Verschieben=Nur Administratoren erlauben] (unbeschränkt)))
(Unterschied) ← Nächstältere Version | Aktuelle Version (Unterschied) | Nächstjüngere Version → (Unterschied)
Wechseln zu: Navigation, Suche

ISO 32000-1:2008 (Table 333 – Table 340)

  • Document - A complete document. This is the root element of any structure tree containing multiple parts or multiple articles.
  • Part - A large-scale division of a document. This type of element is appropriate for grouping articles or sections.
  • Art - (Article) A relatively self-contained body of text constituting a single narrative or exposition. Articles should be disjoint; that is, they should not contain other articles as constituent elements.
  • Sect – (Section) A container for grouping related content elements. Note 1: For example, a section might contain a heading, several introductory paragraphs, and two or more other sections nested within it as subsections.
  • Div - (Division) A generic block-level element or group of elements.
  • BlockQuote - (Block quotation) A portion of text consisting of one or more paragraphs attributed to someone other than the author of the surrounding text.
  • Caption - A brief portion of text describing a table or figure.
  • TOC - (Table of contents) A list made up of table of contents item entries (structure type TOCI) and/or other nested table of contents entries (TOC). A TOC entry that includes only TOCI entries represents a flat hierarchy. A TOC entry that includes other nested TOC entries (and possibly TOCI entries) represents a more complex hierarchy. Ideally, the hierarchy of a top level TOC entry reflects the structure of the main body of the document. Note 2: Lists of figures and tables, as well as bibliographies, can be treated as tables of contents for purposes of the standard structure types.
  • TOCI – (Table of contents item) An individual member of a table of contents. This entry’s children may be any of the following structure types: Lbl, Reference, NonStruct, P, TOC, Link
  • Index - A sequence of entries containing identifying text accompanied by reference elements (citations to content elsewhere in the document) that point out occurrences of the specified text in the main body of a document.
  • NonStruct - (Nonstructural element) A grouping element having no inherent structural significance; it serves solely for grouping purposes. This type of element differs from a division (structure type Div) in that it shall not be interpreted or exported to other document formats; however, its descendants shall be processed normally.
  • Private - (Private element) A grouping element containing private content belonging to the application producing it. The structural significance of this type of element is unspecified and shall be determined entirely by the conforming writer. Neither the Private element nor any of its descendants shall be interpreted or exported to other document formats.
  • P - Paragraph) A low-level division of text.
  • H - (Heading) A label for a subdivision of a document’s content. It should be the first child of the division that it heads.
  • H1 – H6 - Headings with specific levels, for use in conforming writers that cannot hierarchically nest their sections and thus cannot determine the level of a heading from its level of nesting.
  • L - (List) A sequence of items of like meaning and importance. Its immediate children should be an optional Caption (a brief portion of text describing a table or figure) followed by one or more list items (structure type LI).
  • LI - (List item) An individual member of a list. Its children may be one or more labels, list bodies, or both (structure types Lbl or LBody).
  • Lbl - (Label) A name or number that distinguishes a given item from others in the same list or other group of like items. Note: In a dictionary list, for example, it contains the term being defined; in a bulleted or numbered list, it contains the bullet character or the number of the list item and associated punctuation.
  • LBody - (List body) The descriptive content of a list item. In a dictionary list, for example, it contains the definition of the term. It may either contain the content directly or have other BLSEs (Block Level Structure Elements), perhaps including nested lists, as children.
  • Table - (Table) A two-dimensional layout of rectangular data cells, possibly having a complex substructure. It contains either one or more table rows (structure type TR) as children; or an optional table head (structure type THead) followed by one or more table body elements (structure type TBody) and an optional table footer (structure type TFoot). In addition, a table may have a caption (a brief portion of text describing a table or figure) as its first or last child.
  • TR - (Table row) A row of headings or data in a table. It may contain table header cells and table data cells (structure types TH and TD).
  • TH - (Table header cell) A table cell containing header text describing one or more rows or columns of the table.
  • TD - (Table data cell) A table cell containing data that is part of the table’s content.
  • THead - (Table header row group; PDF 1.5) A group of rows that constitute the header of a table. If the table is split across multiple pages, these rows may be redrawn at the top of each table fragment (although there is only one THead element).
  • TBody - (Table body row group; PDF 1.5) A group of rows that constitute the main body portion of a table. If the table is split across multiple pages, the body area may be broken apart on a row boundary. A table may have multiple TBody elements to allow for the drawing of a border or background for a set of rows.
  • TFoot - (Table footer row group; PDF 1.5) A group of rows that constitute the footer of a table. If the table is split across multiple pages, these rows may be redrawn at the bottom of each table fragment (although there is only one TFoot element.)
  • Span - (Span) A generic inline portion of text having no particular inherent characteristics. It can be used, for example, to delimit a range of text with a given set of styling attributes. Note 1: Not all inline style changes need to be identified as a span. Text colour and font changes (including modifiers such as bold, italic, and small caps) need not be so marked, since these can be derived from the PDF content (see “Font Characteristics” in 14.8.2.4, “Extraction of Character Properties”). However, it is necessary to use a span to apply explicit layout attributes such as LineHeight, BaselineShift, or TextDecorationType (see “Layout Attributes for ILSEs” in 14.8.5.4, “Layout Attributes”). Note 2: Marked-content sequences having the tag Span are also used to carry certain accessibility properties (Alt, ActualText, Lang, and E; see 14.9, “Accessibility Support”). Such sequences lack an MCIDproperty and are not associated with any structure element. This use of the Span marked-content tag is distinct from its use as a structure type
  • Quote - (Quotation) An inline portion of text attributed to someone other than the author of the surrounding text. The quoted text should be contained inline within a single paragraph. This differs from the block-level element BlockQuote (see 14.8.4.2, “Grouping Elements”), which consists of one or more complete paragraphs (or other elements presented as if they were complete paragraphs).
  • Note - An item of explanatory text, such as a footnote or an endnote, that is referred to from within the body of the document. It may have a label (structure type Lbl; see “List Elements” in 14.8.4.3, “Block-Level Structure Elements”) as a child. The note may be included as a child of the structure element in the body text that refers to it, or it may be included elsewhere (such as in an endnotes section) and accessed by means of a reference (structure type Reference). Tagged PDF does not prescribe the placement of footnotes in the page content order. They may be either inline or at the end of the page, at the discretion of theconforming writer.
  • Reference - A citation to content elsewhere in the document.
  • BibEntry – (Bibliography entry) A reference identifying the external source of some cited content. It may contain a label (structure type Lbl; see “List Elements” in 14.8.4.3, “Block-Level Structure Elements”) as a child. Although a bibliography entry is likely to include component parts identifying the cited content’s author, work, publisher, and so forth, no standard structure types are defined at this level of detail.
  • Code - A fragment of computer program text.
  • Link - (Link) An association between a portion of the ILSE’s content and a corresponding link annotation or annotations (see 12.5.6.5, “Link Annotations”). Its children should be one or more content items or child ILSEs and one or more object references (see 14.7.4.3, “PDF Objects as Content Items”) identifying the associated link annotations. See “Link Elements” in 14.8.4.3, “Block-Level Structure Elements,” for further discussion.
  • Annot - (Annotation; PDF 1.5) An association between a portion of the ILSE’s content and a corresponding PDF annotation (see 12.5, “Annotations”). Annot shall be used for all PDF annotations except link annotations (see the Link element) and widget annotations (see the Form element in Table 340). See “Annotation Elements”14.8.4.4, “Inline-Level Structure Elements,” for further discussion.
  • Ruby - (Ruby; PDF 1.5) A side-note (annotation) written in a smaller text size and placed adjacent to the base text to which it refers. A Ruby element may also contain the RB, RT, and RP elements. See “Ruby and Warichu Elements” in14.8.4.4, “Inline-Level Structure Elements,” for more details. 14.8.4.4: (Ruby) The wrapper around the entire ruby assembly. It shall contain one RB element followed by either an RT element or a three-element group consisting of RP, RT, and RP. Ruby elements and their content elements shall not break across multiple lines
  • RB - (Ruby base text) The full-size text to which the ruby annotation is applied. RB may contain text, other inline elements, or a mixture of both. It may have the RubyAlignattribute.
  • RT - (Ruby annotation text) The smaller-size text that shall be placed adjacent to the ruby base text. It may contain text, other inline elements, or a mixture of both. It may have the RubyAlign and RubyPosition attributes.
  • RP - (Ruby punctuation) Punctuation surrounding the ruby annotation text. It is used only when a ruby annotation cannot be properly formatted in a ruby style and instead is formatted as a normal comment, or when it is formatted as a warichu. It contains text (usually a single LEFT or RIGHT PARENTHESIS or similar bracketing character).
  • Warichu - (Warichu; PDF 1.5) A comment or annotation in a smaller text size and formatted onto two smaller lines within the height of the containing text line and placed following (inline) the base text to which it refers. A Warichu element may also contain the WT and WP elements. See “Ruby and Warichu Elements” in14.8.4.4, “Inline-Level Structure Elements,” for more details. 14.8.4.4: (Warichu) The wrapper around the entire warichu assembly. It may contain a three-element group consisting of WP, WT, and WP. Warichu elements (and their content elements) may wrap across multiple lines, according to the warichu breaking rules described in the Japanese Industrial Standard (JIS) X 4051-1995.
  • WT - (Warichu text) The smaller-size text of a warichu comment that is formatted into two lines and placed between surrounding WP elements.
  • WP - W (Warichu punctuation) The punctuation that surrounds the WT text. It contains text (usually a single LEFT or RIGHT PARENTHESIS or similar bracketing character). According to JIS X 4051-1995, the parentheses surrounding a warichu may be converted to a SPACE (nominally 1/4 EM in width) at the discretion of the formatter.
  • Figure - An item of graphical content. Its placement may be specified with the Placement layout attribute (see “General Layout Attributes” in 14.8.5.4, “Layout Attributes”).
  • Formula - A mathematical formula. This structure type is useful only for identifying an entire content element as a formula. No standard structure types are defined for identifying individual components within the formula. From a formatting standpoint, the formula shall be treated similarly to a figure (structure type Figure).
  • Form - (Form) A widget annotation representing an interactive form field (see 12.7, “Interactive Forms”). If the element contains a Role attribute, it may contain content items that represent the value of the (non-interactive) form field. If the element omits a Role attribute (see Table 348), it shall have only one child: an object reference (see 14.7.4.3, “PDF Objects as Content Items”) identifying the widget annotation. The annotations’ appearance stream (see 12.5.5, “Appearance Streams”) shall describe the appearance of the form element.

zur Liste der Prüfschritte