Catchwords

catchword forme work page break hyphenation
pb mw fw sic corr corr hi

Encoding of catchwords using mw type="catch", including how to handle discrepancies between the catchword and the matching word in the text

Catchwords should be encoded with fw type="catch". No other encoding is typically nested inside the fw element, with the exception of sic and hi as noted below.

Occasionally in older texts there may be discrepancies between the catchword and the corresponding word at the start of the next page, and it is useful to mark this discrepancy even if only to avoid having to check subsequently to make sure it is not a transcription error. The discrepancy can be marked using the sic element within fw as in Example 1. We do not consider it necessary to include a corrected reading, partly because in many cases it is normal to have the catchword differ slightly from its corresponding word in the text: it may be truncated, or in a different typeface, or it may be capitalized differently. The function of the sic element in this case is to mark the discrepancy, not to correct a clear error. For the same reason, it may be unnecessary to mark minor discrepancies which are common: for instance, truncation of the catchword.

It is more common for the catchword to be in error than for the word in the main text flow, and in this case the sic element should be applied to the reading in the main flow of text, with the corr attribute to supply a corrected reading. See Example 2.

Any hyphens within catchwords should be encoded as ordinary (hard) hyphens (using the hyphen character -). They should not be encoded as soft hyphens, since they are not intended to disappear if line breaks are removed. Quotation marks and other delimiters in the catchword are best encoded as #PCDATA rather than by using the rend attribute, on the principle that there are no circumstances under which their presentation would be varied.

Examples

Example 1.

An example of an error appearing in a catchword (discrepancy between catchword and text), where the last word on preceding page is un-; the catchword is pining; the first word on next page is repining

<p>...un&shy;
<mw type="catch"><sic>pining</sic></mw>
<pb>
repining...</p>

Example 2.

An example of an error in the text, where the last word on preceding page is un-; the catchword is repining; and the first word on next page is pining

<p>...un&shy;
<mw type="catch">repining</mw>
<pb>
<sic corr="re"></sic>pining...</p>

Example 3.

An example of an error in both catchword and text, where the last word on the preceding page is un-; the catchword is repinning; the first word on next page is pining

<p>...un&shy;
<mw type="catch"><sic>repinnig</sic></mw>
<pb>
<sic corr="re"></sic>pining...</p>