[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[bug #57556] Enforce left and right minimums as required by hyphenation
From: |
G. Branden Robinson |
Subject: |
[bug #57556] Enforce left and right minimums as required by hyphenation patterns |
Date: |
Sun, 17 Jan 2021 01:42:31 -0500 (EST) |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Firefox/78.0 |
Follow-up Comment #6, bug #57556 (project groff):
[comment #1 comment #1:]
> See
>
>
https://github.com/hyphenation/tex-hyphen/tree/master/hyph-utf8/tex/generic/hyph-utf8/patterns/tex
>
> for patterns with YAML metadata containing hyphenmin values.
Below I'm inlining what those look like. Integrating YAML parsing into the
troff executable seems like overkill. We just need a little state machine to
read and match the relevant chunk ("hyphenmins") of the comment header and
complain if one isn't found.
Design question: since a minimum hyphenation mode can be _computed_ based upon
this characteristic of the pattern files, should reading such a pattern file
effectively set .hy?
I'm thinking "probably not" and that such a thing can be rolled into bug
#55070, if that ever happens, but it's worth considering.
% title: Hyphenation patterns for American English
% copyright: Copyright (C) 1990, 2004, 2005 Gerard D.C. Kuiken
% notice: This file is part of the hyph-utf8 package.
% See http://www.hyphenation.org/tex for more information.
% language:
% name: English, American spelling
% tag: en-us
% version: 2005-05-30
% authors:
% -
% name: Gerard D.C. Kuiken
% licence:
% text: >
% Copying and distribution of this file, with or without
modification,
% are permitted in any medium without royalty provided the copyright
% notice and this notice are preserved.
% hyphenmins:
% typesetting:
% left: 2
% right: 3
% changes:
% March 1, 1990 Initial release
% May 30, 2005 Added copyright notice, no patterns change.
% texlive:
% encoding: ascii
% babelname: usenglishmax
% legacy_patterns: ushyphmax.tex
% message: Hyphenation patterns for American English
% package: english
% known_bugs:
% de-mo-c-rat: 'instead of dem-o-crat (see GitHub issue #15)'
% ==========================================
%
% ushyphmax.tex -- patterns for more hyphenation pattern memory (12000+).
% Also known as ushyphen.max.
%
% Needs extended pattern memory.
% Hyphenation trie becomes 7283 with 377 ops.
%
% These patterns are based on the Hyphenation Exception Log
% published in TUGboat, Volume 10 (1989), No. 3, pp. 337-341,
% and a large number of incorrectly hyphenated words not yet published.
% If added to Liang's before the closing bracket } of \patterns,
% the patterns run errorfree as far as known at this moment.
%
% These patterns find all admissible hyphens of the words in
% the Exception Log. ushyph2.tex is a smaller set.
%
% Please send bugs or suggestions to tex-live (at) tug.org.
%
% 2005-05-30 (karl): in the past, ushyphmax.tex was a file containing
% only the additional patterns, without the \patterns command, etc.
% This turned out not to be very useful, since in practice the TeX
% distributions need one self-contained file for a language. Therefore,
% ushyphmax.tex now contains both the additional patterns from
% Dr. Kuiken, and the original patterns and hyphenations from Knuth's
% hyphen.tex.
%
% The Plain TeX hyphenation tables.
\patterns{ % just type <return> if you're not using INITEX
.ach4
[...]
_______________________________________________________
Reply to this item at:
<https://savannah.gnu.org/bugs/?57556>
_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/