Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Doc Type: Complete proposal with documentation and data filesTitle: Proposal to include Duployan Shorthands and Chinook script and Shorthand Format Controls in UCSAttached Files: UCD additions file, ShorthandFormat.txt, Encoding Guide, Test Encoding, keyboard layouts, Code ChartSource: Van AndersonStatus: Submitted to UTC.Action: For approval by UTC and WG2. Comments to Van AndersonDate: 2010-07-29Discussion list: Chinook in the UCS
Historical Overview of the Duployéan and adaptations
The Duployéan shorthands and Chinook script are used as a secondary shorthand for writing French, English, German, Spanish, Romanian, andas an alternate primary script for several first nations' languages of interior British Columbia, including the Chinook Jargon, Okanagan, Lilooet,Shushwap, and North Thompson. The original Duployéan shorthand was invented by Emile Duployé, published in 1860, as a stenographicshorthand for French. It was one of the two most commonly used French shorthands, being more popular in the south of France, and adjacentFrench speaking areas of other countries. Adapted Duployéan shorthands were also developed for English, German, Spanish, and Romanian.The basic inventory of consonant and vowel signs - all in the first two columns of the allocation - have been augmented over the years to providemore efficient shorthands for these languages and to adapt it to the phonologies of these languages and the languages using Chinook writing.There currently exists no encoding - PUA or otherwise - for the representation of the Duployan or Chinook. Indeed, the submission of theDuployan Shorthands and Chinook script to the Unicode Consortium has necessitated the creation, from scratch, of the first Duployéan/Chinookfont, and the allocation is based solely on the internal logic of the script and affinity of usage among characters.
The Chinook script was an adaptation and augmentation of the Duployéan shorthand by fr. Jean Marie Raphael LeJeune, used for writing theChinook Jargon and other languages of 19th c. interior British Columbia. Its original use and greatest surviving attestation is from the run of theKamloops Wawa, a (mostly) Chinook Jargon newsletter of the Catholic diocese of Kamloops, British Columbia, published 1891-1923. At thetime, the Chinook Jargon pidgin was widely spoken from SE Alaska to northern California, from the Pacific to the Rockies, and sporadicallyoutside this area. Although the Chinook Jargon was the lingua franca in many communities of the Pacific Northwest, it was generally a spoken,rather than written language. Most attempts at documentation used the Latin script to approximate Jargon words with English or Frenchphonology, and indeed, dictionaries of the Chinook Jargon are still readily available in these Latinate orthographies. In contrast, the archives ofthe Kamloops Wawa, written in Chinook, includes a considerable dictionary, but also constitutes an unparalleled 3+ decade corpus of ChinookJargon usage during the height of its spread and utility. The Chinook Script makes use of the basic Duployéan inventory, with the addition ofseveral derived letterforms and compound letters.
In 1984, the "Students' Practical Encyclopedia" (Enciclopedia practicâ a copiilor) was published in Romania, containing the "Curs deStenografie" by Margareta Sfinţescu. This shorthand was an adaptation of the Duployéan for Romanian, using a few of the Chinook andDuployan shorthand compound letters as basic letterforms, and several basic vowel forms with diacritics. It also makes use of a "doubling mark"to indicate a general duplication of a word or phonemic form.
The Pernin shorthand was first published by Helen M. Pernin as "Pernin's Universal Phonography" no later than 1882. There is an alternateversion of the Pernin shorthand published as "Pernin's Practical Reporter", that has different affixes. The next year, John Mathew Sloanpublished the competing Sloan-Duployan method, which was expanded in 1918, when Denis R. Perrault published the Perrault-Duployan
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
system. All three of the above, being the main English adaptations of Duployan, enjoyed some popularity, but never attained the reach of Pitmanor Gregg shorthands. All three systems share many characters with Chinook and each other. The most significant anomalies of these systems arethe invariant vowel signs in Pernin, the quarter-circle combined consonants, found in each system but with differing values, the extensive use ofvowel diacritics in Sloan, and heavy shading of letters - as the voiced consonants in Pitman-based systems are - to indicate "r" flavored letters inSloan.
Unsupported orthographies. Currently, materials are unavailable to attempt including Carl Brandt's English Duployéan adaptation or GeorgeGalloway's extension of the Sloan-Duployan in the current encoding. Similarly, documentation of the adaptations of Duployéan to German andSpanish are unavailable, so complete support for these orthographies is probably not offered in the current allocation. Allocation space has beenset aside to reasonably accommodate extensions for some of these extensions of the Duployéan script.
Typology
Duployéan is, at its core, an alphabetic (consonant & vowel) stenographic (simple line & curve) writing system (cf. Pitman shorthand, astenographic abjad). It classifies under the geometric shorthands, in that the model letterforms are generally based on circles and lines (cf.Gregg's eliptical shorthand). In general, there is a visual and functional distinction between consonants, which are based either on lines or largesemi-circles and have invariable orientation, ie consonants do not rotate to match with surrounding letters; and the vowels, which are generallybased on circles, quarter arcs, and small semi-circles, and generally reshape and orient contextually. It is an LTR script, proceeding down thepage in lines like most modern Western scripts, although individual letters may be written right-to-left.
Script Structure
The core repertoire of the Duployéan writing contains several classes of letters, differentiated primarily by visual form and stroke direction, andnominally by phonetic value. Letter classes include the line consonants (P, T, F, K, & L-type) and arc consonants (M, N, J, & S-type), circlevowels (A, O, & W- vowels), nasal vowels, and orienting vowels (U/Eu,I/E). In addition, the Chinook writing contains spacing letters,compound consonants, and a logograph. The extended Duployéan shorthand includes four other letter classes - the complex letters (multisyllabicsymbols with consonant forms), and high, low, and connecting terminals for common word endings. The Romanian stenography, Pernin,Perrault, and Sloan orthographies add a few letters or letter forms, ideographs, and several combined letters. Most "core" letters have relatedvariant forms, including the addition of ancillary dots and crosses, size variants, and the compounding of vowels.
Since the Duployéan was originally developed as a shorthand system, strings of letters are joined together cursively into words in Duployéan,Romanian, Pernin, Perrault, and Sloan, or nominally syllabic units in Chinook - usually with a single circle vowel for each unit. The originalDuployéan and its offshoots all encourage overlapping for initialisms and abbreviations and many prescribe overlaps and raised or lowered textheight for some morphemes or phonemes.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Character List
Supplemental Punctuation 2E00-2E7F
2E3C;STENOGRAPHIC PERIOD
Duployan Shorthands and Chinook 1BC00-1BC9F
1BC00 - DUPLOYAN LETTER H1BC01 - DUPLOYAN LETTER P1BC02 - DUPLOYAN LETTER T1BC03 - DUPLOYAN LETTER F1BC04 - DUPLOYAN LETTER K1BC05 - DUPLOYAN LETTER L1BC06 - DUPLOYAN LETTER M1BC07 - DUPLOYAN LETTER N1BC08 - DUPLOYAN LETTER J1BC09 - DUPLOYAN LETTER S1BC0A - DUPLOYAN LETTER O1BC0B - DUPLOYAN LETTER A1BC0C - DUPLOYAN LETTER I1BC0D - DUPLOYAN LETTER U1BC0E - DUPLOYAN LETTER OU1BC0F - DUPLOYAN LETTER OW1BC10 - DUPLOYAN LETTER X1BC11 - DUPLOYAN LETTER B1BC12 - DUPLOYAN LETTER D1BC13 - DUPLOYAN LETTER V1BC14 - DUPLOYAN LETTER G1BC15 - DUPLOYAN LETTER R1BC16 - DUPLOYAN LETTER VOCALIC M1BC18 - DUPLOYAN LETTER NASAL I1BC19 - DUPLOYAN LETTER NASAL U1BC1A - DUPLOYAN LETTER NASAL O1BC1B - DUPLOYAN LETTER NASAL A1BC1C - DUPLOYAN LETTER E1BC1D - DUPLOYAN LETTER EU1BC1E - DUPLOYAN LETTER ROMANIAN I1BC1F - DUPLOYAN LETTER ROMANIAN U1BC20 - DUPLOYAN LETTER U N1BC21 - DUPLOYAN LETTER P N1BC22 - DUPLOYAN LETTER D S1BC23 - DUPLOYAN LETTER F N1BC24 - DUPLOYAN LETTER K M1BC25 - DUPLOYAN LETTER R S1BC26 - DUPLOYAN LETTER M S1BC27 - DUPLOYAN LETTER N S1BC28 - DUPLOYAN LETTER J S1BC29 - DUPLOYAN LETTER S S1BC2A - DUPLOYAN AFFIX HIGH ACUTE1BC2B - DUPLOYAN AFFIX HIGH GRAVE1BC2C - DUPLOYAN AFFIX HIGH DOT1BC2D - DUPLOYAN AFFIX HIGH CIRCLE
1BC2E - DUPLOYAN AFFIX HIGH LINE1BC2F - DUPLOYAN AFFIX HIGH WAVE1BC30 - DUPLOYAN LETTER J N1BC31 - DUPLOYAN LETTER J N S1BC32 - DUPLOYAN LETTER M N1BC33 - DUPLOYAN LETTER N M1BC34 - DUPLOYAN LETTER J M1BC35 - DUPLOYAN LETTER S J1BC36 - DUPLOYAN LETTER M N S1BC37 - DUPLOYAN LETTER N M S1BC38 - DUPLOYAN LETTER J M S1BC39 - DUPLOYAN LETTER S J S1BC3A - DUPLOYAN AFFIX LOW ACUTE1BC3B - DUPLOYAN AFFIX LOW GRAVE1BC3C - DUPLOYAN AFFIX LOW DOT1BC3D - DUPLOYAN AFFIX LOW CIRCLE1BC3E - DUPLOYAN AFFIX LOW LINE1BC3F - DUPLOYAN AFFIX LOW WAVE1BC40 - DUPLOYAN AFFIX ATTACHED SECANT1BC41 - DUPLOYAN AFFIX ATTACHED TANGENT1BC42 - DUPLOYAN AFFIX ATTACHED TAIL1BC43 - DUPLOYAN AFFIX ATTACHED E HOOK1BC44 - DUPLOYAN AFFIX ATTACHED I HOOK1BC46 - DUPLOYAN LETTER AOU1BC47 - DUPLOYAN LETTER OA1BC48 - DUPLOYAN LETTER J S WITH DOT1BC49 - DUPLOYAN LETTER S WITH DOT BELOW1BC4A - DUPLOYAN LETTER SHORT I1BC4B - DUPLOYAN LETTER EE1BC4C - DUPLOYAN LETTER IE1BC4D - DUPLOYAN LETTER UI1BC4E - DUPLOYAN LETTER YE1BC4F - DUPLOYAN DOUBLE MARK1BC50 - DUPLOYAN AFFIX LOW ARROW1BC51 - DUPLOYAN AFFIX ATTACHED TANGENT HOOK1BC52 - DUPLOYAN AFFIX ATTACHED LEFT-TO-RIGHT SECANT1BC55 - DUPLOYAN LETTER J WITH DOTS INSIDE AND ABOVE1BC56 - DUPLOYAN LETTER M WITH DOT1BC57 - DUPLOYAN LETTER N WITH DOT1BC58 - DUPLOYAN LETTER J WITH DOT1BC59 - DUPLOYAN LETTER S WITH DOT1BC5A - DUPLOYAN LETTER WO1BC5B - DUPLOYAN LETTER WA1BC5C - DUPLOYAN LETTER WI1BC5D - DUPLOYAN LETTER WEI1BC5F - DUPLOYAN LETTER WOW1BC60 - DUPLOYAN LETTER XW1BC61 - DUPLOYAN LETTER TH1BC62 - DUPLOYAN LETTER DH1BC63 - DUPLOYAN LETTER SLOAN DH1BC66 - DUPLOYAN LETTER SLOAN J1BC67 - DUPLOYAN LETTER KK1BC68 - DUPLOYAN LETTER HL
1BC69 - DUPLOYAN LETTER LH1BC6A - DUPLOYAN LETTER RH1BC6E - DUPLOYAN SIGN O WITH CROSS1BC6F - DUPLOYAN PUNCTUATION CHINOOK FULL STOP1BC70 - DUPLOYAN LETTER W1BC71 - DUPLOYAN LETTER LONG U1BC72 - DUPLOYAN LETTER UH1BC73 - DUPLOYAN LETTER OOH1BC74 - DUPLOYAN LETTER SLOAN U1BC75 - DUPLOYAN LETTER SLOAN OW1BC76 - DUPLOYAN LETTER SLOAN EH1BC77 - DUPLOYAN LETTER SLOAN EE1BC78 - DUPLOYAN LETTER LONG I1BC7A - DUPLOYAN LETTER PERNIN AN1BC7B - DUPLOYAN LETTER PERNIN AM1BC7C - DUPLOYAN LETTER SLOAN AN1BC7D - DUPLOYAN LETTER SLAON EN1BC7E - DUPLOYAN LETTER SLOAN ON1BC7F - DUPLOYAN THICK LETTER SELECTOR1BC80 - DUPLOYAN AFFIX LOW VERTICAL SECANT1BC81 - DUPLOYAN AFFIX MID VERTICAL SECANT1BC82 - DUPLOYAN AFFIX HIGH VERTICAL SECANT1BC83 - DUPLOYAN AFFIX HIGH LONG GRAVE1BC84 - DUPLOYAN AFFIX HIGH VERTICAL1BC85 - DUPLOYAN AFFIX HIGH TIGHT ACUTE1BC88 - DUPLOYAN LETTER S T1BC89 - DUPLOYAN LETTER S T R1BC8A - DUPLOYAN LETTER S P1BC8B - DUPLOYAN LETTER S P R1BC8C - DUPLOYAN LETTER T S1BC8D - DUPLOYAN LETTER T R S1BC8E - DUPLOYAN LETTER WH1BC8F - DUPLOYAN LETTER W R1BC90 - DUPLOYAN AFFIX LEFT HORIZONTAL SECANT1BC91 - DUPLOYAN AFFIX MID HORIZONTAL SECANT1BC92 - DUPLOYAN AFFIX RIGHT HORIZONTAL SECANT1BC93 - DUPLOYAN AFFIX LOW LONG GRAVE1BC94 - DUPLOYAN AFFIX LOW VERTICAL1BC95 - DUPLOYAN AFFIX LOW TIGHT ACUTE1BC9A - DUPLOYAN LETTER S N1BC9B - DUPLOYAN LETTER S M1BC9C - DUPLOYAN LETTER K R S1BC9D - DUPLOYAN LETTER G R S1BC9E - DUPLOYAN LETTER S K1BC9F - DUPLOYAN LETTER S K R
Shorthand Format Controls 1BCF0-1BCFF
1BCF0 - SHORTHAND FORMAT LETTER OVERLAP1BCF1 - SHORTHAND FORMAT CONTINUING OVERLAP1BCF2 - SHORTHAND FORMAT DOWN STEP1BCF3 - SHORTHAND FORMAT UP STEP
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Character names. For naming purposes, the Duployan Shorthands and Chinook script have two distinct sets of characters. The first set consistsof most letters and letter based signs that generally interact cursively with each other, with the exception of a few spacing characters. The secondset consists of affix signs that can be attached/overlapping or sit above or below the adjacent characters at the beginning and end of words, andthe word signs. The first set have character names that indicate their primary phonetic value, while the second set are described graphically.
Support, Funding, and Thanks. This project was made possible in part by a grant from the U.S. National Endowment for the Humanities to theUniversal Scripts Project (as part of the Script Encoding Initiative, UC Berkeley). Any views, findings, conclusions or recommendationsexpressed in this publication do not necessarily reflect those of the National Endowment of the Humanities.
This proposal has also been materially supported by the facilities and resources of Michael Everson's Evertype, Justin Cassidy, the United States'Library of Congress, and the Timberland Regional Library, and the views, findings, conclusions and recommendations expressed herein do notnecessarily reflect those of said organizations.
Special thanks to Laurenţiu Iancu at Microsoft, Eric Muller at Adobe, Dave Robertson at University of Victoria (BC), and Michael Everson, fortips, information, documentation, and other intellectual support in this project. Thanks to Ken Whistler (Sybase), Rick McGowan (Unicode),Deborah Anderson (UC Berkeley), Justin Cassidy, Micah Ferrell, William Poser, and Asmus Freytag for feedback and logistical support. Thisproposal would not have been possible without the involvement of these people. Thank you to the members of the Microsoft VOLT user groupfor technical help with the test font for this project.
Character Ordering and Roadmap to the Duployan Shorthand and Chinook character block
Ordering of the characters in the Duployéan-based scripts is generally undefined - many cite in Latin alphabetical order - and the allocation orderis based on usage and script logic. The currently proposed allocation ordering and its basis is as follows:
Columns 0 and 1 are occupied mostly by characters that make up the core inventory of the different Duployan shorthands and the Chinookscript. Most Duployan orthographies will use all but a few of the characters in these two columns. Optimization algorithms may be able to takeadvantage of the fact that these characters constitute the most frequently used in any Duployan orthography. Columns 2 and 3 contain the FrenchDuployéan compound letters and affixes. Several of these characters are also in the core and supplementary inventory of other orthographies.Column 4 is a mixture of diphthongs, affixes, and letters for several orthographies. Columns 5 and 6 contain the Chinook compound letters, andsimilarly constructed letters and signs from the Romanian shorthand and Sloan systems, the Chinook Full Stop and Likalisti signs, and a coupleRomanian affixes. Column 7 contains vowels for the English Duployan systems, ending with the Sloan "R", which is a combining character thatfunctions as a format. Columns 8 and 9 have two parts, each beginning with Pernin affixes, and containing the quarter-circle arcs for the Englishorthographies.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
This allocation provides for all characters needed for French Duployéan in columns 0-4, Romanian shorthand in 0-5, Chinook in 0-6, withPernin, Sloan, and Perrault using columns from 0 through 9. Seventeen code points have been left unallocated for any additions needed for theBrandt and Galloway systems or the Spanish and German adaptations of Duployéan, as no documents on or in these systems has been located.
Collation
Information on collation of Duployan scripts is generally ambiguous and arbitrary. Many dictionaries and primers simply cite in that language'sLatin alphabetical order with no attempt made at native collation. Other sources group words by novel alphabetization, no more or less canonicalthan any other. The Romanian "Curs de Stenografie" does make an effort at native collation, starting with vowels, and then in the general orderof the consonants in this allocation. The collation algorithm prescribed herein is based on principles derived from the Chinook, but results in asimilar order as the Romanian.
The most logical collation, given the structure of the script, is to collate by general shape, which places primacy on the consonants which, beinginvariant, tend to determine the shape of a word. Vowels have their own order, and clusters of one or more vowels should be collated as if theywere a single vowel. Initial vowel clusters are ordered before the first consonant, medial and final clusters after the last.
Collation starts with consonants - initial vowels (ie no consonant) << H << P << T << F << K << L << M << N << J << S << combinedconsonants << medial/final vowels - then Affixes - attached << high << low - and finally signs. Secondary weight is given to diacritics, marks,and the bold R letters in the Sloan orthography - all characters which do not change the basic shape of the word form. Tertiary weight is given tothe joiners, spaces, and format controls, some of which can indicate semantic content, but often indicate presentation form.
All variant and compound consonants are collated directly after their base letters, with voiced consonants and their variants after the lastunvoiced variants. The vowels collate similarly - O, A, I, U, Ou, Ow, Nasals - with variants collated after their base letters.
This collation order, based on the numeric values of letters in Chinook, corresponds significantly with the order of words in the Romanian "Cursde Stenografie", except that F/V comes before K/G instead of after, and A comes after O instead of before.
Collation table The ornamental horizontal rules in this document show the general collation order in simplified form.
Primary collation: Initial vowel cluster < H < X < P < B < P N < T < TH < SLOAN DH < D < DH < D S < F < V < F N < K < KK < G <SLOAN J < K M < L < HL < LH < R < RH < R S < M < M N < M WITH DOT < M S < M N S < N < N M < N WITH DOT < N S < N M S <J < J M < J N < J WITH DOT < J WITH DOTS INSIDE AND ABOVE < J S < J M S < J N S < J S WITH DOT < S < S J < S WITH DOT <S WITH DOT BELOW < S S < S J S < S T < S T R < S P < S P R < T S < T R S < W < WH < W R < S N < S M < K R S < G R S < S K <S K R < medial/final vowel cluster.
< ATTACHED SECANT < ATTACHED TANGENT < ATTACHED TAIL < ATTACHED I HOOK < ATTACHED E HOOK <ATTACHED TANGENT HOOK < ATTACHED LTR SECANT < LOW VERTICAL SECANT < MID VERTICAL SECANT <HIGH VERTICAL SECANT < LEFT HORIZONTAL SECANT < MID HORIZONTAL SECANT < RIGHT HORIZONTAL SECANT <HIGH ACUTE ARC < HIGH TIGHT ACUTE < HIGH GRAVE ARC < HIGH LONG GRAVE < HIGH DOT < HIGH CIRCLE < HIGH LINE
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
< HIGH WAVE < HIGH VERTICAL < LOW ACUTE ARC < LOW TIGHT ACUTE < LOW GRAVE ARC < LOW LONG GRAVE <LOW DOT < LOW CIRCLE < LOW LINE < LOW WAVE < LOW VERTICAL < LOW ARROW < O WITH CROSS < word/affix signs fromoutside the Duployan block - Less Than, Greater Than, Multiplication, Plus Sign, etc.
Vowel order: O < WO < AOU < A < WA < OA < Sloan OW < I < E < WI < WEI < Romanian I < Sloan EH < Sloan EE < Short I < EE < IE <UI < YE < Long I < U < EU < XW < U N < LONG U < UH < OOH < Sloan U < OU < OW < WOW < Romanian U < Vocalic M < Nasal I <Nasal U < Nasal O < Nasal A < Pernin AN < Pernin AM < Sloan AN < Sloan EN < Sloan ON;
Secondary collation: No marks < Combining R < Double Mark < Diacritics.
Tertiary collation: No format < Variation Selectors < SP/NBSP < ZWNJ < ZWJ; < shorthand formats Letter Overlap < Continuing Overlap <Down < Up; < following ZWSP < HSP < 6/MSP < THSP < 4/MSP < 3/MSP < NSP < MSP.
Irrelevant: All punctuation, including CHINOOK FULL STOP.
Input. A Basic Duployan keyboard layout has been devised for inputting Duployan text. This places the most common characters in the easiestto reach key positions. Keys are also defined for the basic nasal vowels with inherent joiners, which are the necessary encoding form in manyorthographies. A character map, MSKLC file, and installation files are attached in keyboard.zip.
This keyboard layout should be considered informative as a base layout for the complete Duployan and Chinook. Other Duployan keyboardsshould not be constrained by this layout; specifically, a layout for a particular orthography should place the characters necessary for thatlanguage on the most convenient keys, regardless of the general layout, and should not necessarily provide for access to all Duployan charactersor alternate forms of the nasal vowels.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Character Properties. Duployan and Chinook are uncased, and as such, letters and affix signs are gc=Lo. The Double Mark is gc=Mn, ccc=1.The O with Cross is gc=So, Chinook Full Stop and Stenographic Period, gc=Po. Shorthand Formats and Duployan Thick Letter Selector aregc=Cf. All character properties are contained in the attached UCDadditions.txt.
Principles of the Duployan Shorthands and Chinook scripts
Rendering Duployan Characters. Duployan characters, like characters in most shorthand scripts, can cursively connect, combine, and changeshape depending on their context. Its appearance is affected by the presence of adjacent characters, ligaturing, the font used to render thecharacter, and the application or system environment. These variables can cause the appearance of Duployan and Chinook characters to differfrom their nominal glyphs (used in the code charts). Duployan and Chinook characters are default joining to each other, except for the high andlow affixes and where otherwise noted in the code chart. Characters marked as non-joining, and any characters from other blocks are non-joiningto Duployan and Chinook characters by default. Exceptions are Zero Width Joiner (U+200D), by definition, and the Shorthand Format Controls(U+1BCF0-U+1BCF3), which are tied to Duployan as Script_Extensions, and alter the joining characteristics of adjacent stenographiccharacters. Defined width spacing characters (U+2000-U+200B) preserve the height of the cursive stroke, so they should be treated as simplejoining characters, with a blank glyph image.
Invariant letters. The majority of characters in the Duployan shorthands and Chinook scripts are invariant letters. They have a static shape,orientation, and stroke direction, and the set of invariant letters is almost completely contiguous with the consonants. Each invariant has a size -as many as three; a shape - line, quarter-circle, semicircle; a static orientation - N/S, E/W, NE/SW, NW/SE; an inherent stroke direction -generally LTR or TopToBottom; and many have derived and compound variants with markings (crosses or dots). They will usually cursivelyconnect - the end of first character's stroke is the beginning of the second's - but will also overlap with a following character when shorthandformats are used. A few invariant letters and all of the high and low affixes are classified as non-joining characters, that interact typographicallywith adjacent characters like a word or text break, and only have a stroke direction when overridden by ZWJ (U+200D).
It can be assumed in the following that similar characters, like D, D-S, TH, and DH have the same cursive, overlapping, and other connectingproperties as the character on which it is based, ie T. Likewise, variations of N - N-S, N-M, N-M-S, and Ng - connect like an N, and so on. Theinvariant letters can be generally classified as P type (line with N-S stroke direction), T type (line, W-E), F type(line, NW-SE), K type(line, NE-SW), L type(line, SW-NE), M type(N-E-S semicircle), N type(N-W-S semicircle), J type(W-N-E semicircle), and S types(W-S-E semicircle),and combined consonants (all quarter-circles, see code chart). Furthermore, the P,T,F,K, and L collectively constitute the Line consonants, andthe M,N,J, and S types, as well as the combined consonants, are arc consonants.
Orienting characters. Many vowels have a consistent shape, but rotate to align with the preceding character and will mirror to allow thefollowing character to attach without crossing the vowel or preceding character. When adjacent one non-joining and one joining character, theseorienting vowels will rotate to align with the adjacent joining character, and mirror right/up or left/down based on their identity as a primaryorienting or secondary orienting vowel. Likewise, when adjacent two similar type characters, or if the following character allows mirroringeither way, they will align with the preceding character and mirror according to their orientation. Directional affinities are preserved, even whenpreceded by a non-joining and followed by a joining character. Primary orientation indicates an affinity for a stroke direction towards the right,and up when lacking a right/left distinction. Conversely, secondary orientation is left/down. Many orienting vowels come in pairs, with opposite
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
orientations but the same basic shape. Except for 'I' and 'E', orienting characters can be bracketed by ZWJ/ZWNJ (U+200D/U+200C) to make ajoining or non-joining invariant version. 'I' and 'E' have related invariant characters encoded seperately.
Table 1: Comparison of Primary and Secondary Orienting Vowels
Primary (right/up) Orienting Vowels-
Secondary (left/down) Orienting Vowels -
Related to the orienting vowels and invariant letters are the attached affixes. Many of these, noted in the charts with "dots [to] show position onand relative orientation to base glyph", act as spacing or non-spacing marks that do not effect joining of adjacent characters, but do rotate tomatch the angle of the base character. Some, noted in the charts with "dots [to] show position on base glyph", are non-spacing invariant marks.
Circle vowels. The most commonly encountered vowel letters are the circle vowels. These vowels connect to preceding and followingcharacters, with the adjacent characters entering the circle vowel at a tangent, and most (except Ou U+1BC0E) exiting the vowel shape at atangent. The circle vowels often take partial contextual forms, with the adjacent characters implicitly completing the circle by crossing tangents.
Circle vowels followed but not preceded by a joining letter have a clockwise stroke direction into line consonants and will lie inside the arc of anarc consonant. Circle vowels preceded but not followed by a joining character will again sit inside the arc of an arc consonant, as if followed by aT-type if following a line consonant, and above the end of a T-type consonant. Circle vowels adjacent two line consonants will lie outside theangle created by the intersection of the two lines. When adjacent same type line consonants, they will again lie as if followed by a T-type. Whenadjacent an arc consonant and another invariant, the circle vowel will follow the angle rule as given above, but when the adjacent characters donot present an angle, the circle vowel will lie in the same position as if the following joining character were not there.
Many sequences of successive circle vowels default ligature forms. Where a ligature is not available, or when overridden by an intervening ZWJ+ ZWNJ + ZWJ (U+200D + U+200C + U+200D), successive circle vowels not preceded by a joining character will connect at the verticaltangent on the shared side. If a preceding joiner character is present, cursively connected circle vowels will sit on opposite sides of the end of theprevious character, with the following character determining the position of the second character, as with a primary orienting vowel (see Table 5and additions to Figure 16-3, below).
The Duployan Letter Sloan Ow (U+1BC75) and, in the Pernin and Sloan orthographies, discretionary ligatures of circle vowels, are classified as
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
reverse circle vowels. These reverse circle vowels are opposite a regular circle vowel, ie they have a withershins stroke direction, will lie outsideof arc vowels, inside the angle of two line consonants, &c. Reverse circle vowels are not known to interact typographically with other vowelcharacters.
Table 2: Circle Vowels and Reverse Circle Vowels
Circle Vowels Reverse Circle Vowels
Nasal vowels are the only Duployan characters that are positioned contextually. A fully implemented typeface will allow for three differentrenderings of the four basic nasal vowels (U+1BC18-U+1BC1B). When adjacent two joining characters, the nasal vowels will render as adiacritic placed outside the angle of the adjacent characters, shadowing the position of circle vowels adjacent two characters, explained above.When still preceded by a joining character, but followed by ZWJ (U+200D) + a joining character or by any non-joining character, the nasalvowel will render as a primary or secondary orienting vowel in relation to the preceding joining character. It will either join with the followingcharacter, if a ZWJ intervenes, or be unjoined with ZWNJ or a non-joining character. Likewise, when following a joining character +ZWJ/ZWNJ (U+200D/U+200C), the nasal vowel will render as a primary or secondary orienting vowel in relation to the following joiningcharacter. The Duployan Letter Vocalic M (U+1BC16) is always a primary orienting vowel, cutting backward in relation to the precedingcharacter, and does not position diacritically. A nasal vowel not preceded by a joining character, and followed by a ZWJ + joining character willstill orient in relation to the following joining character, allowing for consistent use of Nasal Vowels + ZWJ in orthographies that do not usediacritic positioning of nasals.
When bracketed by Zero Width Joiners, nasal vowels will render as combining invariant characters as per the nominal glyph images. ZWNJ(U+200C) can be used when the orienting or invariant nasal vowel is not to be connected to an adjacent joining character. The Pernin and Sloannasal vowels (U+1BC7A-U+1BC7E) are always invariant, and the Vocalic M (U+1BC16) never. The orthography of the Romanian stenographyuses the two U arc vowels (U+1BC0D, U+1BC1D) as nasals, however the Romanian stenography uses nasals as orienting vowels (+ZWJ), andno marking is needed for proper rendering.
Table 3: Nasal Vowels
+ + → F + An + T
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
+ ++ → F + Anj + T
+ ++ → F + Annj + T
+ ++ → F + Onnj + T
++ + → T + jAn + T
++ + → T + jOn + T
++ ++ → F + Ani + T
P.S. The logic behind the prescribed use of ZWJ/ZWNJ is that it deprives the surrounding context from the nasal vowel, specifying only whetherthe adjacent characters will join, as there are no known ligatures of nasal vowels. The joiner controls could be replaced by any non-joiningcharacter and result in the same rendering of the nasal.
Compound vowels. The default rendering of compound vowel sequences (or vowel clusters) depends on the nature of the vowels involved.Most orthogyraphies prefer ligation to simple compounding of circle vowels. However, compounding that visually preserves each member isregularly encountered in sequences involving orienting vowels combined with a circle vowel or any number of other orienting vowels. As a rule,circle vowels act as if an adjacent orienting vowel were a line consonant whose orientation is determined by any joining characters adjacent thevowel cluster. The entire sequence should be rendered as if it were an orienting vowel, although a circle vowel between a joining character andorienting vowel will sit opposite the orienting vowel, touching at the intersection of the orienting vowel and joining character. These vowelclusters have primary or secondary orientation determined usually by the first character of the sequence, but the last character when not precededby a joining character. When the vowel cluster is not adjacent any joining characters, default rendering is along a horizontal mid-line, as withclusters of circle vowels (see circle vowels, above).
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Table 4: Compound vowels
+ → A + I
+ → A + E
+ + → A + I + T
+ + → A + E + T
+ + → I + A + T
+ + → P + A + I
+ + + → OR P + A + I + T
+ + + → OR P + A + I + M
+ → E + I
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
+ → I + E
+ → O + I
Ligatures, Allographs, and Standard Variants. Ligaturing behaviour is fairly limited in the Duployan orthographies, especially in comparisonwith other cursive scripts like Arabic and Devanagari. As with Arabic, Devanagari, and other complex scripts, ligatures can be expresslyrequested by use of Zero Width Joiner (U+200D). Zero Width Non Joiner (U+200C) should break a ligature into its component characters, andthe sequence ZWJ + ZWNJ + ZWJ (U+200D + U+200C + U+200D) would break a default ligature and render the characters by default joiningbehaviour (see circle vowels, above).
Discretionary Features. All discretionary contextual/ligature forms can be requested in plain text by using ZWJ (U+200D).
The Pernin orthography makes use of a contextual form for repeated consonants, reducing the second consonant to a small blot (in writing,caused by increasing pen or pencil pressure) at the end of the previous character's stroke. This applies to both identical and similar consonants,with the first consonant represented by its full form, eg. T+Dot = T+T or T+D or T+Th &c.
Pernin also prescribes a ligature form of a circle vowel preceding the Pernin R (Duployan letter L, U+1BC05), unless it is followed by anothercircle vowel. The ligature form is an identically sized reverse circle vowel (see Circle Vowels, above). Similarly, in the Sloan orthography, aninitial circle vowel preceded by an R (U+1BC15) will render as a reverse circle vowel.
Standard Variants. All standard variants are requested in plain text Duployan by using Variation Selector 1 (U+FE00).
Pernin prescribes a "slight upward tick inclining to the left" for an L (U+1BC05, Pernin R) following R (U+1BC15, Pernin L), and one "to theright" for an R after L. This upward tick can also sometimes be found, generally at word end, following other consonants. These ticks are astandard variation sequence of the Duployan Letter L and the Duployan Letter R, encoded as L/R + VS1 (U+FE00).
The Duployan Letter W (U+1BC70) is the most variable letter among Duployan scripts. In the Sloan and Perrault orthographies, it is a fullquarter arc, written NE-SW, 12 o'clock to 9 o'clock. On the other hand, in Pernin, it is closer to a one-sixth arc, starting closer to the 11 o'clockposition, though still roughly the same length arc (larger diameter) than the Sloan/Perrault variety. Following K and G (U+1BC04, U+1BC14),the Duployan Letter W takes the form of a hook - Perrault tending a bit more wave-like than Pernin. Sloan prescribes other characters forK/G+W, and does not have a hook-form of W. The Pernin variant of W can be accessed in plain text by the use of the variation sequence W +VS1 (U+1BC70 + U+FE00), and the hook form as a default ligature/contextual form. As with all default ligatures, the unligated, joined sequenceof K/G + W can be requested in plain text with a medial ZWJ + ZWNJ + ZWJ (U+200D + U+200C + U+200C).
In Chinook usage, the letters M, N, J, and S (U+1BC06 - U+1BC09) can be used as numbers (see numbers below). When they do so, they are
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
smaller than the normal sized "letter" forms. These variants can be specified, again, by the variation sequence M/N/J/S + VS1 (U+1BC06/7/8/9+ U+FE00).
Other default ligatures and contextual forms. Default features are unmarked in plain text. Unligated forms of these character sequences can berequested with the joining sequence ZWJ + ZWNJ + ZWJ (U+200D + U+200C + U+200C).
Most orthographies have some means of indicating the junction of two same type line consonants. Usually, this comes in the form of a slight ( ≤line width) jog at the intersection, or sometimes a short cross-tick at the intersection of the characters or an angle change of L/R characters. Forthe purposes of plain text, the jog is considered the unligated form of the character sequence, and is the neutral default rendering. Animplementation can prescribe the cross tick, or other indicator as a default rendering. ZWJ should always request the Pernin dotted form, above,and never the tick, angle, or jog.
The Romanian orthography prescribes contextual forms for the Romanian U character (U+1BC1F) and its compounds. The nominal form givenin the code charts is for non-medial contexts. When medial, it takes the form of Duployan Letter Ow (U+1BC0F). Positional ligatures includethe sequence O + Romanian U (U+1BC0A + U+1BC1F), when initial or final, taking the form of an elongated, oval shaped, plain circle vowel.Medially, A or O + Romanian U (U+1BC0A/U+1BC0B + U+1BC1F) exhibits the default joining behaviour of sequential circle vowels, sittingon opposite sides of the end of the previous character - Romanian U again appearing in its medial "Ow" form. Following other vowels,Romanian U appears in diminished form, as a sort of tail.
Romanian also prescribes a ligated form of the vowel sequence O + A (U+1BC0A + U+1BC0B) that is visually identical to the letter Wa(U+1BC5B).
Lastly, the Duployan thick letter selector (U+1BC7F, DTLS) does not have a visual form of its own, but causes the previous character to berendered as a thick variant, representing the addition of an 'R' sound to a Sloan letter. The Duployan Letter R (U+1BC15) can not substitute aligature behavior for the DTLS, as the added 'R' sound can occur in the middle of a compound letter.
Table 5: Ligatures, Allographs, & Alternates
Discretionary features
+ → B + p
+ + + → B + Ar + T
+ + → rO + P
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Standard variants
+ + VS1 → R + L variant
+ + VS1 → T + R variant
+ VS1 → W variant
+ VS1 → M variant
+ VS1 → N variant
+ VS1 → J variant
+ VS1 → S variantDefault features
+ → K + W
+ + → P + Rom U + T
+ + → B + O + Rom U
+ + + → B + O + Rom U + D
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
+ + → B + A + Rom U
+ + + → B + A + Rom U + D
+ + + → B + I + A + Rom U
+ → T + D
+ + → B + O + A
+ + → FR + A
Additions to Figure 16-1. Prevention of Joining
+ → 1BC11 1BC08
+ + → 1BC11 200C 1BC08
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Additions to Figure 16-2. Exhibition of Joining Glyphs in Isolation
→ 1BC1F
+ + → 200D 1BC1F 200D
Additions to Figure 16-3. Effect of Intervening Joiners
CharacterSequences As Is
1BC11 1BC0B 1BC05 1BC02
1BC11 1BC0A 1BC0B 1BC02
1BC15 1BC05
or or Joined text. The most common form of character interaction is that of the cursive connection. The termination of a character stroke leadsdirectly into the beginning of the next character. Vowel signs follow the dynamic shaping discussed above, but fundamentally are the same asother joining characters, joining at a tangent to adjacent characters. Non-joining characters - any character from other scripts, and those found inDuployan - have a small intervening space, as with standard alphabetic writing.
Unjoined text. The Duployan script has a cursive conjoining property that, like Arabic, is effected by the use of the Zero Width Non-Joiner(ZWNJ, U+200C). ZWNJ encodes a break within a word, turning an otherwise joining character into a non-joining character, and resetting thecursive stroke height to neutral. This break is usually found only at nominally syllabic boundaries in Chinook texts, and where a separated letter
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
or letters indicates an affix in the Duployan shorthands. This break is smaller than a word space, in some instances involving negative kerning,and is not a word break. ZWNJ and Zero Width Joiner (U+200D) will also change the positioning of the nasal vowels (see nasal vowels, above).
Overlapping text. The use of overlapping letters to indicate abbreviations and initialisms is found in many systems of shorthand. As such, thecurrent proposal allocates a block of shorthand format characters, which encode non-default text flow in any shorthand. Included are two overlapcontrol characters: the first (U+1BCF0) indicating a single letter overlap, with the text continuing to flow as if that overlapping character did notexist, and the second (U+1BCF1) indicating a continuing overlap where the text flow proceeds from the overlapping character. In Duployan, thisbehaviour is limited to consonants, circle vowels, and orienting vowels overlapping consonants.
The overlapping behavior in Duployan shorthands and Chinook is fairly straightforward: for two line consonants, two arc consonants, or a voweloverlapping any consonant, the two characters overlap at approximately 3/5 along the stroke of the first consonant and 2/5 along the stroke of asecond consonant or the middle of a vowel. For overlaps of arc and line consonants, the arc consonant is split into the first and second half of thearc, an arc overlapping a line taking place in the first half, line over arc in the second. The line consonant, again at the 3/5 / 2/5 point, will meetthe arc at a perpendicular angle, or as close as possible, never beyond the middle of the arc, nor past the end.
It is unknown if or how M type and N type or J type and S type arc consonants would overlap each other until such a time as examples of thisoccurrence are documented. Default rendering should indicate the overlap in some way, either preserving control characters, or through anoffset. Same type line consonants also will not overlap, necessitating similar default rendering; L-type and K-type consonants will not overlapeach other, as well, due to their similar angle.
As indicated above, the flow of text continues either with the first character in the case of U+1BCF0, or with the second in the case ofU+1BCF1. An overlapping letter can also take another overlapping letter before returning to the original text flow. Also, in the Romanianshorthand, long line consonants (U+1BC11-U+1BC15) can take two overlapping characters, indicated by two Letter-Overlap control characters(U+1BCF0 + U+1BCF0) followed by the two overlapping characters. With double overlaps, the first overlapping character overlaps atapproximately 1/3 of the stroke length of the base character, the second at ~ 2/3. See Parsing of Shorthand Overlap Sequences, below.
Down step. The Romanian shorthand prescribes that a certain set of word endings be indicated by letters following not in the default direction oftext flow - to the right, but below the word. Likewise, the Sloan-Duployan and Pernin methods prescribe contracted word endings, wherein thenext word is started low, to signal a dropped sound at the end of the previous word. As such, a shorthand format has been defined (U+1BCF2)that indicates a following character should be rendered below the previous character, with any subsequent joined characters proceeding relativeto the lowered glyph. At word boundaries, this causes the next word (or stenographic period) to be lowered. Because the lowering is a part of theprevious word, the lowered word boundary should be indicated by the shorthand format down step, followed by a width defined space (U+2002-U+200b) and the next word, or period (U+2E3C?). Note that the step format control is found directly after the preceding word, as it encodes aphoneme missing from the end. When Cross' Eclectic shorthand is encoded, a space will come before the step format control as the change inalignment represents a missing initial phoneme.
Up step. The Sloan-Duployan and Pernin methods also prescribe contracted word endings, where the next word is started high, signaling thedropped sound. A shorthand format has been defined (U+1BCF3) to indicate a following word (or stenographic period) to be raised. Even
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
though the up control is only found at word boundaries, this boundary form is still indicated by the shorthand format up step, followed by awidth defined space and the next word, or period.
Aligned text. The last form of contracted words in Sloan-Duployan and Pernin are non-stepping, with the two words even. As with distinctionsin spacing with the Step formats, distinctions in spacing of aligned text are are encoded with defined-width space characters (ZWSP, U+200B;HSP, U+200A; 6/MSP, U+2006; 4/MSP, U+2005; 3/MSP, U+2004; ENSP, U+2003; EMSP, U+2002) or the non-breaking counterparts thereof(hence, Word Joiner, U+2060, as well). Note that Thin Space (U+2009), is not used, due to its common equivalence to the Six-Per-Em Space(U+2006). The regular space characters (U+0020 and U+00A0) cause the following word to start at a neutral baseline, and cannot be used foraligned or stepped word boundaries. If different sized spaces are needed unaligned, again, the above space characters can be used, preceded byZWNJ. Note that the natural letterspacing of unjoined characters is retained with step format controls, so a ZWSP (U+200B) will not cause theadjacent characters to touch, and will, in fact, appear identical to a ZWNJ (U+200C), except that alignment will be preserved.
Table 6: Text flow
Joined Text
+ + → PJH
+ + → DKXUnjoined Text
++ ++ → P.J.H
++ ++ → D.K.XLetter Overlaps
++ → LineXS
++ → SXLine
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
++ → BxR
++ → DxG
++ → VxD
++ → GxB
++ → RxV
++ → MxM
++ → MxS
++ → MxJ
+ + + + + +
→ KATxKAT
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Continuing & Double Overlaps
+ + + + + +
→ KATX+KAT
++ ++ → SxBxJ
+++ + + → DxA+KUnUnder affixes
+ ++ + → MIn-SAUnder word
++ = + → D-_TOver word
+ + = + → B+_Ie
Combining diacritical marks on vowels. Several Duployan orthographies use combining diacritical marks to distinguish vowels. Thesediacritics include acute, grave, breve, macron, under macron, over dot, under dot, diaeresis, under diaeresis, &c. They can appear on orientingvowels, circle vowels, and nasal vowels (On, and An). Although there are several vowel letters with marks included in the allocation, these arenot decomposable as a combining sequence, as the diacritic marks change position along with their "base" orienting vowel. Combining diacriticsindicate vowels with diacritics that consistently appear above or below the base character, no matter the adjacent joining characters.
Affixes. Except for Chinook, every Duployan orthography makes extensive use of a set of marks - often similar, in appearance, to diacritics -and letters to symbolize lexical affixes. The unattached high and low Duployan affixes (U+1BC2A-2F, U+1BC3A-3E, U+1BC50, U+1BC83,
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
U+1BC93) act much like spacing characters - the marks are written next to the word root, and will be either higher or lower than the adjacentletter.
The attached affixes (U+1BC40, x41, x44, etc.) touch or cross the first or last letter of a word (again for prefixes or suffixes), with the locationof crossing (and touching if not evident) symbolized by a dotted line in the charts. The character names list specifies if the character rotates tocomplement the angle of the base letter, or is invariant. An attached affix always attaches to a letter, never to an affix. Since affixes are encodedlogically, and unattached affixes can logically occur between a root and an attached affix, the displayed order of affixes may be different fromthe encoded order.
Third, some orthographies use letters or sequences of letters to indicate affixes, some of which appear similar to the high or low affix signs. As arule, signs that are similar to a letter, but unmotivated - that is, they don't symbolize a sound of the affix - or if a high and low pair is found in theorthography, they are symbolized by affix signs, not letters. Signs that are motivated and aren't paired high/low should be represented by a letter,often separated by ZWNJ (U+200C) from the root, whether the affix usually appears lower or higher than the adjacent character or not. Someletter affixes are encoded with the shorthand format Continuing Overlap (U+1BCF1). For consistency, the shorthand format Letter Overlap(U+1BCF0) should not be used to combine an affix to a root - even if the root is a single character.
In the Sloan orthography, successive high and low affixes or letters-as-affix and high/low affixes are written joined together. These compoundaffixes always position like the first high or low affix in the compound. It is encoded as affix 1 + ZWJ + affix 2, whether it is affix sign + affixsign or letter + affix sign. Letter + letter affixes do not need to be joined by ZWJ, as they are already joining characters. As with other affixes, ifthe compound ends with a letter-affix, it must also be followed by ZWNJ if it does not cursively connect with the word root. Likewise, somehigh/low affix signs can be used as an attached affix, again encoded with ZWJ (U+200D).
Table 7: Diacritics and affixes
Diacritics and Precomposed Vowels
→ P + E + Underdot
+ → P + Rom I
→ T + E + Underdot
+ → T + Rom I
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Affix Signs and Letters
+ + → arc D arc
+ + → Darcarc
+ ++ → Dline+arc
+ + → /arcS → arc/S
+ + ++ → DOK-M
+ ++ → KT-R
++ ++ → T +vert+ I
Numbers. The Duployan orthographies each have a distinct means of expressing numbers. Some number systems must utilize formattingrequiring markup to represent all aspects of the number system, and as of this time, there is no expectation that a full transcription of all numberforms should be representable in plain text. The Chinook number system uses Duployan characters and markup to indicate numbers. TheRomanian shorthand and French Duployéan use regular European/Arabic numerals in conjunction with Duployan characters, combining marks,and markup to indicate magnitude and aspect. Sloan and Pernin use markup and non-Duployan characters in conjunction with regularEuropean/Arabic numerals.
Chinook numbers. The Chinook number characters are 1 P, 2 T, 3 F, 4 K, 5 R, 6 M, 7 N, 8 J, 9 S, 0 O, 10 A, 100 Wa, and 1000-enclosingcircle handled with markup. The numbers can be indicated Hanzi-style with P-S combining with O, A, or Wa to indicate value, although an O,A, or Wa must be preceded by a P to indicate a single hundred or ten, unlike Hanzi numerals. P-S connects to O, A, and Wa the same as inrunning text. O is used unconnected to indicate a zero or connected for the tens with a following digit zero, while A is used when connecting thetens to a non-zero ones digit. The enclosing circle for thousands surrounds the entire group of up to five characters (P-S + Wa + P-S + A/O + P-S), and can nest inside itself to indicate millions - a separate circle surrounding a following thousands group. Chinook numbers can also be
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
indicated Indian/Arabic style, with the digits 0 9 (O-S) having place value. This is especially common when writing years or when numberingitems, as opposed to enumerating them. The digits generally connect cursively, the same as in Hanzi-style Chinook numbers. For most Chinookwriters, the numeral forms of M, N, J, and S are about half-size normal, and are requested in plain text by M/N/J/S + VS1 (U+FE00).
Romanian numbers. The Romanian number system uses the European/Arabic numerals to indicate numbers 0-99, with marks to indicatefurther powers of ten: an overdot (U+0307) for hundreds, a preceding Middle Dot (U+00B7) for thousands, a dot below (U+0323) for millions,and a following Middle Dot for thousand millions. As with most systems using marks to indicate magnitude, these marks can be used inconjunction, e.g. a dot above and dot below for hundred millions. Multiplicative forms (with the prefix ân-) use the character A Nasal(U+1BC1B) before a number, percentages with Combining Ring Above (U+030A), and grade with the degree sign (U+00B0). Ordinals aresymbolized by a following T (U+1BC02), while fractions are written numerator over denominator, with no solidus or line. This representation offractions constitutes a presentation form of already encoded fraction signs or can be explicitly expressed using markup, never with the shorthandformat down step (U+1BCF2).
Pernin numbers. The Pernin number system uses the European/Arabic numerals to write numbers, although periods (U+002E) can be usedinstead of zeros. An underline (by markup) indicates ordinals (first, second...), while an overline (again) indicates the numerical adverbs (once,twice...). The Pernin system suggests, however, that "when large numbers are to be written ... it is better to indicate ... us[ing] a correspondingshorthand contraction for thousand, million, etc.", such contractions left to the individual.
Sloan numbers. The Sloan number system uses the European/Arabic numerals to write numbers, and can be used for ordinals, iteratives, &c.e.g. 2: two, twice, second, secondly. The shorthand aspect in the Sloan system is the use of an overline, strikethrough, and underline (allrepresented with markup) for magnitude as follows: Overline: hundreds; Strikethrough: thousands; Underline: millions. Again, these can be usedin conjunction with each other to indicate, for example hundred millions with an overline and underline.
French Duployéan numbers. The French Duployéan number system, like the Romanian, uses the European/Arabic numerals with Duployanletters and affixes indicating magnitude and aspect. Magnitude is indicated as follows: Hundreds with an S (U+1BC09) after the number;Thousands with the Duployan Affix High Dot (U+1BC2C) following the number; Millions with the Duployan Affix Low Grave Arc(U+1BC3B) following; and Thousand Millions (Milliards) with a following R (U+1BC15) like a large solidus. As above, these indicators ofmagnitude can be combined, e.g. an S and high dot indicating hundred thousands. For ordinals, the Duployan Affix Low Dot (U+1BC3C) isused following any indications of magnitude; Adverbs with the Duployan High Acute Arc (U+1BC2A); Approximates (dizaine, douzaine, &c)with the Duployan High Grave Arc (U+1BC2B); Adverbials with the Duployan High Circle (U+1BC2D); Percents with the Duployan LowCircle (u+x3D), doubled for Per mill. Manuscripts will indicate the numbers 4 and 6 with an underline to distinguish these number forms fromthe words "quittance" and "mot" to which the regular number forms show affinity; This distinction should be handled with markup or bytypeface choice.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Confusability and usage
Given the complex shaping engine required to render Duployan text, there can be ambiguity as to which character or character sequence shouldbe used to represent a given form. The full names list supplied can be consulted for known ambiguities, but this is not an exhaustive list. Fordotted letters vs. diacritics, the determining factor is always whether the dot moves in relation to the letter contextually, as explained indiacritics, above. The dotted consonants should always be used and never decomposed; e.g. HL (U+1BC68) ≠ H (U+1BC00) + L (U+1BC05)and S with dot below (U+1BC49) ≠ S (U+1BC09) + Dot Below (U+0323). Other confusables are in the affixes, and the rule (as given above) isthat an affix that is motivated uses the letters, generally unjoined to the word, e.g. Pernin Inter- = In (U+18) + T (U+1BC02) + ZWNJ, Magn- =M (U+1BC06) + ZWNJ, and Multi- = M (U+1BC06) + Continuing Overlap (U+1BCF1). When there is a positional distinction (high vs. low),the affix signs should always be used.
Romanian word signs For the most part, the extensive list of Romanian word signs are unambiguous. The Duployan Letter Ow (U+1BC0F)should only be used in Romanian text as an overlapping character or as a word sign. In running text, the Ow shape represents the medial form ofthe Duployan Letter Romanian U (U+1BC1F). In numeric contexts, the Degree Sign (U+00B0) and Combining Ring Above (U+030A) shouldbe used instead of the High Circle Affix (U+1BC2D) for indicating percentages and grade of Romanian numbers. Likewise, the Combining dots(U+0307 & U+0323), Combining Diaereses (U+0308 & U+0324), and Middle Dot (U+00B7) should be used to indicate powers of ten insteadof the Dot affixes (U+1BC2C & U+1BC3C) and letter H (U+1BC00).
Proper Names Most Duployan shorthands prescribe that proper names be marked, as there are no majescule letters. Universally, they prescribean underline, which should not be encoded in plain text, but handled through markup.
Stenographic Period
This proposal includes a Stenographic Period character for inclusion in the BMP Supplemental Punctuation block at or after U+2E3C. TheStenographic Period is used with shorthand/stenography systems in place of the normal period. Oftentimes, these systems will make use of a dotfor a letter, word, or affix symbol, and the crossed period is used to avoid ambiguity. Due to its script=common attribute, and its unsuitability toany SMP blocks, this punctuation mark should be placed in the BMP.
Parsing of Shorthand Overlap sequences.
Parsing as a tree. Even though the handling of Duployan characters with Shorthand Format Overlap Controls is fairly simple, it is based on amore robust model with a few simple rules analysable as an N-ary tree: 1) each Overlap Control (branch) has as its base (parent node) the mostrecent character in the text stream 2) each Overlap Control must take a single shorthand character as its "argument" (child node), 3) the argumentof each Overlap Control is allocated by a preorder insertion, where the number of branches of a particular node (character) is defined by thenumber of consecutive Overlap Controls directly following the character in the text stream. 4) for a Continuing Overlap to be valid, its base(parent node) must be the original base character, or the argument (child node) of another valid Continuing Overlap.
Parsing as a stream. The structure of a shorthand overlap sequence can also by analysable as a simple stream 1) each Overlap Control has as itsbase the most recent character in the text stream, 2) each Overlap Control must take a single shorthand character as its argument, 3) after the the
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
initial base, each character binds to one Overlap control - the first unbound Overlap control in the most recent group of overlap controls with anunbound member. 4) for a Continuing Overlap to be valid, its base must be the original base character, or a character bound to another validContinuing Overlap.
Example. The example given below is for demonstration purposes only, counterfactually presuming that a Duployan character can take threeOverlaps or be a third overlapping character. Known textual examples contain just a few overlaps associated with a single parent base character.Each character and overlap control is numbered identically in the text stream, parsing structures, and output image, and is color matched betweenthe parsing structures and output image.
Character 1 is the highest base character. If there are no Continuing overlaps, the next non-overlapping character will cursively connect to thischaracter.Characters 2, 3, &4 are Overlap Format Controls, with Character 1 as their base. In this case, Character 2 is a Continuing Overlap, and is thefirst (leftmost) overlap of Character 1, while 3 and 4 are the middle and rightmost overlapsCharacter 5 is the first character overlapping Character 1. Since Character 2, of which this character is the argument, is a valid ContinuingOverlap, the next non-overlapping character will cursively connect to this character, if it is not the base for another Continuing Overlap.Character 6 is a Continuing Overlap, with Character 5 as its base.Character 7 is the character overlapping Character 5. Since Character 6 was a valid Continuing Overlap, the next non-overlapping characterwill cursively connect to this character.Character 8 is the second character overlapping Character 1. It is the argument for Character 3.Characters 9 and 10 are Letter Overlaps with Character 8 as their base.Character 11 is the first character overlapping Character 8, and is the argument of Character 9.Character 12 is the second character overlapping Character 8, and is the argument for Character 10.Character 13 is the third character overlapping Character 1, and is the argument of Character 4.Since there are no remaining unbound Overlap controls, Character 14 is not an overlapping character. Since the cursive connection was passedinto the overlaps by the Continuing Overlap Format Controls (Characters 2 and 6), this character cursively connects to Character 7, instead ofCharacter 1, and the rest of the word would continue from it.
Example Text Stream, Parsing Examples, and Sequence Rendering
number 1 2 3 4 5 6 7 8 9 10 11 12 13 14
glyph code point U+1BC22 xF1 xF0 xF0 x14 xF1 x03 x15 xF0 xF0 x03 x02 x01 x14
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Parsing of Shorthand Steps.
In contrast with overlaps, the Shorthand Step Format controls have a simple grammar: ZWNJ (U+200c), space & NBSP (U+0020 & U+00a0)and all non shorthand characters will return a text stream to a neutral baseline. Spacing characters (except for space and NBSP), includingZWSP, ZWJ, and WJ (U+200b, U+200d, & 2060), and all gc=Mn will preserve the current baseline (ie, the height of the cursive stroke) andadvance the text stream, if appropriate. Shorthand Step Format controls always act in relation to the current stroke height, whether it is neutral orhas been altered by preceding characters.
Given that future shorthands will need to be encoded with varying step heights, and the needs of those shorthands should take precedence, thisproposal does not define whether multiple instances of an Up or Down step is legal. Until such time as a determinative shorthand is encoded, asecond (or more) Up or Down Step Format control should be interpreted as raising or lowering the stroke height a second (or more) time.
Shorthand Control level of implementation.
Different shorthands will have need for differing levels of implementation of the Shorthand Format Controls. Support of arbitrarily complexOverlap sequences shall not be required for Unicode conformance; therefore, the block description for each encoded shorthand should includespecifications for the width of overlaps (maximum number of overlaps assignable to a single character), the depth (how many overlaps can
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
"stack" on each other), and the breadth (how many overlaps are assignable to an already overlapping character). If the depth is only 1, then thebreadth is, by default, 0. For example, the Duployan Shorthands and Chinook, as a whole, have a width of 2 (on the medium line consonants inRomanian), a depth of 2 (Chinook abbreviations), and a breadth of 1. Implementations for specific orthographies are: French Duployan: 1x1;Romanian: 2x1; Chinook: 1x2(1); Sloan: 1x1; Pernin: 1x1; Perrault: 0.
Individual characters may or may not be used with certain format controls. This information is contained in a new UCD data file,ShorthandFormat.txt, attached.
References
Archives of the Kamloops Wawa 1891-1900 (subscription required), Fr. J.M.R. LeJeune, 1891-1923, Kamloops, BC Dictionary of the Chinook Jargon, by George Gibbs, Echo Library ISBN 1-40680-924-1 Chinook:.... A History and Dictionary, by Edward Harper Thomas, 1935, Metropolitan Press, Portland, OR Cours de Sténographie Duployé Fondamentale, by A. Hautefeuille and C. Ramaude Pernin's Practical Reporter, compiled and published by H. M. Pernin, 1882, O. S. Gulley Printing House, Detroit, MI Pernin's Universal Phonography, 16th ed, by H. M. Pernin, 1902, Detroit, MI Curs De Stenografie, publicat de Margareta SfinÅ£escu în Enciclopedia practică a copiilor, Editura Ion Creangă, 1984 Stenographie Integrale, http://www.stenographie.ch/stenographie_integrale.pdf Modern Shorthand. the Sloan-Duployan Phonographic Instructor, 11th ed, by J. M. Sloan; 1st ed. 1882; Ramsgate, England; St. John's, NL;Brisbane, QLD Modern Shorthand: the Sloan-Duployan system. Reporters' Rules, by John Mathew Sloan, 1892, London. The Wawa Shorthand Instructor, first edition, by the Editor of the Kamloops "Wawa", 1896, Kamloops, BC Perrault-Duployan Complete Elementary Course of Stenography in Six Lessons, Sixth Edition, by Denis R. Perrault, 1918, Montreal. nouveau site duployé, http://cf.geocities.com/barouder396/
Documentation
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 1:Basic Inventory of Chinook letters. Page 5 of Chinook Rudiments from the Kamloops Wawa. Circled are Duployan letters A, O, Ou,Ow, Wa, U, Nasals I/U/O/A; H, P, T, F, K, L, M, N, J, S; B, D, V, G, R, J/S/N with dot inside; Wo, Wow, We, Weyi; HL, LH, RH, X, and TH.
Example 3:French Affixes, page 83, ibid. Circled are the
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 2:Complex French consonants. Page 55, Cours De Stenographie,Duployé Fondamentale. Circled are Duployan letters KM, PN, FN, DS, RS;MS, NS, JS, SS; MN, NM, JM, SJ; MNS, NMS, JMS, SJS; JN, and JNS.
Attached affixes Tail, E Hook, I Hook, Tangent, and Secant;High and Low affixes Acute, Grave, Dot, Circle, Wave, andLine.
Example 4:Wi/Weyi distinction, from the Kamloops Wawa. Circled are We and Weyi, and the Chinook Full Stop.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 5:Wow in use, ibid. Circled are Wow and Chinook Full Stop. Notice the overlapped T + K (God) and S + S (Holy Spirit) at thebottom.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 6:DH digraph, ibid. Circled is Duployan letter DH, with Latin English transliteration.
Example 7:Chinook Numbers, ibid.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 8:Primary/Secondary orientation. Page 8, Stenographie Integrale
Example 9:I/E and U/Eu, Page 1, ibid.
Example 10:K+W and G+W nominal differences vs. actual implementation (Perrault above Pernin)
Example 11:Romanian arc consonant word signs. Page 18 (241), Curs de Stenographie. Circled are Duployan Signs J with dots inside andabove, and M with Dot.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 12:Pernin Affixes. Page 26, Pernin's Practical Reporter. Examplesof both Secant affixes. Example 13:Pernin Affixes, page 29, ibid. Horizontal and
Vertical Secant affixes.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 14:Pernin Affixes. Page 74, Pernin's UniversalPhonography.
Example 15:Pernin Affixes. Page 75, Pernin's Universal Phonography. Pro-and Sub- signs contrast Affix High Acute and Affix High Right Acute.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 16:Pernin Suffixes. Page 82, ibid. Circled are Vertical Attached Affixes Up and Down.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 17:Pernin Prefix chart (note double prefix "precon-"). Page 32, Pernin's Practical Reporter. Circled are the Horizontal and VerticalSecant affixes.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 18:Circle vowels and Pernin "R" reverse circle vowels. pp 19 & 23, Pernin's Universal Phonography.
Example 19:Pernin Vowels. Page 16, ibid. Circled are Duployan letters OA, Long U, IE, EE, UI, Short I; Pernin An, and Pernin Am.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 20:Perrault Consonants. Page 13, Perrault-Duployan ....
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 21:Perrault combined consonants, circle and orienting vowels, pp 14 & 15, ibid. Circled are Duployan Letters TS, TRS, ST, STR, SP,SPR, WR, KRS, GRS, SK, SKR, SN, SM.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 22:Perrault nasal vowels, pp 16 & 17, ibid. Circled are Duployan Letters XW and Vocalic M.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 23:Romanian Affixes. Page 14 (232,233), Curs de Stenographie.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 24:Page 15 (234,235), ibid.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 25:Page 16 (236,237), ibid.
Example 26:Unique Romanian arc consonants. Page 5 (212), ibid. Circled is Duployan Letter S with Dot Below.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 27:Romanian I. Page 11 (226), ibid. Circle is Duployan Letter Romanian I.
Example 28:Romanian Numbers. Page 13 (230, 231), ibid.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 29:Romanian U. Page 7 (217), ibid. Circled are Duployan Letter Romanian U, final and medial forms.
Example 30:Romanian overlaps, double overlaps, etc. Page 19 (242), ibid.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 31:Double mark. Page 17 (238), ibid.Example 32:Romanian U & Ow in overlaps. Pp 19 (242) & 20 (244),
ibid.
Example 33:Sloan Letters. Page 6&7, Sloan-Duployan Phonographic Instructor. Circled are Duployan Letters Uh, Ooh, Sloan Eh, Sloan Ee;Sloan U, and Sloan Ow.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 35:Sloan Affixes. Page 16, Sloan-Duployan,Reporter's Rules
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 34:Sloan combined consonants + combining R (note TRS & DRS).
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Page 8, ibid.
Example 36:Page 17, ibid.
Example 37:Sloan Combined consonants. Page 5, ibid.
Example 39:The Sloan R rule. Page 12, ibid.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 38:Sloan Numbers. Page 8, ibid.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Example 40:Examples of the Stenographic Period from French Duployéan, Romanian,Sloan-Duployan, Pernin's Universal, and Pernin's Reporters' shorthands.
Example 41: The Sloan "vowel rule", showing theshorthand up/down control at word breaks.
ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Form number: N3702-F (Original 1994-10-14; Revised 1995-01, 1995-04, 1996-04, 1996-08, 1999-03, 2001-05, 2001-09, 2003-11, 2005-01, 2005-09, 2005-10, 2007-03, 2008-05, 2009-11)
A. Administrative 1. Title: Proposal to include Duployan Shorthands and Chinook script in Unicode / ISO-10646.2. Requester's name: Van Anderson [email protected]. Requester type (Member body/Liaison/Individual contribution): Individual contribution4. Submission date: 2010-04-125. Requester's reference (if applicable): 6. Choose one of the following:
This is a complete proposal: X (or) More information will be provided later:
B. Technical - General 1. Choose one of the following:
a. This proposal is for a new script (set of characters): Yes Proposed name of script: 1) Duployan Shorthands and Chinook 2) Shorthand Format Controls b. The proposal is for addition of character(s) to an existing block: Yes Name of the existing block: Supplemental Punctuation
2. Number of characters in proposal:148 - (1 in Supplemental Punctuation, 4 in Shorthand format controls, 143 in Duployan Shorthandsand Chinook)
3. Proposed category (select one from below - see section 2.2 of P&P document):
A-Contemporary B.1-Specialized (small collection) X B.2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4. Is a repertoire including character names provided? Yes a. If YES, are the names in accordance with the "character naming guidelines" Yes b. Are the character shapes attached in a legible form suitable for review? Yes5. Fonts related:
a. Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the standard?
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
Van Anderson [email protected] b. Identify the party granting a license for use of the font by the editors (include address, e-mail, ftp-site, etc.):
Van Anderson https://boil.afraid.org/Chinook/DuployanProp.ttf6. References:
a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? Yes b. Are published examples of use (such as samples from newspapers, magazines, or other sources)
of proposed characters attached? Yes, for some of repertoire7. Special encoding issue
Does the proposal address other aspects of character data processing (if applicable) such as input,
presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? Yes
Information on presentation and collation is included in this document, above. Standard transliteration is superfluous due to theexistence of Latin orthographies for all known languages using Duployan.
C. Technical - Justification 1. Has this proposal for addition of character(s) been submitted before? No If YES explain 2. Has contact been made to members of the user community (for example: National Body,
user groups of the script or characters, other experts, etc.)? Yes
If YES, available relevant documents:
Online forums: Forum du petit sténographe(http://forumsteno.vosforums.com/), Chinook Language List(http://listserv.linguistlist.org/archives/chinook.html)
3. Information on the user community for the proposed characters (for example:
size, demographics, information technology use, or publishing use) is included? Yes
Reference:
Script will be used primarily by small community of hobbyists and linguistic/historical scholars, withexpected minor utility to legal and government historians, due to extensive usage of Duployan shorthandsin Canada and France, and the historical use of shorthands to record legal and legislative proceedings.
4. The context of use for the proposed characters type of use; common or rare) rare Reference: 5. Are the proposed characters in current use by the user community? Yes
If YES, where? Reference:Still in use by small hobbyist community, mostly in France. Scholarly and historical/culturalpreservation use.
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
6. After giving due considerations to the principles in the P&P document must the proposed characters be entirely
in the BMP? No. If YES, is a rationale provided?
If Yes, reference:Except for one character in Supplemental Punctuation, characters should be allocated in SMP (Plane1) as per Roadmap.
7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? Yes8. Can any of the proposed characters be considered a presentation form of an existing
character or character sequence? No If YES, is a rationale for its inclusion provided? If Yes, reference: 9. Can any of the proposed characters be encoded using a composed character sequence of either
existing characters or other proposed characters? No If YES, is a rationale for its inclusion provided? If Yes, reference: 10. Can any of the proposed character(s) be considered to be similar (in appearance or function)
to an existing character? Yes. If YES, is a rationale for its inclusion provided? Yes If Yes, reference: Any similarities in appearance are coincidental or a motivated adaptation of letter shapes to Duployan.11. Does the proposal include use of combining characters and/or use of composite sequences? Yes If YES, is a rationale for such use provided? Yes
If Yes, reference:Several orthographies use optional combining accents to distinguish similar vowel sounds. Furtherjustification is contained in document, above.
Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? No.
If Yes, reference:Examples of several composite sequences are provided, and all other sequences can be trivially derivedfrom those given.
12. Does the proposal contain characters with any special properties such as
control function or similar semantics? Yes If YES, describe in detail (include attachment if necessary)
The 4 Shorthand Format Control characters (U+1BCF0-U+1BCF3) and Duployan thick letter selector (U+1BC7F) are discussed above.See tables 5 & 6 for examples and preceding text for description. Parsing and syntax information for Shorthand Format Seqences is onpage 10.
13. Does the proposal contain any Ideographic compatibility character(s)? No If YES, is the equivalent corresponding unified ideographic character(s) identified?
Proposal to include Duployéan Shorthands and Chinook script and Shorthand Format Controls in Unicode/ISO-10646
file:///C|/Users/vanisaac/Desktop/Chinook/Proposal.html[2010-07-30 13:20:57]
If Yes, reference:
Printed using UniBook™(http://www.unicode.org/unibook/)
1
2E7FSupplemental Punctuation2E00
2E0 2E1 2E2 2E3 2E4 2E5 2E6 2E7
∠
∠
⸂
⸃
⸄
⸅
⸆
⸇
⸈
⸉
⸊
⸋
⸌
⸍
⸎
⸏
⸐
⸑
⸒
⸓
⸔
⸕
⸖
⸗
⸘
⸙
⸚
⸛
⸜
⸝
⸞
⸟
⸠
⸡
⸢
⸣
⸤
⸥
⸦
⸧
⸨
⸩
⸪
⸫
⸬
⸭
⸮
ⸯ
⸰
⸱
⸼
2E00
2E01
2E02
2E03
2E04
2E05
2E06
2E07
2E08
2E09
2E0A
2E0B
2E0C
2E0D
2E0E
2E0F
2E10
2E11
2E12
2E13
2E14
2E15
2E16
2E17
2E18
2E19
2E1A
2E1B
2E1C
2E1D
2E1E
2E1F
2E20
2E21
2E22
2E23
2E24
2E25
2E26
2E27
2E28
2E29
2E2A
2E2B
2E2C
2E2D
2E2E
2E2F
2E30
2E31
2E3C
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBook™(http://www.unicode.org/unibook/)
2
2E3CSupplemental Punctuation2E00
Brackets2E1C ⸜ LEFT LOW PARAPHRASE BRACKET2E1D ⸝ RIGHT LOW PARAPHRASE BRACKET
• used in N’Ko
Dictionary punctuation2E1E ⸞ TILDE WITH DOT ABOVE
• indicates derived form changes to uppercase2E1F ⸟ TILDE WITH DOT BELOW
• indicates derived form changes to lowercase
Brackets2E20 ⸠ LEFT VERTICAL BAR WITH QUILL2E21 ⸡ RIGHT VERTICAL BAR WITH QUILL
Half bracketsThese form a set of four corner brackets and are used editorially.They are distinguished from mathematical floor and ceilingcharacters. Occasionally quine corners are substituted for halfbrackets.2E22 ⸢ TOP LEFT HALF BRACKET
→ ⌈ left ceiling→ ⌜ top left corner→ 「 left corner bracket
2E23 ⸣ TOP RIGHT HALF BRACKET2E24 ⸤ BOTTOM LEFT HALF BRACKET2E25 ⸥ BOTTOM RIGHT HALF BRACKET
Brackets2E26 ⸦ LEFT SIDEWAYS U BRACKET
→ ⊂ subset of2E27 ⸧ RIGHT SIDEWAYS U BRACKET
→ ⊃ superset of2E28 ⸨ LEFT DOUBLE PARENTHESIS
→ ⦅ left white parenthesis→ ⦅ fullwidth left white parenthesis
2E29 ⸩ RIGHT DOUBLE PARENTHESIS
Historic punctuation2E2A ⸪ TWO DOTS OVER ONE DOT PUNCTUATION2E2B ⸫ ONE DOT OVER TWO DOTS PUNCTUATION2E2C ⸬ SQUARED FOUR DOT PUNCTUATION2E2D ⸭ FIVE DOT MARK2E2E ⸮ REVERSED QUESTION MARK
= punctus percontativus→ ? question mark→ ¿ inverted question mark→ ؟ arabic question mark
2E2F ⸯ VERTICAL TILDE• used for Cyrillic yerik→ ̾ combining vertical tilde→ ꙿ cyrillic payerok
2E30 ⸰ RING POINT• used in Avestan→ ∘ ring operator→ ◦ white bullet
2E31 ⸱ WORD SEPARATOR MIDDLE DOT• used in Avestan, Samaritan, ...→ · middle dot
Alternate Punctuation2E3C ⸼ STENOGRAPHIC PERIOD
• used in shorthands and stenographies→ . period
New Testament editorial symbols2E00 ∠ RIGHT ANGLE SUBSTITUTION MARKER
→ ⌜ top left corner2E01 ∠ RIGHT ANGLE DOTTED SUBSTITUTION MARKER2E02 ⸂ LEFT SUBSTITUTION BRACKET2E03 ⸃ RIGHT SUBSTITUTION BRACKET2E04 ⸄ LEFT DOTTED SUBSTITUTION BRACKET2E05 ⸅ RIGHT DOTTED SUBSTITUTION BRACKET2E06 ⸆ RAISED INTERPOLATION MARKER
→ ⊤ down tack2E07 ⸇ RAISED DOTTED INTERPOLATION MARKER2E08 ⸈ DOTTED TRANSPOSITION MARKER2E09 ⸉ LEFT TRANSPOSITION BRACKET2E0A ⸊ RIGHT TRANSPOSITION BRACKET2E0B ⸋ RAISED SQUARE
• used as an opening raised omission bracket2E0C ⸌ LEFT RAISED OMISSION BRACKET
• used as an opening or closing raised omissionbracket
2E0D ⸍ RIGHT RAISED OMISSION BRACKET• used as a closing or opening raised omission
bracket
Ancient Greek textual symbols2E0E ⸎ EDITORIAL CORONIS
→ ᾽ greek koronis2E0F ⸏ PARAGRAPHOS2E10 ⸐ FORKED PARAGRAPHOS2E11 ⸑ REVERSED FORKED PARAGRAPHOS2E12 ⸒ HYPODIASTOLE2E13 ⸓ DOTTED OBELOS
• glyph variants may look like ‘÷’ or ‘∸’→ ⁒ commercial minus sign
2E14 ⸔ DOWNWARDS ANCORA• contrary to its formal name this symbol points
upwards2E15 ⸕ UPWARDS ANCORA
• contrary to its formal name this symbol pointsdownwards
2E16 ⸖ DOTTED RIGHT-POINTING ANGLE= diple periestigmene
Ancient Near-Eastern linguistic symbol2E17 ⸗ DOUBLE OBLIQUE HYPHEN
• used in ancient Near-Eastern linguistics• hyphen in Fraktur text uses - or ‐ ,
but with a ‘⸗’ glyph in Fraktur fonts→ - hyphen-minus→ = equals sign→ ‐ hyphen
General punctuation2E18 ⸘ INVERTED INTERROBANG
= gnaborretni→ ‽ interrobang
2E19 ⸙ PALM BRANCH• used as a separator
Dictionary punctuationThese punctuation marks are used mostly in German dictionaries, toindicate umlaut or case changes with abbreviated stems.2E1A ⸚ HYPHEN WITH DIAERESIS
• indicates umlaut of the stem vowel of a pluralform
2E1B ⸛ TILDE WITH RING ABOVE• indicates change in case for derived form
Printed using UniBook™(http://www.unicode.org/unibook/)
3
1BC9FDuployan Shorthands and Chinook1BC00
1BC0 1BC1 1BC2 1BC3 1BC4 1BC5 1BC6 1BC7 1BC8 1BC9
𛰀𛰁𛰂𛰃𛰄𛰅𛰆𛰇𛰈𛰉𛰊𛰋𛰌𛰍𛰎𛰏
𛰐𛰑𛰒𛰓𛰔𛰕𛰖
𛰘𛰙𛰚𛰛𛰜𛰝𛰞𛰟
𛰠𛰡
𛰢𛰣𛰤𛰥𛰦𛰧𛰨𛰩𛰪𛰫𛰬𛰭𛰮𛰯
𛰰𛰱𛰲𛰳𛰴𛰵𛰶𛰷𛰸𛰹𛰺𛰻𛰼𛰽𛰾𛰿
𛱀𛱁𛱂𛱃𛱄
𛱆𛱇
𛱈𛱉𛱊𛱋𛱌𛱍𛱎𛱏
𛱐𛱑𛱒
𛱕𛱖𛱗𛱘𛱙𛱚𛱛𛱜𛱝
𛱟
𛱠𛱡𛱢𛱣
𛱦𛱧𛱨𛱩𛱪
𛱰𛱱𛱲𛱳𛱴𛱵𛱶𛱷𛱸
𛱺𛱻𛱼
𛲀𛲁𛲂𛲃𛲄𛲅
𛲈
𛲐𛲑𛲒𛲓𛲔𛲕
𛲜𛲝𛲞𛲟
1BC00
1BC01
1BC02
1BC03
1BC04
1BC05
1BC06
1BC07
1BC08
1BC09
1BC0A
1BC0B
1BC0C
1BC0D
1BC0E
1BC0F
1BC10
1BC11
1BC12
1BC13
1BC14
1BC15
1BC16
1BC18
1BC19
1BC1A
1BC1B
1BC1C
1BC1D
1BC1E
1BC1F
1BC20
1BC21
1BC22
1BC23
1BC24
1BC25
1BC26
1BC27
1BC28
1BC29
1BC2A
1BC2B
1BC2C
1BC2D
1BC2E
1BC2F
1BC30
1BC31
1BC32
1BC33
1BC34
1BC35
1BC36
1BC37
1BC38
1BC39
1BC3A
1BC3B
1BC3C
1BC3D
1BC3E
1BC3F
1BC40
1BC41
1BC42
1BC43
1BC44
1BC46
1BC47
1BC48
1BC49
1BC4A
1BC4B
1BC4C
1BC4D
1BC4E
1BC4F
1BC50
1BC51
1BC52
1BC55
1BC56
1BC57
1BC58
1BC59
1BC5A
1BC5B
1BC5C
1BC5D
1BC5F
1BC60
1BC61
1BC62
1BC63
1BC66
1BC67
1BC68
1BC69
1BC6A
1BC6E
1BC6F
1BC70
1BC71
1BC72
1BC73
1BC74
1BC75
1BC76
1BC77
1BC78
1BC7A
1BC7B
1BC7C
1BC7D
1BC7E
1BC7F
1BC80
1BC81
1BC82
1BC83
1BC84
1BC85
1BC88
1BC89
1BC8A
1BC8B
1BC8C
1BC8D
1BC8E
1BC8F
1BC90
1BC91
1BC92
1BC93
1BC94
1BC95
1BC9A
1BC9B
1BC9C
1BC9D
1BC9E
1BC9F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBook™(http://www.unicode.org/unibook/)
4
1BC19Duployan Shorthands and Chinook1BC00
1BC0F 𛰏 DUPLOYAN LETTER OW• should not be used for Romanian U≈ <medial> 𛰟 → 𛰎 duployan letter ou
1BC10 𛰐 DUPLOYAN LETTER X• Salishan• non-joining character
1BC11 𛰑 DUPLOYAN LETTER B→ 𛲀 duployan affix low vertical secant→ 𛲁 duployan affix mid vertical secant→ 𛲂 duployan affix high vertical secant
1BC12 𛰒 DUPLOYAN LETTER D→ 𛲐 duployan affix left horizontal secant→ 𛲑 duployan affix mid horizontal secant→ 𛲒 duployan affix right horizontal secant
1BC13 𛰓 DUPLOYAN LETTER V1BC14 𛰔 DUPLOYAN LETTER G
• written down and to the left1BC15 𛰕 DUPLOYAN LETTER R
• Chinook number 5• French number milliards• written up and to the right= Pernin letter L= Pernin Reporters word repeat sign
1BC16 𛰖 DUPLOYAN LETTER VOCALIC M• primary orienting vowel= Perrault letters Am, Em, Im, Um (with accents)→ 𛰚 duployan letter nasal o→ 𛱻 duployan letter pernin am
1BC17 <reserved>1BC18 𛰘 DUPLOYAN LETTER NASAL I
• character positions diacritically, as an orientingvowel, or as an invariant vowel
• primary orientation• invariant direction down• Romanian multiplicative number prefix= Pernin letter IM= Consolidated Duployan affix INT-R-→ 𛰖 duployan letter vocalic m→ 𛰙 duployan letter nasal u→ 𛰚 duployan letter nasal o→ 𛰛 duployan letter nasal a
1BC19 𛰙 DUPLOYAN LETTER NASAL U• character positions diacritically, as an orienting
vowel, or as an invariant vowel• secondary orientation• invariant direction down• French number 1= Pernin letter IN= Consolidated Duployan affix INT-R-→ 𛰖 duployan letter vocalic m→ 𛰘 duployan letter nasal i→ 𛰚 duployan letter nasal o→ 𛰛 duployan letter nasal a
Basic Duployan1BC00 𛰀 DUPLOYAN LETTER H
• Chinook, Pernin, Sloan, Perrault• non-joining character
1BC01 𛰁 DUPLOYAN LETTER P• Chinook number 1
1BC02 𛰂 DUPLOYAN LETTER T• Chinook number 2
1BC03 𛰃 DUPLOYAN LETTER F• Chinook number 3
1BC04 𛰄 DUPLOYAN LETTER K• Chinook number 4• Written down and to the left
1BC05 𛰅 DUPLOYAN LETTER L• Written up and to the right= Pernin letter R
1BC06 𛰆 DUPLOYAN LETTER M• Chinook Number 6
1BC07 𛰇 DUPLOYAN LETTER N• Chinook number 7
1BC08 𛰈 DUPLOYAN LETTER J• Chinook number 8= Chinook letter SH= Pernin letter SH
1BC09 𛰉 DUPLOYAN LETTER S• Chinook number 9• French Hundreds
1BC0A 𛰊 DUPLOYAN LETTER O• Chinook number 0
1BC0B 𛰋 DUPLOYAN LETTER A• Chinook number 10s
1BC0C 𛰌 DUPLOYAN LETTER I• character rotates to match entry angle of
preceding consonant• character has primary orientation (right and up)= Perrault letter long A, short E (with accents)→ 𛰜 duployan letter e→ 𛱃 duployan affix attached e hook→ 𛱄 duployan affix attached i hook→ 𛱊 duployan letter short i→ 𛱋 duployan letter ee→ 𛱌 duployan letter ie→ 𛱍 duployan letter ui
1BC0D 𛰍 DUPLOYAN LETTER U• character rotates to match entry angle of
preceding consonant• character has primary orientation (right and up)= Romanian stenographic letter EN→ 𛰝 duployan letter eu→ 𛰟 duployan letter romanian u→ 𛱠 duployan letter xw→ 𛱰 duployan letter w→ 𛱱 duployan letter long u
1BC0E 𛰎 DUPLOYAN LETTER OU• should not be used for Perrault Ow≈ <initial, final> 𛰏 = Chinook letter OO→ 𛰏 duployan letter ow→ 𛰟 duployan letter romanian u→ 𛱲 duployan letter uh→ 𛱳 duployan letter ooh→ 𛱴 duployan letter sloan u
Printed using UniBook™(http://www.unicode.org/unibook/)
5
1BC2DDuployan Shorthands and Chinook1BC1A
Duployéan compound letters1BC20 𛰠 DUPLOYAN LETTER U N
→ 𛰍 duployan letter u→ 𛰇 duployan letter n
1BC21 𛰡 DUPLOYAN LETTER P N= Sloan B B→ 𛰁 duployan letter p
1BC22 𛰢 DUPLOYAN LETTER D S= Sloan D D→ 𛰂 duployan letter t
1BC23 𛰣 DUPLOYAN LETTER F N= Sloan V V→ 𛰃 duployan letter f
1BC24 𛰤 DUPLOYAN LETTER K M• written down and to the left= Sloan G G→ 𛰄 duployan letter k
1BC25 𛰥 DUPLOYAN LETTER R S• written up and to the right= Sloan R R→ 𛰅 duployan letter l
1BC26 𛰦 DUPLOYAN LETTER M S= Sloan shorthand letter M M→ 𛰆 duployan letter m
1BC27 𛰧 DUPLOYAN LETTER N S= Pernin, Sloan, Perrault letter NG→ 𛰇 duployan letter n
1BC28 𛰨 DUPLOYAN LETTER J S= Romanian stenographic letter Ge= Pernin, Perrault letter ZH= Sloan letter CH→ 𛰈 duployan letter j
1BC29 𛰩 DUPLOYAN LETTER S S• French, Sloan= Romanian stenographic letter Ts= Pernin, Perrault letter Z→ 𛰉 duployan letter s
Basic High affixes1BC2A 𛰪 DUPLOYAN AFFIX HIGH ACUTE
= French suffix -ment= Romanian suffix —mant= Pernin Sub-= Pernin Reporters' suffix Pro-→ 𛲅 duployan affix tight high acute→ ˊ modifier letter acute accent
1BC2B 𛰫 DUPLOYAN AFFIX HIGH GRAVE= French suffix -ien= Pernin suffix Con-→ ˋ modifier letter grave accent
1BC2C 𛰬 DUPLOYAN AFFIX HIGH DOT• not Romanian hundreds - use U+ ̇
Combining Dot Above and U+ ̈ Combining Diaeresis
• French number thousands= French suffix -eur= Romanian shorthand affix trans-/-lui→ ˙ dot above
1BC2D 𛰭 DUPLOYAN AFFIX HIGH CIRCLE• Not Romanian number grade or percent suffix• French ordinal number= French suffix -euse→ ° degree sign→ ˚ ring above
1BC1A 𛰚 DUPLOYAN LETTER NASAL O• character positions diacritically, as an orienting
vowel, or as an invariant vowel• neutral nasal vowel for transcription of an
ambiguous secondary orienting nasal vowel• secondary orientation• invariant direction up= Pernin letter OM= Perrault letters An, En, In, Un (with accents)= Pernin letter IM= Consolidated Duployan affix INT-R-→ 𛰘 duployan letter nasal i→ 𛰙 duployan letter nasal u→ 𛰛 duployan letter nasal a
1BC1B 𛰛 DUPLOYAN LETTER NASAL A• Perrault vocalic N - An, En, In, Un (with
accents)• character positions diacritically, as an orienting
vowel, or as an invariant vowel• neutral nasal vowel for transcription of an
ambiguous primary orienting nasal vowel• primary orientation• invariant direction up= Pernin letter ON= Romanian stenographic letter YN→ 𛰖 duployan letter vocalic m→ 𛰘 duployan letter nasal i→ 𛰙 duployan letter nasal u→ 𛰚 duployan letter nasal o→ 𛱺 duployan letter pernin an→ 𛱻 duployan letter pernin am→ 𛱼 duployan letter sloan an→ duployan letter sloan en→ duployan letter sloan on
1BC1C 𛰜 DUPLOYAN LETTER E• character rotates to match entry angle of
preceding consonant• character has secondary orientation (left and
down)= Sloan letter long A= Perrault letter short I, long E (with dot accent)→ 𛰌 duployan letter i→ 𛱃 duployan affix attached e hook
1BC1D 𛰝 DUPLOYAN LETTER EU• character rotates to match entry angle of
preceding consonant• character has secondary orientation (left and
down)• in French usage, may be rendered with a dot
contextually= Romanian stenographic letter AN→ 𛰍 duployan letter u
1BC1E 𛰞 DUPLOYAN LETTER ROMANIAN I• character rotates to match entry angle of
preceding consonant, with dot maintainingrelative position
• secondary orienting (left and down)→ 𛰜 duployan letter e
1BC1F 𛰟 DUPLOYAN LETTER ROMANIAN U→ 𛰏 duployan letter ow
Printed using UniBook™(http://www.unicode.org/unibook/)
6
1BC4BDuployan Shorthands and Chinook1BC2E
Attached affixes1BC40 𛱀 DUPLOYAN AFFIX ATTACHED SECANT
• dots show position on and relative orientationto base glyph and are not rendered
• as a prefix, takes opposite relative position tofollowing glyph
• generally crosses adjacent character atperpendicular, but has a bias towards SW/NEangle to contrast 𛱒
• default neutral secant affix= French suffix -anse= Pernin prefix Pre-= Sloan affix Ax-/-ext→ 𛱒 duployan affix attached ltr secant
1BC41 𛱁 DUPLOYAN AFFIX ATTACHED TANGENT• dots show position on and relative orientation
to base glyph and are not rendered• as a prefix, takes opposite relative position to
following glyph= French suffix -tan= Romanian shorthand letter Str-/-str
1BC42 𛱂 DUPLOYAN AFFIX ATTACHED TAIL• orienting character= French suffix -sionaire
1BC43 𛱃 DUPLOYAN AFFIX ATTACHED E HOOK• glyph is retrograde and opens up or down,
dependent on preceding letter• dots show position of preceding glyph and are
not rendered→ 𛰜 duployan letter e= French suffix -te
1BC44 𛱄 DUPLOYAN AFFIX ATTACHED I HOOK• glyph is retrograde and opens left or right,
dependent on preceding letter• dots show position of preceding glyph and are
not rendered→ 𛰌 duployan letter i= French suffix -tou= Sloan affix Irre-/-ary
1BC45 <reserved>
Variant letters1BC46 𛱆 DUPLOYAN LETTER AOU1BC47 𛱇 DUPLOYAN LETTER OA
= Pernin letter AW= Perrault letter AW
1BC48 𛱈 DUPLOYAN LETTER J S WITH DOT= Sloan letter hard CH= Pernin, Perrault letter Ch→ 𛰈 duployan letter j
1BC49 𛱉 DUPLOYAN LETTER S WITH DOT BELOW= Romanian Sh→ 𛰉 duployan letter s
1BC4A 𛱊 DUPLOYAN LETTER SHORT I• Pernin, Duployan shorthand• used as an invariant vowel and for orienting
word abbreviations consisting of only vowels→ 𛰌 duployan letter i= Consolidated Duployan letter R T R
1BC4B 𛱋 DUPLOYAN LETTER EE• Pernin, Duployan shorthand• used as an invariant vowel and for orienting
word abbreviations consisting of only vowels→ 𛰌 duployan letter i
1BC2E 𛰮 DUPLOYAN AFFIX HIGH LINE= French suffix -iste= Romanian shorthand affix -tor= Pernin affix Dis-→ ¯ modifier letter macron
1BC2F 𛰯 DUPLOYAN AFFIX HIGH WAVE= French suffix -ificatif→ ˜ small tilde
Duployéan compound letters1BC30 𛰰 DUPLOYAN LETTER J N
→ 𛰈 duployan letter j→ 𛰇 duployan letter n
1BC31 𛰱 DUPLOYAN LETTER J N S→ 𛰈 duployan letter j→ 𛰇 duployan letter n
1BC32 𛰲 DUPLOYAN LETTER M N• Romanian mai mult, not mult mai sign→ 𛰆 duployan letter m
1BC33 𛰳 DUPLOYAN LETTER N M• not Romanian nu nu shorthand sign→ 𛰇 duployan letter n
1BC34 𛰴 DUPLOYAN LETTER J M• not Romanian ceea ce shorthand sign→ 𛰈 duployan letter j
1BC35 𛰵 DUPLOYAN LETTER S J• not Romanian sa se shorthand sign→ 𛰉 duployan letter s
1BC36 𛰶 DUPLOYAN LETTER M N S→ 𛰆 duployan letter m
1BC37 𛰷 DUPLOYAN LETTER N M S→ 𛰇 duployan letter n
1BC38 𛰸 DUPLOYAN LETTER J M S→ 𛰈 duployan letter j
1BC39 𛰹 DUPLOYAN LETTER S J S→ 𛰉 duployan letter s
Basic Low affixes1BC3A 𛰺 DUPLOYAN AFFIX LOW ACUTE
= French suffix -cion= Pernin prefix ex-→ 𛲕 duployan affix tight low acute→ ˏ modifier letter low acute accent
1BC3B 𛰻 DUPLOYAN AFFIX LOW GRAVE= French suffix -ion• French number millions→ ˎ modifier letter low grave accent
1BC3C 𛰼 DUPLOYAN AFFIX LOW DOT= French suffix -ie• French iterative number= Romanian shorthand affix Inter-• not Romanian millions - see U+ ̣
Combining Dot Below and U+ ̤ Combining Diaeresis Below
1BC3D 𛰽 DUPLOYAN AFFIX LOW CIRCLE= French suffix -iere• French percent→ ˳ modifier letter low ring
1BC3E 𛰾 DUPLOYAN AFFIX LOW LINE= French suffix -isme= Pernin affix Mis-→ ˗ modifier letter minus sign
1BC3F 𛰿 DUPLOYAN AFFIX LOW WAVE= French suffix -ification→ ˷ modifier letter low tilde
Printed using UniBook™(http://www.unicode.org/unibook/)
7
1BC6DDuployan Shorthands and Chinook1BC4C
1BC59 𛱙 DUPLOYAN LETTER S WITH DOT= Chinook TS= Chinook, Romanian, Sloan Z→ 𛰉 duployan letter s
Compound W vowels1BC5A 𛱚 DUPLOYAN LETTER WO
• Chinook→ 𛰊 duployan letter o
1BC5B 𛱛 DUPLOYAN LETTER WA• Chinook• Not Romanian O+A= Perrault letter OY• Chinook number 100s→ 𛰋 duployan letter a
1BC5C 𛱜 DUPLOYAN LETTER WI• Chinook→ 𛰌 duployan letter i
1BC5D 𛱝 DUPLOYAN LETTER WEI• Salishan
1BC5E <reserved>1BC5F 𛱟 DUPLOYAN LETTER WOW
• Salishan→ 𛰏 duployan letter ow
1BC60 𛱠 DUPLOYAN LETTER XW= Perrault Uh• not French Eu→ 𛰍 duployan letter u→ 𛰝 duployan letter eu
Dotted line consonants1BC61 𛱡 DUPLOYAN LETTER TH
• Chinook, Sloan, Pernin, Perrault→ 𛰂 duployan letter t
1BC62 𛱢 DUPLOYAN LETTER DH• Chinook→ 𛰒 duployan letter d
1BC63 𛱣 DUPLOYAN LETTER SLOAN DH→ 𛰂 duployan letter t
1BC64 <reserved>1BC65 <reserved>1BC66 𛱦 DUPLOYAN LETTER SLOAN J
→ 𛰔 duployan letter g1BC67 𛱧 DUPLOYAN LETTER KK
• Chinook• written down and to the left→ 𛰄 duployan letter k
1BC68 𛱨 DUPLOYAN LETTER HL• Chinook• written up and to the right→ 𛰅 duployan letter l
1BC69 𛱩 DUPLOYAN LETTER LH• Chinook• written up and to the right→ 𛰅 duployan letter l
1BC6A 𛱪 DUPLOYAN LETTER RH• Chinook• written up and to the right→ 𛰕 duployan letter r
1BC6B <reserved>1BC6C <reserved>1BC6D <reserved>
1BC4C 𛱌 DUPLOYAN LETTER IE• Duployan shorthand• used as an invariant vowel and for orienting
word abbreviations consisting of only vowels→ 𛰌 duployan letter i= Pernin letter A
1BC4D 𛱍 DUPLOYAN LETTER UI• Duployan shorthand• used as an invariant vowel and for orienting
word abbreviations consisting of only vowels→ 𛰌 duployan letter i= Pernin letter E
1BC4E 𛱎 DUPLOYAN LETTER YE
Shorthand double mark1BC4F 𛱏 DUPLOYAN DOUBLE MARK
• Dots show position on and relative orientationto base glyph and are not rendered
• Romanian, Sloan shorthands• Should be used with M, N, J, and S for the
Romanian word signs Mai mult, Nu nu, Ceeace, and Sa se
• Can be doubled and tripled
Other affixes1BC50 𛱐 DUPLOYAN AFFIX LOW ARROW
= Romanian prefix Sub-• low affix
1BC51 𛱑 DUPLOYAN AFFIX ATTACHED TANGENT HOOK• attached affix• dots show position on and relative orientation
to base glyph and are not rendered= Romanian affix Ist-/-ism= Consolidated Duployan prefix T-R-
1BC52 𛱒 DUPLOYAN AFFIX ATTACHED LEFT-TO-RIGHTSECANT• dots show position on and relative orientation
to base glyph and are not rendered• generally crosses adjacent character at
perpendicular, but has a bias towards NW/SEangle to contrast 𛱀
• as a suffix, takes opposite relative position tofollowing glyph
= Pernin prefix Per-→ 𛱀 duployan affix attached secant
1BC53 <reserved>1BC54 <reserved>
Dotted arc consonants1BC55 𛱕 DUPLOYAN SIGN J WITH DOTS INSIDE AND ABOVE
= Romanian sign Ici→ 𛰈 duployan letter j
1BC56 𛱖 DUPLOYAN SIGN M WITH DOT= Romanian sign Mijloc→ 𛰆 duployan letter m
1BC57 𛱗 DUPLOYAN LETTER N WITH DOT= Chinook NG= Romanian sign Nici→ 𛰇 duployan letter n
1BC58 𛱘 DUPLOYAN LETTER J WITH DOT= Chinook, Romanian CH= Sloan ZH= Chinook, Perrault J→ 𛰈 duployan letter j
Printed using UniBook™(http://www.unicode.org/unibook/)
8
1BC8CDuployan Shorthands and Chinook1BC6E
Pernin additional affixes1BC80 𛲀 DUPLOYAN AFFIX LOW VERTICAL SECANT
= Pernin Reporters Sub-• dots show position on base glyph and are not
rendered → 𛰑 duployan letter b→ 𛲁 duployan affix mid vertical secant→ 𛲂 duployan affix high vertical secant
1BC81 𛲁 DUPLOYAN AFFIX MID VERTICAL SECANT= Pernin Reporters Trans-• dots show position on base glyph and are not
rendered → 𛰑 duployan letter b→ 𛲀 duployan affix low vertical secant→ 𛲂 duployan affix high vertical secant
1BC82 𛲂 DUPLOYAN AFFIX HIGH VERTICAL SECANT= Pernin Reporters Super-• dots show position on base glyph and are not
rendered → 𛰑 duployan letter b→ 𛲀 duployan affix low vertical secant→ 𛲁 duployan affix mid vertical secant
1BC83 𛲃 DUPLOYAN AFFIX HIGH LONG GRAVE= Pernin Contra-→ 𛰫 duployan affix high grave arc
1BC84 𛲄 DUPLOYAN AFFIX HIGH VERTICAL• also functions as attached affix vertical up with
ZWJ• this affix is about half as long as Duployan
Letter P• as a prefix, has falling stroke direction= Pernin ZWJ + -ime= Sloan Tele-→ 𛰁 duployan letter p
1BC85 𛲅 DUPLOYAN AFFIX HIGH TIGHT ACUTE= Pernin Pro-• as a suffix, placed above and to the right of the
following letter→ 𛰪 duployan affix high acute
1BC86 <reserved>1BC87 <reserved>
Down slope combined consonants1BC88 𛲈 DUPLOYAN LETTER S T
• Pernin, Perrault• written down= Sloan SM
1BC89 DUPLOYAN LETTER S T R• Pernin, Perrault• written down= Sloan SN
1BC8A DUPLOYAN LETTER S P• Pernin, Perrault• written down= Sloan KW
1BC8B DUPLOYAN LETTER S P R• Pernin, Perrault• written down= Sloan SKW
1BC8C DUPLOYAN LETTER T S• written down• Perrault= Sloan STD
Chinook non-letters1BC6E DUPLOYAN SIGN O WITH CROSS
• Chinook Likalisti1BC6F DUPLOYAN PUNCTUATION CHINOOK FULL STOP
Pernin and Sloan vowels1BC70 𛱰 DUPLOYAN LETTER W
• Sloan, Perrault, Pernin• written down• takes form of a hook or wave after K and G→ 𛲞 duployan letter sloan wh
1BC71 𛱱 DUPLOYAN LETTER LONG U• Pernin, Perrault• this vowel does not rotate to match entry angle
of preceding consonant→ 𛰎 duployan letter ou
1BC72 𛱲 DUPLOYAN LETTER UH• Sloan→ 𛰎 duployan letter ou
1BC73 𛱳 DUPLOYAN LETTER OOH• Sloan→ 𛰎 duployan letter ou
1BC74 𛱴 DUPLOYAN LETTER SLOAN U→ 𛰎 duployan letter ou
1BC75 𛱵 DUPLOYAN LETTER SLOAN OW• reverse circle vowel→ 𛰋 duployan letter a
1BC76 𛱶 DUPLOYAN LETTER SLOAN EH1BC77 𛱷 DUPLOYAN LETTER SLOAN EE1BC78 𛱸 DUPLOYAN LETTER LONG I
• Pernin• angles like an “F” when adjacent a K-type
consonant1BC79 <reserved>1BC7A 𛱺 DUPLOYAN LETTER PERNIN AN
• written down→ 𛰛 duployan letter nasal a
1BC7B 𛱻 DUPLOYAN LETTER PERNIN AM• written down→ 𛰛 duployan letter nasal a→ 𛰖 duployan letter vocalic m
1BC7C 𛱼 DUPLOYAN LETTER SLOAN AN→ 𛰛 duployan letter nasal a
1BC7D DUPLOYAN LETTER SLOAN EN→ 𛰛 duployan letter nasal a→ 𛰘 duployan letter nasal i
1BC7E DUPLOYAN LETTER SLOAN ON→ 𛰛 duployan letter nasal a→ 𛰚 duployan letter nasal o
Sloan R-form selector1BC7F DUPLOYAN THICK LETTER SELECTOR
• commonly abbreviated DTLS• Sloan R-flavored letters• Shape shown is arbitrary and is not visibly
rendered• Causes previous Duployan character to be
rendered bold
Printed using UniBook™(http://www.unicode.org/unibook/)
9
1BC9FDuployan Shorthands and Chinook1BC8D
1BC9B DUPLOYAN LETTER S M• written up• Perrault= Pernin GRS= Sloan SL
1BC9C 𛲜 DUPLOYAN LETTER K R S• written up• Perrault
1BC9D 𛲝 DUPLOYAN LETTER G R S• written up• Perrault
1BC9E 𛲞 DUPLOYAN LETTER S K• written up• Perrault, Pernin= Sloan TS
1BC9F 𛲟 DUPLOYAN LETTER S K R• written up• Perrault, Pernin= Sloan DS
1BC8D DUPLOYAN LETTER T R S• written down• Perrault= Sloan SST
1BC8E DUPLOYAN LETTER WH• written down→ 𛱰 duployan letter w
1BC8F DUPLOYAN LETTER W R• written down• Perrault= Sloan SW
Pernin additional affixes1BC90 𛲐 DUPLOYAN AFFIX LEFT HORIZONTAL SECANT
= Pernin Reporters Extra-• dots show position on base glyph and are not
rendered → 𛰒 duployan letter d→ 𛲑 duployan affix mid horizontal secant→ 𛲒 duployan affix right horizontal secant
1BC91 𛲑 DUPLOYAN AFFIX MID HORIZONTAL SECANT= Pernin Reporters Inter-• dots show position on base glyph and are not
rendered → 𛰒 duployan letter d→ 𛲐 duployan affix left horizontal secant→ 𛲒 duployan affix right horizontal secant
1BC92 𛲒 DUPLOYAN AFFIX RIGHT HORIZONTAL SECANT= Pernin Reporters Contra-• dots show position on base glyph and are not
rendered → 𛰒 duployan letter d→ 𛲐 duployan affix left horizontal secant→ 𛲑 duployan affix mid horizontal secant
1BC93 𛲓 DUPLOYAN AFFIX LOW LONG GRAVE= Pernin Extra-→ 𛰻 duployan affix low grave arc
1BC94 𛲔 DUPLOYAN AFFIX LOW VERTICAL• functions as attached affix vertical down with
ZWJ• this affix is about half as long as Duployan the
letter P• as a prefix, has rising stroke direction= Pernin ZWJ + -ine→ 𛰁 duployan letter p
1BC95 𛲕 DUPLOYAN AFFIX LOW TIGHT ACUTE= Pernin Suf-, Sug-• as a suffix, placed under and to the right of the
following letter→ 𛰺 duployan affix low acute
1BC96 <reserved>1BC97 <reserved>
Up slope combined consonants1BC98 <reserved>1BC99 <reserved>1BC9A DUPLOYAN LETTER S N
• written up• Perrault= Pernin KRS= Sloan SP
Shorthand Format Controls1BCF0 SHORTHAND FORMAT LETTER OVERLAP
• shape shown is arbitrary and is not visiblyrendered
1BCF1 SHORTHAND FORMAT CONTINUING OVERLAP• shape shown is arbitrary and is not visibly
rendered1BCF2 SHORTHAND FORMAT DOWN STEP
= Romanian shorthand affix -tsion-= Sloan contracted ending oo/o + ZWSP• shape shown is arbitrary and is not visibly
rendered1BCF3 SHORTHAND FORMAT UP STEP
= Sloan contracted ending uh/au/aui + ZWSP• shape shown is arbitrary and is not visibly
rendered1BCF4 <reserved>1BCF5 <reserved>1BCF6 <reserved>1BCF7 <reserved>1BCF8 <reserved>1BCF9 <reserved>1BCFA <reserved>1BCFB <reserved>1BCFC <reserved>1BCFD <reserved>1BCFE <reserved>1BCFF <reserved>
Printed using UniBook™(http://www.unicode.org/unibook/)
10
1BCFFShorthand Format Controls1BCF0
1BCF
1BCF0
1BCF1
1BCF2
1BCF3
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
This document contains three texts in three different Duployan shorthands: Sloan-Duployan, French
Duployéan, and Romanian stenographie. Each text is accompanied by a "transliteration" in standard Latin
orthography, either as part of the original text, or, in the case of the French text, the translation was
supplied on the French shorthand online forum in response to the posting of the original Duployéan text.
These texts have been parsed line by line, and supplied with the proposed encoding for that text. The
resulting Duployan has been supplied by printing the output of the Microsoft VOLT proofing tool, with
manual adjustment of some of the spacing and combining - the font is not perfect yet. Non-Duployan
characters are represented by symbols as follows: "_" = space (U+0020), "|" = Zero Width Space
(U+200B), "ZWNJ" = Zero Width Non Joiner (U+200C), "ZWJ" = Zero Width Joiner (U+200D), and "."
= Stenographic Period (proposed, U+2E3A). The use of Zero Width Space may or may not be necessary,
pending input from the UTC as to whether letters used as word signs in Sloan-Duployan that cursively
connect should have the Zero Width Space to indicate a word boundary since there is no visible word
spacing to represent word breaks.
x0B x02 x7F x0C x11 _ x07 | x02 _ x03 x0B x05x02 _ x70 x0B x09 _ x12 _ x02 _ x3E x03 x7F x02 x19 .
x28 | x15 | x88 | x0A | x02 _ x4D ZWNJ x01 x7F x18 _ x12 _ x0C x09 x11 _ x03 x7F x29 _ x03 _ x01 x7F x01 x02 _ x0A x01 x7F x1C
x08 _ x03 | x02 | x14 x7F _ x05A x11 x7F x0B x02 x7F x0C _ x0A _ x07 x1C x02 x7F .
x02 _ x3E x8C _ x02 _ x03 x1C _ x11 | x02 _ x3E ZWJ x2A x12 x0B x04 _ x0A x02 _ x03 | x70 | x11 | ZWJ x1B _ x4D ZWNJ x01 x0B
x02 _ x70 x0C _ x09 x7F ZWJ x1A _ x1B _ x0C ZWJ x2A ZWNJ x09 _ x70 x0C _ x8C x09 .
L’artichaut
Si le chardon son lointain cousin est l’emblème de l’Ecosse, l’artichaut n’en pourrait pas moins être considéré comme l’un des emblèmes
majeurs de la cuisine méditerranéenne au même titre que l’ail x0E la tomate. L’artichaut est un chardon domestiqué et cultivé. On désigne
sous ce nom à la fois la plante entière et sa partie comestible. Le nom est apparu à la Renaissance et il viendrait de l’arabe où il signifie
épine de la terre.
x04 x0B x15 x08 x0A
x09 x0C _ x05 _ x08 x0B x15 x12 x1A _ x08 x1B _ x05 x0A x18 x02 x18 _ x04 x0A x08 x18 _ x4D _ x05 x18 ZWJ x11 x1C x06 _ x12
_ x05 x1C x04 x0A x08 _ x05 x0B x15 x08 x0A
x07 x1A _ x01 x0A x15 x1C _ x01 x0B _ x06 x0A x18 _ x0C x02 _ x04 x1B ZWJ x08 x15 x1C _ x24 _ x05 x19 _ x12 x0C _ x18 x11
x1C x06 _ x06 x0B x08 _ x12 x0B _ x04 x0C x08 x0C x07
x06 x12 x1C x15 x0B x07 x1C x07 _ x0A _ x06 x1C x06 _ x02 x15 _ x04 _ x05 x0B _ x4E _ x0E _ x05 x0B _ x02 x0A x06 x0B x02
_ x05 x0B x15 x08 x0A _ x4D _ x19 _ x08 x0B x15 x12 x1A
x12 x0A x25 x04 x1C _ x4D _ x04 x0D x05 x13 x1C _ x1A _ x22 x0C x07 _ x08 x0E _ x08 _ x07 x1A _ x0B x05 _ x03 x0B _ x05
x0B _ x01 x05 x41 _ x18 x02 x3D
x4D _ x08 x0B _ x0B x02 x0C _ x24 x1C x08 x11 _ x05 _ x06 x1A _ x4D _ x0B x01 x0B x15 x0D _ x0B x05 _ x15 x25 x40 _ x4D _
x4C _ x23 x12 x0C
x12 _ x05 x0B x15 x0B x11 _ x0E _ x4C _ x08 x0C x07 x0C x03 x1C _ x1C x21 _ x12 x0B _ x02 x1C x15
x09 x1C _ x41 x12 x1F x44 x44 _ x02 x4F _ x04 x05A x09 x0B _ x09 x0B _ x0C x0B _ x07 x0A x02 _ x06 x0B x15 x0C .
x19 _ x01 x1C x09 x04 x0B x15 _ x59 x0B x15 x44 x44 _ x19 _ x01 x0C x09 x04 .
x09 x0B x15 x0B _ x0B _ x0B x12 x1F x09 _ x19 _ x1F x15 x09 _ x1F x12 _ x05A _ x08 x15 x04 .
Guide to Duployan Shorthands and Stenographies Each guide provides a) a list of the characters necessary to represent the shorthand (overlaps - xF0 & xF1 - not included), b) a list of affixes and their plain text encoding, c) the encoding of any word signs (may be a list, a statement of principles, or some combination thereof), d) the encoding of the nasal vowels and other characters needing ZWJ or ZWNJ, and e) any miscelaneous information, such as number forms, variant characters and their variation selectors, and
Guide to French Duployéan encoding a. Columns 0-4.
x01-x0E: P, T, F, K, L, M, N, J, S, O, A, I, U, Ou; x11-x15: B, D, V, G R; x18-x1D: In, Un, On, An, E, Eu; x20-x29: UN, PN, DS, FN, KM, RS, MS, NS, JS; x2A-x2F: High acute, grave, dot, circle, line, and wave; x30-x39: SS, JN, JNS, MN, NM, JM, SJ, MNS, NMS, JMS, SJS; x3A-x3F: Low acute, grave, dot, circle, line, and wave; x40-x44: Attached secant, tangent, tail, E-hook, and I-hook; x46-x47: Aou, Oa; x4A-x4E: short I, Ee, Ie, Ui, Ye.
b. Affixes: -cionnel/-cionnaire/-zionnel/-zionnaire, +x42; -té/-dé, +x43; -ta/-to/-tou/-da/-do/-dou, +x44; -tan/-dan/-anté/-ande, +x41; -anse/-inse/-onse/-ianse, +x40; -ment, +x2A; -cion, +x3A; -ing/-indre/-ian/-éen/-ien, +x2B; -ion/-zion, +x3B; -euil/-ieu/-ueu/-ieur/-eur, +x2C; -ié/-eil, +x3C; -ueuse/-ieuse/-euse, +x2D; -ière, +x3D; -ificative/-ificatif, +x2F; -ification, +x3F; -iste, +x2E; -isme, +x3E.
c. French Duployéan makes extensive use of conventional word signs. For the most part, these are single character word signs, or sequences of a few characters. Importantly, x0C & x1C (I & E) should not be used unless the word sign contains consonants which will determine the orientation of the vowels. Vowel-only word signs will instead use x4A-x4D (short I, Ee, Ie, and Ui), to orient the adjacent characters.
d. Nasal vowels are encoded In/Un/On/An + ZWJ. e. Numbers: quantity one (not digit 1), x19; otherwise,
Uses European numbers with the characters for powers of ten: hundreds, + ZWNJ + x09; thousands, + x2C; millions, + 3B; milliards, +ZWNJ + x15; hundred thousand/million/milliard + ZWNJ + x09 + [x2C / x3B / ZWNJ + x15] Other forms follow the number and power signs: ordinals, + x3C; adverbials, x2A; approximate, x2B; percent, U+0325; permille, U+035A.
Guide to Chinook Pipa encoding a. Columns 0,1,5&6,
x00-x0F: H, P, T, F, K, L, M, N, J, S, O, A, I, U, Ou, Ow; x10-x15: X, B, D, V, G, R; x18-x1C: In, Un, On, An, E; x57-x5D, x5F: N/J/S with dot, Wo, Wa, Wi, Wei, Wow; x60-x62, x67-x6A, x6E-x6F: Uh, Th, Dh, Kk, Hl, Lh, Rh, Likalisti, Chinook Full Stop.
b. No affixes are used in Chinook Pipa writing. c. The only word signs are x6E, Likalisti sign, and abbreviations using letter overlap, xF0. d. Nasal vowels encoded without plain. U (x0D) must be adjacent a non-joining character or be followed by ZWNJ. Most non-
adjacent vowels and all circle vowels - even when adjacent each other - have an intervening non-joining character. e. Numbers: x01-x04, x15, x06-x0A = digits 1-9&0; x0B = 10s, x5B= 100s, markup for thousands/millions/&c.:
When enumerating (quantity), the numbers are used Hanzi-style, with x5B and x0B muliplying a preceding character. If there would be no ones digit, it should take x0A, instead of x0B, to indicate the digit 0. Markup should be used to circle groups of numbers to indicate x1000, and the rule of the ones digit should apply to these circled groups as well. When numbering, as in years, &c. the numbers are used Indian/Arabic-style, using the above digits 1-9 & 0 with place value.
Guide to Romanian Stenografie encoding a. Columns 0-5, Combining diacritical marks, Basic Latin, and Latin 1.
x00: H (as affix -tat, -tate) x01-x0B: P, T, F, K, L, M, N, J, S, O, A; x0D: U (as nasal)
x0F: Ow (in word signs only) x11-x15: B, D, V, G, R; x18-x1F: In, Un, On, An, E, Eu (as nasal), Romanian I, Romanian U; x28-x29: JS (as ge/gi), SS (as ţ); x2A,x2C: High acute and dot; x32: MN (word sign) x3C: Low dot; x44: Attached I-hook; x49: S with dot; x4B: Ee; x4D: Ui; x4F: Double mark x50-x51: Low arrow, attached tangent hook x55-x59: J with dots above and below, N/M/J/S with dots; xF2: Shorthand control under. U+002B: + sign (prefix); U+003C-3E, U+00D7, U+2197: <, =, >, × signs, NE arrow (in word signs). U+0323: combining dot below
b. Affixes: trans-, x2C+; inter-, x3C+; -ţiune, U+0323; -tiv/-ziv/-siv, +3C; -lui, +x2C; -tat/-tate/-tâţe, x00+; -mant, +x2A; -lor/-ilor, +ZWNJ+x15; -ţion + ar/al/at/am/eazâ, + xF2 + [ x07 / x05 / x02 / x06 / x09+x0B ] -mentar(e)/-mîntare, + ZWNJ + x06; -anţâ/enţâ/inţâ/onţ/unţ, +x18-x1B; -escu/-eṣti/-eascâ, +x44+x44; -ism, +x51; -titudine, +x12; (-)str/zdr/st(-), (+)x41(+); isto-, x51+; -ist, +41; contra-, x04+x02+ZWNJ; ex-/exa-/exo-/exter-/extra-/extre-, x4D+; circum-/circu-, x4B; super-/supra-, x09+ZWNJ+; asupra-, x0B+ZWNJ+; electra-, x05+ZWNJ; ne-, x07+ZWNJ; nemai-, x07+x08+ZWNJ; -tor, +ZWNJ+x02; sub-, x50+; dra-/dre-/dru-, x12+ZWNJ; pra-/pro-/pre-/pri-/pru-, x01+ZWNJ; bra-/bri-/bro-/bru, x11+ZWNJ; cre-/cra-/cro-/cru-, x04+ZWNJ; gra-/gre-/gri-/gru, x14+ZWNJ; fru-/fra-/fre-/fri-, x03+ZWNJ; vre-/vra-, x13+ZWNJ; plus-, U+002B.
c. Word signs: The latin script style word signs should be encoded as Basic Latin (U+0030..007F) with script font encoded with markup. Under no circumstances should the Mathematical Italic or Script letters (U+1D434..1D503) be used for Romanian Stenographic word signs. The multiplication, less than, equal, greater than signs, and north-east arrow (U+00D7, U+003C-U+003E, & U+2197) should be used for multiple/multe, mai mic, egal/acelaṣi, mai mare, & în These characters can overlap Duployan characters, and even each other.
d. Nasal vowels are encoded In/Un/On/An + ZWJ, and U/Eu. e. Numbers: Uses European numerals with diacritics and numbers for powers of ten:
hundreds, + U+0307; thousands, + U+00B7; millions, + U+0323; milliards, U+00B7 +; hundred (milliards, &c.), + U+0307 + (U+00B7, &c.); Other forms precede or follow the number and power signs: multiplicative, x1B +; percent, + U+030A; grade, + U+00B0; ordinals, + x12.
Guide to Pernin encoding a. Columns 0-9,
x00-x0F: H, P, T, F, K, L, M, N, J, S, O, A, I, U, Ou, Ow; x11-x15: B, D, V, G, R; x18-x1C: nasal I, U, A, O, letter E; x21-x29: PN, DS, FN, KM, RS, MS, NS, JS; x2A, x2B, x2E: High Affixes Acute, Grave, Line; x3A, x3B, x3E: Low Affixes Acute, Grave, Line; x48: JS w/ dot; x4A-x4D: short I, Ee, Ie, Ui; x58: J w/ dot; x61: Th; x70, x71, x78: W, Long U, Long I; x7A, x7B: Pernin An, Am; x80-x82: Affixes Low, Mid, & High Vertical Secant; x83-x85: High Affixes Long Grave, Vertical, Tight Acute; x88-x8D, 8F: ST, STR, SP, SPR, TS, TRS, WR; x90-x92: Affixes Right, Mid, &Left Horizontal Secant; x93-x95: Low Affixes Long Grave, Vertical, Tight Acute; x9A-x9F: SN, SM, KRS, GRS, SK, SKR;
b. Universal Affixes: con-, x2B; contr-/counter-, x83; dis-, x2E; ex-, x3B; extra-, x93; enter-/intro-, x19+x02+ZWNJ; mis-, x3E; nom-/non-/num-, x07+ZWNJ; magn-, x06+ZWNJ; por-/pro-/pru-, x86; multi-, x06+xF1; sub-/sur-, x2A; suf-/sug-, x96; trans-, x02+xF1; accom-/accoun-, x0B+ZWJ+x2B; concom-, x2B+x2B; encom-/encoun-/incog-, x18+ZWJ+x2B; uncon-/uncom-, x1B+ZWJ+x2B; uncontro-x1B+ZWJ+x83; unencum, x1B+ZWJ+x18+ZWJ+x2B; unaccoun-, x1B+x0B+ZWJ+x2B; unpro-, x1B+ZWJ+x86; recon-/recom-/recoun-/recog-, x15+ZWJ+x2B; compro-, x2B+ZWJ+x86; discon-, x2E+ZWJ+x2B; dismis-, x2E+x3E; misex-, x3E+ZWJ+3B; noncom-, x07+ZWJ+x2B; nonsub-, x07+ZWJ+x2A; procon-, x86+ZWJ+x2B; propor-, x86+x86; subcom-, x2A+ZWJ+x2B; subcontra-, x2A+ZWJ+x83; unex-, x1B+ZWJ+x3B; enun-/enum-/innum-, x1A+ZWJ+x07+ZWNJ; insub-, x19+ZWJ+x2A; irrecon-, x4D+x15+ZWJ+x2B; acs-, x0B+ZWJ+x3B; aux-, x0A+ZWJ+x3B; per-/pre-/pur-, x01; retr+a/e/i/o-, x15; circum-, x9E; ever-, x4D+x13; every-, x4D+x13+x0C; for-/fore-, x03; just-, x48; out-, x0F; upper-, x0D+x01; after-, x0B+x03+x02; good-, x14;
under-, x0B; over-, x0A+x13; -ness, x07; -full, x03; fully, x03+0C; -fullness, x03+x07; -less, x05; -lessly, x05+x0C; -a/i/+ble, x11; -a/i/+bly, x11+x0C; -cian/-c/s/t+ion, x08; -ime, +ZWJ+x84; -imely,+ZWJ+x84+ZWJ+x0C; -ine, +ZWJ+x94; -inely,+ZWJ+x94+ZWJ+x0C; -ineness,+ZWJ+x94+ZWJ+x07; -ment, x06; -ing(s), xF3+ZWSP; -ingly, x0C; -ingness, x27+x0C; -n(d/g)ing, x27; -some, x29; -with, x61; -ship, x08+x0C; -after, x0B+x03+x02; -a/e/i+l/r+ity, x02+x0C; -sci/ti/de/ge+ousness, x08+x07; -e/i/a+tive, x13; -graph, x14; -graphy, x14+x0C; -graphic, x14+x0C+x04; -self, x09+x03 -selves, x09+x13; -(i)blemenss, x11+x07; Reporters' Affixes: pro-, x2A; per-, x52; pre-, x40; con-/coun-/com-, x2B; dis-/des-, x2E; mis-/mes-, x3E; sub-/sup-/surp-, x80; trans-, x81; super-/supre-, x82; extr+a/e/i-, x90; i/e+enter-/intro-, x91; contr+a/i/o-/counter-, x92; precon-, x40+x2B; unpre-, x1B+ZWJ+x40; discon-, x2E+x2B; indis-, x19+ZWJ+x2E; miscon-, x3E+x2B; uncon-, x1B+ZWJ+x2B; reco+m/n-, x15+ZWJ+x2B; irrecon-, x0C+x15+ZWJ+x2B; acco+m/un-, x2B+x0B+x04; for(e)-, x03; self-, x09+x03; just-, x58; circum-, x09+x05; retr+o/i-, x15+x1C+x02; repre-, x15+x01; -ness, x07; -full, x03; -ment, x06; -less, x05; -cian/-c/s/t+ion, x08; -a/i/+ble, x11; -a/i/+bly, x11+x0C; -lative, x05+x13; -tative, x02+x13; -bility, x11+x02; -fully, x03+0C; -fullness, x03+x07; -les/-lou+sly, x05+x0C; -lessness, x05+x07; -iveness, x13+x07; -ousness, x09+x07; -ableness, x11+x07; -sci/-ti+ousness, x08+x07; -d/-g+eousness, x58+x07;
c. Word signs: Almost every letter used in the Pernin orthography can be used as a word sign for a common word. Pernin does not prescribe overlapping for abbreviations, although ad hoc abbreviations are encouraged in the reporters orthography.
d. Nasal vowels are encoded with x18-x1B+ZWJ, x7A, & x7B. e. Pernin does not prescribe number usage.
Guide to Sloan-Duployan encoding a. Coumns 0-9,
x00-x0C: H, P, T, F, K, L, M, N, J, S, O, A, I; x11-x15: B, D, V, G, R; x19, x1B: nasal U, A; x22, x26-x29: DS, MS, NS, JS, SS; x48: JS w/ dot; x57-x58: S & J w/ dot; x61,x63: Th, Sloan Dh; x66: Sloan J; x70, x72-x77, x7C-x7F: W, Uh, Ooh; Sloan U, Ow, Eh, Ee; Sloan An, En, On; Combining R; x88-x8F: ST, STR, SP, SPR, TS, TRS, Wh, WR; x9A-x9F: SN, SM, KRS, GRS, SK, SKR;
b. Affixes: per-/pre-/pro, x01+x7F; co(l/m/n/un)-/econ-, x04+ZWNJ; acco+m/un-, x0B+x04+ZWNJ; reco+l/m/n/un-, x15+x0C+ZWNJ; contr+a/i/o-/counter, x2B; acc-/ax-/ex-, x40; ab-/ob-, x2C; des-/dis-, x2E; mes-/mis-, x3E; su(b/c/f/g/p/r/s)-, x09+ZWNJ; tra+m/n/ns-, x02+x7F; sup+er/ra/re-, x9A+ZWNJ; extr+a/e/i-, x1C+x04+ZWNJ; e/i+nt+er/ri/ro-, x19+x7F+ZWNJ; ant+a/e/i-, x19+ZWNJ; ind+e/i/is/us-, x18+ZWNJ; ultra-, x0A+ZWNJ; (in)tel+e/le/li-, x02+ZWNJ; sem+a/e/i-, x88+ZWNJ; det+er/ra/ri-, x12+ZWNJ; man(a/i/u)-/mon(o/u)-, x06+ZWNJ; nom-/non-/num-, x07+ZWNJ; irr+a/e/i/u-, x44; edi-/edu-, x4B+ZWNJ; i/u+nco+m/n/l-, x4D+ZWNJ; self-, x09+x05; just-, x8C; circu(l/m)-, x8B; -ness, x07; -ful, x03; -less, x05; -lessly, x05+x0C; -s/t+ion, x08; -s/t+ions, x08+x09; -cess/cis/sat/sess/sic/sit+ion, x09+x08; -a/i/+ble, x11; -ment, x06; -mony, x06+x0C; -ments/-most/-mous/-mus, x06+x09; -graph, x14+x7F; -graphy, x14+x7F+x0C; -ous, x09; -ously, x09+x0C; -(a/i)tiveness, x13+x07; -tively, x13+x0C; -tativeness, x02+x13+x07; -tatively, x02+x13+x0C; -lativeness, x05+x13+x07; -latively, x05+x13+x0C; -c/t/x+ious, x08+x0A; -ciously, x08+x0A+x0C;
-ciousness, x08+x0A+x07; -c/s/t+ial, x08+x0B; -c/t+ialness, x08+x0B+x07; -c/t+ially, x08+x0B+x0C; -mously, x06+x09+x0C; -mousness, x06+x09+x07; -nously, x07+x09+x0C; -nousness, x07+x09+x07; -somely, x88+x0C; -sally, x9B+x0C; -cessity, x8D+x0C; -a/e/i+lity, x05+x02; -a/e+nce/-uns, ZWJ+7B; -self/-selves, x09+x03; -a/e/i/o/u+ry, x44; -monious, x06+x0C+x09; -ism/-some, x88; -ci/sia+sm, x89; -c/s+ity, x8C+x0C; -(o)logy, x05+x14; -cation, x04; -(a/i)tive, x13; -neous/-nesses/-nous, x07+ZWJ+x09; -(e/i)x(t), x40; -a/i/+bly, x11+x0C; -fully, x03+0C; -fullness, x03+x07; -lessness, x05+x07; -ableness, x11+x07; -bility, x11+x02; -lative, x05+x13; -tative, x02+x13; -ousness, x09+x07; -someness, x88+07; Numerous double prefixes can be formed by joining simple prefixes, with ZWJ, if needed. When a nasal sound precedes a prefix, it can be joined (+ZWJ) to the prefix.
c. Word signs: Almost every letter used in the Sloan orthography can be used as a word sign for a common word. When word signs are found in series, they can be visually connected, separated only by a Zero Width Space (U+200B).
d. Nasal vowels encoded x19 & x1B+ZWJ, and x7C-x7E. e. Sloan uses European numbers, with overstrike, strike-through, and understrike indicating x100, x1000, and x1 000 000.
Guide to Perrault encoding a. columns 0-9,
x00-x0E: H, P, T, F, K, L, M, N, J, S, O, A, I, U, Ou; x11-x16, x1B: B, D, V, G, R, vowel M, A nasal; x27-x29: NS, JS, SS; x47-x48, x58: Oa, JS w/ dot, J w/ dot; x5B: Wa; x60-x61: Xw, Th; x70, x71, x78: W, Long U, Long I; x88-x8D, x8F: ST, STR, SP, SPR, TS, TRS, WR x9A-x9F: SN, SM, KRS, GRS, SK, SKR; U+0300, U+0301, U+0304, U+0307, U+0317, U+0323: diacritics;
b. The Perrault shorthand does not have any known codified affixes. c. The Perrault shorthand is not know to use word signs. d. Nasal vowels are encoded x1B+(diacritic)+ZWJ, or x16+diacritic. e. The Perrault shorthand does not prescribe number usage.
An oddity of the Perrault system is the use of diacritics to differentiate vowels. With Vocalic M and A Nasal to indicate the inherent vowel of the nasal, and with I (x0C) to indicate an 'ei', 'eh', 'ee', or short I sound.
Top Related