提交 · 9fa052733eb93a3ce1205f63ff8f74cb295cbe99 · OpenHarmony / Third Party Harfbuzz

24 7月, 2012 1 次提交

[Indic] Limit syllables to at most five consonants · 9fa05273

由 Behdad Esfahbod 提交于 7月 23, 2012

Seems to be about what Uniscribe does.  Not exactly.  But close enough.
More consonants will start a new cluster.

A few scripts went way down in failures.  In particular:

  - Devanagari failures went down from 490 to 56.
  - Telugu went down from 113 to 49.

Other scripts went down slightly or didn't change.  New numbers:

BENGALI: 353908 out of 354285 tests passed. 377 failed (0.106412%)
DEVANAGARI: 693572 out of 693628 tests passed. 56 failed (0.00807349%)
GUJARATI: 366485 out of 366506 tests passed. 21 failed (0.00572978%)
GURMUKHI: 60750 out of 60809 tests passed. 59 failed (0.0970251%)
KANNADA: 950730 out of 951913 tests passed. 1183 failed (0.124276%)
KHMER: 298613 out of 299124 tests passed. 511 failed (0.170832%)
MALAYALAM: 1046881 out of 1048416 tests passed. 1535 failed (0.146411%)
ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%)
SINHALA: 271333 out of 271847 tests passed. 514 failed (0.189077%)
TAMIL: 1091837 out of 1091837 tests passed. 0 failed (0%)
TELUGU: 970524 out of 970573 tests passed. 49 failed (0.00504856%)

Some of the remaining Telugu and Devanagari issues seem to be Uniscribe
eating Anusvara when placed before a non-joiner.  Ouch!

9fa05273

21 7月, 2012 2 次提交

[Indic] Allow a ZWNJ after SM's · 5791f329

由 Behdad Esfahbod 提交于 7月 20, 2012

Malayalam failures go way down.  Other scripts benefitted slightly too.
Sinhala had one or two test regressions, but...

5791f329

[Indic] Break syllables at Halant,ZWNJ · 9e4f94a7

由 Behdad Esfahbod 提交于 7月 20, 2012

That's really what Uniscribe does, and explains a lot of pecularities of
Halant,ZWNJ before the base.

Sent Telugu from 1% failures to 0.03%.  Improved Kannada and Malayalam
slightly.  Fixed half of Bengali, and did NOT break anything!

9e4f94a7

19 7月, 2012 8 次提交
- B
  [Indic] Accept a forced Rakar sequence at the end of syllable · 422ecd2d
  由 Behdad Esfahbod 提交于 7月 18, 2012
```
In Sinhala, Rakar is formed by Al-Lakuna,ZWJ,Ra.  If you put that at the
end of a Consonant,Matra syllable, you get a dotted-circle from
Uniscribe.  Apparently adding a ZWJ before the Al-Lakuna "fixes" that.
And people have been encoding that sequence...  So, allow a forced
"ZWJ,Virama,ZWJ,Ra" sequence at the of syllables.

Fixes some 100 or more of Sinhala failures.  Now at 622 only (0.23%).
```
  422ecd2d
- B
  [Indic] Allow joiners on both sides of Halant at the same time · 6fc17320
  由 Behdad Esfahbod 提交于 7月 18, 2012
```
The sequence <ZWJ,Al-Lakuna,ZWJ> is used in Sinhala to explicitly ask
for Rakar.  Fixes two-thousand Sinhala tests.  Not many left.
```
  6fc17320
- B
  [Indic] Treat Register Shifters like Nukta · 552d19b7
  由 Behdad Esfahbod 提交于 7月 18, 2012
```
Really this time.

Fixes another 18 Khmer tests.
```
  552d19b7
- B
  [Indic] Allow joiners before matras · dcb52724
  由 Behdad Esfahbod 提交于 7月 18, 2012
```
Fixes 1 more Devanagari test!
```
  dcb52724
- B
  [Indic] Allow halant group in Vowel and placeholder syllables · 391cc033
  由 Behdad Esfahbod 提交于 7月 18, 2012
```
Fixes 2 out of 560 Devanagari failures.  AND:
Fixes 1 out of 2 Tamil failures.
```
  391cc033
- B
  
  [Indic] Streamline halant/joiner in grammar · ca4e3d3e
  由 Behdad Esfahbod 提交于 7月 18, 2012
  
  ca4e3d3e
- B
  
  [Indic] Minor · 418d00df
  由 Behdad Esfahbod 提交于 7月 18, 2012
  
  418d00df
- B
  [Indic] Hopefully minor! · 4c3691d2
  由 Behdad Esfahbod 提交于 7月 18, 2012
```
Refactoring Indic machin.  No semantic change.
```
  4c3691d2
18 7月, 2012 4 次提交
- B
  [Indic] Position Khmer Robat · db8981f1
  由 Behdad Esfahbod 提交于 7月 17, 2012
```
It's a visual Repha.

Still not positioning logical Repha as occurs in Malayalam.

Another 200 Khmer failures fixed.  547 to go.  That's better than
Devanagari!
```
  db8981f1
- B
  [Indic] Better categorize Register Shifters and Khmer Various signs · 25bc4894
  由 Behdad Esfahbod 提交于 7月 17, 2012
```
Down another 500 or so Khmer failures!
```
  25bc4894
- B
  [Indic] Treat Khmer Register Shifters more like Nuktas · 34b57149
  由 Behdad Esfahbod 提交于 7月 17, 2012
```
Except that there may be a ZWNJ before a Register Shifter.
```
  34b57149
- B
  
  [Indic] Minor · 11e2a601
  由 Behdad Esfahbod 提交于 7月 17, 2012
  
  11e2a601
17 7月, 2012 3 次提交
- B
  [Indic] Recategorize Khmer coeng sign as a separate category OT_Coeng · c50ed71e
  由 Behdad Esfahbod 提交于 7月 17, 2012
```
Amend the syllable structure to allow a final subscripted consonant
(Coeng+C) and a final subscripted independent vowel (Coeng+V).
Fixes another 2k of Khmer failures.
```
  c50ed71e
- B
  [Indic] Add a separate Coeng class · deb521de
  由 Behdad Esfahbod 提交于 7月 17, 2012
```
No characters recategorized yet.  No semantic change.
```
  deb521de
- B
  [Indic] Recognizer Register Shifter marks · 7d09c98a
  由 Behdad Esfahbod 提交于 7月 16, 2012
```
Fixes another 6% of the Khmer failures.
```
  7d09c98a
13 7月, 2012 1 次提交
- B
  Make sure HB_BEGIN_DECLS / HB_END_DECLS is only used in public headers · a98d0ab1
  由 Behdad Esfahbod 提交于 7月 13, 2012
```
So we can use them to switch default visibility to internal if desired,
and use these to make only declared symbols public.
```
  a98d0ab1
25 5月, 2012 1 次提交
- B
  
  Minor · 27aba594
  由 Behdad Esfahbod 提交于 5月 24, 2012
  
  27aba594
12 5月, 2012 3 次提交

[Indic] Add Uniscribe bug feature for dotted circle · 18c06e18

由 Behdad Esfahbod 提交于 5月 11, 2012

For dotted-circle independent clusters, Uniscribe does no Reph shaping
for the exact sequence Ra+Halant+25CC.  Which also is the only possible
sequence with 25CC at the end.

18c06e18

B
[Indic] Allow multiple Consonants in Vowel/NBSP syllables · 9c099289
由 Behdad Esfahbod 提交于 5月 11, 2012
```
Uniscribe allows multiple Halant+Consonant after a Vowel.
Tests:
↦       * U+0905,U+094D,U+092B,U+094D,930,94d,930
```
9c099289

[Indic] Allow two Nuktas per consonant · 8c0aa486

由 Behdad Esfahbod 提交于 5月 11, 2012

Uniscribe allows up to two nuktas per consonant and one per matra. It does so
indepent of whether the consonant already has a nukta in it.  Tests:

        * U+0916,U+093C,U+0941
        * U+0959,U+093C,U+0941
        * U+0916,U+093C,U+093C,U+0941
        * U+0959,U+093C,U+093C,U+0941
        * U+0916,U+093C,U+093C,U+093C,U+0941
        * U+0959,U+093C,U+093C,U+093C,U+0941
        * 915,93c,93c,,94d,U+0916,U+093C,U+093C,U+093e,93c,93c

8c0aa486

11 5月, 2012 4 次提交
- B
  [Indic] Fix U+0952 and similar classification to match Uniscribe · 3399a06e
  由 Behdad Esfahbod 提交于 5月 11, 2012
```
See comments.
```
  3399a06e
- B
  
  [Indic] Don't use syllable serial value 0 · ff24d108
  由 Behdad Esfahbod 提交于 5月 11, 2012
  
  ff24d108
- B
  
  [Indic] Fix state machine to backtrack · 4be46bad
  由 Behdad Esfahbod 提交于 5月 11, 2012
  
  4be46bad
- B
  [Indic] Move syllable tracking from Indic to generic layer · cee71874
  由 Behdad Esfahbod 提交于 5月 11, 2012
```
This is to incorporate it into GSUB/GPOS processing.
```
  cee71874
10 5月, 2012 2 次提交
- B
  
  [Indic] Don't give up syllable parsing upon junk · 86e5dd38
  由 Behdad Esfahbod 提交于 5月 09, 2012
  
  86e5dd38
- B
  
  [Indic] Towards multi-cluster syllables and final reordering · ef24cc8c
  由 Behdad Esfahbod 提交于 5月 09, 2012
  
  ef24cc8c
17 4月, 2012 2 次提交
- B
  Fix ragel regexp in vowel-based syllable · 9ceca3ae
  由 Behdad Esfahbod 提交于 4月 16, 2012
```
As reported by datao zhang on the mailing list.
```
  9ceca3ae
- B
  Rewrite ragel expression to better match the one on MS spec · b870afcd
  由 Behdad Esfahbod 提交于 4月 16, 2012
```
https://www.microsoft.com/typography/otfntdev/devanot/shaping.aspx
```
  b870afcd
08 4月, 2012 1 次提交
- B
  
  Move code around, in prep for Thai/Lao shaper · d4cc4471
  由 Behdad Esfahbod 提交于 4月 07, 2012
  
  d4cc4471
02 3月, 2012 1 次提交

Fix cluster formation in Indic · 461b9b63

由 Behdad Esfahbod 提交于 3月 01, 2012

Makes number of failures against Uniscribe with hi_IN dictionary from
OO.o to go down from 6334 to 4290. Not bad for a one-line change!

Mozilla Bug 729626 - ASAN: heap-buffer-overflow HTML

461b9b63

30 7月, 2011 1 次提交

[Indic] Apply Indic features · 743807a3

由 Behdad Esfahbod 提交于 7月 29, 2011

Find the base consonant and apply basic Indic features accordingly.
Nothing complete, but does something for now.  Specifically:
no Ra handling right now, and no ZWJ/ZWNJ.

Number of failing shape-complex tests goes from 174 down to 125.

Next: reorder matras.

743807a3

08 7月, 2011 1 次提交
- B
  
  Shuffle code around, remove shape_plan from complex shapers · 76f76812
  由 Behdad Esfahbod 提交于 7月 07, 2011
  
  76f76812
05 7月, 2011 1 次提交
- B
  [Indic] Well, at least finding syllables works now :) · d69d5cea
  由 Behdad Esfahbod 提交于 7月 04, 2011
```
Still not much there.
```
  d69d5cea
25 6月, 2011 1 次提交
- B
  
  [Indic] Some of the basic features are global; Mark them so · c7fe56a1
  由 Behdad Esfahbod 提交于 6月 24, 2011
  
  c7fe56a1
18 6月, 2011 1 次提交
- B
  [indic] Add syllable recognition state machine · 867361c3
  由 Behdad Esfahbod 提交于 6月 17, 2011
```
Using an incredible tool called Ragel.
```
  867361c3

OpenHarmony / Third Party Harfbuzz 大约 1 年 前同步成功

OpenHarmony / Third Party Harfbuzz
大约 1 年前同步成功