utf8proc/data
Keno Fischer 41c6b23aab Unicode 9 updates (#70)
* Updates for Unicode 9.0.0 TR29 Changes

- New rules GB10/(12/13) are used to combine emoji-zwj sequences/
  (force grapheme breaks every two RI codepoints). Unfortunately this
  breaks statelessness of grapheme-boundary determination. Deal with
  this by ignoring the problem in utf8proc_grapheme_break, and by
  hacking in a special case in decompose

- ZWJ moved to its own boundclass, update what is now GB9 accordingly.

- Add comments to indicate which rule a given case implements

- The Number of bound classes Now exceeds 4 bits, expand to 8 and
  reorganize fields

* Import Unicode 9 data

* Update Grapheme break API to expose state override

* Bump MAJOR version
2016-06-28 16:04:25 -04:00
..
charwidths.jl Fix deprecated warnings with Julia 0.4 2015-10-31 13:59:38 -04:00
data_generator.rb Unicode 9 updates (#70) 2016-06-28 16:04:25 -04:00
Makefile Add missing files to make clean 2015-10-30 14:56:03 -04:00