utf8proc

Commit Graph

Author	SHA1	Message	Date
Paul Smith	95fc75b839	Ensure generated const data tables are hidden via "static" (#100 )	8 years ago
Michael Hatherly	eab97d16fb	Don't use cached version of UnicodeData.txt (#92 ) Ref: https://github.com/JuliaLang/julia/pull/19725, UnicodeData.txt is now being cached in JuliaLang/julia's build.	8 years ago
Steven G. Johnson	15e1819cdd	update to unifont 9.0.04	8 years ago
petercolberg	11b84e2de1	Use versioned Unicode data URLs (#78 ) This ensures the tests keep working when a new Unicode version is released.	8 years ago
Steven G. Johnson	c02ebd5a83	update to Unifont 9 (for Unicode 9 charwidths) (#75 )	8 years ago
Benito van der Zander	eeebf70bcf	Smaller tables (#68 ) * convert sequences to utf-16 (saves 25kb) * store sequence length in properties instead using -1 termination (saves 10kb) * cache index for slightly faster data creation * store lower/upper/title mapping in sequence array (saves 25kb). Add utf8proc_totitle, as title_mapping cannot be used to get the title codepoint anymore. Rename xxx_mapping to xxx_seqindex, so programs assuming a value with the old meaning fail at compile time * change combination array data type to uint16 (saves 40kb) * merge 1st and 2nd comb index (saves 50kb) * kill empty prefix/suffix in combination array (saves 50kb) * there was no need to have a separate combination start array, it can be merged in a single array * some fixes * mark the table as const again * and regen	8 years ago
Keno Fischer	41c6b23aab	Unicode 9 updates (#70 ) * Updates for Unicode 9.0.0 TR29 Changes - New rules GB10/(12/13) are used to combine emoji-zwj sequences/ (force grapheme breaks every two RI codepoints). Unfortunately this breaks statelessness of grapheme-boundary determination. Deal with this by ignoring the problem in utf8proc_grapheme_break, and by hacking in a special case in decompose - ZWJ moved to its own boundclass, update what is now GB9 accordingly. - Add comments to indicate which rule a given case implements - The Number of bound classes Now exceeds 4 bits, expand to 8 and reorganize fields * Import Unicode 9 data * Update Grapheme break API to expose state override * Bump MAJOR version	8 years ago
Michaël Meyer	26436c9775	Reduce the size of the binary. Use integers instead of pointers in Unicode tables. Saves 226 kb / 716 kb in the compiled library.	9 years ago
Peter Colberg	b10b64dc10	Fix deprecated warnings with Julia 0.4	9 years ago
Peter Colberg	8f522ad8e7	Add missing files to `make clean`	9 years ago
Peter Colberg	0a20307c39	Set URLCACHE to JuliaLang cache server for Travis builds Download Unicode data from upstream server by default. Download GNU Unifont from reliable GNU mirror by default.	9 years ago
Peter Colberg	f35e18e4b5	Generate fontforge font files in makefile Revise the script to directly read fontforge font files, which are generated in the makefile. This permits overriding the fontforge path during the build, and executing fontforge in parallel with make -j. Avoid duplicating download URLs in the script, which ensures that the script itself works without network access, e.g., when downloading the data files on a developer machine with network access and executing the script on a build machine without network access.	9 years ago
Jiahao Chen	f0675f26f4	Update Unifont to 8.0.01	9 years ago
Steven G. Johnson	eefdaed218	sort keys to try to eliminate data dependence on Ruby version	10 years ago
Steven G. Johnson	6a7f92da64	fix #46 (make sure symbol-like codepoints have nonzero width even if they aren't in Unifont)	10 years ago
Jiahao Chen	d18963cc46	Minor fixes to work with Unicode 8.0.0 data	10 years ago
Tony Kelman	0a818c7003	Prefix other C99 typedefs with utf8proc_	10 years ago
Steven G. Johnson	a4c84d2063	fix #2 : add charwidth function	10 years ago
Steven G. Johnson	90721f2d39	directory cleanup: move tests and data into subdirectories	10 years ago

19 Commits (acc204f1f15e879331c95096b6d4fac8bc2c906f)