blob: 605d1c097e217075a126b083a188b470d7041102 [file] [log] [blame]
This directory contains Python script to parse several of the Unicode tables
that are downloadable from the web and generate C header files from them.
These are the locations of the files which are parsed. You should download these
files and put them in this directory.
http://www.unicode.org/Public/5.1.0/ucd/extracted/DerivedGeneralCategory.txt
http://www.unicode.org/Public/5.1.0/ucd/extracted/DerivedCombiningClass.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/5.1.0/ucd/Scripts.txt
Then you can run the following python scripts to generate the header files:
python category-parse.py DerivedGeneralCategory.txt category-properties.h
python combining-class-parse.py DerivedCombiningClass.txt combining-properties.h
python grapheme-break-parse.py GraphemeBreakProperty.txt grapheme-break-properties.h
python scripts-parse.py Scripts.txt script-properties.h