Subject: Unicode library released
Date: Monday 21st May 2012 11:40:04 UTC (over 4 years ago)
Hi guys, A Unicode library for Gambit has been released. Many thanks to Florian Loitsch for releasing the Unicode library for Bigloo, of which this library in many respects is a port. The Unicode library for Gambit does the following Unicode handling: unicode-char-upper unicode-char-lower unicode-string-upper unicode-string-upper! ; "aBc" => "ABC" unicode-string-lower unicode-string-lower! ; "aBc" => "abc" unicode-string-capitalize-strict! unicode-string-capitalize-strict ; "aBc" => "Abc" string-unicode-ci string-unicode-ci<=? string-unicode-ci=? string-unicode-ci>? string-unicode-ci>=? It runs out of the box on vanilla Gambit and on Black Hole. License is New BSD. It's both released on the Dumping grounds http://dynamo.iro.umontreal.ca/~gambit/wiki/index.php/Dumping_Grounds and on GitHub at https://github.com/m-i-k-a-e-l/gambit-unicode . For some more info, please see the copy of the library headers below. Kind regards, Mikael - Gambit Unicode library 1.0 New BSD license Enables Gambit Scheme with Unicode handling features. Runs out of the box on vanilla Gambit and on Black Hole. The Gambit-specific code and distribution: Copyright (C) 2010-2012 Mikael More The unicode-data.scm generator and Bigloo unicode library this library was inspired by, found in the bigloo-unicode directory: Copyright (C) 2007-2012 Florian Loitsch The source data for the unicode-data.scm generator, found in the bigloo-unicode/data directory: Copyright (C) 1991-2011 Unicode, Inc. ## Exports unicode-char-upper unicode-char-lower unicode-string-upper unicode-string-upper! ; "aBc" => "ABC" unicode-string-lower unicode-string-lower! ; "aBc" => "abc" unicode-string-capitalize-strict! ; "aBc" => "Abc" unicode-string-capitalize-strict ; string-unicode-ci string-unicode-ci<=? string-unicode-ci=? string-unicode-ci>? string-unicode-ci>=? ## Relation with Bigloo Unicode library. Intended scope of use of this library. This library is based on the Unicode library for Bigloo Scheme by Florian Loitsch, as bundled with the js2scheme distribution as found on http://code.google.com/p/js2scheme/source/browse/ . A copy of this library is provided in this distribution in the bigloo-unicode directory, to enable you to easily generate Unicode constants based on any new Unicode Character Database versions. The vesion of the Bigloo Unicode library bundled here is was pulled from the abovementioned repo the 13:t of May 2012. Florian's last changes were the 12:th of March 2011. Florian's code is to be viewed as stable since at least 2010, to the best of my knowledge the only changes since then have been minor restructurings. Please note that the scope of this library and the Bigloo unicode library are different, in that Gambit has complete native Unicode character handling and charset encoding/decoding features built-in and thus none of this needs to be provided in a separate library, whereas the Bigloo Unicode library provides such functionality to Bigloo. ## Distribution overview unicode.scm - The Gambit Unicode handling library unicode-data.scm - The constants required to make unicode.scm spin. Generated out of the Unicode Character Database. bigloo-unicode/* - Florian Loitsch's unicode-data.scm generator, and Bigloo Unicode library that was used as inspiration for making this library. Use this to generate a new unicode-data.scm out of a Unicode Character Database update, if that would ever be released and relevant. Latest version as of the 13:th of May 2012, see more above. bigloo-unicode/data/* - The required files of the Unicode Character Database, see below. ## How to update unicode-data.scm with a new version of the Unicode Character Database In case by any reason this would ever be relevant, here is how to proceed: Update the bigloo-unicode/data/ files (namely SpecialCasing.txt and UnicodeData.txt) with the new Unicode Character Database files. As of the date of writing, these are released on http://www.unicode.org/Public/UNIDATA/SpecialCasing.txt and http://www.unicode.org/Public/UNIDATA/UnicodeData.txt , and you can see the release date of the latest update of them on http://www.unicode.org/Public/UNIDATA/ .