Luke-Jennings | 10 Jun 2010 22:40

Re: Writing new language modules

On Fri, 2010-06-11 at 00:12 +0430, Mehdi Omidali wrote:
> I forgot to mention the file utf8-test.xdy that should be created by running
> ./tamil.pl utf8
> 
I now have all the files now which area attached. I placed them
in /usr/share/xindy/lang/tamil and compiled out manual and the relevant
output was 
xindy  -L tamil -L tamil -C utf8 -I xindy -M "ubuntu-manual-ta" -t
"ubuntu-manual-ta.glg" -o "ubuntu-manual-ta.gls" "ubuntu-manual-ta.glo"
Cannot locate xindy module for language tamil in codepage utf8.

Have I got the correct number of files with the right content and are
they named correctly? Also when running the tamil.pl file I still get
the several lines of 
Wide character in print at make-rules.pl line 56.
Is this a problem?
How would I go about sorting out support for alphabets with 240 letters?

Many thanks

Luke Jennings

> On 6/10/10, Mehdi Omidali <mehdioa <at> gmail.com> wrote:
> > It should be OK; if you want to see the Tamil characters, open the
> > file with an appropriate unicode editor like Texworks. You now just
> > need utf8-lang.xdy. It is a simple file like this
> >
> > -------------------------------------------------------------------
> > (require "lang/english/utf8.xdy")
> > (define-sort-rule-orientations (forward backward forward forward))
> > (use-rule-set :run 0
> > 	      :rule-set ("en-alphabetize" "en-ignore-special"))
> > (use-rule-set :run 1
> > 	      :rule-set ("en-resolve-diacritics" "en-ignore-special"))
> > (use-rule-set :run 2
> > 	      :rule-set ("en-resolve-case" "en-ignore-special"))
> > (use-rule-set :run 3
> > 	      :rule-set ("en-resolve-special"))
> > ------------------------------------------------------------------------------------------------------
> > Just change
> > english-->tamil
> > and "en" in "en-alphabetize" (and all other phrases like this) to
> > "ta", where "ta" is the local abbreviation for your language. Then put
> > this file along with your utf8.xdy in /lang/tamil (side by side all
> > other language directories, create the directory). That's all.  Now
> > you can run
> > xindy -L tamil -C utf8 ....
> >
> > On 6/10/10, Luke-Jennings <ubuntujenkins <at> googlemail.com> wrote:
> >> On Thu, 2010-06-10 at 15:25 +0430, Mehdi Omidali wrote:
> >>> Hi,
> >>> Try
> >>> ./tamil.pl utf8
> >>> Then you obtain the result in utf8.xdy
> >> I have that file now which I have attached, for me I get loads of random
> >> letters and not tamil ones is that ok? I have attached a screenshot to
> >> how how it looks for me. I now also have a tuf8.raw which only contains
> >> (indexentry :key ("?") :locref "1")
> >> (indexentry :key ("!") :locref "2")
> >> (indexentry :key (".") :locref "3")
> >> (indexentry :key ("-") :locref "4")
> >> (indexentry :key ("'") :locref "5")
> >> (indexentry :key ("\/") :locref "6")
> >>
> >> The utf8-doc.tex does not have any alphabet in it still.
> >> Is that correct?
> >>
> >> Thanks for the help so far everyone this list is very helpful and quick
> >> to respond.
> >>
> >> Luke Jennings
> >>
> >>> On 6/10/10, Zdenek Wagner <zdenek.wagner <at> gmail.com> wrote:
> >>> > 2010/6/10 Luke-Jennings <ubuntujenkins <at> googlemail.com>:
> >>> >> Hello everyone,
> >>> >>
> >>> >> I am part of a team that uses LaTeX to write a manual for the ubuntu
> >>> >> operating system. We released our first manual a few months ago [1].
> >>> >> We
> >>> >> now have translators that are working hard to translate it.
> >>> >>
> >>> > I have almost working module for Hindi and Marathi. It needs some more
> >>> > work but it is written in such a way that it can be reused for other
> >>> > Indic languages. If you like, I can send it to you at the beginning of
> >>> > the next week.
> >>> >
> >>> >> The language that I am trying to write the rules for first is Tamil
> >>> >> which has up to 240 letters.I don't personally speak Tamil but am
> >>> >> working with someone who does. After lots of reading of the docs I
> >>> >> either missed or didn't find much on writing modules. Sorry if I have
> >>> >> missed them. I have found the alphabets/test1 file and I have a few
> >>> >> questions.
> >>> >>
> >>> >> Firstly the file says
> >>> >> "Currently the maximum number of letters is limited to 95. A future
> >>> >> expansion up to 223 letters should be no problem."
> >>> >> Is there now support for 223 letters and how easy would it be to make
> >>> >> it
> >>> >> 240?
> >>> >>
> >>> >> If in understand correctly I need to fill the $alphabet = [ section
> >>> >> as
> >>> >> follows.
> >>> >>
> >>> >> $alphabet = [
> >>> >> ['அ'],
> >>> >> ['ஆ'],
> >>> >> ['இ'],
> >>> >> ['ஈ'],
> >>> >> ['உ'],
> >>> >>
> >>> >> etc...
> >>> >>
> >>> >> The full Tamil alphabet can be found here [2].
> >>> >>
> >>> >> I have attached my tamil.pl file, if anyone can spot any errors that
> >>> >> would be great.
> >>> >>
> >>> >> When I run perl tamil.pl in the alphabets directory I get
> >>> >>
> >>> >> Alphabet has 228 elements.
> >>> >>
> >>> >> and several lines of
> >>> >>
> >>> >> Wide character in print at make-rules.pl line 56.
> >>> >>
> >>> >> I also get out a  "-test.xdy" file but it does not have any alphabet
> >>> >> in
> >>> >> the the section that should contain the alphabet. I have attached
> >>> >> both
> >>> >> of the output files I get ( "-test.tex" and "-doc.tex" ) Is there
> >>> >> anything that I am missing?
> >>> >>
> >>> >> I will also be trying to write rules for different languages as the
> >>> >> team
> >>> >> requires them. If it is any help I would be happy to release these
> >>> >> under
> >>> >> gpl.v3 so that they can be added into xindy.
> >>> >>
> >>> >> I look forward to any help you can give
> >>> >>
> >>> >> Regards
> >>> >>
> >>> >> Luke Jennings
> >>> >>
> >>> >> [1]http://ubuntu-manual.org/
> >>> >> [2]http://en.wikipedia.org/wiki/Tamil_script#Compound_table_of_Tamil_letters
> >>> >>
> >>> >> ------------------------------------------------------------------------------
> >>> >> ThinkGeek and WIRED's GeekDad team up for the Ultimate
> >>> >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
> >>> >> lucky parental unit.  See the prize list and enter to win:
> >>> >> http://p.sf.net/sfu/thinkgeek-promo
> >>> >> ________________________________________________________________________
> >>> >> xindy-discuss mailing list -- general xindy announcements and
> >>> >> discussions
> >>> >> Unsubscribe via
> >>> >> https://lists.sourceforge.net/lists/listinfo/xindy-discuss
> >>> >>
> >>> >
> >>> >
> >>> >
> >>> > --
> >>> > Zdeněk Wagner
> >>> > http://hroch486.icpf.cas.cz/wagner/
> >>> > http://icebearsoft.euweb.cz
> >>> >
> >>> > ------------------------------------------------------------------------------
> >>> > ThinkGeek and WIRED's GeekDad team up for the Ultimate
> >>> > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
> >>> > lucky parental unit.  See the prize list and enter to win:
> >>> > http://p.sf.net/sfu/thinkgeek-promo
> >>> > ________________________________________________________________________
> >>> > xindy-discuss mailing list -- general xindy announcements and
> >>> > discussions
> >>> > Unsubscribe via
> >>> > https://lists.sourceforge.net/lists/listinfo/xindy-discuss
> >>>
> >>> ------------------------------------------------------------------------------
> >>> ThinkGeek and WIRED's GeekDad team up for the Ultimate
> >>> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
> >>> lucky parental unit.  See the prize list and enter to win:
> >>> http://p.sf.net/sfu/thinkgeek-promo
> >>> ________________________________________________________________________
> >>> xindy-discuss mailing list -- general xindy announcements and
> >>> discussions
> >>> Unsubscribe via
> >>> https://lists.sourceforge.net/lists/listinfo/xindy-discuss
> >>
> >>
> >
> 
> ------------------------------------------------------------------------------
> ThinkGeek and WIRED's GeekDad team up for the Ultimate 
> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
> lucky parental unit.  See the prize list and enter to win: 
> http://p.sf.net/sfu/thinkgeek-promo
> ________________________________________________________________________
> xindy-discuss mailing list -- general xindy announcements and discussions
> Unsubscribe via https://lists.sourceforge.net/lists/listinfo/xindy-discuss

Attachment (tamil.zip): application/zip, 6693 bytes
------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo

Gmane