Ray Saintonge | 19 Jul 2004 23:20

Re: UTF-8 and converting all wiktionaries

Gerard.Meijssen wrote:

>Ray Saintonge wrote:
>  
>
>>Gerard.Meijssen wrote:
>>    
>>
>>>On the beer parlour of en:wiktionary, Tim Starling says that it would be helpfull if all wiktionaries
work in the same way.  The article was about UTF-8.
>>>The opportunity now is to have all wiktionaries transfer to UTF-8. This is the only way to have articles
with titles like ჰოლანდიური or ភាសាហុលឍង់េដ or
네델란드어. As the purpose of wiktionaries is to have articles of words in any language, it should be
obvious that this is the way to go.
>>>
>>>PS I would like these articles in nl:wiktionary :) GerardM
>>>      
>>>
>>English Wiktionary shifted to UTF-8 at a very early stage of its development.  There are still a few minor
problems left over from the transfer, but none important enough to make us rush to clean them up.  nl ended up
with something else because the software was copied from the nl Wikipedia in the interest of having
something workable available quickly.  I don't think that there is much opposition to having all the
Wiktionaries on UTF-8.
>>
>>It is only speculation on my part to say that Tim would favour having every project on UTF-8 (he can speak
best for himself), but the Beer Parlour context had to do with first letter capitalization.  See
http://en.wiktionary.org/wiki/Wiktionary:Beer_parlour/case-sensitivity_vote at the very end.
>>
>>Please don't confuse the two issues.
>>
>>Ec
>>    
>>
>
>When you read the post, you will find that in the post about the case sensitivity vote, UTF-8 was mentioned.
In that context quote "I'd really much prefer it if all the Wiktionaries were converted, or none of them."
and later "... asked me to switch of first letter capitalisation for de.wiktionary.org, but I refused.
..." 
>
>Please read the article and you will see that the first part has to do with UTF-8 .  Read the effing article.. 
>
The first unsigned entry in that thread by Polyglot read

> When this was first proposed, it was said it was going to be 
> implemented across all Wiktionaries. Wiktionary DE had asked for it, 
> but we had to agree on Wiktionary EN, since the impact was going to be 
> the biggest due to the amount of pages we already have. I think the 
> other Wiktionaries and certainly NL have to implement this change as 
> well, especially now that it is possible to get a clean start over there.

What has that got to do with UTF-8?  You introduced that off-topic 
reference later to whinge about nl:Wiktionary not having UTF-8.  This 
was (as Timwi pointed out) an unfair criticism of the developers.

>>From a professional IT point of view having many environments all different is a disaster waiting to
happen. To prevent them, all situations should be/have to be tested. This requires scarce resources.
Silly backward compatability for no good reason makes the work of developpers and system management
people needlessly complicated. 
>
I don't know where you're quoting this from.

>PS I am AS/400 certified CISSP, MCP, CNE, Tivoli certified and know little of this environment but
basically the more different configurations the bigger the headache.
>GerardM
>
So?  Who cares about the alphabet soup certification?

>When you read the post, you will find that in the post abou
>
Yes, I read it ... including your attempts to change the topic.

Ec

Gmane