Tim Starling | 22 Dec 06:25 2010

Re: Old Wikipedia backups discovered

On 22/12/10 12:13, Anthony wrote:
> On Tue, Dec 21, 2010 at 7:51 PM, Tim Starling <tstarling <at> wikimedia.org> wrote:
>> In XML 1.1:
>> "Char       ::=       #x9 | #xA | #xD | [#x20-#xD7FF] |
>> [#xE000-#xFFFD] | [#x10000-#x10FFFF]    /* any Unicode character,
>> excluding the surrogate blocks, FFFE, and FFFF. */"
> Where are you reading that?  At http://www.w3.org/TR/xml11/#charsets I read:

Ah yes, that would be the XML 1.0 spec. My fault.

>> Without this change, importDump.php gives a fatal error.
> Have you tried escaping them?  Does importDump.php work with XML 1.1,
> or only XML 1.0?  Is the file defined as XML 1.1 or XML 1.0?  If the
> file is designated as XML 1.1 (*), the control characters are escaped,
> and importDump.php still gives a fatal error, it sounds like a bug in
> importDump.php.

I provided both versions of the XML if you want to muck around with
that. I don't think there's much historical value in the control

Speaking of historical value, I found the argument between Lars
Aronsson and Larry Sanger, which caused Lars to quit and found
susning.nu. It happened on May 21.

Lars had just spent several days writing dictionary-like articles, and
he wrote [[Short words]] to organise the effort. At 12:48, Larry
complained about this on [[LA2]], and at 12:53, he created [[Wikipedia
is not a dictionary]], which was clearly an attack on what Lars was
doing. At 12:54, Lars announced that he was leaving, as a comment on
[[Wikipedia is not a dictionary]]. There were several responses.

On July 24, Larry erased all the comments from [[Wikipedia is not a
dictionary]], and on July 28, he rewrote the original text, toning
down the language. Before I found this backup, the earliest version we
had of this policy page was from August 17.

-- Tim Starling

WikiEN-l mailing list
WikiEN-l <at> lists.wikimedia.org
To unsubscribe from this mailing list, visit: