Printable Version of Topic

Click here to view this topic in its original format

_ Forum Information Archive _ Wikipedia-sponsored denial of service attack to this site, organized from #wikipedia IRC channel

Posted by: Locke85

I just wanted to suggest that we do more to weed out fake usernames that are either meant to be used as read only accounts (to get access to the pit and other places that are now or may in the future be only available to members) and accounts used for write access but only to post things that are all but trolling. I know the blatant troll accounts are immediately block as I have seen happen to admins who have tried to infiltrate this just to spam us but over IRC (#wikipedia on irc.freenode.net for those of you who are interested) I heard a discussion in regards to creating fake usernames specifically to spam this site and if nothing else to crowd the userlist with useless names with no posts.

Posted by: sgrayban

Its already being delt with. And who are you anyways?

Posted by: Locke85

QUOTE(sgrayban @ Thu 25th May 2006, 12:49am)

Its already being delt with. And who are you anyways?

For various reasons I would rather not give too much information including the fact that I'd rather not have people knowing who I am in real life and especially know my Wikipedia username but I'll leave it that I am a Wikipedia editor and although I don't agree with a lot of what's said here I also agree with a lot of what's said here since it is quite valid criticism. I figure that nobody will have a problem with me leaving it at that.

Posted by: sgrayban

Fair enough but you do sound like you have a been a long time lurker here in the first place.

Posted by: Blu Aardvark

We do tend to keep an eye out for suspicious accounts. On average, I tend to suspend anywhere between two and five accounts a week. Obvious trolls are usually picked out with ease, as are accounts created with temporary email addresses (a la mailinator.com). I may eventually go through the member list and suspend accounts older than two weeks with no posts, but I don't see the need at this point.

Posted by: Selina

Legal action will be taken against those who deliberately disrupt this website through spam denial-of-service attacks or otherwise.

I'm sure Daniel Brandt would love to help us match up IPs to names (although I can probably do most or all of it myself)

Deliberate vandalism to a website (forum or no) is punishable under international law or/as well as United States law including Title 18 of the U.S. Code to include the Computer Fraud and Abuse Act of 1986 and the National Information Infrastructure Protection Act.

Changed the topic title to something a bit more descriptive.

"http://en.wikiquote.org/wiki/Leon_Trotsky#Attributed", eh?

It would be very much appreciated if you or anyone else could supply me or any other administrator with a log of that conversation, and /whois nickname (which shows IP) of the IRC usernames that said the incriminating things. Thank you anyone for any help.

Posted by: Selina

http://meta.wikimedia.org/wiki/IRC_Group_Contacts

QUOTE

The IRC Group Contacts are the people who are responsible for, and in charge of, all Wikimedia channels on the FreeNode network.

They are:

http://en.wikipedia.org/wiki/User:Fennec (IRC nicks "FennecFoxen" or "Fennecus")
http://meta.wikipedia.org/wiki/User:Jdforrester http://meta.wikipedia.org/wiki/User_talk:Jdforrester (IRC nicks "James_F", "James_F|Away", "James_F|Busy")
http://meta.wikipedia.org/wiki/User:Essjay http://meta.wikipedia.org/wiki/User_talk:Essjayhttp://meta.wikipedia.org/wiki/User:Essjay/Contact (IRC nick Essjay) (Inactive)
http://meta.wikipedia.org/wiki/User:MichaelDiederich (da_didi on IRC)
http://meta.wikipedia.org/wiki/User:Angela

Categories: http://meta.wikimedia.org/wiki/Category:IRC

If all else fails, we know who else could be legally to blame...

Oh, by the way anyone reading this can go in and chat, or log, what people say there...
(PERSONAL INFORMATION NOTICE: Your IP address will be visible to any and all Wikipedia users in the channel if you connect to IRC):
http://wikipediareview.com/chat/
(it's the "Live Chat" link up to the top of this page)

Posted by: Avillia

I am very doubtful that Wikipedia members will have anything close to the tools needed for fast paced flooding.

Posted by: Selina

QUOTE(Avillia @ Thu 25th May 2006, 4:21pm)

I am very doubtful that Wikipedia members will have anything close to the tools needed for fast paced flooding.

You're joking, right? Cyde for one definitely has, just check his deletion log on Userboxes
In fact a lot of Wikipedia members make "bulk" scripts or programs for their personal use, like that recent changes monitor for example, or the one that reads hundreds of pages checking for spelling mistakes

Plus you have plenty of people like Alkivar who are well involved in the open-source/hacking/warez community

Posted by: Daniel Brandt

Wget will do recursive crawling too. Looking at the site, I don't think the recursive crawl would work well. Even if it works, you'd end up with too much garbage.

I'd write a program that shells to Lynx -dump URL > outputfile

The first step is to make a text file of each of the 28 days, plus how many pages in each day.

Use this file to drive the program. This info is all you need to construct the URL for each fetch. By using Lynx, you already avoid about 35 percent of the characters in each file, because Lynx strips out the HTML.

After you get all your files, you can write a routine to parse out more noise at the top and bottom of each file that Lynx didn't delete. Make each line flush left by deleting any tabs or white space. Add the date to the time if it isn't there already. (The latest files include the date, but the early ones I looked at have the time only on each line.) Concatenate each of the 28 days into a single file for that day.

It would take a day of work, but it's probably worth doing. I just saw some of the stuff they were saying about me today on #wikipedia, and it's not very kind. Keeping a log like this could be evidence of their intent.

I know that Wikipedia considers these logs private, but as far as I know, there is no legal standing behind this policy. Is anyone aware of any legal problems with logging this stuff and making it searchable?

Finally, does anyone have a Linux program that can run stand-alone (no browser) that will log the channel? I don't know much about IRC -- have only played with it for a few weeks, and only with ChatZilla.

Better PM me on this; I'm sure they're listening.