The Wikipedia Review: A forum for discussion and criticism of Wikipedia
Wikipedia Review Op-Ed Pages

Welcome, Guest! ( Log In | Register )

> Checkuser data retention
gomi
post Mon 15th September 2008, 5:12pm
Post #1


Member
********

Group: Members
Posts: 3,022
Joined: Fri 17th Nov 2006, 6:38pm
Member No.: 565




here:

QUOTE
Tim Starling tstarling at wikimedia.org
Thu Sep 11 03:11:52 UTC 2008

Jon wrote:
> I could not find this in the privacy policy... however, what is
> Wikimedia's current data retention policy? That is to ask, how long do
> projects keep data for use in tools such as checkuser?

CheckUser data used to be kept for 3 months, but Aaron recently increased
it to 5 months. I'm not sure why or on whose authority.

<http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/CheckUser/CheckUser.php?r1=39734&r2=40620>

-- Tim Starling



User is offlineProfile CardPM
Go to the top of the page
+Quote Post
 
Reply to this topicStart new topic
Replies
Rootology
post Fri 19th September 2008, 5:57pm
Post #2


Fat Cat
******

Group: Regulars
Posts: 1,489
Joined: Fri 26th Jan 2007, 11:11pm
Member No.: 877



Last night I poked around all the listed materials out of curiosity, and darned if I can find where wgCUDMaxAge and the retention for CU data is actually stored on the separate table. I have a feeling it's out of view. Since this is the principle "privacy" matter (especially after Poetgate) that everyone is always worried about, I'm honestly wondering if the benefits of hiding this information from being displayed in a clear fashion outweigh the possible harm.

This post has been edited by Rootology: Fri 19th September 2008, 5:58pm
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
Krimpet
post Sat 20th September 2008, 2:46am
Post #3


Senior Member
****

Group: Regulars
Posts: 402
Joined: Mon 16th Jul 2007, 3:44am
From: Rochester, NY
Member No.: 1,975

WP user page - talk
check - contribs



QUOTE(Rootology @ Fri 19th September 2008, 1:57pm) *

Last night I poked around all the listed materials out of curiosity, and darned if I can find where wgCUDMaxAge and the retention for CU data is actually stored on the separate table. I have a feeling it's out of view. Since this is the principle "privacy" matter (especially after Poetgate) that everyone is always worried about, I'm honestly wondering if the benefits of hiding this information from being displayed in a clear fashion outweigh the possible harm.


From CheckUser.php:
CODE

    # Every 100th edit, prune the checkuser changes table.
    wfSeedRandom();
    if( 0 == mt_rand( 0, 99 ) ) {
        # Periodically flush old entries from the recentchanges table.
        global $wgCUDMaxAge;
        $cutoff = $dbw->timestamp( time() - $wgCUDMaxAge );
        $recentchanges = $dbw->tableName( 'cu_changes' );
        $sql = "DELETE FROM $recentchanges WHERE cuc_timestamp < '{$cutoff}'";
        $dbw->query( $sql );
    }


Also, while Wikimedia's LocalSettings does require() a file called "PrivateSettings.php" that's not world-viewable that contains database passwords and the like, it's loaded before the CheckUser extension is, meaning any attempts to hide CU settings in there would be overridden by the defaults when CheckUser is loaded.

I think it can be confidently said, then, that CU data on Wikimedia is deleted after the default 90 days. smile.gif
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
tarantino
post Sat 20th September 2008, 2:44pm
Post #4


the Dude abides
******

Group: Regulars
Posts: 1,439
Joined: Mon 30th Jul 2007, 11:41pm
Member No.: 2,143



QUOTE(Krimpet @ Sat 20th September 2008, 2:46am) *

I think it can be confidently said, then, that CU data on Wikimedia is deleted after the default 90 days. smile.gif

This is confirmed by Tim Starling -
QUOTE

It's the same everywhere, it's three months. Neither the Board nor the
executive have expressed any desire to make that decision, but they are
free to weigh in if they want to. We chose the three month figure as a
compromise between privacy advocates and troll hunters.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

Posts in this topic
gomi   Checkuser data retention   Mon 15th September 2008, 5:12pm
Rootology   Kelly said recently that the Checkuser data was mo...   Mon 15th September 2008, 5:25pm
Kelly Martin   Kelly said recently that the Checkuser data was mo...   Mon 15th September 2008, 5:38pm
gomi   I should hasten to add that there are probably sit...   Mon 15th September 2008, 5:48pm
C H   Note, Tim Starling has since reverted the change t...   Mon 15th September 2008, 8:59pm
anthony   Who is this developer Aaron Aaron Schulz a.k.a....   Tue 16th September 2008, 2:53am
jch   [quote name='C H' post='129925' date='Mon 15th Se...   Fri 19th September 2008, 6:29am
Lar   [quote name='anthony' post='130007' date='Tue 16t...   Fri 19th September 2008, 4:28pm
jch   [quote name='anthony' post='130007' date='Tue 16...   Sat 20th September 2008, 2:10am
Rootology   Links to the mail reversions?   Mon 15th September 2008, 9:01pm
C H   [quote name='Rootology' post='129926' date='Mon 1...   Mon 15th September 2008, 9:15pm
Kelly Martin   [quote name='Rootology' post='129926' date='Mon ...   Tue 16th September 2008, 1:02am
Rootology   The curious question here is: does each project ac...   Mon 15th September 2008, 9:11pm
Random832   The curious question here is: does each project a...   Tue 16th September 2008, 12:34am
Kelly Martin   Last night I poked around all the listed material...   Sat 20th September 2008, 2:31am


Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 

-   Lo-Fi Version Time is now: 22nd 5 13, 1:00pm