The Wikipedia Review: A forum for discussion and criticism of Wikipedia
Wikipedia Review Op-Ed Pages

Welcome, Guest! ( Log In | Register )

> General Discussion? What's that all about?

This subforum is for general discussion of Wikipedia and other Wikimedia projects. For a glossary of terms frequently used in such discussions, please refer to Wikipedia:Glossary. For a glossary of musical terms, see here. Other useful links:

Akahele.orgWikipedia-WatchWikitruthWP:ANWikiEN-L/Foundation-L (mailing lists) • Citizendium forums

 
Reply to this topicStart new topic
> Size-per-view ratio
thekohser
post Thu 11th February 2010, 7:46pm
Post #1


Member
*********

Group: Regulars
Posts: 10,274
Joined: Thu 1st Feb 2007, 10:21pm
Member No.: 911



Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
MBisanz
post Thu 11th February 2010, 8:06pm
Post #2


Senior Member
****

Group: Regulars
Posts: 478
Joined: Sun 13th Apr 2008, 6:00am
Member No.: 5,693

WP user page - talk
check - contribs



QUOTE(thekohser @ Thu 11th February 2010, 8:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.

Throwing up a bad guessesList of Advanced Dungeons & Dragons 2nd edition monsters, 390,000 characters and 1,000 views per month; Austrian legislative election, 2008, 299,000 characters and 1,000 views per month. I couldn't help but note the large size of List of mass murderers and spree killers by number of victims, 140,000 characters, but a good 16,000 views.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
CharlotteWebb
post Thu 11th February 2010, 8:48pm
Post #3


Postmaster General
********

Group: Regulars
Posts: 2,740
Joined: Mon 18th Jun 2007, 2:09am
Member No.: 1,727

WP user page - talk
check - contribs



QUOTE(thekohser @ Thu 11th February 2010, 7:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.


I do know there's enough db dump data to answer this with absolute certainty. I have a good idea how to go about it but I think it would require more time than four days, plus a larger hard-drive.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
radek
post Thu 11th February 2010, 9:31pm
Post #4


Über Member
*****

Group: Regulars
Posts: 699
Joined: Sat 28th Nov 2009, 10:40pm
Member No.: 15,651

WP user page - talk
check - contribs



QUOTE(MBisanz @ Thu 11th February 2010, 2:06pm) *

QUOTE(thekohser @ Thu 11th February 2010, 8:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.

Throwing up a bad guessesList of Advanced Dungeons & Dragons 2nd edition monsters, 390,000 characters and 1,000 views per month; Austrian legislative election, 2008, 299,000 characters and 1,000 views per month. I couldn't help but note the large size of List of mass murderers and spree killers by number of victims, 140,000 characters, but a good 16,000 views.


This mammoth of an article, which I created (for future expansion), got 0 views in 8/2009, which means that length/views=INFINITY!!!
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
MZMcBride
post Thu 11th February 2010, 11:33pm
Post #5


Über Member
*****

Group: Regulars
Posts: 671
Joined: Wed 25th Mar 2009, 5:02am
Member No.: 10,962

WP user page - talk
check - contribs



QUOTE(CharlotteWebb @ Thu 11th February 2010, 3:48pm) *

QUOTE(thekohser @ Thu 11th February 2010, 7:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.


I do know there's enough db dump data to answer this with absolute certainty. I have a good idea how to go about it but I think it would require more time than four days, plus a larger hard-drive.

It would only require a large hard drive if you tried to process the squid logs by yourself. Anyone with sense would just use Henrik's stats.grok.se. (Pro-tip: It has a JSON module.)
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
Eva Destruction
post Fri 12th February 2010, 12:17am
Post #6


Fat Cat
******

Group: Regulars
Posts: 1,735
Joined: Sun 30th Sep 2007, 7:22pm
Member No.: 3,301

WP user page - talk
check - contribs



QUOTE(thekohser @ Thu 11th February 2010, 7:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.


List of tablets on the Memorial to Heroic Self Sacrifice has got to be up there.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
MBisanz
post Fri 12th February 2010, 12:37am
Post #7


Senior Member
****

Group: Regulars
Posts: 478
Joined: Sun 13th Apr 2008, 6:00am
Member No.: 5,693

WP user page - talk
check - contribs



QUOTE(Eva Destruction @ Fri 12th February 2010, 1:17am) *

QUOTE(thekohser @ Thu 11th February 2010, 7:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.


List of tablets on the Memorial to Heroic Self Sacrifice has got to be up there.

Not even close, 35,000 characters to 214 page views.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
gomi
post Fri 12th February 2010, 5:32pm
Post #8


Member
********

Group: Members
Posts: 3,022
Joined: Fri 17th Nov 2006, 6:38pm
Member No.: 565



QUOTE(Eva Destruction @ Thu 11th February 2010, 4:17pm) *

Funny, I thought that this was the "List of tablets on the Memorial to Heroic Self Sacrifice". Silly me.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
MZMcBride
post Fri 12th February 2010, 10:04pm
Post #9


Über Member
*****

Group: Regulars
Posts: 671
Joined: Wed 25th Mar 2009, 5:02am
Member No.: 10,962

WP user page - talk
check - contribs



QUOTE(thekohser @ Thu 11th February 2010, 2:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.

It seems I already had a script lying around that was pretty much capable of doing this without much modification. I queried the 6,000 longest non-redirects in the article namespace (according to the page.page_len field). This means there will be some mismatch if a page has dramatically grown since December or if a page has been moved, but it's good enough™ data.

The raw results are available here (sorted by page length): http://toolserver.org/~mzmcbride/yahoo-size-views-output.txt

The columns are page title, page length, total views in 2009-12, and ratio (page length / total views).

Dump the data into Excel or whatever if you want to calculate a better ratio.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
Random832
post Sat 13th February 2010, 2:55am
Post #10


meh
*******

Group: Regulars
Posts: 1,933
Joined: Thu 14th Feb 2008, 8:52pm
Member No.: 4,844

WP user page - talk
check - contribs



QUOTE(MZMcBride @ Fri 12th February 2010, 10:04pm) *
I queried the 6,000 longest non-redirects in the article namespace (according to the page.page_len field).


Er. Is there a significant difference between this set and the 6,000 longest pages including redirects?
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
Sarcasticidealist
post Sat 13th February 2010, 3:19am
Post #11


Head exploded.
******

Group: Regulars
Posts: 1,662
Joined: Tue 22nd Jan 2008, 1:54am
From: Fredericton, New Brunswick, Canada
Member No.: 4,536

WP user page - talk
check - contribs



QUOTE(Random832 @ Fri 12th February 2010, 10:55pm) *
Er. Is there a significant difference between this set and the 6,000 longest pages including redirects?
I understand that [[pi]] has all kinds of redirects, including [[3.14]], [[3.142]], [[3.1415]], and so on to the six hundred thousandth decimal place.

Edit: Wait, that joke fell flat. For the redirect to be one of Wikipedia's longest pages, [[pi]] would have to redirect *to* some incredibly close approximation. Bah. Just pretend I got the joke right, and try to imagine how funny it would have been.

This post has been edited by Sarcasticidealist: Sat 13th February 2010, 3:21am
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
tarantino
post Sat 13th February 2010, 4:16am
Post #12


the Dude abides
******

Group: Regulars
Posts: 1,439
Joined: Mon 30th Jul 2007, 11:41pm
Member No.: 2,143



QUOTE(MZMcBride @ Fri 12th February 2010, 10:04pm) *

QUOTE(thekohser @ Thu 11th February 2010, 2:46pm) *

Interesting question, I think.

What do you think might be a candidate for the longest Wikipedia article that receives the fewest number of page views?

In other words, this would be the Wikipedia equivalent of an "FT2 Award" for tl;dr.

It seems I already had a script lying around that was pretty much capable of doing this without much modification. I queried the 6,000 longest non-redirects in the article namespace (according to the page.page_len field). This means there will be some mismatch if a page has dramatically grown since December or if a page has been moved, but it's good enough™ data.

The raw results are available here (sorted by page length): http://toolserver.org/~mzmcbride/yahoo-size-views-output.txt

The columns are page title, page length, total views in 2009-12, and ratio (page length / total views).

Dump the data into Excel or whatever if you want to calculate a better ratio.


It's hard to believe the mammoth article Astor House Hotel (Shanghai) (T-H-L-K-D) would have 0 page views for months at a time. There's another 99 articles on the list also with 0 page views. Something must be broken in the way the database keeps track of page views.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
MZMcBride
post Sat 13th February 2010, 6:46am
Post #13


Über Member
*****

Group: Regulars
Posts: 671
Joined: Wed 25th Mar 2009, 5:02am
Member No.: 10,962

WP user page - talk
check - contribs



QUOTE(Random832 @ Fri 12th February 2010, 9:55pm) *

QUOTE(MZMcBride @ Fri 12th February 2010, 10:04pm) *
I queried the 6,000 longest non-redirects in the article namespace (according to the page.page_len field).

Er. Is there a significant difference between this set and the 6,000 longest pages including redirects?

I doubt it. Though people have a tendency to put unusual shit below redirects. It's mostly new editors (or clueless ones). The parser doesn't really parse most content below the initial line if it's a redirect. Though it takes exception to certain aspects like template links, category links, and maybe other types of links. This level of bizarreness is par for the course for MediaWiki. There's a whole list of this phenomenon here, if you're interested: Wikipedia:Database reports/Redirects obscuring page content. It doesn't look as though any of those titles are long enough to be in the top 6,000, but saying "non-redirects in the article namespace" is a good level of specificity, all things considered. At least in my opinion!

QUOTE(tarantino @ Fri 12th February 2010, 11:16pm) *

It's hard to believe the mammoth article Astor House Hotel (Shanghai) (T-H-L-K-D) would have 0 page views for months at a time. There's another 99 articles on the list also with 0 page views. Something must be broken in the way the database keeps track of page views.

From the article's page history:
QUOTE

7 February 2010 Smjwalsh (talk | contribs) m (204,227 bytes) (moved Astor House, Shanghai to Astor House Hotel (Shanghai): Change to actual name and putting disambiguation location.) (undo)

That'd be your answer there. The actual value for that page should be 1697. Any 0 value is likely the result of a page move or a new article. Though, given the length requirement, a new article is pretty unlikely. So I calculated about 5,900 entries instead of 6,000; you can handle the other 100. tongue.gif
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
Enric_Naval
post Sat 13th February 2010, 8:27am
Post #14


Member
***

Group: Contributors
Posts: 105
Joined: Thu 15th May 2008, 1:31am
Member No.: 6,149

WP user page - talk
check - contribs



QUOTE(MZMcBride @ Sat 13th February 2010, 7:46am) *


QUOTE(tarantino @ Fri 12th February 2010, 11:16pm) *

It's hard to believe the mammoth article Astor House Hotel (Shanghai) (T-H-L-K-D) would have 0 page views for months at a time. There's another 99 articles on the list also with 0 page views. Something must be broken in the way the database keeps track of page views.

From the article's page history:
QUOTE

7 February 2010 Smjwalsh (talk | contribs) m (204,227 bytes) (moved Astor House, Shanghai to Astor House Hotel (Shanghai): Change to actual name and putting disambiguation location.) (undo)

That'd be your answer there. The actual value for that page should be 1697. Any 0 value is likely the result of a page move or a new article. Though, given the length requirement, a new article is pretty unlikely. So I calculated about 5,900 entries instead of 6,000; you can handle the other 100. tongue.gif


Another example. The List of restriction enzyme cutting sites article was created in 4th January and moved in 7th January. The visit data is from December (I think) so it had zero visits in December because it didn't exist yet when the visit data was gathered.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
tarantino
post Sat 13th February 2010, 5:08pm
Post #15


the Dude abides
******

Group: Regulars
Posts: 1,439
Joined: Mon 30th Jul 2007, 11:41pm
Member No.: 2,143



QUOTE(MZMcBride @ Sat 13th February 2010, 6:46am) *

That'd be your answer there. The actual value for that page should be 1697. Any 0 value is likely the result of a page move or a new article. Though, given the length requirement, a new article is pretty unlikely. So I calculated about 5,900 entries instead of 6,000; you can handle the other 100. tongue.gif


Ahh, my mistake. I didn't think to look for page moves. Of course, some of the 5,900 may have been moved during December, so they would show less than the actual number of views.
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
thekohser
post Sat 13th February 2010, 6:34pm
Post #16


Member
*********

Group: Regulars
Posts: 10,274
Joined: Thu 1st Feb 2007, 10:21pm
Member No.: 911



Some enterprising member of Yahoo! Answers could get an easy 10 points about now.

Thanks, MZMcBride! (Please don't tell the world about your other data run that I got from you a few days ago. Its awesomeness for me depends in part on it not becoming public.)

Greg
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 

-   Lo-Fi Version Time is now: 19th 5 13, 12:15am