Toggle navigation
Home
New Query
Recent Queries
Discuss
Database tables
Database names
MediaWiki
Wikibase
Replicas browser and optimizer
Login
History
Fork
This query is marked as a draft
This query has been published
by
Hendrik.schriefer
.
•Pick a Wikipedia Language Edition • Collect the number of distinct editors who have edited each article, U. the number of revisions of each article, E. the length of the article in bytes, L. the length of the talk page associated to the article, T. • Produce the Histogram and Probability Density Function for all the 4 variables in linear scale and logarithmic scale and discuss them. • Try to remove “redirect” pages from your data. How would that change the result?
Toggle Highlighting
SQL
use simplewiki_p; SELECT article_title, page_id FROM (SELECT page_title AS article_title, page_id, page_len AS L, page_is_redirect, COUNT(DISTINCT rev_user) AS U, COUNT(rev_id) AS E FROM revision JOIN page ON rev_page = page_id WHERE LOWER(rev_user) NOT LIKE LOWER("%bot%") AND page_is_redirect = "0" AND page_namespace = "0" GROUP BY page_id ORDER BY RAND() LIMIT 100) AS subq OUTER JOIN page WHERE page_title = article_title AND page_namespace = "1";
By running queries you agree to the
Cloud Services Terms of Use
and you irrevocably agree to release your SQL under
CC0 License
.
Submit Query
Stop Query
All SQL code is licensed under
CC0 License
.
Checking query status...