SQL
AخA
SELECT
CONCAT('https://en.wikipedia.org/w/index.php?title=Talk:', REPLACE(page_title, ' ', '_')) AS page_url,
page_len,
actor.actor_name AS creator,
CASE WHEN user.user_id IS NULL THEN 'Unknown'
WHEN user.user_editcount > 1000 AND actor.actor_name LIKE '%Bot' THEN 'Likely Bot'
WHEN user.user_editcount > 1000 AND actor.actor_name RLIKE '(bot|Bot)$' THEN 'Likely Bot'
WHEN ug.ug_group = 'bot' THEN 'Confirmed Bot'
ELSE 'Likely Human'
END AS creator_type
FROM page
JOIN revision ON page_id = rev_page AND rev_parent_id = 0 -- First revision
JOIN actor ON rev_actor = actor_id
LEFT JOIN user ON actor.actor_user = user.user_id
LEFT JOIN user_groups AS ug ON user.user_id = ug.ug_user AND ug.ug_group = 'bot'
WHERE page_namespace = 1 -- Talk namespace
AND page_title LIKE '%/Archive%'
AND page_is_redirect = 0
AND page_len >= 1
ORDER BY page_len ASC
LIMIT 1000;
By running queries you agree to the Cloud Services Terms of Use and you irrevocably agree to release your SQL under CC0 License.
All SQL code is licensed under CC0 License.