Toggle navigation
Home
New Query
Recent Queries
Discuss
Database tables
Database names
MediaWiki
Wikibase
Replicas browser and optimizer
Login
History
Fork
Fork of
nlwiki External links with double http, excluding archives
by
Bdijkstra
This query is marked as a draft
This query has been published
by
Smile4ever
.
Toggle Highlighting
SQL
SELECT page_title, el_to_domain_index AS domain, el_to_path AS path FROM externallinks, page WHERE el_from=page_id AND #page_namespace IN (0,1,4,6,8,10,12,14,100,828) AND page_namespace = 0 AND page_title NOT REGEXP '/[Aa]rchief' AND el_to_domain_index REGEXP '^https?://' AND el_to_domain_index NOT REGEXP '^https?://(' 'at\\.ac\\.onb\\.webarchiv|' 'au\\.gov\\.nla\\.webarchive|' 'ca\\.gc\\.bac-lac\\.webarchive|' 'de\\.webwiki\\.www|' 'com\\.archive-(be|nl|org)|' 'com\\.wikiwix\\.archive|' 'edu\\.stanford\\.swap|' 'edu\\.unt\\.cybercemetery|' 'eu\\.archiefweb|' 'gov\\.loc\\.webarchive|' 'is\\.vefsafn\\.wayback|' 'nl\\.presurf\\.zeeland|' 'nl\\.sitearchief\\.archief[0-9]+|' 'nl\\.pagefreezer\\.publiek|' 'org\\.(bibalex\\.)?(archive\\.web|waybackmachine)(\\.replay)?|' 'org\\.archive\\.wayback|' 'org\\.archive-it\\.wayback|' 'org\\.ghostarchive|' 'org\\.mementoweb\\.timetravel|' 'org\\.wayback\\.archive|' 'pt\\.arquivo|' 'uk\\.gov\\.nationalarchives\\.webarchive|' 'uk\\.org\\.webarchive\\.www' ')\\.' AND #el_to_domain_index NOT REGEXP '^(https?|http)://archive\\.(fo|is|li|md|ph|today|vn)$' AND el_to_domain_index NOT REGEXP '^(https?|http)://(fo|is|li|md|ph|today|vn)\\.archive\\.$' AND el_to_path REGEXP '[^\\?#]*https?://' AND el_to_path NOT LIKE '%=http%' el_to_path NOT LIKE '%\?http%' ORDER BY el_to_domain_index, el_to_path LIMIT 1000
By running queries you agree to the
Cloud Services Terms of Use
and you irrevocably agree to release your SQL under
CC0 License
.
Submit Query
Stop Query
All SQL code is licensed under
CC0 License
.
Checking query status...