Fork of Mainspace page titles mixing Latin with Cyrillic/Greek characters by Certes
This query is marked as a draft This query has been published by Certes.

SQL

AخA
 
SELECT P1.page_title Title, rd_title RedirectTarget,
 REGEXP_REPLACE(REGEXP_REPLACE(CONVERT(P1.page_title USING utf8), "(\\p{Greek}+)", "[Greek \\1]"), "(\\p{Cyrillic}+)", "[Cyrillic \\1]") AnnotatedTitle
FROM page P1
LEFT JOIN templatelinks TL1 ON TL1.tl_from=page_id AND TL1.tl_target_id IN (3933 /*Chembox*/, 7831 /*Drugbox*/, 25401 /*Infobox_gene*/, 10018 /*Infobox_enzyme*/, 6127 /*Starbox_begin*/)
LEFT JOIN redirect ON rd_from=P1.page_id AND rd_namespace=0
WHERE P1.page_namespace=0 AND CONVERT(P1.page_title USING utf8) NOT REGEXP "μ(SA|F|m|g|s)(\\b|_)"
AND CONVERT(P1.page_title USING utf8) REGEXP "[\\p{Greek}\\p{Cyrillic}]\\p{Latin}|\\p{Latin}[\\p{Greek}\\p{Cyrillic}]"
AND NOT EXISTS (SELECT 1 
                FROM page P2
                JOIN templatelinks TL2 ON TL2.tl_from=page_id AND TL2.tl_target_id IN (3933 /*Chembox*/, 7831 /*Drugbox*/, 25401 /*Infobox_gene*/, 10018 /*Infobox_enzyme*/, 6127 /*Starbox_begin*/)
                WHERE P2.page_title=rd_title AND P2.page_namespace=0)
AND TL1.tl_from IS NULL
ORDER BY P1.page_title
By running queries you agree to the Cloud Services Terms of Use and you irrevocably agree to release your SQL under CC0 License.
All SQL code is licensed under CC0 License.

Checking query status...