This includes:
- new keyword in CirrusSearch to remove them from search results
- new option in AdvancedSearch to do so
This includes:
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T304939 Translation pages cluttering various page listings | |||
Open | Feature | None | T304977 Option to remove translation pages from search results | ||
Resolved | Reedy | T347429 Store whether a page is a "translation" (sub) page in the database to allow varying SQL queries |
CirrusSearch already provides inlanguage:en: https://www.mediawiki.org/wiki/Help:CirrusSearch#Inlanguage
Reopen.
Thanks for filing. Just so I better understand your comments above, are you saying that the CirrusSearch language filter is not sufficient because translation pages have English subpages and still show up in search results? I'm trying to better understand why the language filter workaround doesn't work for you.
Firstly translation pages have English subpages; secondly even uncommon, it is possible for any page in any wiki to be in any languages and there is interface (Special:PageLanguage) to change it.
Examples:
A big source of frustration here is that for certain keywords on wikis like mediawiki.org or Meta you will get a ton of half-translated Extension:Translate subpages, all containing the English original and this having high specificity scores while actually being utterly irrelevant. Sometimes search is borderline useless because of that.
I think the ideal solution there would be to not index untranslated text on translated subpages, or maybe somehow coalesce translations of the same page into a single search result.