| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
| |
Reason for the change is in the Phabricator task.
Bug: T380047
Change-Id: If7f8b07674851e028583fd76b1014208d3ee598e
|
|
|
|
|
| |
Bug: T304802
Change-Id: Iccacf6288d758e526c95df69861385a4ac2d0f55
|
|
|
|
|
|
|
|
|
|
|
| |
Follows-up I301f471f86ba2.
For ease of navigation, move Converter subclasses to a group called
"Languages", which for documentation purposes is a subgroup of
"Language". The next commit does the same for Messages* files,
and Language subclasses (done separately for ease of review).
Change-Id: If1cef9aa15f536ebaedd4477ad7453426e7f3b85
|
|
|
|
|
|
|
|
|
| |
'LEFT-TO-RIGHT MARK' (U+200E) and 'RIGHT-TO-LEFT MARK' (U+200F) are not
needed in the source code comments.
c8caf26ffd9 removed already the LRM/RLM from Names.php.
Change-Id: I7d5e170af688fb74b8f040be43bd2be0a7622dee
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This is a first pass at Latin/Cyrillic translitertion for Crimean
Tatar (crh).
Includes transliteration tables, prefix/suffix mappings, regex
mappings, and exceptions lists for words and abbreviations.
Regularize CRH language name in messages/* files.
Fix "varient" typos in qqq.json.
Add unit tests for CRH transliteration.
Bug: T23582
Change-Id: I424703f99adf837f6217872b882d1ea26bfdd068
|
|
|
|
|
|
|
| |
Performed using
find . -name \*.php -exec sed -i -e :a -e '/./,$!d;/^\n*$/{$d;N;};/\n$/ba' {} \;
Change-Id: I5d0627f94c73690cf3a8a453539c22c760c2aa60
|
|
|
|
|
|
|
|
|
|
| |
Per wikitech-l consensus:
https://lists.wikimedia.org/pipermail/wikitech-l/2016-February/084821.html
Notes:
* Disabled CallTimePassByReference due to false positives (T127163)
Change-Id: I2c8ce713ce6600a0bb7bf67537c87044c7a45c4b
|
|
|
|
|
|
|
|
|
|
| |
- use tab as indent instead of spaces
- Added space after closures "function"
- Added spaces around string_concat
- Added newline inside empty blocks
- Removed four spaces after comma
Change-Id: I4425b0c6a69b36f40acfea6511b8950cf09ce2b2
|
|
|
|
|
|
|
|
| |
* Interface strings are now elsewhere
* MessagesQQQ no longer exists
* Prefer https for translatewiki.net
Change-Id: I76652ea94cca80441cd5d978029e4707ee41c4fd
|
|
|
|
|
|
| |
Largest patch set in the history of MediaWiki!
Change-Id: I918cfdc46c099017ca3622d1bcbbdd30c4fd00f6
|
|
|
|
| |
Change-Id: I18970accbce3b383e4ad89df2b5cd0d638861e26
|
|
|
|
| |
Change-Id: I71e13fdbb541c2fccbe303849a3c78f0c1260bd2
|
|
|
|
| |
Change-Id: I9ed903822cabaf2a845d01f1a68ee34ac444d980
|
|
|
|
| |
Change-Id: Iaee33fbd265807b18879b6f9873d1d4344dfb878
|
|
|
|
| |
Change-Id: I8d9a05eb6ce4136d424c4f2ce7cdc65e8c09b0f6
|
|
|
|
| |
Change-Id: I78257c1482d2a8a5a36693082c3228a63d0f4b8f
|
|
|
|
| |
Change-Id: Id4b89fcef61ccf1e329710f6beff9398669fdadb
|
|
|
|
| |
Change-Id: Id32d5c87533081016e6a00312703f450782a1d6d
|
|
|
|
| |
Change-Id: I91af046a42d410457d6ccb6ce44657a29d184434
|
|
|
|
| |
Change-Id: Ic4c5e76d04630a9da5bde0590db09b55d30132ec
|
|
|
|
| |
Change-Id: Iac7415ec17a0a30026e8c542f5ddd98d67b35d59
|
|
|
|
| |
Change-Id: I0eb353cd0cadf0bed7cd17d4c3eb2c7e108f6aa6
|
|
|
|
| |
Change-Id: I5a74c20de10a3203ff81d453de6c739a8a476081
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
The existing "linkprefix" message is unlikely to be accurately
customized by message translators (as shown by the fact that, of the 10
distinct customizations prior to Iaa7eaa44 (which made them even more
complicated), 3 were broken or entirely ineffective, 1 was half
ineffective, and 2 more seem to have included the Latin-1 Supplement by
accident) or by local wiki admins. So, like linktrail before it, let's
move it out of the system messages and into a separate language
variable.
At the same time, let's make it a simple character set (like
$wgLegalTitleChars) rather than a complicated regular expression. The
complicated regex now lives in the parser.
This also adjusts the output of the API's action=query&meta=siteinfo and
adds an accessor parallel to the linkTrail accessor to Language.
Note the following changes that are not simply extracting the existing
charset from the linkprefix message for $linkPrefixCharset:
* The En message matched all non-ASCII UTF-8 characters by matching the
component bytes (\\x80-\\xff). The new character set is equivalent.
* Various languages were identical to En and so have no $linkPrefixCharset
set. These are: Ary Az Ce Ga Id Ka Kiu Km Ltg Mk Ms Ne Nn Ro Roa_tara Sc Si
Sr_ec Sr_el Tl Tt_cyrl Tt_latn Ug_arab War
* Cu, Uk, and Udm are changed to match any number of „ or « in the prefix.
* Cv tried to include "«" that was redundant to the range \\x80-\\xff
(see En comment). This was removed.
* Diq was entirely bogus, and so was removed.
* Gu included many additional UTF-8 characters that are redundant to the
range \\x80-\\xff (see En comment). These were removed, and the
resulting character set is equivalent to En.
* Mt has been broken since it was introduced in r37242. The charset used is
equivalent to the broken regex.
Bug: 56031
Change-Id: I3369851b33113fc118a1bace38f3ac310cdd9725
|
|
|
|
| |
Change-Id: I7d7dfd1a1e1a395c639b30489bacceedfbfc6330
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
The regular expression in the linkprefix message is run against the
entire page up to each wikilink, and is expected to capture one group
having everything except the prefix and another having only the prefix.
For long pages this winds up being a lot of text, so inefficient regular
expressions are going to cause problems.
The current regex is this:
/^(.*?)([a-zA-Z\\x80-\\xff]+)$/sD
This is not efficient: it will scan through the string trying to match
against every run of one or more letters/non-ASCII characters,
backtracking at every one except possibly the last. The only reason this
hasn't been a huge problem everywhere is because only a few languages
have this feature enabled.
This change replaces this with this regex:
/^((?>.*(?<![a-zA-Z\\x80-\\xff])))(.+)$/sD
This is rather more efficient: it will grab the whole string (which is
actually fast even for huge strings), then back off character by
character until it finds one that isn't a letter/non-ASCII.
Note that the above could be simplified somewhat:
/^((?>.*[^a-zA-Z\\x80-\\xff]|))(.+)$/sD
The performance improvement here is minor, and Gujarati, Church Slavic,
Udmurt, and Ukrainian would still need the other style for their current
implementations.
For Gujarati, we also use another regex trick: a look-behind assertion
in PCRE must be fixed length, so something like (?<!a|bb) won't work.
But that regex fragment is equivalent to (?<!a)(?<!bb) which is allowed,
so we use that instead.
Bug: 52865
Change-Id: Iaa7eaa446b3f045a9ce970affcb2a889f44bdefd
|
|
|
|
| |
Change-Id: I4f52889299ffc8ae8d49634fd8b36d73c66d8b5a
|
|
|
|
| |
Change-Id: I5e7da5913a10c7f75cdf223f2166cc7873134949
|
|
|
|
| |
Change-Id: I332982e76aa7f846c42e81fbd30141d85326f8b8
|
|
|
|
| |
Change-Id: Iad6b538be6d172d232b8e593ff98faf3d5707abe
|
|
|
|
| |
Change-Id: I9a8b8769d0077d4d406c27910b8f94d9f3a2ceeb
|
|
|
|
| |
Change-Id: Idfa4331722672da932fe8a124bc9618de0e31fea
|
|
|
|
| |
Change-Id: I4e865f40a8c69fdc7520bd3a89ec61ca9af57337
|
|
|
|
| |
Change-Id: I599770c4c8b53d0d46c74f1426168a5c3d963a81
|
|
|
|
| |
Change-Id: Ic862c638ed6ce33d3fe4dad9743c88136d832433
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Format the three messages in header as one paragraph with three
sentences, instead of a paragraph and two split unordered lists with
one item each and inconsistent full stops.
Message changes: In 'wlheader-enotif' and 'wlheader-showupdated',
remove initial bullet point if present and add final full stop if
missing. First used the regexes below, then went through each language
file and manually changed the messages if applicable (e.g., Thai not
using full stops at all, Asian languages using '。', Devanagari
languages using '।' etc.)
Find: ('wlheader-[^']+'\s*=>\s*)(['"])(?:\*\s*)?([\s\S]+?)\.?\2,\n
Replace with: $1$2$3.$2,\n
Bug: 48615
Change-Id: I856f71f36d7f4b4baff5e968d88e4d3f7aeecce2
|
|
|
|
| |
Change-Id: Id18fb2721534f365dbcab26c7531fca87f859b28
|
|
|
|
| |
Change-Id: Ib5e4b598df854da0bf279190047508a5f276eaab
|
|
|
|
| |
Change-Id: I36833fa0e34d515906005009fcfe7ffb7f59a3e9
|
|
|
|
| |
Change-Id: Ie28bd2d1bce13b5a78eeb8573d8b5b9527b1afeb
|
|
|
|
| |
Change-Id: If7185aa1d59b6c4207765127204270ce2bc91217
|
|
|
|
| |
Change-Id: Ia14126efb5937b32606d7abb2334a6537eae8544
|
|
|
|
| |
Change-Id: If3901d13b72f526be319d8626e3d101767eccdbc
|
|
|
|
| |
Change-Id: Iecaef86923bb32e102dd69ec12651824565bc3ed
|
|
|
|
| |
Change-Id: I554504cd3d622f5eccc27feaf55057c1af443212
|
|
|
|
| |
Change-Id: I1832224e316757869dc2071d0f52b9a376ad0a1c
|
|
|
|
|
|
|
|
| |
Update the order of parts in messages files. Not done for all files. Order set as:
fallback, encoding, namespace related, special pages, magic words, other (no fixed
ordering after magic words).
Change-Id: Ide5ec747ba62a8c2bca8040a14d0aeea8e6c79b9
|
|
|
|
| |
Change-Id: Ie33d83a4d96477c2ed53bba8ee01bc760b980351
|
|
|
|
| |
Change-Id: I1b2cf70c578511ef44a2381e506ee168cc57c7ec
|
|
|
|
| |
Change-Id: I766bdb2f0fd630144b1bd56f0fa5135af8fbdf5d
|