This is an old revision of this page, as edited by Grufo (talk | contribs) at 05:20, 18 October 2024 (→Protected edit request on 18 October 2024). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Revision as of 05:20, 18 October 2024 by Grufo (talk | contribs) (→Protected edit request on 18 October 2024)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)This module was considered for merging with Module:HTMLDecode on 2020 May 8. The result of the discussion was "no consensus". |
Text and/or other creative content from this version of Module:String was copied or moved into incubator:Module:Wp/nod/String with this edit. The former page's history now serves to provide attribution for that content in the latter page, and it must not be deleted as long as the latter page exists. |
This page has archives. Sections older than 90 days may be automatically archived by Lowercase sigmabot III when more than 5 sections are present. |
Extract number
I am trying to extract a number from a string and ignore any units. I have been using {{first word}} which works in some cases.
- {{first word|1={{convert|45|m}}}} -> 45
- {{first word|1={{convert|18.6|nmi|km mi|abbr=on}}}} -> 18.6 nmi
In the second case it doesn't work. I think because a nbsp is being used instead of a space. Is there any template which will work for both? — Martin (MSGJ · talk) 08:20, 11 April 2022 (UTC)
- You can use Special:ExpandTemplates to see that convert outputs
18.6 nmi (34.4 km; 21.4 mi)
for the second case above. What is the purpose of this? If it's for convert and if you can control the parameters, using abbr=values would make selection easier. For the second case above, that would give18.6 (34.4; 21.4)
. Johnuniq (talk) 11:01, 11 April 2022 (UTC)- It's for importing data into Wikidata from infobox fields. I can't control the parameters because convert is used separately on each article. — Martin (MSGJ · talk) 19:36, 11 April 2022 (UTC)
Template:Str rep pattern match irregularities
Editors interested in this module may be interested in the following discussion: Template talk:Str rep#Pattern match irregularities. Your feedback would be welcome. Mathglot (talk) 08:14, 8 October 2022 (UTC)
How to do string.replace of level3 headers in transcluded wikitext
I am trying to use replace to strip out the level3 headers from a transclusion of Dublin North-West (Dáil constituency)#TDs.
However {{#invoke:String|replace|{{trim|{{#section-h:Dublin North-West (Dáil constituency)|TDs}}}}|%=%=%=*%=%=%=||plain=false}}
leaves the level3 headers in place -- see my /sandbox test page at special:permalink/1164910747
Any suggestions? Feel free to edit that sandbox if it helps. BrownHairedGirl (talk) • (contribs) 21:13, 11 July 2023 (UTC)
- I have never seen the #section-h construction before. Would you mind explaining where that comes from? — Martin (MSGJ · talk) 21:50, 11 July 2023 (UTC)
- Turns out this has nothing to do with Lua: when using an equals sign in an argument to a template or module you need to name the parameter explicitly or escape it with {{=}}:
{{#invoke:String|replace|{{trim|{{#section-h:Dublin North-West (Dáil constituency)|TDs}}}}|pattern=%=%=%=*%=%=%=|replace=|plain=false}}
should work. * Pppery * it has begun... 22:09, 11 July 2023 (UTC)- Bless you @Pppery. That's done it. Thanks also to @Trappist the monk for appplyig the fix to my sandbox -- I am not sure which of you did what first, but you have both been a great help, and mighty promptly.
- I case you are curious, this is for my development of the {{Constituency Teachtaí Dála navbox}} meta-template. When I used it to make {{Kerry (Dáil constituency)/TDs}}, the level3 headers were an annoying intrusion, but now I can make 'em vanish from the navbox. Thanks again. BrownHairedGirl (talk) • (contribs) 22:35, 11 July 2023 (UTC)
- This isn't quite what you were asking, but related enough you might be interested in it: {{Conditional heading}}. Mathglot (talk) 04:50, 12 July 2023 (UTC)
- Thanks, @Mathglot. I want to remove the headings rather than to promote or demote them, so it's not what I need in this case. But it's good to know about this, for future refrence. BrownHairedGirl (talk) • (contribs) 23:11, 12 July 2023 (UTC)
- This isn't quite what you were asking, but related enough you might be interested in it: {{Conditional heading}}. Mathglot (talk) 04:50, 12 July 2023 (UTC)
Protected edit request on 21 September 2023
This edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
On Latin Misplaced Pages I have created a tiny module named Module:Translitteratio, which contains a single function named abecedaria
. The function accepts three parameters and works exactly like the translit
M4 macro (but with the order of arguments changed to chars, replacement, string). For example,
{{#invoke:translitteratio|abecedaria|aeiou|äëì|Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nam mollis, metus ac volutpat auctor, sem ligula interdum nulla, ac pellentesque nibh erat sed nisi. Integer nec auctor orci. Quisque nibh metus, viverra et viverra nec, laoreet in mi. Integer aliquet, mauris vitae tristique fringilla, ante odio ultrices quam, et fermentum orci purus nec lorem. Integer quis vehicula arcu, et laoreet enim. Nulla euismod neque velit, eget iaculis massa tempor ut. Nulla convallis maximus neque et porttitor. In nunc eros, convallis id posuere ac, egestas et arcu. Aenean faucibus sollicitudin ipsum, in condimentum mauris maximus in. Etiam facilisis eros vel tortor tincidunt iaculis. Vestibulum lorem lectus, posuere sed enim in, varius pharetra lorem. Morbi nisl lacus, eleifend ut urna vitae, interdum consequat ipsum. Cras ut imperdiet ante. Sed quis luctus purus, quis porta erat. Ut gravida gravida augue a euismod.}}
generates
- Lrëm ìpsm dlr sìt ämët, cnsëctëtr ädìpìscìng ëlìt. Näm mllìs, mëts äc vltpät äctr, sëm lìglä ìntërdm nllä, äc pëllëntësqë nìbh ërät sëd nìsì. Intëgër nëc äctr rcì. Qìsqë nìbh mëts, vìvërrä ët vìvërrä nëc, lärëët ìn mì. Intëgër älìqët, märìs vìtäë trìstìqë frìngìllä, äntë dì ltrìcës qäm, ët fërmëntm rcì prs nëc lrëm. Intëgër qìs vëhìclä ärc, ët lärëët ënìm. Nllä ëìsmd nëqë vëlìt, ëgët ìäclìs mässä tëmpr t. Nllä cnvällìs mäxìms nëqë ët prttìtr. In nnc ërs, cnvällìs ìd psërë äc, ëgëstäs ët ärc. Aënëän fäcìbs sllìcìtdìn ìpsm, ìn cndìmëntm märìs mäxìms ìn. Etìäm fäcìlìsìs ërs vël trtr tìncìdnt ìäclìs. Vëstìblm lrëm lëcts, psërë sëd ënìm ìn, värìs phärëträ lrëm. Mrbì nìsl läcs, ëlëìfënd t rnä vìtäë, ìntërdm cnsëqät ìpsm. Cräs t ìmpërdìët äntë. Sëd qìs lcts prs, qìs prtä ërät. Ut grävìdä grävìdä ägë ä ëìsmd.
(You can experiment with it in the Latin Misplaced Pages sandbox page)
This allowed to write the {{la:Sine notis diacriticis}} template, which transforms vowels with various accents into simple vowels, and the {{la:Ufrac}} template, which writes fractions using only Unicode characters (no HTML).
I think that Module:String would benefit from this function, with its name changed from abecedaria
to translit
.
Translated into English, the code to add to this module would be:
str.translit = function( frame ) local args = frame.args if args == nil or args == '' or args == nil or args == '' then return '' end local npairs local text = mw.text.split(args, '', true) local removenda = mw.text.split(args, '', true) local substituenda if args ~= nil then substituenda = mw.text.split(args, '', true) else substituenda = {} end if #removenda > #substituenda then npairs = #substituenda else npairs = #removenda end for idx1 = 1, npairs do for idx2 = 1, #text do if text == removenda then text = substituenda end end end for idx1 = npairs + 1, #removenda do for idx2 = 1, #text do if text == removenda then table.remove(text, idx2) end end end return table.concat(text) end return str
Grufo (talk) 17:40, 21 September 2023 (UTC)
- This feels sufficiently obscure that it doesn't belong in this module, unless some actual use case can be presented here unrelated to what other wikis do. * Pppery * it has begun... 22:02, 21 September 2023 (UTC)
- What is obscure, what it does or its code? What it does is well explained by the documentation of M4's
translit
(this is just a clone). As for what can be done with it, I believe template {{la:Ufrac}} has no languages. --Grufo (talk) 01:28, 22 September 2023 (UTC)- Yes, and why should we care? Why is the Unicode system for fractions superior to the one currently in use? Why should we invent more ways to do the same thing? * Pppery * it has begun... 01:33, 22 September 2023 (UTC)
- I believe this discussion started with the wrong foot. This is a generic function proposal, it is not a proposal about the {{la:Ufrac}} template. In fact, the other template that uses this proposal ({{la:Sine notis diacriticis}}) does something completely different (i.e. removing accents from vowels). I have no problem answering questions about {{la:Ufrac}}, but I would kindly ask you to stay on topic on this function proposal. Re:
Why should we invent more ways to do the same thing?
Because there are many cases where you have no alternatives. For instance, you can write,100 {{Abbr|carats|A carat is ¹⁄₁₇₂₈ of a pound.}}
- ↳ 100 carats
- but you cannot use the CSS/HTML solution used by {{Fraction}} inside {{Abbr}}. But again, since it is a generic function proposal, frankly I think it will find most of its applications in the transliteration of alphabets rather than in fractions (that was indeed only one example). It is such a basic functionality that it has been part of Unix since the old days. For instance, if you look at the code of {{la:Ufrac/par}} you will find
{{#invoke:translitteratio|abecedaria|0123456789|⁰¹²³⁴⁵⁶⁷⁸⁹|{{{1}}}}}
- All that does is mapping each digit to its superscript unicode character. If we wanted to do the same using {{#invoke:string|replace}} we would need ten module invocations. --Grufo (talk) 01:55, 22 September 2023 (UTC)
- We already have Module:MultiReplace for that. Yes, the syntax you used is slightly terser than that module's syntax, but there's no need to shake up the world. * Pppery * it has begun... 02:09, 22 September 2023 (UTC)
- I have not done a speed test, but I suspect that syntax will not be the only thing where this function will greatly beat Module:MultiReplace. In fact, I suspect that performance will be tremendously less expensive here. This function is less powerful (the mapping happens verbatim character by character), hence, exactly for this reason, it is possible to map without problems strings that are page-long – while I suspect that Module:MultiReplace will break the servers in that case. If all this seems unrealistic, think about the amount of Unicode characters: just to map four types of accents in {{la:Sine notis diacriticis}} I had to write the following string:
ĀĒĪŌŪȲĂĔĬŎŬÀÈÌÒÙỲÁÉÍÓÚÝÂÊÎÔÛŶāēīōūȳăĕĭŏŭàèìòùỳáéíóúýâêîôûŷ̄̆
- I assure you, there can be way more complex transliterations than this. And last but not least: in terms of both computational complexity and ease of use, what you call “shaking up the world” is rather a going back to the old days. --Grufo (talk) 02:26, 22 September 2023 (UTC)
- Yes, and do we really care? Don't make speculative claims about breaking the servers - see WP:PERF. I'm still not convinced that we need any of this level of complexity instead of just letting people type what they want to type rather than relying on templates to fix it for them and it's clear we won't convince each other at this point. * Pppery * it has begun... 02:33, 22 September 2023 (UTC)
- Fine. Continuing with this spirit interface-wise Latin Misplaced Pages will soon beat English Misplaced Pages. --Grufo (talk) 02:41, 22 September 2023 (UTC)
- Yes, and do we really care? Don't make speculative claims about breaking the servers - see WP:PERF. I'm still not convinced that we need any of this level of complexity instead of just letting people type what they want to type rather than relying on templates to fix it for them and it's clear we won't convince each other at this point. * Pppery * it has begun... 02:33, 22 September 2023 (UTC)
- We already have Module:MultiReplace for that. Yes, the syntax you used is slightly terser than that module's syntax, but there's no need to shake up the world. * Pppery * it has begun... 02:09, 22 September 2023 (UTC)
- I believe this discussion started with the wrong foot. This is a generic function proposal, it is not a proposal about the {{la:Ufrac}} template. In fact, the other template that uses this proposal ({{la:Sine notis diacriticis}}) does something completely different (i.e. removing accents from vowels). I have no problem answering questions about {{la:Ufrac}}, but I would kindly ask you to stay on topic on this function proposal. Re:
- Yes, and why should we care? Why is the Unicode system for fractions superior to the one currently in use? Why should we invent more ways to do the same thing? * Pppery * it has begun... 01:33, 22 September 2023 (UTC)
- What is obscure, what it does or its code? What it does is well explained by the documentation of M4's
Protected edit request on 25 October 2023
This edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
Please, add r to the word fist (resulting in first), line number 61. Nishimoto, Gilberto Kiyoshi (talk) 18:11, 25 October 2023 (UTC)
Protected edit request on 3 September 2024
This edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
All of the Lua pseudo-regex special characters are in the ASCII range. See en:UTF-8#Encoding. Therefore, we don't need at all to use the (costly) mw.ustring.*
functions in some parts I have reviewed.
My request is to replace:
function str._escapePattern( pattern_str ) return mw.ustring.gsub( pattern_str, "(])", "%%%1" ) end
with:
function str._escapePattern( pattern_str ) return ( string.gsub( pattern_str, "]", "%%%0" ) ) end
(I am also removing the capture group, which is unneeded as we can use the "%0" whole capture)
(edit: I am also taking the opportunity, for extra robustness, to add parentheses in order to discard the 2nd value (number of replacements) returned by these gsub() functions, then subsequently by _escapePattern(). The more I encounter this "multiple values returned" Lua feature, the more I think it was a terrible design idea)
Second change: line 409, we can similarly replace:
replace = mw.ustring.gsub( replace, "%%", "%%%%" ) --Only need to escape replacement sequences.
with:
replace = string.gsub( replace, "%%", "%%%%" ) --Only need to escape replacement sequences.
These changes would significantly decrease the overhead of having the "plain mode" enabled in this module's functions.
Od1n (talk) 03:26, 3 September 2024 (UTC)
- 1243840019, thanks. Od1n (talk) 22:38, 3 September 2024 (UTC)
Protected edit request on 18 October 2024
It is requested that an edit be made to the fully protected module at Module:String. (edit · history · last · links · sandbox · edit sandbox · sandbox history · sandbox last edit · sandbox diff · test cases · transclusion count · protection log) This template must be followed by a complete and specific description of the request, so that an editor unfamiliar with the subject matter could complete the requested edit immediately.
Edit requests to fully protected pages should only be used for edits that are either uncontroversial or supported by consensus. If the proposed edit might be controversial, discuss it on the protected page's talk page before using this template. Consider making changes first to the module's sandbox and test them thoroughly here before submitting an edit request. To request that a page be protected or unprotected, make a protection request. When the request has been completed or denied, please add the |
The value returned by a module function must always be a string, however some functions here return numbers (these are len
, str_find
, find
and count
). Could you please apply this diff? You can just copy and paste the code at this permanent link.
Although unnoticeable when used in normal wikitext, this can create problems when Module:String is invoked using other modules.
For instance, focusing on the len
function, for each argument passed, a template named mytemplate
containing the following code
{{#invoke:params|mapping_by_invoking|string|len|mapping_by_replacing|^.*$|%0 mod 3|1|for_each|}}
should print
The code above invokes {{#invoke:string|len|...}}
for each parameter passed. Then it attempts to replace the lengths saved with %0 mod 3
, i.e. by adding mod 3
at the end of each parameter. And so, for instance, {{mytemplate|hello|world|foo|bar}}
should print
However, since {{#invoke:string|len|...}}
returns a number, any attempt to do string manipulation with the number returned will generate an error. --Grufo (talk) 05:17, 18 October 2024 (UTC)