User talk:Citation bot/Archive 33

Page contents not supported in other languages.
Source: Wikipedia, the free encyclopedia.
Archive 30 Archive 31 Archive 32 Archive 33 Archive 34 Archive 35 Archive 38

Suggestion: Could citebot learn from its mistakes? (cont)

As a quick update to an earlier discussion here, I now see there's been under way for the last couple of years a WM project along these lines (of keeping a centralised citation db) called Shared Citations. — Guarapiranga  01:59, 20 July 2022 (UTC)

Regular expression failures

I'm seeing of lot of "Regular expression failures" but I don't see what's getting things flagged. Example:

!Regular expression failure in Jules de Cuverville when extracting Templates The following text might help you figure out where the error on the page is (Look for lone { and } characters)

# # # CITATION_BOT_PLACEHOLDER_TEMPLATE 2 # # #

# # # CITATION_BOT_PLACEHOLDER_TEMPLATE 3 # # ## # # CITATION_BOT_PLACEHOLDER_TEMPLATE 4 # # #

Jules de Cuverville (28 July 1834 – 14 March 1912) was a French naval officer who rose to become Chief of Staff of the French Navy. He entered politics in later life, elected to the senate where between 1901 and 1912 he represented Finistère.[1]

Jules Marie Armand de Cuverville was born at Allineuc, a small village a short distance inland from Saint-Brieuc in Brittany. His father was Louis-Paul de Cuverville who represented the locality on the monarchist benches in the National Assembly between 1849 and 1853. Louis-Paul's family was descended from the lords of the manor at Maucomble in Normandy, some of whom had been Squires to French kings. Other kinsmen included sailors and naval officers, such as his grandfather, the Rear Admiral Louis-Hyacinthe Cavelier de Cuverville.

Jules Marie attended school at

Siege of Sevastopol (1854–55), where he was badly wounded. There were further missions in Africa and in the Crimea. He served in Algeria as deputy to Vice-admiral de Gueydon between 1871 and 1873. He was briefly given command of the avisos "Kleber" and "Cuvier" before being seconded to the diplomatic service, serving as naval attache at the French embassy in London during the middle 1870s. He then returned to France, serving aboard the Infernet as a commander with the South Atlantic Naval Division between 1875 and 1879, and promoted to the rank of ship's captain in 1878, taking command of a succession of training ships. He was promoted to the rank of rear admiral in 1888 and appointed a member of the Admiralty Council. Between 1890 and 1892 he served as head of the North Atlantic Naval division and was involved in the Pacification of Dahomey.[1]

He became a vice-admiral in 1893, and then

Maritime Prefect for Cherbourg, a member of the Upper Admiralty Council, commander of the Reserve Mediterranean Squadron in 1897 and inspector-general of the marine in 1898. He was Chief of Staff of the French Navy between 1898 and 1899.[1]

Jules de Cuverville was elected to the senate on 31 March 1901 in a bye-election caused by the death of the previous incumbent, General Arsène Lambert, who had died. He was re-elected in the general election of 4 January 1903. He lost his seat to Maurice Fenoux by a narrow margin on 7 January 1912, however.[1]

Brittany, then as subsequently, was relatively conservative in religious terms, and during the summer of 1902 Jules de Cuverville was among those at

Laïcité (separation of secular and religious institutions).[2]

A friend and political ally of

Catholic and steadfast in his commitment to the Third Republic, Admiral Count de Cuverville had two principal political priorities: defence of the church and support for the navy.[1]

Several months after losing his seat in the senate, Jules de Cuverville was crushed by a truck while crossing the street in Paris. He survived long enough to be taken to his home at 15, rue Dugay-Trouin, but died a few hours later.[4]

  1. # # CITATION_BOT_PLACEHOLDER_COMMENT 0 # # #
References 

  1. ^ a b c d e # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 5 # # #
  2. ^ # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 6 # # #
  3. ^ # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 7 # # #
  4. ^ # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 8 # # #
  1. # # CITATION_BOT_PLACEHOLDER_TEMPLATE 9 # # #
  1. # # CITATION_BOT_PLACEHOLDER_TEMPLATE 0 # # #
  1. # # CITATION_BOT_PLACEHOLDER_TEMPLATE 1 # # #
  !Report this problem please about page Jules de Cuverville

>No changes required.

   # # # 
Done with page. Abductive (reasoning) 15:18, 21 July 2022 (UTC)
I came here to echo that I have the same problem here's the error from "Los_Angeles_Unified_School_District"

> The following text might help you figure out where the error on the page is (Look for lone { and } characters) > # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 17 # # # # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 18 # # # # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 19 # # # # # # CITATION_BOT_PLACEHOLDER_TEMPLATE 23 # # # Mason (talk)


{{fixed}} Sorry, AManWithNoPlan (talk) 15:24, 21 July 2022 (UTC)

The Irish Times is a newspaper

Status
{{fixed}}
Reported by
BrownHairedGirl (talk) • (contribs) 07:48, 22 July 2022 (UTC)
What happens
<ref>{{Cite web|url=https://www.irishtimes.com/life-and-style/people/50-people-to-watch-in-2022-the-best-young-talent-in-ireland-1.4757081|title=<ref>{{Cite journal|url=https://www.irishtimes.com/life-and-style/people/50-people-to-watch-in-2022-the-best-young-talent-in-ireland-1.4757081|title=
What should happen
<ref>{{Cite web|url=https://www.irishtimes.com/life-and-style/people/50-people-to-watch-in-2022-the-best-young-talent-in-ireland-1.4757081|title=<ref>{{Cite news|url=https://www.irishtimes.com/life-and-style/people/50-people-to-watch-in-2022-the-best-young-talent-in-ireland-1.4757081|title=
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Rebecca_Moynihan&diff=prev&oldid=1099722759
We can't proceed until
Feedback from maintainers


Cleanup of pubmed urls

Status
 Fixed
Reported by
b
}
01:58, 20 July 2022 (UTC)
What should happen
[1] / [2]


All have PMCs set (or free DOIs), and thus should be overriden by autolinking.

b
} 01:58, 20 July 2022 (UTC)

Cleanup JSTOR URLs

Status
 Fixed
Reported by
b
}
10:29, 23 July 2022 (UTC)
What should happen
[3], [4], [5], [6], [7], etc...


Unhandled write error

Status
new bug
Reported by
Horsesizedduck (talk) 21:48, 23 July 2022 (UTC)
What happens
!API call failed: Invalid CSRF token.. Will sleep and move on.
  !Unhandled write error.  Please copy this output and report a bug
We can't proceed until
Feedback from maintainers


Unfixable. Although since it seems be more common, I might have the code sleep and try again. AManWithNoPlan (talk) 22:39, 23 July 2022 (UTC)

Addition of page publication dates

Citation Bot adds page publication dates that are stated in the page's HTML metadata but not on the page itself.

For instance, the publication dates for just about every page on LeatherLicensePlates.com:

Is it legit to add these dates when they're not actually stated on the page itself, even if they *are* correct?

Klondike53226 (talk) 00:54, 17 June 2022 (UTC)

Yes.
b
}
02:18, 17 June 2022 (UTC)
Uh... anyone going to back Headbomb up here?
All I want here is to be 110% sure that it *is* acceptable to add page publication dates when they're stated in the metadata but not on the page itself, being someone who is so used to adding such dates when they *are* stated on the page (as with news or magazine articles).
Nothing more, nothing less. And, as ever, no disrespect intended. :)
Thanks, Klondike53226 (talk) 22:48, 17 June 2022 (UTC)
No, it is not acceptable in cases like this where there is a range of dates between when the web site was first published and when it was last modified. The bot has no way of knowing which date is applicable to the version of the site used as a source. That's what access dates are for, but even if an access date is provided, and lands within that publication-modification range, it is impossible to infer the correct date of the cited version of the source, because it could have been modified multiple times within that range. The only reasonable thing to do with web sites with this sort of date issue is to flag them for human attention so that a human can check that the current version still sources the content and update the access-date. But even if you did that, you'd be very likely to run into "helpful" gnomes (bot-like humans) who run through lists of these flags, check only that the web page exists and not that it is still accurate, and set a bad access-date. —David Eppstein (talk) 21:13, 18 June 2022 (UTC)
@David Eppstein: Well, I was already far from 110% sure that it was acceptable to add publication dates in cases such as this one with LeatherLicensePlates.com, when the discussion two below this one popped up - and now I'm as far from 110% sure as one can get. That is, 0% sure.
I find myself agreeing with you that it's better to leave a source undated when the corresponding page/site does not display a visible publication date, even if the publication and modification dates stated in the metadata are correct.
I've already put the "deny Citation Bot" comment in the date parameter for some of the LeatherLicensePlates.com sources used, and I shall waste no time in tackling the rest of these sources. Klondike53226 (talk) 22:18, 18 June 2022 (UTC)
Have to agree that the date should be shown on the displayed page or there is no easy way to verify that you are looking at the source that it is claiming to be. You should not add just from the HTML code. Keith D (talk) 16:08, 19 June 2022 (UTC)

The bot's still doing this - compare this edit to this source. That date being in the HTML isn't ironclad proof that that's truly the proper publishing date. I'm seeing that I'm the fourth person here to question this - is there a consensus somewhere that pulling these dates from the HTML is proper? Hog Farm Talk 22:20, 23 June 2022 (UTC)

This is one reason why the bot never adds |access-date= and it never adds a |date= that is newer than the |access-date=. AManWithNoPlan (talk) 13:23, 24 June 2022 (UTC)

TNT Journal=JSTOR

Status
 Fixed, also title=jstor while I was at it
Reported by
b
}
01:22, 25 July 2022 (UTC)
What should happen
[8]
We can't proceed until
Feedback from maintainers


bot adds |chapter= to cite journal and to cite news

Status
new bug
Reported by
Trappist the monk (talk) 14:14, 30 June 2022 (UTC)
What happens
|chapter= (and its aliases) is not supported by {{cite journal}} and {{cite news}} (also not supported in {{cite magazine}} and {{cite web}})
Relevant diffs/links
  • diff|chapter= added to {{cite journal}} (this source is actually a book)
  • diff|chapter= added to {{cite news}} (this source is actually a conference proceedings)
We can't proceed until
Feedback from maintainers


i watch that error category and fix them by hand. almost universaly, the bot points out that the cite was broken to begin with. i will add code to log these pages. AManWithNoPlan (talk) 11:40, 24 July 2022 (UTC)

Mostly {{fixed}}, many will still require human interaction to clean-up. But they are now being logged to a file when they are found. The error category does not include Draft/sandboxes/etc AManWithNoPlan (talk) 15:04, 25 July 2022 (UTC)

Umm, cs1|2 errors in draft namespace are categorized in all appropriate cs1|2 categories except when the page is a /sandbox, /testcases, /log, or /archive subpage; see line 12 et seq. If you have evidence to the contrary, I would like to see it.
Trappist the monk (talk) 15:10, 25 July 2022 (UTC)
That is news to me. I am glad to hear that. I will keep the logging so that I can correct these by hand when they occur. AManWithNoPlan (talk) 17:27, 25 July 2022 (UTC)

New Google Books

Status
 Fixed
Reported by
Jonatan Svensson Glad (talk) 13:41, 1 August 2022 (UTC)
What happens
The bot is not configured for the "new" Google Books
What should happen
It should fetch author name. It should clean the uRL, change from .se to .com, etc..
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User:Josve05a/sandbox&diff=1101722271&oldid=1101722254


Side note, the old version of Google Books did not generate a publisher in this example, while the new version added |publisher=Berghahn Books. Jonatan Svensson Glad (talk) 13:42, 1 August 2022 (UTC)

CS1 errors

This bot routinely causes CS1 errors - for example, [this edit] caused the article to be listed at https://en.wikipedia.org/wiki/Category:CS1_maint:_numeric_names:_authors_list. It happens often, making unnecessary busywork to those of us who do CS1 maintenance. The bot should be modified so that it doesn't make any CS1 errors and cause pages to be listed at either https://en.wikipedia.org/wiki/Category:CS1_maintenance or https://en.wikipedia.org/wiki/Category:CS1_errors

Ira Leviton (talk) 20:37, 28 July 2022 (UTC)

Bad metadata is an ongoing issue yes. There's no way to anticipate all cases.
b
}
21:37, 28 July 2022 (UTC)
As with any other script, users of Citation bot are responsible for verifying that edits made using that feature are correct. --Redrose64 🌹 (talk) 22:46, 28 July 2022 (UTC)
New York added to non-human list. AManWithNoPlan (talk) 17:33, 31 July 2022 (UTC)

World Bird List

The bot has repeatedly changed the "cite web" to "cite journal" in this reference though it is clearly a web page and not a journal (<> and {} brackets removed so it shows here): cite web ref name=IOC12.1Gill, F.; Donsker, D.; Rasmussen, P. (July 2021). "IOC World Bird List (v 12.1)".

doi:10.14344/IOC.ML.11.2. Retrieved January 15, 2022. /ref . Craigthebirder (talk
) 17:45, 27 July 2022 (UTC)

Not only in bird lists, in individual species' pages. Craigthebirder (talk) 18:52, 27 July 2022 (UTC)
Why are you citing version 12.1 in the title and version 11.2 in the doi (which doesn't actually link to version 11.2 of the list)? And for specific bird species why aren't you linking to the particular place in the list for that species – shouldn't you link to https://www.worldbirdnames.org/new/bow/buttonquail/ from Plover?
Trappist the monk (talk) 19:01, 27 July 2022 (UTC)
First question - because I messed up; have corrected for future and will try to correct earlier entries. Second question - because that page is the link to the whole species list. But will link to family sections in the future. Thanks for pointing these out. Craigthebirder (talk) 20:34, 27 July 2022 (UTC)

I flagged 10.14344 as a non-journal DOI. AManWithNoPlan (talk) 20:01, 27 July 2022 (UTC)

Thank you. Craigthebirder (talk) 20:34, 27 July 2022 (UTC)

DOING MORE HARM THAN GOOD! Stop this bot!

Status
 Fixed
Reported by
Arminden (talk) 06:17, 24 July 2022 (UTC)
What happens
bot removing essential page number and replacing it with incorrect or insufficient ones
What should happen
deactivate the bot
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Hezekiah_(governor)&diff=next&oldid=1044424095


Your bot MUST be removed! You are damaging articles with no real benefit to balance that.

At Hezekiah (governor), this edit added data of little importance (doi, jstor, s2cid) since the URL is already indicated, while removing the very much essential page number and replacing it with incorrect or insufficient ones:

  • "page= 122–126 [125]" is correct data, in both form and content. It means that the entire article covers pp. 122–126, while the relevant ref page is 125 only.
  • "pages= 8, 13" are the correct, relevant ref pages. The bot replaced them with who knows what ("109–118"), maybe the general pages of the article.

What really matters is the actual page the ref is citing. Some sources can't be fully accessed online, but the ref page can be. If one can indicate the start & finish pages, that can benefit some who do have access (subscribers, access to hard copies), but is not essential. The form "x-y [z]" is well accepted in academic publications and transparent enough for the user: "the article goes from x to y, but look up only z." I have also seen it as "x-y (z)". Now you have forced me to remove the start & end pages of the article, in order to preserve the relevant page number(s) I'm actually citing. What's far worse, elsewhere the intervention of the bot has most likely not been reverted and now the cited page is missing. Now you must go and manually fix all the damage already done.

The bot harms the user. We're far better off without it. Arminden (talk) 06:17, 24 July 2022 (UTC)

|page=122–126 [125] should be kept when converting to the proper |pages=, yes. But |pages=8, 13 is definitely incorrect. Those are the pages of the preprint PDF, not the pages of the published book, which is what the reader [thus bot] is expecting.
Also, you were not harmed by the bot. Don't be dramatic.
b
}
06:52, 24 July 2022 (UTC)
The template is problematic, as there's only one page of reference, 125, so definitely singular, page. But it can be useful to indicate the size and position of the article, 122–126. I'm not coding, those who are should find a solution.
As I wrote on your page: your manual edits are absolutely perfect, detailed, and most welcome; the bot however is not ready to be let go. Unless and until it is, it is totally wrong to activate it.
"Being dramatic" is a touch too personal, if you don't mind. Wiki IS better off without the unfixed bot, broken bots tend to damage lots of articles, and those in charge aren't always quick to react, so clear words can only help. Arminden (talk) 10:45, 24 July 2022 (UTC)
fixed- now detects square brackets or commas AManWithNoPlan (talk) 11:37, 24 July 2022 (UTC)
AManWithNoPlan, great, thanks! Now that I know you can do it, let's see all aspects.
  • "page= X–Y [Z]" means that only Z is directly relevant, so "page" in the singular. Or not? Ideally the template should lead to smth like "pp. X-Y (see p. Z)", or if needed "pp. X-Y (see pp. V, W, Z)".
  • "page= X–Y (Z)" is also used by some. I also seem to remember that in some templates or contexts, straight brackets create a lot of chaos, so round ones are an important option. Arminden (talk) 11:56, 24 July 2022 (UTC)
PS: All the edits produced by the bot before it reaching the final, mature and correct form must be fixed retroactively. Does that happen automatically, or must they be identified and fixed by hand, one by one? If the latter, how and who? Arminden (talk) 12:00, 24 July 2022 (UTC)
round paras add too. The bot has been fully approved for well over a decade. People are expected to report problems as they see them. AManWithNoPlan (talk) 12:10, 24 July 2022 (UTC)
Why on Earth would the reader be expecting the pages of the published book and not the linked copy? On the various templates we provide links to preprints, partial google books previews, and chapter excerpts, in spite of the bulk of the citation indicating the full published work, entirely because we fully expect the bulk of the readers are not going to have journal subscriptions or a complete next-door library and thus, should they want to verify a statement, they should be given as direct a route as possible.
Changing page numbers in any form is absolutely uncalled for unless you, personally, verify they still point to the sourced information. SamuelRiv (talk) 12:17, 24 July 2022 (UTC)
""Why on Earth would the reader be expecting the pages of the published book and not the linked copy?" because that is the published version. I'm the one that added the preprint link. It wasn't linked before. If I tell you look up something from Hoogenboom, BJ; Manske, RC (October 2012), "How to write a scientific article", International Journal of Sports Physical Therapy, 7 (5): 512–7,
b
} 13:52, 24 July 2022 (UTC)
SamuelRiv, hi. We are almost of the same opinion, but just almost. Indicating the pages of the actually linked PDF (a preprint) is what we had and what's most needed, the bot changed that, and then the editor who apparently activated it, Headbomb, set in both infos.
Why the full info can be of some use? Because links rot, people do have access to hard or digital copies, and I for instance often bump into JSTOR docs I can only read online and knowing if the needed page is not too far in (not, say, on the 20th page of the doc) helps me decide right away if to bother clicking through the doc. I do read some of the referenced sources, as they educate me about topics I'm interested in, and often bump into "failed verification" cases. For some important titles I do have & use hard copies. So it's not all that hypothetical. ANd if it does no harm by replacing potentially useful, valid data with bad, let it be, it's a bot, doesn't cause sweat.
What I didn't realise: apparently, the bot isn't automatically crawling through the entire enWiki and doing edits, but needs to be activated by a specific editor for each article. Or not? Very different concepts. Arminden (talk) 13:27, 24 July 2022 (UTC)
Both modes are authorized. Generally a user has a specific issue that needs fixe, such as all pages where the jstor url has #meta_blah_blah_blah included, or all pages with a certain invalid DOI, etc. AManWithNoPlan (talk) 13:42, 24 July 2022 (UTC)
In the case where both the page range and pinpoint page(s) are given, the pinpoint should be preferred as the page range is almost always easily determined with the rest of the citation information, and the pinpoint is far more important for verifiability. If the bot can't distinguish in a template whether the editor was referencing page range, pinpoint, both, or neither, the proper action is to leave it alone or, if the metadata are corrupted, convert to |at=. SamuelRiv (talk) 18:06, 27 July 2022 (UTC)

Parameter with only a non-breaking space

Status
 Fixed once deployed
Reported by
BrownHairedGirl (talk) • (contribs) 07:15, 2 August 2022 (UTC)
What happens
{{cite web|author=&nbsp; left untouched
What should happen
|anyparameter=&nbsp; should be treated as empty, and removed.
Relevant diffs/links
The {{cite web|author=&nbsp; was added in this
WP:REFLINKSedit[9] to Tony Oller
, and I didn't spot the glitch before saving. I thought that CB would remove it
We can't proceed until
Feedback from maintainers


More generally, starting or ending with one.
AManWithNoPlan (talk) 11:27, 2 August 2022 (UTC)

Edit Summary: "Upgrade ISBN10 to 13" but no ISBNs changed

Status
 Fixed
Reported by
  — Chris Capoccia 💬 19:03, 2 August 2022 (UTC)
What happens
minor issue where edit summary includes "Upgrade ISBN10 to 13" but no ISBNs changed
What should happen
Should only include ISBN in summary when ISBN changes are happening
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Interfacial_thermal_resistance&type=revision&diff=1101965603&oldid=1101965336


There's not even an ISBN anywhere in the article!

b
} 23:14, 2 August 2022 (UTC)

The bot detects the number of "978" present in the article before and after. One PMID added had that. The code now counts "978-", since that is what the bot adds. The mistake will still happen if for example the title of a journal article added is "The 978- area code: demographics", but that is rare enough to live with. AManWithNoPlan (talk) 12:49, 3 August 2022 (UTC)

Fails to run on Scintillator

Status
Red X Won't fix since not easy and pretty rare
Reported by
b
}
23:04, 2 August 2022 (UTC)
What happens
Failure to run
Replication instructions
Try to run on Scintillator
We can't proceed until
Feedback from maintainers


I've looked everywhere for a stray bracket, and I just can't find what the issue is.

b
} 23:04, 2 August 2022 (UTC)

The fix was to make this template-looking-thing in a math block less template looking.
b
}
23:20, 2 August 2022 (UTC)
The inner brackets around the \tau are unnecessary: \tau_f and \tau_s would have worked. —David Eppstein (talk) 00:09, 3 August 2022 (UTC)
indeed {{math stuff} more math} does confuse the bot. About once a week I add spaces to markup on a page. All such page failures are logged. AManWithNoPlan (talk) 11:14, 3 August 2022 (UTC)

Mark 10.4249/... DOI as free access.

Like so.

The doi prefix

b
} 11:11, 3 August 2022 (UTC)

 Fixed once deployed. AManWithNoPlan (talk) 12:32, 3 August 2022 (UTC)

Caps: La Salle

Status
 Fixed once deployed
Reported by
b
}
23:48, 3 August 2022 (UTC)
What should happen
[10]
We can't proceed until
Feedback from maintainers


Adds pseudonymous collective author, already listed, as redundant editor

Status
Flagging as Red X Won't fix since it is obscure.
Reported by
David Eppstein (talk) 02:08, 4 August 2022 (UTC)
What happens
Special:Diff/1102235876
What should happen
Not that. The book does not list an editor on its cover or copyright page: it is formatted as an authored work. Note that the supposed author/editor, M. Lothaire, is actually a pseudonym for the collective of contributors, is already listed as an author, and is an accurate collective name for the group of people who wrote the chapters of the book (that is, its authors). Also, in the bot's version the "." in "M." is missing. According to the foreword, the actual editor of this volume is Dominique Perrin.
We can't proceed until
Feedback from maintainers


Please see publishers page (linked via DOI for 2nd edition) and the MR review for first edition. AManWithNoPlan (talk) 13:07, 4 August 2022 (UTC)

The book itself and a correct description of its content is more definitive than what the publisher lists it as in some database. Lothaire is a name for the group of people who wrote the content (its author). It is not a name of an editor. —David Eppstein (talk) 16:37, 4 August 2022 (UTC)
I see that now. I have flagged the page to avoid the using of the bad meta-data. AManWithNoPlan (talk) 16:44, 4 August 2022 (UTC)
Flag all pages with this DOI. AManWithNoPlan (talk) 16:56, 4 August 2022 (UTC)
Thanks. Since this appears to be bad publisher metadata rather than a bot bug, I think there is nothing more to do. —David Eppstein (talk) 19:48, 4 August 2022 (UTC)

Acryonym changed to lower case

Status
{{fixed}}
Reported by
45154james
What happens
Acronym changed from entirely upper-case (e.g. LRTAP) to incorrect capitalized form of the same thing (i.e. Lrtap)
What should happen
Acronym should be left as is, whatever else is changed.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Brynjulf_Ottar&diff=prev&oldid=1102403203
We can't proceed until
Feedback from maintainers


Robert Koch-Institut is not the name of an author

Status
{{fixed}} by adding Institut to non-human list
Reported by
ElLutzo (talk) 21:48, 7 August 2022 (UTC)
What happens
Robert Koch-Institut is not the name of an author
Relevant diffs/links
[11]
We can't proceed until
Feedback from maintainers


Nor is the cited source a journal...

Trappist the monk (talk) 22:05, 7 August 2022 (UTC)

Error processing the List of new religious movements

Status
{{
notabug
}}
Reported by
Guarapiranga  22:19, 7 August 2022 (UTC)
What happens
Running CITEBOT on List of new religious movements produces a long error msg with # # # CITATION_BOT_PLACEHOLDER_TEMPLATE n # # # repeated many times. Is this bc of Harvard refs?
Relevant diffs/links
https://citations.toolforge.org/process_page.php?edit=toolbar&slow=1&page=List_of_new_religious_movements
We can't proceed until
Feedback from maintainers


Fixed the wiki-problems. https://en.wikipedia.org/w/index.php?title=List_of_new_religious_movements&type=revision&diff=1103126764&oldid=1102985631 BUT a topic expert needs to look over and probably double check. AManWithNoPlan (talk) 12:13, 8 August 2022 (UTC)

The BBC is not a newspaper

Status
Red X Won't fix, just random zotero oddity
Reported by
BrownHairedGirl (talk) • (contribs) 05:36, 7 August 2022 (UTC)
What happens
bare URL filled as {{cite news | url=https://www.bbc.com/russian/news-62419765 | title=Леонид Кучма: "Путин хотел уничтожить Украину, а получит наше второе рождение" | newspaper=BBC News Русская Служба }}
What should happen
use |work=: {{cite news | url=https://www.bbc.com/russian/news-62419765 | title=Леонид Кучма: "Путин хотел уничтожить Украину, а получит наше второе рождение" | work=BBC News Русская Служба }}
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=2003_Tuzla_Island_conflict&diff=prev&oldid=1102841451->
We can't proceed until
Feedback from maintainers


Yeah, only a minor thing. I am mostly puzzled about how the bot came to do this, because it's so odd that it suggests the possibility of a wider glitch in how the bot decides that a URL is that of a newspaper. --BrownHairedGirl (talk) • (contribs) 05:36, 7 August 2022 (UTC)

It is Citoid: [ { "key": "4HM5G9UK", "version": 0, "itemType": "newspaperArticle", "creators": [], "tags": [ { "tag": "Война России с Украиной", "type": 1 }, { "tag": "Украина", "type": 1 } ], "title": "Леонид Кучма: \"Путин хотел уничтожить Украину, а получит наше второе рождение\"", "publicationTitle": "BBC News Русская служба", "url": "https://www.bbc.com/russian/news-62419765", "abstractNote": "Экс-президент Украины Леонид Кучма встретил российское вторжение у себя дома в Киеве и уезжать не захотел. BBC News Украина удалось взять у него первое и пока единственное интервью за время войны.", "language": "ru", "libraryCatalog": "www.bbc.com", "accessDate": "2022-08-08", "shortTitle": "Леонид Кучма" } ]

caps: Séries A et B

Status
 Fixed
Reported by
b
}
23:53, 6 August 2022 (UTC)
What should happen
[12]


Caps: The Newspaper for IT Leaders

Status
{{fixed}}
Reported by
b
}
10:58, 11 August 2022 (UTC)
What should happen
[13]
We can't proceed until
Feedback from maintainers


Hi

Hi bot i give you a barnstar 😁 Einahr (talk) 11:23, 12 August 2022 (UTC)

Citation bot tries to fetch from known dead URLs

Status
Red X Won't fix, since people do not use template correctly
Reported by
Guy Harris (talk
) 00:53, 3 August 2022 (UTC)
What happens
If Citation Bot sees a template with archive-url= and a url-status other than live, it tries fetching using the url= value, even though the url-status indicates that it's not working.
What should happen
Unless there's url-status=live, if there's an archive-url parameter, it shouldn't bother looking at what url= specifies.
Relevant diffs/links
VAX
Replication instructions
Run Citation Bot in thorough mode on VAX, and note that there is a "could not resolve" message for http://www.openvms.compaq.com/doc/73final/4515/4515pro_020.html#index_x_790, the reference for which has an archive-url link and does not have url-status=live. Other pages have gotten timeouts for dead URLs with archive-url and no url-status=live.
We can't proceed until
Feedback from maintainers


The citation that matches that URL has no url-status flag set. Neither does the the other citation to openvms.compaq.com. Do you get the same error if you set |url-status=dead? Sideswipe9th (talk) 01:01, 3 August 2022 (UTC)
First not flagging a url as live does not mean the url is dead. Second, even if it was flagged dead, websites often from back to life, so in thourough mode, it should definitely check the url if it can.
b
}
01:41, 3 August 2022 (UTC)
"First not flagging a url as live does not mean the url is dead" The documentation for those parameters is not of the highest quality.
Help:Citation Style 1 says:
  • url-status: To change the order with the title retaining the original link and the archive linked at the end, set |url-status=live
...
When the original URL has been usurped for the purposes of spam, advertising, or is otherwise unsuitable, setting |url-status=unfit or |url-status=usurped suppresses display of the original URL (but |url= and |archive-url= are still required).
and Template:Cite web says:
By default, if "archive-url" is used, the parameter |url-status=dead is assumed and the resulting main link is to the archived version:
and Template:Citation Style documentation/url says of url-status:
this optional parameter is ignored if archive-url is not set. If omitted, or with null value, the default value is |url-status=dead.
(and also mentions a "deviated" value for url-status).
So either not flagging a url as live does mean it's dead, if there's an archive-url parameter, or the documentation needs to be fixed to indicate that omitting url-status isn't the same as saying url-status=dead.
Guy Harris (talk
) 02:05, 3 August 2022 (UTC)
"Second, even if it was flagged dead, websites often from back to life, so in thourough mode, it should definitely check the url if it can." ...but not complain if the check fails, because it was warned that it's (probably) dead). ) 02:09, 3 August 2022 (UTC)
So either not flagging a url as live does mean it's dead No it doesn't. It's an assumption the template makes as far as the presentation of links go. It does not mean the URL is dead. Plenty of people simply archive links preemptively, but don't bother as flagging the url as live, mostly because it's rather pointless to do so.
...but not complain if the check fails The bot doesn't a complain about it, it reports that the check failed.
> Could not resolve URL http://www.openvms.compaq.com/doc/73final/4515/4515pro_012.html#basic_arch_sect
The message is in gray, because it's not a very important thing to report. Unlike messages that are in red/orange/other colors. ce
b
}
03:00, 3 August 2022 (UTC)

Just a heads up that my little handy dandy guide on Citation bot finally got in the Signpost, as intended years ago. Feel free to leave a comment (or make suggestions for other guides on different topics).

b
} 20:04, 1 August 2022 (UTC)

Thank you. AManWithNoPlan (talk) 00:05, 2 August 2022 (UTC)
Just read it, and it is good. AManWithNoPlan (talk) 00:14, 2 August 2022 (UTC)
Good work, @Headbomb. And it's probably just as well that this fine guide did not appear sooner, 'cos until Citation bot got a massive amount of extra capacity a few months ago, the result of the extra attempts to use CB would ave been a lot of frustrated editors as requests timed out. BrownHairedGirl (talk) • (contribs) 07:09, 2 August 2022 (UTC)
Thanks for the words, though
b
} 07:35, 2 August 2022 (UTC)

RfC: Should you use cite web, or cite magazine, or cite news?

@Sideswipe9th: Please don't hold RfCs in user talk space (the practice of holding RfCs to discuss user conduct ceased some years ago). This matter should be discussed at Help talk:Citation Style 1. --Redrose64 🌹 (talk) 21:29, 1 July 2022 (UTC)
@Redrose64: I had asked previously where to hold this and there was an even split between this talk page, and the CS1 talk page being the appropriate venue. I'm not opposed to moving it, if it was to be relaunched, so to speak, with the same question, would copy and pasting what has already been discussed and !voted on be acceptable?
I'm also not opposed to using this opportunity to address the concerns raised by both @XOR'easter and David Eppstein:, which may result in a different question being asked however. I'd just like to check what the options are. Sideswipe9th (talk) 21:41, 1 July 2022 (UTC)
I've moved the whole RfC unchanged, apart from a slight adjustment to the section heading and also omitting the two posts above. --Redrose64 🌹 (talk) 08:14, 2 July 2022 (UTC)

RFC closed

The RFC has been closed[14] by @ScottishFinnishRadish:

There is a strong consensus for Citation bot to use {{cite news}} and {{cite magazine}} in cases where online content doesn't appear in a print edition of a publication.

It is helpful to have this community endorsement of Citation bot's good work. This outcome was clearly foreseeable, but it is a relief that the saga is finally over.

However, I remain very sad that so much time and effort was spent in resolving the complaints of the 12 Angry Marvelites who created the drama. Their refusal to listen to more experienced editors led to this vast time-sink, and their aggressive rudeness poisoned the atmosphere. I hope that the 12 Angry Marvelites will reflect on their conduct, and pursue andy further concerns without the groupthink and assumptions of bad faith which characterised their approach to this issue.

And thanks to @Sideswipe9th for their hard work in preparing the RFC. I wish that the RFC had not been needed, but when the 12 Angry Marvelites refused to listen, it was the only way to end the drama. --BrownHairedGirl (talk) • (contribs) 07:35, 2 August 2022 (UTC)

Thank you all. AManWithNoPlan (talk) 12:52, 3 August 2022 (UTC)
Some of the people on other side to you seem more experienced with the websites involved with dispute which is more relevant, even though you seem to be more experienced with references. No need to be uncivil, we were acting in good-faith, don't believe we were rude, this is not the place to discuss user conduct. Indagate (talk) 13:34, 3 August 2022 (UTC)
There is nothing uncivil in pointing out the time wasted by this drama. The rudeness is well-documented, and even extended to falsely accusing the bot of vandalism, an outrageous allegation which was repeated even after the rude Marvelite was pointed to
WP:NOTVAND
. Not one of the tag-teaming 12 Angry Marvelites objected to that smear, but instead I was attacked for asking for a retraction. And Indagate may not have been actually rude, but it was not civil for Indagate to waste the time of other editors by a silly claim that 9 to 8 amounted to a consensus for their view, and it was not civil for Indagate to falsely claim experience which they actually lacked.
I still hope that the Angry Marvelites may learn from this saga, and raise any further concerns more colegially. BrownHairedGirl (talk) • (contribs) 20:04, 3 August 2022 (UTC)
@BrownHairedGirl: I have no desire to get into this again, but I'd like to note a few things:
  • This RfC was not a "waste of time" to address our concerns. The lack of consensus in the prior discussion left us with no other option to resolve the dispute, unless you would have preferred that the discussion drag on forever and Citation bot's edits continue to be reverted, or the bot be blocked outright on a large number of pages. I too am deeply saddened that you continue to regard our concerns as petty and insignificant.
  • Just because a group of editors has more "experience" does not mean the opinion of editors with less experience do not matter. We did not refuse to listen to you, we provided very clear counterclaims (magazine websites have different content, etc.) and questioned why Citation bot should continue making these changes despite there being no consensus at the discussion. Do you recall what response we received? "Online magazines are magazines." (Not the point, we're talking about magazine websites.) "There is consensus because most edits are not reverted." (We're talking about consensus in the discussion, not in CB's edits.) And for the record, some of the editors who disagreed with you had comparable levels of experience, so your repeating that you have "more experience" is beginning to feel condescending.
  • Darkwarriorblake is not a "Marvelite". I've almost never seen them pop up on a Marvel-related article, nor are they part of our taskforce.
  • it was not civil for Indagate to waste the time of other editors by a silly claim that 9 to 8 amounted to a consensus for their view – 9 to 8? Really? Unless my counting is significantly off, the participants of the prior discussion were pretty evenly split. If it really were 9 to 8 I wouldn't have called for an RfC, because consensus would have been clear in that case.
  • You continue to admonish us for being rude and uncivil, yet you're once again attacking Indagate, who did nothing wrong. Yes, maybe they shouldn't have claimed experience based on their account age, but that's not the definition of "uncivil", which implies malice. Mind retracting that statement?
To repeat, I do not want another lengthy debate, so I won't reply further. Also, I'd like to thank Sideswipe9th for their efforts as well. But I'll close my comment with this: while it's totally appropriate to reflect on the results of the RfC, do you really think it's appropriate to dance on someone's grave with this patronizing subsection of yours? InfiniteNexus (talk) 05:56, 4 August 2022 (UTC)
@InfiniteNexus: there is no grave to dance on. Nobody has died or been banned or blocked.
Your choice to reply with absurdly dramatic hyperbole is another reminder of how unpleasant the whole saga was made by the 12 Angry Marvelites. And no, I will not alter my view that making false claims to try to sway a discussion is uncivil. BrownHairedGirl (talk) • (contribs) 07:14, 4 August 2022 (UTC)

Pages vs Issue vs at vs ...

Status
 Fixed with recognition of () and []
Reported by
SteveMcCluskey (talk) 16:40, 31 July 2022 (UTC)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Aberration_(astronomy)&diff=1101370619&oldid=1101370242
We can't proceed until
Feedback from maintainers


The edit at Aberration_(astronomy) changed the pages= entry to the issue number (e.g., A91), removing the page range (e.g. 1-6) of the article. This happened several times. I recently corrected the pages= entries.

You're actually reintroducing errors the bot is fixing (full proper fix is here, there were some GIGO issues). A&A and similar journals use an 'article ID' instead of a page number. This is what goes in |page= and how people cite these journals. There are no issue numbers for those.
b
}
17:07, 31 July 2022 (UTC)
I'm puzzled by this usage. There are clearly page numbers in the pdf version of the article; where should they be provided if not in the page(s)= entry of the citation template. The article ID seems analogous to an issue number. Could you point me to appropriate reference supporting this strange usage of the citation template. --SteveMcCluskey (talk) 18:08, 31 July 2022 (UTC)
OK, I see that you're using the page= value to place the ArticleID without parentheses, which are used for issue numbers. It's a bit of a kludge but it reproduces the style used in AandA. However, it leaves no place for the page numbers (which this historian of science finds standard for a complete citation). I'm not going to make a big deal of it. --SteveMcCluskey (talk) 18:31, 31 July 2022 (UTC)
So when I tag those citations with {{Page needed}}, what is the suggested remedy? SamuelRiv (talk) 18:51, 31 July 2022 (UTC)
Don't misuse parameters to hold something that they are not intended to hold. An article number is not an in-source location so |page=, |pages=, and |at= are not the right place for an article number. cs1|2 does not have any support for article numbers. I think that there has been some discussion at Help talk:Citation Style 1 but never to the point of action. Quite often I see article numbers in |number= (an alias of |issue=) which, really, is also incorrect. I suppose that if you must include an article number |number= is the best place for it pending some decision to add support for article numbers – but don't be surprised when someone like me comes along and removes it.
Trappist the monk (talk) 19:07, 31 July 2022 (UTC)
Perhaps use the |id= parameter. --Redrose64 🌹 (talk) 22:41, 31 July 2022 (UTC)
|id= is for bibliographic identifiers like {{
b
} 22:46, 31 July 2022 (UTC)
Trappist has already stamped on |at=. I'm trying to find a parameter which would not be misused. --Redrose64 🌹 (talk) 23:33, 31 July 2022 (UTC)
I think Trappist is wrong. For journals that identify articles by article numbers rather than by page numbers, the article number must be included in the citation, in the position that the page numbers would go for page-number-using journals, and page= or at= are the only ways to put it there. id= is no good because it puts the key article-identifying information in the wrong part of the citation. —David Eppstein (talk) 23:50, 31 July 2022 (UTC)
Trappist is wrong indeed.
b
}
00:15, 1 August 2022 (UTC)
I've been using |id= for press release numbers but reading this I think that may have been wrong. Perhaps I should have been using |number= instead, but there is no documentation to support this. Hawkeye7 (discuss) 01:26, 1 August 2022 (UTC)
Yes. Article numbers go in the same slot as page numbers; the journals themselves even do it that way (Philosophical Transactions of the Royal Society springs to mind as an example that I've seen recently).
talk
) 16:09, 1 August 2022 (UTC)
I'm pleased to see that I wasn't the only editor disturbed by changes involving article IDs. Articles with article IDs also have page numbers within the article. There should be a separate place for both within the citation template. It's beyond my abilities but those involved in maintaining the citation templates should be asked to deal with this new citation style, which seems to be proliferating in the science journals. --SteveMcCluskey (talk) 21:06, 1 August 2022 (UTC)
Standard practice is to use brackets |page=A9 [17] if you want to refer to a specific page/page range.
b
}
21:10, 1 August 2022 (UTC)
Another convention I've seen (working better when the article numbers are short) is to use artnum:firstpage–artnum:lastpage as the range of pages, so Headbomb's example would become |pages=A9:1–A9:17. That's how they're numbered by LIPIcs (an open-access publisher of many computer science conference proceedings), for instance. —David Eppstein (talk) 07:22, 4 August 2022 (UTC)

Suggestion: convert Google books URLs to GBurl templates

Status
Red X Won't fix, since I think this would require a very strong discussion that will result in pitchforks
Reported by
RDBrown (talk) 10:04, 11 August 2022 (UTC)
What happens
Google books URLs have changed
We can't proceed until
Feedback from maintainers


Instead of modifying Google books URLs, use the GBurl template where the search parameters are encompassed by the template. This change may need an RFC. Possibly the Citer tool (python) may provide more documentation on the parameters of the Query part of the URL, for which there doesn't seem to be easily findable documentation. In *Template.php:expand_by_google_books_inner* the current code doesn't seem to cope with the munged title now being part of the string, so that may be worth enhancing.

      if (preg_match("~^https?://www\.google\.(?:[^\./]+)/books/edition/_/(.+)$~", $url, $matches)) {

Here's some perl snippets since I'm not PHP literate.

use URI;
use URI::Split qw(uri_split uri_join);

LINE:
while (<>) {
    chomp;
    next if /^$/;
    if (/^\s*#/) {
	print $_, "\n";
	next;
    }
    my($uri) = URI->new($_);
    my($sch, $aut, $pth, $qry, $frg) = uri_split($uri);
    if (defined($sch) && ($sch eq 'http' || $sch eq 'https')
    && defined($aut) && defined($pth)) {
	my($template);
	substr($aut, -3, 3) = '' if substr($aut, -3) eq ':80';
...
	if ($aut =~ /^www\.google\.c(?:a|o(?:m|m\.au|\.uk|\.in))$/
	&& $pth =~ m|^/books/edition/[^/]+/([-A-Za-z0-9_]{12})$|) {
	    my($gb_arg) = $1;
	    foreach my $gb_restrict (split('&', $qry)) {
		if ($gb_restrict =~ /^(pg|dq)=/) {
		    if (substr($gb_restrict, 0, 5) eq 'pg=PA') {
			$gb_arg .= '|p=' . substr($gb_restrict, 5);
		    }
		    else {
			$gb_arg .= '|' . $gb_restrict;
		    }
		}
		elsif ($gb_restrict =~ /^bsq=(.+)/) {
		    $gb_arg .= '|dq=' . $1;
		}
		# Safe to skip: hl= (Normally en), gbpv=
		# printsec= - skip only if there's a pg=
		# Shouldn't convert if there's anything else?
	    }
	    $template = "{{GBurl|$gb_arg}}";
	}
---
Example URLs
https://www.google.com.au/books/edition/D%C3%BCsenj%C3%A4ger/r_cuAAAACAAJ?hl=en
https://www.google.com.au/books/edition/%EC%8A%A4%ED%83%80%ED%83%80%EC%9D%B4%EB%93%9C_%EB%9D%BC%EC%9D%B4%EC%A7%95_2_%EC%96%91%EC%9E%A5%EB%B3%B8_Ha/Cn_9jgEACAAJ?hl=en
https://www.google.com.au/books/edition/Machiavelli_The_Prince/05R7kYOKD0cC?hl=en&gbpv=1&dq=Cambridge%2520University&pg=PT12&printsec=frontcover
https://www.google.com.au/books/edition/Machiavelli_The_Prince/05R7kYOKD0cC?hl=en&gbpv=1&pg=PR4&printsec=frontcover&dq=Cambridge
https://www.google.com.au/books/edition/Machiavelli_The_Prince/05R7kYOKD0cC?hl=en&gbpv=1&pg=PT12&printsec=frontcover&dq=Cambridge%20University
https://www.google.com.au/books/edition/The_Existential_Graphs_of_Charles_S_Peir/Q4K30wCAf-gC?hl=en&gbpv=0
https://www.google.com.au/books/edition/The_Existential_Graphs_of_Charles_S_Peir/Q4K30wCAf-gC?hl=en&gbpv=1&pg=PA113&printsec=frontcoverG
https://www.google.com.au/books/edition/The_Golden_Enclaves/7kRYEAAAQBAJ?hl=en
https://www.google.com.au/books/edition/When_Computers_Went_to_Sea/Mi8MhzheOokC?hl=en&gbpv=0
https://www.google.co.uk/books/edition/When_Computers_Went_to_Sea/Mi8MhzheOokC?hl=en&gbpv=0
https://www.google.co.in/books/edition/Conflict_and_Conquest_in_the_Islamic_Wor/jBBYD2J2oE4C?hl=en&gbpv=1
https://www.google.com/books/edition/Playwriting_for_Profit/jhwLAAAAMAAJ?hl=en&gbpv=1&bsq=%22show%20not%20tell%22
https://www.google.com/books/edition/_/A-c7AAAAIAAJ?hl=en&gbpv=1&pg=PA419&dq=%27Barnstead%20charter%27 Provincial
https://www.google.com.au/books/edition/Provincial_and_State_Papers/A-c7AAAAIAAJ?hl=en&gbpv=1&dq='Barnstead%20charter'%20Provincial&pg=PA419&printsec=frontcover
https://www.google.ca/books/edition/Principes_d_exp%C3%A9ditive_fran%C3%A7aise_pour/FgVJ55_weywC?&gbpv=0

{{GBurl|r_cuAAAACAAJ}}
{{GBurl|Cn_9jgEACAAJ}}
{{GBurl|05R7kYOKD0cC|dq=Cambridge%2520University|pg=PT12}}
{{GBurl|05R7kYOKD0cC|pg=PR4|dq=Cambridge}}
{{GBurl|05R7kYOKD0cC|pg=PT12|dq=Cambridge%20University}}
{{GBurl|Q4K30wCAf-gC}}
{{GBurl|Q4K30wCAf-gC|p=113}}
{{GBurl|7kRYEAAAQBAJ}}
{{GBurl|Mi8MhzheOokC}}
{{GBurl|Mi8MhzheOokC}}
{{GBurl|jBBYD2J2oE4C}}
{{GBurl|jhwLAAAAMAAJ|dq=%22show%20not%20tell%22}}
{{GBurl|A-c7AAAAIAAJ|p=419|dq=%27Barnstead%20charter%27%20Provincial}}
{{GBurl|A-c7AAAAIAAJ|dq='Barnstead%20charter'%20Provincial|p=419}}
{{GBurl|FgVJ55_weywC}}

No, this is a bad idea, especially for a bot. --Izno (talk) 01:53, 15 August 2022 (UTC)

Postscript again

Status
{{
notabug
}}, please put outside of template
Reported by
– Arms & Hearts (talk) 16:45, 5 August 2022 (UTC)
What happens
Citation bot removes a necessary non-breaking space from the |postscript= parameter in {{cite journal}} and {{cite web}}
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Whites,_Jews,_and_Us&diff=1102312419&oldid=1098444133
We can't proceed until
Feedback from maintainers


This was marked as fixed in February 2020 – either that was in error or it's subsequently been broken again. – Arms & Hearts (talk) 16:45, 5 August 2022 (UTC)

|postscript= specifies the citation's terminal character. For cs1 templates, the default character is a dot; for cs2 templates, the default is no terminal punctuation. When the value assigned to |postscript= has more than one character, Module:Citation/CS1 emits a CS1 maint: postscript message and adds the article to Category:CS1 maint: postscript. At Whites, Jews, and Us § External links, both cs1 templates have |postscript=&nbsp;– which Module:Citation/CS1 sees as 7 characters. The exception to this one-character limit is the keyword none which suppresses the normal cs1 terminal dot (not allowed in cs2 templates because redundant). Consider writing:
{{cite ... |postscript=none}}&nbsp;– <descriptive text>
for cs2 templates:
{{citation ... }}&nbsp;– <descriptive text>
Trappist the monk (talk) 19:01, 5 August 2022 (UTC)
Thanks Trappist the monk, now fixed in the article as per. I see that the documentation for {{cite journal}} and {{cite web}} both say Additional text, or templates that render more than a single terminating punctuation character, will generate a maintenance message, but that doesn't seem to have happened here. – Arms & Hearts (talk) 12:17, 6 August 2022 (UTC)
Are you sure? When I look at this citation in your revert of the bot edit, I see both maintenance messages. Remember that maintenance messages are hidden by default. Editors who wish to see the maint messages must enable them (see Help:CS1 errors § Controlling error message display).
You came to this discussion with the complaint that the bot [removed] a necessary non-breaking space yet in this edit you did the same thing?
Trappist the monk (talk) 13:13, 6 August 2022 (UTC)
I didn't realise maintenance messages were opt-in – from the documentation I assumed it would be like the red "Text 'Foo' ignored" and similar messages that I see without having to enable anything. Re the space, it only needed to be a non-breaking space while in the postscript parameter, as a leading space is ignored otherwise. If it's not in the template a normal space is fine (but per
MOS:DASH en dashes still need to be spaced). – Arms & Hearts (talk
) 14:19, 6 August 2022 (UTC)
cs1|2 emits preview messaging in the preview-warning box at the top of a previewed page which advertises the existence of cs1|2 error and maintenance messages and links to the instructions that describe how to display or hide those messages.
The
MOS:DASH
says (in part):
Ideally, use a non-breaking space before the en dash, which prevents the en dash from occurring at the beginning of a line (markup: {{
snd
}}
or &nbsp;&ndash;)
There is no reason to believe that a citation in Whites, Jews, and Us § External links won't wrap at a place that would place the en dash at the beginning of a new line; where a wrap occurs depends on user screen size, window size, font size, zoom level, ...
Trappist the monk (talk) 14:57, 6 August 2022 (UTC)
There are sometimes good reasons not to use non-breaking spaces, i.e. to minimise potentially confusing and easily breakable code for the benefit of newer editors using the source editor. I assume that's why that bit of the guideline's framed as it is ("ideally") and isn't that commonly adhered to. But no objection to using them in this case. – Arms & Hearts (talk) 15:42, 6 August 2022 (UTC)

Caps: før

Status
 Fixed
Reported by
b
}
04:35, 17 August 2022 (UTC)
What should happen
[15]


Adds broken DOI

Status
Red X Not a bug
Reported by
– Arms & Hearts (talk) 20:36, 13 August 2022 (UTC)
What happens
The bot adds a DOI that's broken and leads nowhere.
What should happen
The bot should refrain from adding DOIs it knows don't take the reader to the source cited.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Adam_Curle&curid=4379273&diff=1104124822&oldid=1102574848


This has been discussed in the past, and there was a very strong agreement that adding these was a good idea. The DOI can be found via google, etc. and that makes it a useful identifier, even if broken. Also, it encourages people to report and get them fixed. Note that the dx.doi.org reporting method does not necessarily work often, since the DOI might be owned by a different company than the journal owner. I have actually reached out to various publishers and gotten many of them fixed. AManWithNoPlan (talk) 13:08, 15 August 2022 (UTC)

We do remove/block some DOIs, such as the Angle Orthodontist DOIs, since the old ones will never be registered (I have verified that with the publisher via email). AManWithNoPlan (talk) 13:39, 16 August 2022 (UTC)

Cleanup of date

Status
Red X Won't fix because risk of false positives with Issues that are the date, and it is too rare
Reported by
b
}
22:31, 8 June 2022 (UTC)
What should happen
[16]
We can't proceed until
Feedback from maintainers


Will do, if not book, and if no existing |date=, and id year matches any existing |year=. Now to right the code, and unit tests. AManWithNoPlan (talk) 11:35, 13 June 2022 (UTC)

Please make this work only with the |volume= parameter, and not |title=.
A date in the title may be genuinely part of the title, e.g. "Hospital opening delayed to August 2023". --BrownHairedGirl (talk) • (contribs) 14:48, 13 June 2022 (UTC)
I am still not sure how to distinguish dates and Issue numbers/names that are dates. AManWithNoPlan (talk) 18:55, 22 June 2022 (UTC)
Issue is for issue number, not the issue date/issue name.
  • Smith, J. (2007). "Title". Magazine. No. Summer 2007. p. 23.
should be converted to
  • Smith, J. (Summer 2007). "Title". Magazine. p. 23.{{cite magazine}}: CS1 maint: date and year (link)
Same in cite journal, etc.
b
}
04:31, 23 June 2022 (UTC)
Please remove |year= if you add |date= in this case. Izno (talk) 05:15, 23 June 2022 (UTC)
It we take time to get these right, so I will program in tests before implementing it.AManWithNoPlan (talk) 13:17, 24 June 2022 (UTC)
How common is this problem? AManWithNoPlan (talk) 13:13, 12 August 2022 (UTC)

Double slashes in DOIs

Status
 Fixed
Reported by
b
}
13:28, 18 August 2022 (UTC)
What should happen
doi with // search reveals several DOIs with // after their prefix.

The vast majority of those are functional if you remove the second slash.

Citation bot should attempt to cleanup these DOIs. Remove the slash, if still active, it's a good fix. If it doesn't resolve, bad fix and keep the original DOI.

b
} 13:28, 18 August 2022 (UTC)


I will look into doing that. I have run across at least one DOI where the first character in the suffix is a slash. Absolutely nuts. AManWithNoPlan (talk) 13:47, 18 August 2022 (UTC)

I have fixed this and am now running the bot over those pages. Here is a better REGEX. Note that the difference is that it first limits to pages with the phrase "doi" on them. This shrinks the number of pages to REGEX so you get more hits. https://en.wikipedia.org/w/index.php?title=Special:Search&limit=500&offset=0&ns0=1&ns118=1&search=insource%3Adoi++insource%3A%2F10%5C.%5B0-9%5D%5B0-9%5D%5B0-9%5D%5B0-9%5D%5B0-9%5D%3F%5C%2F%5C%2F%2F. AManWithNoPlan (talk) 14:18, 18 August 2022 (UTC)

403 unauthorized

Status
 Fixed
Reported by
Jc3s5h (talk) 15:45, 19 August 2022 (UTC)
What happens
adds false title
What should happen
nothing
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Identity_document&curid=364578&diff=1105152453&oldid=1104207073


The site in question seems to have some kind of browser validation. Perhaps it doesn't like Citation bot and directs it to a different page than normal users. Jc3s5h (talk) 15:45, 19 August 2022 (UTC)

Thank you for reporting. Interesting that this is the "title" for the page, since the page sends that as the valid title instead of sending an actual 403 code. AManWithNoPlan (talk) 17:18, 19 August 2022 (UTC)

Suggestions

Is it possible to add ghostarchive.org and archive.ph alongside the current website/(s) this bot uses? Jenkowelten (talk) 08:35, 19 August 2022 (UTC)

 Fixed. Hopefully, no CAPTCHA or such will block us. AManWithNoPlan (talk) 13:27, 19 August 2022 (UTC)

Adding a blank "title" parameter

Status
 Fixed
Reported by
Centcom08 (talk) 21:49, 19 August 2022 (UTC)
What happens
Bot adds a blank "title" parameter to citations with an existing "script-title" and "trans-title" parameters already
Relevant diffs/links
1, 2


|script-title= was not noticed and so a title was added, but then it was removed for being basically the same. AManWithNoPlan (talk) 23:11, 19 August 2022 (UTC)

Shouldn't NYT render as newspaper, and be linked, instead of work (with no link)? (bug or improvement?)

Status
Red X Not a bug/Red X Won't fix too risky
Reported by
Guarapiranga  01:48, 1 August 2022 (UTC)
What happens
When generating citations,
NYT is classed as work instead of newspaper (e.g.[1]
)
What should happen
e.g.[2]
We can't proceed until
Feedback from maintainers


Whether to link a work is a page-specific choice. As for work versus newspaper, |work= is (effectively) an alias of the other without possibly mis-stating what the value of the parameter actually is. In CS1, it is sufficient. (In CS2, it would require a bit more work than to use |work= in general.) --Izno (talk) 04:14, 1 August 2022 (UTC)

I see.
Whether to link a work is a page-specific choice.
What does that choice typically depend on? Guarapiranga  07:50, 1 August 2022 (UTC)
Couldn't say since I feel no inclination to do so in my work, but either way it's not an appropriate bot action. :) Izno (talk) 15:54, 1 August 2022 (UTC)
Can you point me to any policy on this,
WP:REF. — Guarapiranga 
05:57, 2 August 2022 (UTC)
b
} 05:59, 2 August 2022 (UTC)
"this"? That it is a per-page matter?
WP:DUPLINK is a good start; Citations stand alone in their usage, so there is no problem with repeating the same link in many citations within an article; e.g. |work=The Guardian. However, this is in the overall context of "don't make duplicate links", so editors should have local control over which works to link in citations, and whether to link them in multiple or not at all. Izno (talk
) 06:01, 2 August 2022 (UTC)
There's also collected bibliographies (structured lists of works cited, usually only the first instance is linked) vs unstructured lists (where often all instances are linked). The bot can't distinguish between these.
b
}
06:05, 2 August 2022 (UTC)
editors should have local control over which works to link in citations
Absolutely, but is there a downside to CITEBOT suggesting notable newspapers, magazines, news channels, etc, to be linked by default? —
Guarapiranga  09:20, 2 August 2022 (UTC)
Violations of
b
} 12:00, 2 August 2022 (UTC)
Are you referring to this,
RS sense of source, not the actual cited work), when generating a new one. Whether this should be included in a requested run, is another question. I'd say yes, if there are other improvements, similarly to what the bot does renaming parameters (from |first= and |last= to |first1= and |last1=, for example). — Guarapiranga 
03:41, 12 August 2022 (UTC)
Linking the source is a change in citations. People got blocked for edit warring over it. This is not a good task to be done by bots.
b
}
05:50, 12 August 2022 (UTC)
I'm talking about the citation tool in the editor. Am I asking at the wrong place? — Guarapiranga  22:48, 13 August 2022 (UTC)
Yes, you are. Izno (talk) 01:53, 15 August 2022 (UTC)
(Which doesn't make your suggestion doable or desirable in that context either.) Izno (talk) 01:53, 15 August 2022 (UTC)

recognize Foobar and Barfoo / Foobar & Barfoo as equivalent in journal/publisher

Status
Mostly  Fixed, will remove when adding journal
Reported by
b
}
20:26, 19 August 2022 (UTC)
What should happen
[17]
We can't proceed until
Feedback from maintainers


Same for Foobar / The Foobar.

b
} 20:27, 19 August 2022 (UTC)

Bot removed archive-url

Status
 Fixed
Reported by
BrownHairedGirl (talk) • (contribs) 10:59, 21 August 2022 (UTC)
What happens
Bot removed the URL from malformed parameter: |archive-url|https://web.archive.org/web/20041022022448/http://www.constancedemby.com/spain_f.html|archive-url
What should happen
nothing.
Or ideally, replace the pipe character with an equals sign: |archive-url| with |archive-url=
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Constance_Demby&diff=prev&oldid=1105704081


Yes, this is

GIGO, and this sort of junk is tricky to handle. But I think that it might be possible to fix this particular piece garbage rather than making things worse. --BrownHairedGirl (talk) • (contribs
) 10:59, 21 August 2022 (UTC)

Surprising that having an |archive-date= without an |archive-url= is not reported as a CS1 error. User-duck (talk) 12:54, 21 August 2022 (UTC)
The bot should not remove the URL. It is a separate malformed parameter. User-duck (talk) 12:54, 21 August 2022 (UTC)

Replacement characters are being put into citation titles.

Status
 Fixed == all pages cleaned up, including ones where the archives actually have rubbish titles to start with. Will watch that category
Reported by
User-duck (talk) 03:20, 21 August 2022 (UTC)
What happens
"replacement character in |title= at position ..."
We can't proceed until
Feedback from maintainers


About 200 new entries appeared in Category:CS1 errors: invisible characters. This is an example: International Young Design Entrepreneur of the Year

It apparently occurred with this edit: " curprev 23:03, August 19, 2022‎ Citation bot talk contribs‎ 22,440 bytes +778‎ Alter: title. | Use this bot. Report bugs. | Suggested by BrownHairedGirl | #UCB_webform 2196/3663 undo ". — Preceding unsigned comment added by User-duck (talkcontribs)

An actual diff would be more useful than copy-pasting the thing you see on your watchlist.
b
}
03:32, 21 August 2022 (UTC)
Is this what you want: https://en.wikipedia.org/w/index.php?title=International_Young_Design_Entrepreneur_of_the_Year&type=revision&diff=1105372455&oldid=1093163191 . "diff" means something different to a programmer. User-duck (talk) 12:44, 21 August 2022 (UTC)
PS: The title in the archive page is "Don’t trash this". The apostrophe is probably an extended version of a single quote. (My browser's "Find" no longer detects this difference.) User-duck (talk) 12:44, 21 August 2022 (UTC)
Ah dreaded replacement characters. Took me a while to figure out how to detect those. If you want my solution which is not pretty but is reliable let me know. PhP might have better support than my language does. -- GreenC 03:52, 21 August 2022 (UTC)
Even CrossRef gives us those sometimes. We accept a certain amout of them since often you know that f<<crap character>>r is some languge's word for for. I have a added a bunch of new code to detect character sets and such, so this might be fixed already. Depending upon when an archive was made, web.archive.org formats their html very differently. AManWithNoPlan (talk) 11:40, 21 August 2022 (UTC)
With archived pages, is the bot's new code smart enough to tell when the archive site itself and the page being archived use different character sets/encodings (and switch between charsets/encodings accordingly)? Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 13:34, 21 August 2022 (UTC)

There are over 300 articles in the category now, I have been fixing CS1 errors for some time now. This is the first time I have seen so many new articles in this category so I knew it was the result of a bot run. Either the bot has changed or the input to the bot is garbage. User-duck (talk) 12:44, 21 August 2022 (UTC)

A bunch more fixes implemented. AManWithNoPlan (talk) 18:35, 21 August 2022 (UTC)
Special bot run that fixes these. https://en.wikipedia.org/w/index.php?title=Minamib%C5%8Ds%C5%8D&diff=prev&oldid=1105777971 AManWithNoPlan (talk) 19:19, 21 August 2022 (UTC)

gadget

Status
Red X Not a bug - added to docs
Reported by
Gnuish (talk) 22:39, 24 August 2022 (UTC)
What happens
I tried to follow the instructions for enabling CitationBot, by checking its box in my personal Preferences and hitting Save. However, when I go to a Wikipedia page, hit Edit on the page, and then click the button for [<Checkmark> Citations], it immediately responds with a pop-up box saying "en.wikipedia.org Error: Citations request failed". So I tried logging out and back in again; same results. So I tried it from the left column "Expand citations" link. This produced an endless set of OAuth requests to allow the bot to edit pages. Whenever I approved one request, it would just pop up another. I think I let it go 10 times before I gave up.
Replication instructions
It was no trouble to reproduce it, just by following the instructions. I am surprised that this bot works for anybody. (See below, the cause is that the user's browser uses the NoScript extension.)
We can't proceed until
Feedback from maintainers


Ah -- my browser runs with NoScript enabled. The bot would not work until I enabled Javascript from citations.toolforge.org to run in my browser. After I enabled that in NoScript, it worked. This was not obvious from the documentation of how to enable CitationBot. Gnuish (talk) 22:39, 24 August 2022 (UTC)

Hebrew sources fault

Hello. I've noticed that the bot adds titles like ??????? ???? ??? when fetching Hebrew source titles (e.g. this). Not sure if this is a fault specific to Hebrew, but seen it on a few articles. Cheers, Number 57 20:53, 22 August 2022 (UTC)

https://en.wikipedia.org/w/index.php?title=User%3AAManWithNoPlan%2Fsandbox5&diff=prev&oldid=1106163921 Fixed. That was obscure PHP failure. AManWithNoPlan (talk) 12:38, 23 August 2022 (UTC)
If anyone finds an others, please report them. PHP is not 100% perfect on all character sets. I believe most of these have now been fixed. On rare occasions, the actual web archive has the the title of ???????? which usually means try to find/make a better archive. AManWithNoPlan (talk) 16:24, 23 August 2022 (UTC)

Replaces legit citations with Chinese links to porno sites

Status
Red X Not a bug
Reported by
120.18.222.26 (talk) 06:41, 25 August 2022 (UTC)
What happens
Legitimate references replaced by Chinese links to porn sites. Refer Reference #4 on the Keszthely_Synagogue page.
Relevant diffs/links
https://en.wikipedia.org/wiki/Keszthely_Synagogue


The bot did nothing that wasn't already there. The link still is there too.
b
}
08:01, 25 August 2022 (UTC)

Caps: NZ/N.Z.

Status
 Fixed
Reported by
b
}
14:54, 26 August 2022 (UTC)
What should happen
[18]


Caps: NY/N.Y.

Status
 Fixed
Reported by
b
}
15:03, 26 August 2022 (UTC)
What should happen
[19]


cite web and twitter

Status
{{
wontfix
}}
Reported by
Trappist the monk (talk) 15:30, 25 August 2022 (UTC)
What happens
Here is the history of a Twitter cite:
What should happen
What the bot should have done is something like this:
{{Cite web|url=https://twitter.com/matthew_d_green/status/893110747991093250|access-date=2021-08-24|website=Twitter|language=en|archive-date=2021-08-24|archive-url=https://web.archive.org/web/20210824174254/https://twitter.com/matthew_d_green/status/893110747991093250|url-status=live|author=Matthew Green |title=The pragmatist in me says "who cares, this will obviously work better if it has a few big nodes". The cypherpunk in me says "F that".}}
Matthew Green. "The pragmatist in me says "who cares, this will obviously work better if it has a few big nodes". The cypherpunk in me says "F that"". Twitter. Archived from the original on 2021-08-24. Retrieved 2021-08-24.
Or, perhaps even better, used {{cite tweet}}. (isn't there a bot that already converts {{cite web}} to {{cite tweet}}?)
We can't proceed until
Feedback from maintainers


Apparently Citation bot can get the author and the tweet so why didn't it use separate parameters |author=Matthew Green and |title=The pragmatist in me says ...? And where did … HTTPS://T.co/KOZNOLgyKk come from? That shortened url isn't part of the original or the archived tweets – the url doesn't point to anything of value (unless a poodle in a chair has value...). It is interesting to me that the author text in |title= has the non-English 'Twitterren' (Google translate thinks that it's Basque for 'on Twitter' which is what I normally see: |title=Matthew Green on Twitter: "The pragmatist in me says...". This odd use of 'Twitterren' is what provoked me to investigate the history of this citation.

Trappist the monk (talk) 15:30, 25 August 2022 (UTC)

This is from the web archive title. AManWithNoPlan (talk) 16:22, 25 August 2022 (UTC)

Caps: IFAC

Status
{{fixed}}
Reported by
b
}
11:38, 27 August 2022 (UTC)
What should happen
[20]
We can't proceed until
Feedback from maintainers


Removal of cite id params

Status
{{fixed}}
Reported by
Invasive Spices (talk), 27 August 2022 (UTC)
What happens
Removes |id= params leaving the field with broken params |id=..
What should happen
Don't remove |id= params. I don't think there is reason to do this in any case but certainly not in this case.
Relevant diffs/links
[21]
We can't proceed until
Feedback from maintainers


Remove url where identifier already exist for the same thing

The bot should treat |url=https://www.ncbi.nlm.nih.gov/labs/pmc/articles/PMC6077754/ the same ay as |url=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6077754/
I.e. remove it from a {{cite journal}} if |pmc=6077754 is already present. Jonatan Svensson Glad (talk) 04:59, 31 August 2022 (UTC)

 Fixed AManWithNoPlan (talk) 15:36, 31 August 2022 (UTC)

Update arxiv bibcodes

Status
 Fixed and "tmp" ones also
Reported by
b
}
13:05, 28 August 2022 (UTC)
What should happen
[22] [23]
We can't proceed until
Feedback from maintainers


Any bibcode that starts with ####arXiv... should be updated when possible.

b
} 13:05, 28 August 2022 (UTC)

Okina replaced by apostrophe

Status
 Fixed
Reported by
GA-RT-22 (talk) 21:44, 30 August 2022 (UTC)
What happens
okina has been replaced by apostrophe
What should happen
okina is correct, it should not have been replaced by apostrophe
Relevant diffs/links
[24]


The original citation at [25] incorrectly uses a left single quotation mark U+2018 where okina U+02BB was intended. This was corrected when the citation was added to the article. The bot should have left this alone. GA-RT-22 (talk) 21:44, 30 August 2022 (UTC)

Mislabeling Associated Press and Reuters as a "work" rather than an "agency"

Status
 Fixed
Reported by
Dawnseeker2000 22:34, 25 June 2022 (UTC)
What happens
[26] Here, both AP and Reuters are changed from using the "agency" parameter to the "work" parameter.
  1. Usage of the "work" parameter is limited to news sources that are websites, newspapers, journals, or magazines. Agencies follow the formatting style for publishers.
  2. From the initial sentence of our Reuters article: "Reuters (/ˈrɔɪtərz/ (listen)) is an international news agency". See also that our article title is not italicized. That is correct and consistent with ref formatting for news agencies.
  3. From the initial sentence of our Associated Press article: The Associated Press (AP) is an American non-profit news agency. See also that our article title is not italicized. That is correct and consistent with ref formatting for news agencies.
What should happen
Recommend that Associated Press and Reuters be switched to use the "Agency" parameter.


Not a bug. See the template documentation.

Trappist the monk (talk) 23:03, 25 June 2022 (UTC)

What TMK is referring to is the statement "Do not use for sources published on the agency's own website; e.g. apnews.com or reuters.com; instead, use work or publisher". Using the work parameter presents in italics while publisher does not. I wonder why the indifference. I suggest we use the publisher parameter (again, for consistency with "agency" styling). Dawnseeker2000 23:25, 25 June 2022 (UTC)

I agree with Trappist. In these cases, the news agency is acting as a publisher rather than as an agency. Neutral on whether to use |work= or |publisher=. --BrownHairedGirl (talk) • (contribs) 13:03, 26 June 2022 (UTC)

When the publisher and work are the same, publisher is not usually used. AManWithNoPlan (talk) 14:29, 26 June 2022 (UTC)
Associated Press and Reuters are not works though. They are agencies/publishers, and should not be converted to works.
b
}
15:09, 26 June 2022 (UTC)
Review the template documentation. Realize accordingly that your comment here is in direct contradiction to your comments above about Cite magazine. Yes, these are the same exact issue. Izno (talk) 16:40, 26 June 2022 (UTC)
If this was directed at me... which contradiction? Dawnseeker2000 00:03, 27 June 2022 (UTC)
AP and Reuters are both news agencies, and news works in their own right. When citing the AP or Reuters website directly they are works. When citing a story on another news organisation's website that says that the reporting is from AP or Reuters, then they are acting as an agency. Sideswipe9th (talk) 17:06, 26 June 2022 (UTC)
When used as the name of a work (for an article directly from the AP web site), it should have |work=AP News or |work=Associated Press News. That is the name they give to that part of their site, that is, the work. It is incorrect to list |work=AP or |work=Associated Press. That is the name of the organization, not the name of their web site, and should appear in |publisher= or |agency= instead. —David Eppstein (talk) 19:51, 26 June 2022 (UTC)
Yes, that's what I'm saying. News outlets (regardless of media type) are considered publishers Dawnseeker2000 01:18, 27 June 2022 (UTC)

Dead links

Status
Red X Not a bug
Reported by
Nicolas.hammond (talk) 19:56, 6 September 2022 (UTC)
What happens
you removed dead links
What should happen
leaving them alone
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Joann_Glasson&type=revision&diff=1108877979&oldid=1102652393&diffmode=source


You need to look more closely at the edits. AManWithNoPlan (talk) 20:42, 6 September 2022 (UTC)

Caps: Tae Kwon Do Life Magazine

Status
 Fixed
Reported by
b
}
01:55, 7 September 2022 (UTC)
What should happen
[27]


creates bogus |title=value

Status
{{
wontfix
}}, since these are replacing "archived title", and so exiting data is bad, and new data is an improvement.
Reported by
Trappist the monk (talk) 13:40, 6 September 2022 (UTC)
What happens
yeah, I know, it's in <title>...</title>, but the bot should never add a title with a url to |title= in a cs1|2 template. Further, if a 'title' contains pipe characters (or any of its html equivalents) the bot should probably not use whatever it found in <title>...</title> because most of those things between pipes are not the title.
Relevant diffs/links
diff
We can't proceed until
Feedback from maintainers


titles added included website name

Status
{{
wontfix
}}, since the alternative is worse (not fixing the title), and figuring out the mapping from titles to work/website is too hard.
Reported by
Keith D (talk) 18:44, 7 September 2022 (UTC)
What happens
Adds the website to the end of the title field.
What should happen
In this case the website is already in the website field so just omit from title. If no website field create one and place detail there not in title.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Amr_Zaki&curid=4256847&diff=1109034329&oldid=1106336881
We can't proceed until
Feedback from maintainers


unneeded "issue=" field added

Status
{{fixed}} - will now drop volume
Reported by
UtherSRG (talk) 15:56, 15 September 2022 (UTC)
What happens
added an "issue=" field identical to the "volume=" field
What should happen
nothing
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Robinson%27s_mouse_opossum&diff=prev&oldid=1110459360
We can't proceed until
Feedback from maintainers


Job stalled, won't die

My category run (Category:Genes on human chromosome X) stalled at 11:54 this morning after this edit to item 218 of 542, Collagen, type IV, alpha 5. Around 15:10, I tried to kill the stalled job with https://citations.toolforge.org/kill_big_job.php, which responded with Existing large job flagged for stopping, but, as of now, nearly an hour and a half later, my attempts to start a new category run are still throwing up Run blocked by your existing big run.. Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 21:39, 15 September 2022 (UTC)

 Fixed both hung job and what killed it. AManWithNoPlan (talk) 16:53, 18 September 2022 (UTC)

Caps: N.Y. L. Sch. L. Rev.

Status
 Fixed
Reported by
b
}
20:44, 9 September 2022 (UTC)
What should happen
[28]


Another stalled job

Another one of my category runs's stalled (Category:Corrosion inhibitors this time), having made this edit to Benzotriazole (item 26 of 32); the reason I know it's stalled rather than simply having finished running through that category is that, when I attempted to start a new category run, it threw up Run blocked by your existing big run., which it continued to do even after I tried to kill the stalled job with https://citations.toolforge.org/kill_big_job.php (which obligingly came back with Existing large job flagged for stopping). Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 19:36, 21 September 2022 (UTC)

You should be unlocked now. No idea why. I see no error messages. AManWithNoPlan (talk) 01:15, 22 September 2022 (UTC)
@AManWithNoPlan: And now I'm being locked out again by yet another stalled job (I'm not sure what it is this time!). Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 19:50, 22 September 2022 (UTC)
@Whoop whoop pull up: try yet again. AManWithNoPlan (talk) 21:02, 22 September 2022 (UTC)
Working now, thanx! Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 21:26, 22 September 2022 (UTC)
@AManWithNoPlan: Actually, scratch that; now I'm being locked out even after the run in question's finished! (Category:Cultural depictions of Xaviera Hollander, in case you were wondering.) Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 21:54, 22 September 2022 (UTC)

@AManWithNoPlan: Sorry to pester you, but I'm still being locked out of starting a new run most of a day later. Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 12:12, 23 September 2022 (UTC)

 Fixed AManWithNoPlan (talk) 13:43, 23 September 2022 (UTC)

Thanx! :-) Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 16:59, 23 September 2022 (UTC)

Change parameter for cite web but not cite news?

Why does it change {{cite web |via=NYTimes.com}} to {{cite news |newspaper=The New York Times}}
but not {{cite news |via=NYTimes.com}}to {{cite news |newspaper=The New York Times}}
or am I missing something? Jonatan Svensson Glad (talk) 22:07, 12 September 2022 (UTC)

Can you point me to some specific examples AManWithNoPlan (talk) 12:51, 21 September 2022 (UTC)
See diff of the above. Sidenote, it did not remove |via= when |work= had content. Jonatan Svensson Glad (talk) 17:48, 22 September 2022 (UTC)

{{fixed}} AManWithNoPlan (talk) 19:46, 24 September 2022 (UTC)

Caps: SMPTE

Status
 Fixed
Reported by
b
}
23:48, 24 September 2022 (UTC)
What should happen
[29]


Caps: Proceedings of SPIE

Status
 Fixed
Reported by
b
}
23:53, 24 September 2022 (UTC)
What should happen
[30]


Caps: CA

Status
 Fixed
Reported by
b
}
01:38, 25 September 2022 (UTC)
What should happen
[31]


Caps: Rev Sci Tech Off Int Epiz

Status
 Fixed
Reported by
b
}
23:43, 24 September 2022 (UTC)
What happens
[32]
What should happen
[33]


Caps: SAGE Open

Status
 Fixed
Reported by
b
}
23:51, 24 September 2022 (UTC)
What should happen
[34]


Error Adding Category:Articles lacking reliable references

Status
Red X Not a bug
Reported by
Skeet Shooter (talk) 01:00, 27 September 2022 (UTC)
What happens
The bot fixed a handful of parameter syntax errors (appreciated) in the references, but it also added the hidden category "Articles lacking reliable references". I couldn’t tell what issue caused the addition of this category, but I disagree with the bot’s assessment.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=List_of_SETP_Members&type=revision&diff=1112536444&oldid=1112256880


Umm, that was you. You included {{Better source needed}} when you created the article. That template adds Category:Articles lacking reliable references (see info at the top of the category page).
Trappist the monk (talk) 01:07, 27 September 2022 (UTC)
Thank you, Trappist. I missed that connection, but it makes perfect sense. I wanted to keep these entries for a while that were in the previous version of the list to give other editors a chance to provide reliable references that I have so far been unable to find. I concur the bot’s behavior is correct and not a bug. I appreciate the quick response. Skeet Shooter (talk) 01:30, 27 September 2022 (UTC)

wrong doi!

The bot added

bibcode = 1960Natur.186Q.211. to the

But the Nature article (the doi) is only the review of the real article (the url) --Stone (talk) 08:54, 29 September 2022 (UTC)

The exising S2CID points to nature https://en.wikipedia.org/w/index.php?title=Iridium&type=revision&diff=1112758294&oldid=1112673378 GIGO AManWithNoPlan (talk) 11:30, 29 September 2022 (UTC)
Also the URL is wrong. AManWithNoPlan (talk) 13:06, 30 September 2022 (UTC)

Reuters

Status
Red X Not a bug
Reported by
Mannofthomas (talk) 00:11, 21 September 2022 (UTC)
What happens
The bot incorrectly changes the parameter from 'agency' to 'work' when Reuters is being used, resulting in 'Reuters' being italicized, even though Reuters, an agency, is not italicized in English.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Elon_Musk&diff=1111432762&oldid=1111432493


I don't think this is a news article syndicated by agency Reuters and published by someone else, so the use of |agency=Reuters is incorrect. However, turning it into |work=Reuters is also incorrect. The name of the work is not "Reuters". I think that the correct parameters for this citation would be |work=Reuters Graphics and/or |publisher=Reuters. —David Eppstein (talk) 00:20, 21 September 2022 (UTC)
I totally agree. The main issue for me is the italicization of 'Reuters', which is just wrong, at least by utilizing the 'agency' parameter, the name won't be italicized even though it seems not to be the correct option either. — Mannofthomas (talk) 00:43, 21 September 2022 (UTC)
|work= is always italicized, and should be. If Reuters (the company) published this content on a site (work) whose name was also "Reuters", then italicizing it as the name of a work would be correct. The only problem here is that the name of the site is not "Reuters" but instead "Reuters Graphics". —David Eppstein (talk) 00:59, 21 September 2022 (UTC)
What I've learned is that not all websites are considered works. This is illustrated by the numerous articles we have about websites that don't employ the {{italic title}} template. Also, when an editor considers whether a particular source should be a publisher or work it's helpful to look at things with a focus on the publisher/work relationship.
In old media, there was a pretty consistently identifiable relationship between newspapers and their associated publishing companies, but with new media, that is becoming a thing of the past. Going back to the numerous articles we have about websites that do not have their titles italicized: I think that that is correct because of this lack of a publishing company. Yes, websites can have a parent company, but those are not the same as a publishing company, so it can be confusing. But what I think the editors of these articles have realized, as have I, is that if there's no identifiable publishing company then the website itself is the publisher.
In my work, I try to be consistent with how I decide whether a source should be italicized or not, and this idea that Reuters should be italicized only sometimes is way too perplexing. Whoever deemed that the right way to go needs to think again. Consider the numerous other news agencies that don't have that rule applied. Also consider that every other news agency or publisher is not italicized. Our own article on Reuters is not even italicized. Dawnseeker2000 01:57, 21 September 2022 (UTC)
The rule is very straightforward. When a name is used as the name of a company or organization (for instance, as a publisher or agency), it is upright Roman. When a name is a name of a collective body of work, like a periodical or an edited volume or a web site, it is italic. Some names are used for both kinds of things and depending on how they are used can be formatted differently. Trying to choose a parameter based on how you would like to see it formatted is a mistake. Choose a parameter based on what kind of thing you are naming and let the formatting handle itself. —David Eppstein (talk) 03:59, 21 September 2022 (UTC)
The main things to understand is that editors are indeed noticing the discrepancy of Reuters being italicized and that there will be more who speak up in the future. Dawnseeker2000 04:14, 21 September 2022 (UTC)
There are many places in Wikipedia where details are imperfect and can be improved. But your focus on whether it is italicized or not is not a useful way to think about it. Focus on whether the name Reuters is used to mean the company Reuters or the web site maintained by the company. They are two different things and our citation formatting (when used with the correct parameters, based on meaning rather than intended appearance) helps readers understand which of those two meanings is intended by the name. —David Eppstein (talk) 04:22, 21 September 2022 (UTC)
The bot's edit was more-or-less correct. When Reuters (the corporate entity) publishes something on its eponymously named website (the work) then the 'Reuters' name goes in |work=. In this specific case, at the bottom of the source is this:
Rocket illustrations by Wen Foo
Additional work by Ashlyn Still and Travis Hartman
Editing by Christine Chan and Thomas Brown
REUTERS GRAPHICS
so |work=Reuters Graphics is more correct than |work=Reuters. Use |agency=Reuters when Reuters licences another corporate entity to publish a Reuters article in the 'other corporate entity's' work (commonly a newspaper, or other news source). Example: Reuters grants a licence to The New York Times Company to publish a Reuters article in The New York Times so: |agency=Reuters |newspaper=The New York Times. |agency= should never be used in the absence of |work= (or an alias) – I wonder if cs1|2 should enforce that; it might reduce the number of occasions that this topic must be discussed...
In periodical cs1|2 templates ({{cite journal}}, {{cite magazine}}, {{cite news}}, {{cite web}}) |publisher= should be omitted most of the time. Do not use |publisher=Reuters to avoid italicizing 'Reuters'.
cs1|2 templates create metadata that is used by reference management software (Zotero, for example). The standard that cs1|2 uses does not have support for |publisher= in periodical metadata and no support for |agency= at all. |agency= is only rendered in the visual. Those who consume cs1|2 citations through the metadata have no idea where an article comes from if that information is not included in the correct cs1|2 parameter (|work= or an alias). Do not deprive those consumers of this essential information.
Trappist the monk (talk) 14:03, 21 September 2022 (UTC)

Gimme Some Lovin’

Thanks for the help.😁 Hamsterbird (talk) 01:01, 1 October 2022 (UTC)

Did not fix author

Status
Red X Won't fix - too dangerous for a bot
Reported by
85.238.101.64 (talk) 00:55, 5 October 2022 (UTC)
What happens
That edit was almost great except bot did not analyze existing of "first" field filled, which, in fact, did not resolve a missing last name problem.
What should happen
I don't mean your bot have to analyze "first" field content (as it obviously not easy to do always right way, i.e. to pick out last name from it if it have place there), but would be great in such particular case when "last" field is empty but "first" field is not to at least not only remove "last" field from "cite web" template, but also renaming "first[1, 2, ...]" field name to just "author[1, 2, ...]" which don't have such a strict meaning as "first" or "last" fields, but still resolve the error issue.
Relevant diffs/links
missing last name problem do not being fixed
We can't proceed until
Feedback from maintainers


This would not generally be a good solution to the error noted. --Izno (talk) 01:14, 5 October 2022 (UTC)

@Izno:Then what was a purpose of deleting empty "last" field at all? While doing proposed action it makes something really useful: despite it's not a best solution - it still a solution and not some action that nor fix anything nor do something useful as it now. 85.238.101.64 (talk) 01:19, 5 October 2022 (UTC)
@Ira Leviton:It would be good for you to join here as expert. What do you thing about current automated proposition? Will it ease ediitng some way by resolving "no last name" automatically instead of placing it ot appropriate category or not? 85.238.101.64 (talk) 01:24, 5 October 2022 (UTC)
If any author parameters are set, then all empty author parameters are dropped. Same with editors. Also all aliases: such as if issue is set then number is dropped. It is all about removing empty extra parameters. AManWithNoPlan (talk) 14:13, 6 October 2022 (UTC)

Double edit needed

Status
{{
notabug
}} - user was running in special purpose URL focused mode
Reported by
b
}
03:43, 10 October 2022 (UTC)
What happens
[35]
What should happen
same + [36] (the Frontiers journal specifically)
We can't proceed until
Feedback from maintainers


User-activated instance of bot removes disambiguators from publication year

Status
Red X Not a bug
Reported by
David Eppstein (talk) 05:37, 12 October 2022 (UTC)
What happens
In critical graph, there are two papers by Gallai, published in 1963, disambiguated as Gallai 1963a and as Gallai 1963b so that harv linking to them will still work. User:RDBrown, in what seems to have been a user-activated instance of Citation bot, removed those years and broke the links. If this is something the bot is doing (rather than user error, which it might be instead, I can't tell) it needs to stop. The same edit also changed a valid page number range 194–197 into an abbreviated range 194–7 which the bot should also not be doing, but again it might be user error rather than bot error.
What should happen
Not that.
Relevant diffs/links
Special:Diff/1115537315


That was me, not the bot. I didn't see harv in the notes or refs and forgot to check in the text. RDBrown (talk) 12:41, 12 October 2022 (UTC)

Ok, thanks for clarifying. I'll tag this "not a bug". —David Eppstein (talk) 14:37, 12 October 2022 (UTC)

Edit comment of "Formatted dashes" is not accurate

Status
 Fixed
Reported by
UtherSRG (talk) 18:35, 11 October 2022 (UTC)
What happens
Bot added "pages=281–292" and reported in edit comment that it added pages and formatted dashes
What should happen
Bot adds "pages=281–292" and reports in edit comment that it added pages
Relevant diffs/links
Special:Diff/1115481187


The bot should only report "Formatted dashes" if the dashes it is formatting were already in place. It shouldn't report this for dashes it added. - UtherSRG (talk) 18:35, 11 October 2022 (UTC)

Untitled_new_bug

Status
 Fixed
Reported by
Timrollpickering (talk) 18:48, 14 October 2022 (UTC)
What happens
The web interface refuses to process a new category


Earlier this afternoon I ran the bot on several small categories, all now complete. However hours later when I try to run it on a new category I just get "Run blocked by your existing big run." with no explanation as to what that is when the run should have completed ages ago. Timrollpickering (talk) 18:48, 14 October 2022 (UTC)

Volume vs Issues

Status
{{fixed}}. now correctly prefers issue over volume for Bulletin of the United States National Museum
Reported by
UtherSRG (talk) 10:55, 17 October 2022 (UTC)
What happens
duplicate volume & issue
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Fraser%27s_dolphin&diff=prev&oldid=1116596207
We can't proceed until
Feedback from maintainers


Error in ref re: comma

Status
{{
notabug
}} - that is a single quote, not a comma
Reported by
NOLA1982 (talk) 01:22, 20 October 2022 (UTC)
What happens
I'm new at reporting these types of errors, so please be kind. The comma between city and state abbreviation in a ref title parameter was changed in error. In this example, "Quincy, CA" was changed by the bot in a ref title to: "Quincy' CA"
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=High_Sierra_Music_Festival&type=revision&diff=1117088149&oldid=1023012736
We can't proceed until
Feedback from maintainers


that is really odd. the original character is a single quote - spanish style. AManWithNoPlan (talk) 11:28, 20 October 2022 (UTC)

Small error

Strange error, in this edit Zehnder's first paper was marked with year 2010, even though it is from 1975. Gumshoe2 (talk) 01:11, 23 October 2022 (UTC)

The journal put the wrong data in CrossRef

https://search.crossref.org/?from_ui=&q=10.1002%2Fcpa.3160280104 AManWithNoPlan (talk) 11:19, 23 October 2022 (UTC)

Citation bot treats citations from NIST that are not journal articles as if they were journal articles


Removal of "format" parameter in cs1 citations

Status
Red X Not a bug
Reported by
XAM2175 (T) 19:14, 17 October 2022 (UTC)
What happens
The |format=PDF parameter is being removed unnecessarily from references using the {{Cite book}} template.
What should happen
It should be left in place, per the advice at
HELP:CS1#Using |format=
Relevant diffs/links
[37]


Incidentally, it also says it upgraded an ISBN10 to 13 but all it did was add dashes to an ISBN13. --Izno (talk) 19:30, 17 October 2022 (UTC)

Removing |format=PDF is correct, as the cite template now knows to display the PDF sign regardless. - UtherSRG (talk) 21:01, 17 October 2022 (UTC)
The reason it should not be removed is because |format=PDF displays to everyone, whereas the icon is not available to all users. Izno (talk) 21:07, 17 October 2022 (UTC)
(PDF) displays regardless of whether or not format is declared. It can be removed without any difference to the output.
b
}
23:03, 17 October 2022 (UTC)
How does it determine that it's a PDF? If it's from the file extension in the URL, then is it not possible that the automatic determination could miss PDFs that are reached via URLs that don't have that extension?
Regardless, it strikes me as odd to go to such lengths to remove a piece of human-verified metadata that is 1) potentially useful and 2) absolutely harmless when it isn't be useful, simply because some uses of it are redundant. XAM2175 (T) 00:11, 18 October 2022 (UTC)

Free journal

10.1210/jendso/* should be marked as |doi-access=free. Example: |doi=10.1210/jendso/bvab049. Jonatan Svensson Glad (talk) 19:26, 24 October 2022 (UTC)

 Fixed

NOAA NCEI Natural Hazards databases should be web, not journal

Status
 Fixed
Reported by
Worldbruce (talk) 05:28, 24 October 2022 (UTC)
What happens
Citation bot changes {{cite web}} to {{cite journal}} in 1762 Arakan earthquake, but the linked NOAA NCEI Natural Hazards database looks to me like a web database, not a journal. Meteorology is not my specialist subject, but I believe 10.7289 is a non-journal DOI. If I'm mistaken, what is the name of the journal?
Relevant diffs/links
[38]


Caps: CytoJournal

Status
 Fixed
Reported by
b
}
15:16, 23 October 2022 (UTC)
What should happen
[39]


url in |title=

Status
 Fixed - I will be more careful
Reported by
Trappist the monk (talk) 15:44, 23 October 2022 (UTC)
What happens
No doubt, GIGO, the bot replaced |title=Archived copy with text that contains a url
What should happen
Prospective replacement 'title' contains a url? Abandon the edit. URLs are allowed only in url-holding parameters (|url=, |chapter-url=, etc); pretty sure that I've made this point before...
Relevant diffs/links
Diff
We can't proceed until
Feedback from maintainers


Restricting urls to something-url parameters in the citation templates is, as usual, too mechanical, prescriptive, and procrustian to handle real-world citations. It is frequently necessary to put urls in other parameters including id, page, at, etc. The only parameters for which urls should be excluded are the ones for which it is possible to make a link using a different parameter. However, title is one of those otherwise-linkable parameters, so I agree that it should not have urls. —David Eppstein (talk) 18:28, 23 October 2022 (UTC)
Sorry about that. I am running a special version of the bot that allows me to override the normal GIGO prevention systems to deal with titles of "archived copy". Each and every title has to be manually approved. That was obviously a mistake on my part. AManWithNoPlan (talk) 14:02, 24 October 2022 (UTC)

Bad journal title

Status
 Fixed
Reported by
Pi.1415926535 (talk) 19:56, 28 September 2022 (UTC)
What happens
Incorrectly converts from {{cite book}} to {{cite journal}} and adds obviously incorrect (and badly formatted) journal title
Relevant diffs/links
Special:Diff/1112917250


Another instance Special:Diff/1114768642 "journal=Includes:reports from Commissioners, Inspectors and Others" jnestorius(talk) 06:26, 14 October 2022 (UTC)

Thank you for reporting. Investigating how to detect this bad meta-data. AManWithNoPlan (talk) 12:41, 21 October 2022 (UTC)

Duplicates hdl from inside url parameter

Status
Red X Not a bug
Reported by
Acad Ronin (talk) 16:58, 21 October 2022 (UTC)



On 23 September 2022, on Partridge (1814 ship) the Citation Bot ignored the parameter url=, which had a fully functional URL in it, and added hdl= , with a url. The result was that the citation displayed both with the clickable icon for a url, and the full, redundant hdl= url. I had fixed this untidy semiduplication before, but apparently to no avail. Clearly the bot is ignoring the url parameter, even when it is populated. Can this be fixed? Regards, Acad Ronin (talk) 16:58, 21 October 2022 (UTC)

Looks like it was 8 Sep 2022: Special:Diff/1109275325 - UtherSRG (talk) 17:07, 21 October 2022 (UTC)
This is not a bug. The standard for references is to add identifier, etc.. More links the better. AManWithNoPlan (talk) 15:22, 22 October 2022 (UTC)

Big run doesn't stop

I haven't been able to use the bot in a while, apparently there's a big run done for me, but I can't seem to kill it or find what edit it's wanting to do.

b
} 02:00, 25 October 2022 (UTC)

The server got rebooted and corrupted the files. I have deleted everyones sessions. AManWithNoPlan (talk) 11:41, 25 October 2022 (UTC)

Template change error

Status
 Fixed with special code for 10.5962/bhl.title. DOIs
Reported by
Mikeblas (talk) 20:06, 27 October 2022 (UTC)
What happens
Citation bot changes {{cite web}} to {{cite journal}} without specifying a journal name, and causes a referencing error.
Relevant diffs/links
[40]
We can't proceed until
Feedback from maintainers


And the |p= changes should have been to |pages=. --Izno (talk) 20:24, 27 October 2022 (UTC)
 Fixed the page to pages issue. AManWithNoPlan (talk) 21:30, 27 October 2022 (UTC)

bot adds |chapter= to cite journal

Status
Red X Won't fix
Reported by
Trappist the monk (talk) 23:10, 26 October 2022 (UTC)
What happens
bot added |chapter= to {{
cite ssrn}}, {{cite web
}}.
Relevant diffs/links
Diff


This appears to be an entry in a book, A Dictionary of Political Biography. The metadata for the book, showing it to be a book and giving its title, is what one gets from the doi database. So the correct change here would be to add the book title and change the citation template to {{cite book}} or maybe {{cite encyclopedia}}. —David Eppstein (talk) 23:25, 26 October 2022 (UTC)
Those are actually logged by the bot and I go in and fix them. They are almost universally GIGO that needs some human TLC. AManWithNoPlan (talk) 14:13, 27 October 2022 (UTC)