Wikipedia:Wikipedia Signpost/2017-06-09/Op-ed
Wikipedia's lead sentence problem
In the 9th edition of the Encyclopædia Britannica, editor Thomas Spencer Baynes introduced the convention of including a person's birth and death year after their name in all biographical articles:
CAMPBELL, John, LL.D. (1708–1775), a miscellaneous author, was born at Edinburgh, March 8, 1708.
This allowed a reader to more easily distinguish between the 100+ notable people named John Campbell (only one of whom was actually lucky enough to get an article in the 9th edition). Although this convention was a bit awkward and redundant, it served a useful purpose (in the absence of disambiguation pages), and was kept in all subsequent editions.
When Wikipedia was created in 2001, it sought to emulate the successful model of the Encyclopædia Britannica and many editors adopted the convention of including birth and death years in the lead sentence.[1] Here is the lead sentence for Christopher Columbus as it appeared on June 13, 2001:
Christopher Columbus (1451?–1506) was a probably Genovian sailor who crossed the Atlantic in service of Spain.
Little did Thomas Spencer Baynes realize, Wikipedia editors would eventually expand on his convention, including not only birth and death years, but entire birth and death dates, birth and death dates in alternate calendars, birth and death locations, alternate names, maiden names, foreign names, pronunciations, foreign pronunciations, and transliterations. Fifteen years later, here's what Christoper Columbus's lead sentence had become:
Christopher Columbus (
Latin: Christophorus Columbus; born between 31 October 1450 and 30 October 1451 in Genoa – died on 20 May 1506 in Valladolid) was an Italian explorer, navigator, colonizer, and citizen of the Republic of Genoa.
What began as a concise, encyclopedic sentence had slowly grown into a sprawling mess of multiplying metadata—a sentence so complicatingly packed as to render it unreadable.[2] This isn't just a subjective opinion, either. If you chart the Flesch Reading Ease score of the sentence over the years, you'll see an almost continuous decline since 2002. This is by no means an isolated example, either. The metadata virus has spread from biographical articles to other subjects as well, like geography:
Israel (
Arabic: دَوْلَة إِسْرَائِيل Dawlat Isrāʼīl [dawlat ʔisraːˈʔiːl]), is a country in the Middle East, on the southeastern shore of the Mediterranean Sea and the northern shore of the Red Sea.
The problem has become so noticeable that many reusers of Wikipedia content (including the WMF itself) have started stripping out parenthetical phrases from the lead sentence in certain contexts. If you search for "Christopher Columbus" on Google, you'll see a much more digestible description, both in the
If we don't take significant steps to address this problem, the metadata disease is only going to keep multiplying and spreading. If left unchecked, I fear this is what our future will look like:
[Excerpt from the Americapedia article about Wikipedia, copyright 2034, used with permission.]
...Like frogs in a pot of boiling water, the proliferation of lead sentence metadata happened so slowly that no one noticed until 2021 when John Seigenthaler's son published a devastating video on ClickNews in which he read aloud the lead sentence of his Wikipedia article, and then wept for 3 minutes.
John Michael SeigenthalerEnglish pronunciation: /ˈdʒɑn ˈmaɪkəl ˈsiːɡənθɔːlər/ ⓘ; German pronunciation: [ˈjuːˈan ˈmaɪkəl ˈziːkənθɔːlər] ⓘ; born December 21, 1955 in Nashville, Tennessee , current resident of Weston, Connecticut (as of 2008), not yet deceased), also known as John Seigenthaler Jr. (English pronunciation: /ˈdʒɑn ˈsiːɡənθɔːlər ˈdʒunjəɹ/ ⓘ; German: John Seigenthaler jünger, pronounced [ˈjuːˈan ˈziːkənθɔːlər ˈdʒunjəɹ] ⓘ), is an American news anchor, most recently working for ClickNews.
(
Seigenthaler's video caught the attention of the recently re-elected

Let's avoid this sorry fate and make Wikipedia great again!
- dagger (called a "Kreuz" in German), which has led to endless debates about whether or not the symbol is Christian and thus inappropriate to use for non-Christian biographies. Luckily, such a convention doesn't seem to exist in English encyclopedias!
- largestcontiguous empire in history after his death.
Discuss this story
Join the RfC in response to this article. —A L T E R C A R I ✍ 06:59, 19 June 2017 (UTC) [reply]
Brilliant insight
Sometimes a problem is right in front of you, but you don't notice it until someone else points it out, at which point you see it everywhere. This essay is that sort of eye-opener. --Guy Macon (talk) 22:26, 8 May 2017 (UTC)[reply]
Solutions
Some {{infobox medical condition}} introduced a
|pronounce=
a while ago, which I think is a good solution. Alternate names/languages could be handled the same way in articles with infoboxes, e.g., as documented at Template:Infobox settlement#Name and transliteration.Etymology is an endless problem (e.g., in anatomy articles), with some editors wanting it to be the first thing that you read, others wanting it last, and others not wanting it included at all. WhatamIdoing (talk) 18:53, 12 May 2017 (UTC)[reply]
{{u|Talk} 05:36, 9 June 2017 (UTC)[reply ]
- Really? Have you nothing better to do?
Carl Fredrik talk 07:12, 9 June 2017 (UTC)[reply ]
- I can only assume that this request is some sort of meta-joke. No, we are not going to oversight that part of the article, as it does not qualify under the oversight criteria. Lankiveil (speak to me) 08:56, 11 June 2017 (UTC).[reply]
to reply to me 17:42, 16 June 2017 (UTC)[reply]