Infobox
An infobox is a digital or physical
An infobox may be implemented in another document by
Wikipedia
An infobox may be used to summarize the information of an article on Wikipedia.[3] They are used on similar articles to ensure consistency of presentation by using a common format.[4][2] Originally, infoboxes (and templates in general) were used for page layout purposes.[2] An infobox may be transcluded into an article by specifying the value for some or all of its parameters.[5] The parameter name used must be the same as that specified in the infobox template, but any value may be associated to it.[5] The name is delimited from the value by an equals sign.[5] The parameter name may be regarded as an attribute of the article's subject.[6]
{{Infobox prepared food
| name =
| image =
| image_size =
| caption =
| alternate_name =
| country =
| region =
| creator =
| course =
| type =
| served =
| main_ingredient =
| variations =
| calories =
| other =
}}
|
{{Infobox prepared food
| name = Crostata
| image = Crostata limone e zenzero 3.jpg
| image_size =
| caption = Crostata with lemon ginger filling
| alternate_name =
| country = [[Italy]]
| region = [[Lombardia]]
| creator =
| course = [[Dessert]]
| type = [[Tart]]
| served =
| main_ingredient = Pastry crust, [[jam]] or [[ricotta]], fruit
| variations = ''Crostata di frutta'', ''crostata di ricotta'', many other sweet or savoury variations
| calories =
| other =
}}
|
The infobox Infobox prepared food used on Wikipedia food-related articles, with no values specified for its parameters (attributes).
|
The same infobox as implemented in the article wiki markup: entries enclosed in square brackets (e.g. [[Tart]]) will be rendered as a link to the respective Wikipedia article (e.g. Tart ), and the linked file will be transcluded into the article in place of its markup.
|
On Wikipedia, an infobox is transcluded into an article by enclosing its name and attribute–value pairs within a double set of
Placement of an infobox within the
Baeza-Yates and King say that some editors find templates such as infoboxes complicated,[10] as the template may hide text about a property or resource that the editor wishes to change; this is exacerbated by chained templates, that is templates transcluded within other templates.[10]
As of August 2009, English Wikipedia used about 3,000 infobox templates that collectively used more than 20,000 attributes.[11] Since then, many have been merged, to reduce redundancy. As of June 2013, there were at least 1,345,446 transclusions of the parent Infobox template,[12] used by some, but not all, infoboxes, on 4,251,127 articles.
The name of an Infobox is typically "Infobox [genre]"; however, widely used infoboxes may be assigned shorter names, such as "taxobox" for taxonomy.[8]
Machine learning
About 44.2% of Wikipedia articles contained an infobox in 2008,[13] and about 33% in 2010.[14] Automated semantic knowledge extraction using machine learning algorithms is used to "extract machine-processable information at a relatively low complexity cost".[1] However, the low coverage makes it more difficult, though this can be partially overcome by complementing article data with that in categories in which the article is included.[13] The French Wikipedia initiated the project Infobox Version 2 in May 2011.[15][16]
Knowledge obtained by machine learning can be used to improve an article, such as by using automated software suggestions to editors for adding infobox data.[13] The iPopulator project created a system to add a value to an article's infobox parameter via an automated parsing of the text of that article.[11]
DBpedia uses structured content extracted from infoboxes[2] by machine learning algorithms to create a resource of linked data in the Semantic Web; it has been described by Tim Berners-Lee as "one of the more famous" components of the linked data project.[17]
Machine extraction creates a triple consisting of a subject, predicate or relation, and object.[1] Each attribute-value pair of the infobox is used to create an RDF statement using an ontology.[18] This is facilated by the narrower gap between Wikipedia and an ontology than exists between unstructured or free text and an ontology.[19]
The semantic relationship between the subject and object is established by the predicate.[1] In the example infobox, the triple ("crostata", type, "tart") indicates that a crostata is a type of tart. The article's topic is used as the subject, the parameter name is used as the predicate, and the parameter's value as the object.[18][1] Each type of infobox is mapped to an ontology class, and each property (parameter) within an infobox is mapped to an ontology property.[20] These mappings are used when parsing a Wikipedia article to extract data.
Metadata
By presenting the basic facts of an article within an infobox, also allows the facts to be presented in a machine-friendly way allowing extra functionality such as when a link to a wikipedia is pasted into a compatible program, instead of just the link itself being posted other information such as the articles image is posted too.
Citations
- ^ a b c d e Baeza-Yates & King 2009, p. 31.
- ^ a b c d e Liyang 2011, p. 385.
- ^ a b Broughton 2008, p. 357.
- ^ a b Broughton 2008, p. 17.
- ^ a b c Broughton 2008, p. 18.
- ^ Baeza-Yates & King 2009, p. 30.
- ^ Broughton 2008, p. 234.
- ^ a b Broughton 2008, p. 235.
- wikitext layout is specified at Wikipedia:Manual of Style/Layout and Wikipedia:Manual of Style/Lead section.
- ^ a b Baeza-Yates & King 2009, p. 345.
- ^ a b Lange, Böhm & Naumann 2010, p. 7.
- List of Wikimedia projects. 7 June 2013. Retrieved 7 June 2013.
- ^ a b c Baeza-Yates & King 2009, p. 28.
- ^ Lange, Böhm & Naumann 2010, p. 5.
- ^ Geertman, Reinhardt & Toppen 2011, p. 493.
- ^ The project is hosted on the French Wikipedia page Infobox/V2.
- ^ Miller 2008.
- ^ a b Liyang 2011, p. 387.
- ^ Virvou & Matsuura 2012, p. 315.
- ^ Liyang 2011, p. 395.
Works cited
- Baeza-Yates, Ricardo; King, Irwin, eds. (2009). Weaving services and people on the World Wide Web. Springer. LCCN 2009926100.
- Broughton, John (2008-07-14). Barber, Nan; Meyers, Peter (eds.). ISBN 9780596553777.
- Geertman, Stan; Reinhardt, Wolfgang; Toppen, Fred, eds. (2011). Advancing geoinformation science for a changing world. Lecture Notes in Geoinformation and Cartography. Vol. 1. Springer. LCCN 2011925152.
- Lange, Dustin; Böhm, Christoph; Naumann, Felix (2010). Extracting Structured Information from Wikipedia Articles to Populate Infoboxes. Universitätsverlag Potsdam. ISBN 9783869560816. Archived from the original on 2014-06-06. Retrieved 2013-06-02.)
{{cite book}}
:|work=
ignored (help - Yu, Liyang (2011). A Developer's Guide to the Semantic Web. Springer. ISBN 9783642159695.
- Miller, Paul (2008-02-07). "Sir Tim Berners-Lee Talks with Talis about the Semantic Web". Transcription by CastingWords. Talis Group. Archived from the original on 2013-05-10. Retrieved 2013-06-02.
- Virvou, Maria; Matsuura, Saeko, eds. (2012). Knowledge-based Software Engineering: Proceedings of the Tenth Joint Conference on Knowledge-Based Software Engineering. Frontiers and Artificial Intelligence and Applications. Vol. 240. IOS Press. LCCN 2012943674.
Further reading
- Kiran, Kumar N.; Santosh, G.S.K.; Varma, Vasudeva (June 2011). Multidisciplinary Information Retrieval. Lecture Notes in Computer Science. Vol. 6653. Springer Berlin Heidelberg. S2CID 9901886.
- Chutiporn, Anutariya; Domingue, John, eds. (2008). The Semantic Web: 3rd Asian Semantic Web Conference, ASWC 2008, Bangkok, Thailand, December 8-11, 2008. Proceedings. Lecture Notes in Computer Science. Vol. 5367. Springer. ISSN 0302-9743.
- Wu, Fei; Hoffmann, Ralph; Weld, Daniel s. (2008). "Information extraction from Wikipedia". Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. S2CID 7781746.