Non-breaking space
Non-breaking space | |
---|---|
U+00A0 NO-BREAK SPACE ( ,  ) | |
See also | |
U+202F NARROW NO-BREAK SPACE Other types of spaces |
In
Uses and variations
Despite having layout and uses similar to those of
Non-breaking behavior
Text-processing software typically assumes that an automatic line break may be inserted anywhere a space character occurs; a non-breaking space prevents this from happening (provided the software recognizes the character). For example, if the text "100 km" will not quite fit at the end of a line, the software may insert a line break between "100" and "km". An editor who finds this behavior undesirable may choose to use a non-breaking space between "100" and "km". This guarantees that the text "100 km" will not be broken: if it does not fit at the end of a line, it is moved in its entirety to the next line.
Non-collapsing behavior
A second common application of non-breaking spaces is in
In contrast, non-breaking spaces are not merged with neighboring whitespace characters when displayed. They can, therefore, be used by an author to simply insert additional visible space in the resulting output without using spans styled with peculiar values of the
Width variation
Other non-breaking variants, defined in Unicode:
- U+202F NARROW NO-BREAK SPACE (
NNBSP
) - Due to the tighter binding of value and unit as a continuous visual element NNBSP is recommended for usage in the SI-standard.[6] It was introduced in Unicode 3.0 for Mongolian,[7] to separate a suffix[8] from the word stem without indicating a word boundary. It is also required for big punctuation in French where it is called espace fine insécable and sometimes inaccurately referred to as "double punctuation" (before
;
,?
,!
,»
,›
and after«
,‹
; today often also before:
) and in German between multi-part abbreviations (e.g., "z. B.", "d. h.", "v. l. n. r.").[9] When used with Mongolian, its width is usually one third of the normal space; in other contexts, its width is about 70% of the normal space but may resemble that of the thin space (U+2009), at least with some fonts.[10] Also starting from release 34 of Unicode Common Locale Data Repository (CLDR) the NNBSP is used in numbers as thousands group separator for French and Spanish locale.[11][12]
- U+2007 FIGURE SPACE ( )
- Produces a space equal to the figure (0–9) characters.
- U+2060 WORD JOINER (⁠ ·
WJ
) - Encoded in Unicode since version 3.2. The word joiner does not produce any space and prohibits a line break at its position.
Example
On browsers, resizing the window will demonstrate the effect of non-breaking spaces on the texts below.
To show the non-breaking effect of the non-breaking space, the following words have been separated with non-breaking spaces:
Lorem Ipsum Dolor Sit Amet Consectetur Adipiscing Elit Sed Do Eiusmod Tempor Incididunt Ut Labore Et Dolore Magna Aliqua Ut Enim Ad Minim Veniam Quis Nostrud Exercitation Ullamco Laboris Nisi Ut Aliquip Ex Ea Commodo Consequat Duis Aute
To show the non-collapsing behavior of the non-breaking space, the following words have been separated with an increasing number of non-breaking spaces:
Lorem Ipsum Dolor Sit Amet Consectetur Adipiscing Elit Sed Do Eiusmod Tempor Incididunt Ut Labore Et Dolore Magna Aliqua Ut Enim Ad Minim
In contrast, the following words are separated with ordinary spaces:
Lorem Ipsum Dolor Sit Amet Consectetur Adipiscing Elit Sed Do Eiusmod Tempor Incididunt Ut Labore Et Dolore Magna Aliqua Ut Enim Ad Minim Veniam Quis Nostrud Exercitation Ullamco Laboris Nisi Ut Aliquip Ex Ea Commodo Consequat Duis Aute
Encodings
Preview | | |||||||
---|---|---|---|---|---|---|---|---|
Unicode name | NO-BREAK SPACE | FIGURE SPACE | NARROW NO-BREAK SPACE | WORD JOINER | ||||
Encodings | decimal | hex | dec | hex | dec | hex | dec | hex |
Unicode | 160 | U+00A0 | 8199 | U+2007 | 8239 | U+202F | 8288 | U+2060 |
UTF-8 | 194 160 | C2 A0 | 226 128 135 | E2 80 87 | 226 128 175 | E2 80 AF | 226 129 160 | E2 81 A0 |
Numeric character reference |   |
  |
  |
  |
  |
  |
⁠ |
⁠ |
Named character reference | ,   |   | ⁠ | |||||
ECMA-94 |
160 | A0 | ||||||
Windows code pages: 1250, 1251, 1252, 1253, 1254, 1255, 1256, 1257, 1258 | 160 | A0 | ||||||
KOI8-R, KOI8-U | 154 | 9A | ||||||
255 | FF | |||||||
HP Roman-9 |
160 | A0 | ||||||
LICS | 154 | 9A | ||||||
TeX | ~ |
\phantom{9} |
\, |
\nobreak
| ||||
EBCDIC | hex: 41 – RSP, Required Space |
In Unicode, the byte order mark (BOM), U+FEFF, may be interpreted as a "zero width no-break space", but is a deprecated alternative to word joiner (U+2060).
Keyboard entry methods
It is rare for national or international standards on
Typically, authors of keyboard drivers and application programs (e.g., word processors) have devised their own keyboard shortcuts for the non-breaking space. For example:
System/application | Entry method |
---|---|
Microsoft Windows | Alt+0160 or Alt+255 (does not always work) |
macOS | ⌥ Opt+⇧ Shift+Space |
X11
|
Compose, Space, Space or AltGr+Space |
AmigaOS | Alt+Space |
GNU Emacs | Ctrl+X 8 Space |
Vim | Ctrl+K, Space, Space; or Ctrl+K, ⇧ Shift+N, ⇧ Shift+S |
Ctrl+⇧ Shift+Space[c] | |
FrameMaker, LyX (non-Mac), OpenOffice.org (before 3.0), WordPerfect |
Ctrl+Space |
Mac Adobe InDesign | ⌥ Opt+⌘ Cmd+X |
Python programming language | '\N{NO-BREAK SPACE}' [14] or '\xa0' /'\u00a0'
|
Apart from this, applications and environments often have
See also
- Hyphens in computing, for information about hard and non-breaking hyphens
- List of XML and HTML character entity references
- Non-breaking hyphen – Punctuation mark used to join words
- Punctuation – Marks to indicate pacing of written text
- Plenken
- Sentence spacing in digital media – Horizontal width of inter-sentence space
- Space (punctuation) – Blank area that separates text
- Space (punctuation) § Non-breaking space, for applications
- Zero-width space – Special character in text processing, a non-spacing break
- Widows and orphans – In typography, an isolated line of text starting/ending a page
- Non-printing character in word processors – Formatting marks for content design
- Typographic alignment § Justified
Notes
- ^ For presentation reasons, representations of a regular space in this article are replaced with a no-break space
- Copy/pasting the symbol shown here will result in the light blue color being transferred along with the symbol. This can be avoided by using one of the techniques described at § Keyboard entry methods.
- ^ On Windows 10 systems, Ctrl+⇧ Shift will by default switch keyboard configurations if more than one keyboard has been configured for the current language.
References
- ^ IBM (1998) [1995]. "Windows, Latin 1". REGISTRY, Graphic Character Sets and Code Pages. CPGID 01252.
- .
- ^ "Special Characters". The Chicago Manual of Style Online.
- ^ "Structure", HTML 4.01, W3, 1999-12-24.
- ^ "Text", CSS 2.1, W3.
- ^ "SI-Standard mentioning the insertion of non-breaking space between value and unit" (PDF).
- ^ ISO/IEC 10646-1:1993/FDAM 29:1999(E)
- ^ "Mongolian NNBSP-connected suffixes" (PDF). W3. December 28, 2014.
- ^ Solbrig, Amelie (30 January 2008). "Zweisprachige Mikrotypografie" (PDF) (in German). Hochschule für Technik, Wirtschaft und Kultur Leipzig. p. 58 (PDF p. 113). Archived from the original (PDF) on 2016-03-11. Retrieved 10 June 2018.
Alle Abkürzungen mit Binnenpunkten werden im Deutschen mit einem gFL [geschütztes flexibles Leerzeichen] spationiert. [...] Die englische Schreibweise sieht keine Abstände zwischen einzelnen Buchstaben vor. Nach einem Binnenpunkt folgt demnach ohne gFL sofort der nächste Buchstabe.
- ^ "Writing Systems and Punctuation" (PDF). The Unicode Standard 7.0. Unicode Inc. 2014. Retrieved 2014-11-02.
- ^ "CLDR Chart: Numbers".
- ^ RAE; RAE. "punto | Diccionario panhispánico de dudas". «Diccionario panhispánico de dudas» (in Spanish). Retrieved 2022-01-07.
- CSC – IT Center for Science, archived from the original on 2011-07-27). Drafts of the Finnish multilingual keyboard.
{{citation}}
: CS1 maint: location missing publisher (link - ^ Python Software Foundation. "Unicode HOWTO¶". Python 3.10.6 documentation. Retrieved 19 August 2022.