ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
! " # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ A B C D E F G H I J K L M N O P Q R S T U V W X Y Z [ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 "Compose" key combinations;
- CHR: the charater itself, when HTML-printable;
- MEANING: the ISO-8859-1 (and ASCII) definition.
HTML OCTL HEX CMP CHR MEANING ------ + ---- + --- + --- + --- + ------------------------------ � | 00 | =00 | | | Invalid (ASCII NUL)  | 01 | =01 | | | Unused (ASCII SOH)  | 02 | =02 | | | Unused (ASCII STX)  | 03 | =03 | | | Unused (ASCII ETX)  | 04 | =04 | | | Unused (ASCII EOT)  | 05 | =05 | | | Unused (ASCII ENQ)  | 06 | =06 | | | Unused (ASCII ACK)  | 07 | =07 | | | Unused (ASCII BEL, audible bell)  | 10 | =08 | | | Unused (ASCII BS, backspace) 	 | 11 | =09 | | | Horizontal tab (ASCII HT) 
 | 12 | =0A | | | Line feed (ASCII NL, newline)  | 13 | =0B | | | Unused (ASCII VT, vertical tab)  | 14 | =0C | | | Unused (ASCII NP, new page) 
 | 15 | =0D | | | Carriage Return (ASCII CR)  | 16 | =0E | | | Unused (ASCII SO)  | 17 | =0F | | | Unused (ASCII SI)  | 20 | =10 | | | Unused (ASCII DLE)  | 21 | =11 | | | Unused (ASCII DC1)  | 22 | =12 | | | Unused (ASCII DC2)  | 23 | =13 | | | Unused (ASCII DC3)  | 24 | =14 | | | Unused (ASCII DC4)  | 25 | =15 | | | Unused (ASCII NAK)  | 26 | =16 | | | Unused (ASCII SYN)  | 27 | =17 | | | Unused (ASCII ETB)  | 30 | =18 | | | Unused (ASCII CAN)  | 31 | =19 | | | Unused (ASCII EM)  | 32 | =1A | | | Unused (ASCII SUB)  | 33 | =1B | | | Unused (ASCII ESC, escape)  | 34 | =1C | | | Unused (ASCII FS)  | 35 | =1D | | | Unused (ASCII GS)  | 36 | =1E | | | Unused (ASCII RS)  | 37 | =1F | | | Unused (ASCII US)   | 40 | =20 | | ( ) | Space (ASCII SP) ! | 41 | =21 | | (!) | Exclamation mark " | 42 | =22 | | (") | Quotation mark (") # | 43 | =23 | | (#) | Number sign $ | 44 | =24 | | ($) | Dollar sign % | 45 | =25 | | (%) | Percent sign & | 46 | =26 | | (&) | Ampersand (&) ' | 47 | =27 | | (') | Apostrophe (right single quote) ( | 50 | =28 | | (() | Left parenthesis ) | 51 | =29 | | ()) | Right parenthesis * | 52 | =2A | | (*) | Asterisk + | 53 | =2B | | (+) | Plus sign , | 54 | =2C | | (,) | Comma - | 55 | =2D | | (-) | Hyphen . | 56 | =2E | | (.) | Period (fullstop) / | 57 | =2F | | (/) | Solidus (slash) 0 | 60 | =30 | | (0) | Digit 0 . . . 9 | 71 | =39 | | (9) | Digit 9 : | 72 | =3A | | (:) | Colon ; | 73 | =3B | | (;) | Semi-colon < | 74 | =3C | | (<) | Less than (<) = | 75 | =3D | | (=) | Equals sign > | 76 | =3E | | (>) | Greater than (>) ? | 77 | =3F | | (?) | Question mark @ | 100 | =40 | | (@) | Commercial at-sign A | 101 | =41 | | (A) | Uppercase letter A . . . Z | 132 | =5A | | (Z) | Uppercase letter Z [ | 133 | =5B | | ([) | Left square bracket \ | 134 | =5C | | () | Reverse solidus (backslash) ] | 135 | =5D | | (]) | Right square bracket ^ | 136 | =5E | | (^) | Caret _ | 137 | =5F | | (_) | Horizontal bar (underscore) ` | 140 | =60 | | (`) | Reverse apostrophe (left single quote) a | 141 | =61 | | (a) | Lowercase letter a . . . z | 172 | =7A | | (z) | Lowercase letter z { | 173 | =7B | | ({) | Left curly brace | | 174 | =7C | | (|) | Vertical bar } | 175 | =7D | | (}) | Right curly brace ~ | 176 | =7E | | (~) | Tilde  | 177 | =7F | | | Unused (ASCII DEL) € | 200 | =80 | | | Unused  | 201 | =81 | | | Unused ‚ | 202 | =82 | | | Unused ƒ | 203 | =83 | | | Unused „ | 204 | =84 | | | Unused … | 205 | =85 | | | Unused † | 206 | =86 | | | Unused ‡ | 207 | =87 | | | Unused ˆ | 210 | =88 | | | Unused ‰ | 211 | =89 | | | Unused Š | 212 | =8A | | | Unused ‹ | 213 | =8B | | | Unused Œ | 214 | =8C | | | Unused  | 215 | =8D | | | Unused Ž | 216 | =8E | | | Unused  | 217 | =8F | | | Unused  | 220 | =90 | | | Unused ‘ | 221 | =91 | | | Unused ’ | 222 | =92 | | | Unused “ | 223 | =93 | | | Unused ” | 224 | =94 | | | Unused • | 225 | =95 | | | Unused – | 226 | =96 | | | Unused — | 227 | =97 | | | Unused ˜ | 230 | =98 | | | Unused ™ | 231 | =99 | | | Unused š | 232 | =9A | | | Unused › | 233 | =9B | | | Unused œ | 234 | =9C | | | Unused  | 235 | =9D | | | Unused ž | 236 | =9E | | | Unused Ÿ | 237 | =9F | | | Unused   | 240 | =A0 | | ( ) | Non-breaking space ( ) ¡ | 241 | =A1 | ! ! | (¡) | Inverted exclamation ¢ | 242 | =A2 | c / | (¢) | Cent sign £ | 243 | =A3 | l - | (£) | Pound sterling ¤ | 244 | =A4 | o x | (¤) | General currency sign ¥ | 245 | =A5 | y - | (¥) | Yen sign ¦ | 246 | =A6 | | | | (¦) | Broken vertical bar § | 247 | =A7 | s o | (§) | Section sign ¨ | 250 | =A8 | " " | (¨) | Umlaut (dieresis) © | 251 | =A9 | c o | (©) | Copyright ª | 252 | =AA | - a | (ª) | Feminine ordinal « | 253 | =AB | < < | («) | Left angle quote, guillemotleft ¬ | 254 | =AC | - , | (¬) | Not sign ­ | 255 | =AD | - - | () | Soft hyphen ® | 256 | =AE | r o | (®) | Registered trademark ¯ | 257 | =AF | ^ - | (¯) | Macron accent ° | 260 | =B0 | ^ * | (°) | Degree sign ± | 261 | =B1 | + - | (±) | Plus or minus ² | 262 | =B2 | ^ 2 | (²) | Superscript two ³ | 263 | =B3 | ^ 3 | (³) | Superscript three ´ | 264 | =B4 | | (´) | Acute accent µ | 265 | =B5 | / u | (µ) | Micro sign ¶ | 266 | =B6 | P ! | (¶) | Paragraph sign · | 267 | =B7 | ^ . | (·) | Middle dot ¸ | 270 | =B8 | , , | (¸) | Cedilla ¹ | 271 | =B9 | ^ 1 | (¹) | Superscript one º | 272 | =BA | _ o | (º) | Masculine ordinal » | 273 | =BB | > > | (») | Right angle quote, guillemotright ¼ | 274 | =BC | 1 4 | (¼) | Fraction one-fourth ½ | 275 | =BD | 1 2 | (½) | Fraction one-half ¾ | 276 | =BE | 3 4 | (¾) | Fraction three-fourths ¿ | 277 | =BF | ? ? | (¿) | Inverted question mark À | 300 | =C0 | A ` | (À) | Capital A, grave accent Á | 301 | =C1 | A ' | (Á) | Capital A, acute accent  | 302 | =C2 | A ^ | (Â) | Capital A, circumflex accent à | 303 | =C3 | A ~ | (Ã) | Capital A, tilde Ä | 304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark Å | 305 | =C5 | A * | (Å) | Capital A, ring Æ | 306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature) Ç | 307 | =C7 | C , | (Ç) | Capital C, cedilla È | 310 | =C8 | E ` | (È) | Capital E, grave accent É | 311 | =C9 | E ' | (É) | Capital E, acute accent Ê | 312 | =CA | E ^ | (Ê) | Capital E, circumflex accent Ë | 313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark Ì | 314 | =CC | I ` | (Ì) | Capital I, grave accent Í | 315 | =CD | I ' | (Í) | Capital I, acute accent Î | 316 | =CE | I ^ | (Î) | Capital I, circumflex accent Ï | 317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark Ð | 320 | =D0 | D - | (Ð) | Capital Eth, Icelandic Ñ | 321 | =D1 | N ~ | (Ñ) | Capital N, tilde Ò | 322 | =D2 | O ` | (Ò) | Capital O, grave accent Ó | 323 | =D3 | O ' | (Ó) | Capital O, acute accent Ô | 324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent Õ | 325 | =D5 | O ~ | (Õ) | Capital O, tilde Ö | 326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark × | 327 | =D7 | x x | (×) | Multiply sign Ø | 330 | =D8 | O / | (Ø) | Capital O, slash Ù | 331 | =D9 | U ` | (Ù) | Capital U, grave accent Ú | 332 | =DA | U ' | (Ú) | Capital U, acute accent Û | 333 | =DB | U ^ | (Û) | Capital U, circumflex accent Ü | 334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark Ý | 335 | =DD | Y ' | (Ý) | Capital Y, acute accent Þ | 336 | =DE | P | | (Þ) | Capital THORN, Icelandic ß | 337 | =DF | s s | (ß) | Small sharp s, German (sz ligature) à | 340 | =E0 | a ` | (à) | Small a, grave accent á | 341 | =E1 | a ' | (á) | Small a, acute accent â | 342 | =E2 | a ^ | (â) | Small a, circumflex accent ã | 343 | =E3 | a ~ | (ã) | Small a, tilde ä | 344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark å | 345 | =E5 | a * | (å) | Small a, ring æ | 346 | =E6 | a e | (æ) | Small ae dipthong (ligature) ç | 347 | =E7 | c , | (ç) | Small c, cedilla è | 350 | =E8 | e ` | (è) | Small e, grave accent é | 351 | =E9 | e ' | (é) | Small e, acute accent ê | 352 | =EA | e ^ | (ê) | Small e, circumflex accent ë | 353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark ì | 354 | =EC | i ` | (ì) | Small i, grave accent í | 355 | =ED | i ' | (í) | Small i, acute accent î | 356 | =EE | i ^ | (î) | Small i, circumflex accent ï | 357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark ð | 360 | =F0 | d - | (ð) | Small eth, Icelandic ñ | 361 | =F1 | n ~ | (ñ) | Small n, tilde ò | 362 | =F2 | o ` | (ò) | Small o, grave accent ó | 363 | =F3 | o ' | (ó) | Small o, acute accent ô | 364 | =F4 | o ^ | (ô) | Small o, circumflex accent õ | 365 | =F5 | o ~ | (õ) | Small o, tilde ö | 366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark ÷ | 367 | =F7 | - : | (÷) | Division sign ø | 370 | =F8 | o / | (ø) | Small o, slash ù | 371 | =F9 | u ` | (ù) | Small u, grave accent ú | 372 | =FA | u ' | (ú) | Small u, acute accent û | 373 | =FB | u ^ | (û) | Small u, circumflex accent ü | 374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark ý | 375 | =FD | y ' | (ý) | Small y, acute accent þ | 376 | =FE | p | | (þ) | Small thorn, Icelandic ÿ | 377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
-
HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
-
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
-
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
-
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
-
Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).
-
Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...
Additional references
- Martin Ramsch's iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html