Home | Crosswinds Info | Tutorials | Articles | Resources | About Us | Contact Us | Acknowledgments

Character Entities for HTML

Because the symbols " & < > (double quotationmark, ampersand, lessthansymbol and greaterthansymbol) are integral to HTML code itself, you should be replacing all instances of " & < > (double quotationmark, ampersand, lessthansymbol and greaterthansymbol) that are used for anything other than the code itself.

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples

It is also useful to employ character entities to replace characters in e-mail addresses that are entered on your webpages. This helps greatly in preventing robots from mining e-mail addresses for spam purposes. In your HTML coding, replace the character with &#numberofcharacter; (ampersand, hashmark, numberofcharacter, semicolon). Instead of &, you would type &#038;. There are examples of HTML code using character entities at the bottom of the page.

ASCII stands for "American Standard Code Information Interchange". The numbers of the characters can be found by referring to the character map on your computer. The characters numbered from #032 to #126 are common to all keyboard systems. (#032 is the spacebar and is not the recommended character entity to depict a non-breaking space.)

Recommended for Encoding

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples

characters to be replaced with entities
for display in HTML - strongly recommended

Name & Character     Character Entity
_____________________________________
 ampersand     &         &amp;       
 lessthan      <         &lt;        
 greaterthan   >         &gt;        
 quotationmark "         &quot;      
 at            @         &#064;      
 Copyright     ©         &copy;      

Use &nbsp; to indicate a nonbreaking space in HTML.
encoding examples using some of the above characters
  

Standard Characters

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples

standard characters

These characters will look the same from computer to computer.
(Use a fixed-width font for ASCII-art and charts)

Courier,FixedSys, Monaco and the generic monospace are examples of non-proportional or fixed width fonts and are used for drawing ASCII-art and making charts. Times New Roman, Times and Symbol are examples of proportional width fonts and are not used for drawing ASCII-art or making charts.


&#032; SP    &#051; 3    &#070; F    &#089; Y    &#108; l
&#033; !     &#052; 4    &#071; G    &#090; Z    &#109; m
&#034; "     &#053; 5    &#072; H    &#091; [    &#110; n
&#035; #     &#054; 6    &#073; I    &#092; \    &#111; o
&#036; $     &#055; 7    &#074; J    &#093; ]    &#112; p
&#037; %     &#056; 8    &#075; K    &#094; ^    &#113; q
&#038; &     &#057; 9    &#076; L    &#095; _    &#114; r
&#039; '     &#058; :    &#077; M    &#096; `    &#115; s
&#040; (     &#059; ;    &#078; N    &#097; a    &#116; t
&#041; )     &#060; <    &#079; O    &#098; b    &#117; u
&#042; *     &#061; =    &#080; P    &#099; c    &#118; v
&#043; +     &#062; >    &#081; Q    &#100; d    &#119; w
&#044; ,     &#063; ?    &#082; R    &#101; e    &#120; x
&#045; -     &#064; @    &#083; S    &#102; f    &#121; y
&#046; .     &#065; A    &#084; T    &#103; g    &#122; z
&#047; /     &#066; B    &#085; U    &#104; h    &#123; {
&#048; 0     &#067; C    &#086; V    &#105; i    &#124; |
&#049; 1     &#068; D    &#087; W    &#106; j    &#125; }
&#050; 2     &#069; E    &#088; X    &#107; k    &#126; ~

#032 is the space bar.
Some people like to use named character entities rather than numbered.
&#034;  or  &quot;  "  quotation mark
&#038;  or  &amp;   &  ampersand     
&#060;  or  &lt;    <  less than     
&#062;  or  &gt;    >  greater than  
  

Extended Characters - A Cautionary Tale

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples

The characters numbered from #127 to #255 are not common to all keyboard systems and can look quite different on different operating systems. Please bear that in mind when you use these characters.

Here is an example using the character #188. These will look different depending on whether you are viewing this page on a Mac or a PC or a....

#188 in "Symbol" font: ¼
(On my PC, I see a "..." symbol)

"TimesRoman", "Roman" or "serif" #188: ¼
"Arial", "Helvetica" or "sans-serif" #188: ¼
(In both those cases, on my PC, I see a "1/4" symbol, but depending on your OS, you might be seeing an asterisk, or the symbol for Pi, or ???)

"WingDings" or "Zapf Dingbats" #188: ¼
(On my PC, I see a "clock" symbol)

If you really want the characters you use on your website to be viewed relatively globally, it's a very good idea to follow the guidelines at www.w3.org. If you plan on using extended characters, it may be a good idea to use images rather than entities unless you know categorically that your viewers will be able to see them.

Extended Characters

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples

Extended Characters

You may see something entirely different, depending on your character set. Some people feel that named character entities are safer to use than numbered. I have shown both in the following table. However, some browsers may not render the named character entities. For instance, Windows95 (yes, yes, Win95 is an ancient system but there are still a few floating around) does not render the circumflex ˆ with &circ; (one sees the character entity rather than the character) but it does render ˆ using &#710;. Windows95 does not render tilde ˜ with &tilde; but does render ˜ using &732;. (Those last sentences may look like nonsense to you because your system does render the letter entities.)

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters

These are arranged with the numbered entities to the left of the symbol and named entities to the right. The number entities on the left are supported by NS4+ and IE4+. Some letter entities on right are NOT supported by older OS. If the character used has a number entity greater than #255, it may be advisable to use a numbered entity rather than named entity.

When using named entities, bear in mind that they are case-sensitive. Using the wrong case can produce an unintended result. Note the difference that an uppercase D can make:

&dagger; produces
&Dagger; produces

When using the following codes, please let copy and paste be your friend.

copyright symbols

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters

&#169;   ©   &copy;     Copyright                 
&#174;   ®   &reg;      Registered Trademark      
&#8482;     &trade;    Trademark                 

editorial symbols

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters

&#160;       &nbsp;       nonbreaking space        
&#171;   «   &laquo;      left double angled quote 
&#187;   »   &raquo;      right double angled quote
&#8216;     &lsquo;      left single quote        
&#8217;     &rsquo;      right single quote       
&#8218;     &sbquo;      low single quotation mark
&#8220;     &ldquo;      left double quote        
&#8221;     &rdquo;      right double quote       
&#8222;     &bdquo;      low double quotation mark
&#8249;     &lsaquo;     left single angled quote 
&#8250;     &rsaquo;     right single angled quote
&#161;   ¡   &iexcl;      inverted exclamation mark
&#166;   ¦   &brvbar;     Broken Vertical Bar      
&#167;   §   &sect;       Section                  
&#168;   ¨   &uml;        Umlaut                   
&#173;   ­    &shy;        Soft Hyphen              
&#180;   ´   &acute;      Accute Accent            
&#182;      &para;       Paragraph                
&#183;   ·   &middot;     Middle Dot               
&#184;   ¸   &cedil;      Cedilla                  
&#191;   ¿   &iquest;     Inverted QuestionMark    
&#710;   ˆ   &circ;       circumflex               
&#732;   ˜   &tilde;      tilde                    
&#8211;     &ndash;      en dash                  
&#8212;     &mdash;      em dash                  
&#8230;     &hellip;     horizontal ellipses      
&#8224;     &dagger;     dagger mark              
&#8225;     &Dagger;     double dagger mark       
&#8226;     &bull;       bullet                   

arrows

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters

&#8592;     &larr;      left arrow               
&#8593;     &uarr;      upwards arrow            
&#8594;     &rarr;      right arrow              
&#8595;     &darr;      downwards arrow          
&#8596;     &harr;      leftright arrow          

mathematical symbols

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters

&#172;   ¬   &not;        Not                      
&#175;   ¯   &macr;       Macron                   
&#176;   °   &deg;        Degree                   
&#177;   ±   &plusmn;     Plus or Minus            
&#185;   ¹   &sup1;       Superscript One          
&#178;   ²   &sup2;       Superscript Two          
&#179;   ³   &sup3;       Superscript Three        
&#170;   ª   &ordf;       Feminine Ordinal         
&#186;   º   &ordm;       Masculine Ordinal        
&#181;   µ   &micro;      Micro                    
&#188;   ¼   &frac14;     One Quarter              
&#189;   ½   &frac12;     One Half                 
&#190;   ¾   &frac34;     Three Quarters           
&#215;   ×   &times;      Multiplication symbol    
&#247;   ÷   &divide;     Division symbol          
&#8240;     &permil;     per thousand             

more mathematical symbols

currency symbols

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters | more extended characters

&#162;   ¢   &cent;       cent                     
&#163;   £   &pound;      Pound Sterling           
&#164;   ¤   &curren;     currency                 
&#165;   ¥   &yen;        Yen                      
&#402;   ƒ   &fnof;       forte, florin            
&#8364;     &euro;       Euro (not on older OS)   

accented letters

accented letters | copyright symbols | mathematical symbols | editorial symbols | arrows | currency symbols | more extended characters

&#192;   À   &Agrave;     UpperCase grave accent A 
&#193;   Á   &Aacute;     UpperCase acute accent A 
&#194;   Â   &Acirc;      UpperCase circumflex A   
&#195;   Ã   &Atilde;     UpperCase tilde A        
&#196;   Ä   &Auml;       UpperCase umlaut A       
&#197;   Å   &Aring;      UpperCase ring A         
&#198;   Æ   &AElig;      UpperCase AE ligature    
&#224;   à   &agrave;     LowerCase grave accent a 
&#225;   á   &aacute;     LowerCase acute accent a 
&#226;   â   &acirc;      LowerCase circumflex a   
&#227;   ã   &atilde;     LowerCase tilde a        
&#228;   ä   &auml;       LowerCase umlaut a       
&#229;   å   &aring;      LowerCase ring a         
&#230;   æ   &aelig;      LowerCase ae ligature    

&#199;   Ç   &Ccedil;     UpperCase cedilla C      
&#231;   ç   &ccedil;     LowerCase cedilla c      

&#208;   Ð   &ETH;        UpperCase ETH            
&#240;   ð   &eth;        LowerCase eth            

&#200;   È   &Egrave;     UpperCase grave accent E 
&#201;   É   &Eacute;     UpperCase acute accent E 
&#202;   Ê   &Ecirc;      UpperCase circumflex E   
&#203;   Ë   &Euml;       UpperCase diuresis E     
&#232;   è   &egrave;     LowerCase grave accent e 
&#233;   é   &eacute;     LowerCase acute accent e 
&#234;   ê   &ecirc;      LowerCase circumflex e   
&#235;   ë   &euml;       LowerCase diuresis e     
&#204;   Ì   &Igrave;     UpperCase grave accent I 
&#205;   Í   &Iacute;     UpperCase acute accent I 
&#206;   Î   &Icirc;      UpperCase circumflex I   
&#207;   Ï   &Iuml;       UpperCase diuresis I     
&#236;   ì   &igrave;     LowerCase grave accent i 
&#237;   í   &iacute;     LowerCase acute accent i 
&#238;   î   &icirc;      LowerCase circumflex i   
&#239;   ï   &iuml;       LowerCase diuresis i     

&#209;   Ñ   &Ntilde;     UpperCase tilde N        
&#241;   ñ   &ntilde;     LowerCase tilde n        

&#210;   Ò   &Ograve;     UpperCase grave accent O 
&#211;   Ó   &Oacute;     UpperCase acute accent O 
&#212;   Ô   &Ocirc;      UpperCase circumflex O   
&#213;   Õ   &Otilde;     UpperCase tilde O        
&#214;   Ö   &Ouml;       UpperCase umlaut O       
&#216;   Ø   &Oslash;     UpperCase slash O        
&#338;   Œ   &OElig;      UpperCase OE ligature    
&#339;   œ   &oelig;      Lowercase oe ligature    
&#242;   ò   &ograve;     LowerCase grave accent o 
&#243;   ó   &oacute;     LowerCase acute accent o 
&#244;   ô   &ocirc;      LowerCase circumflex o   
&#245;   õ   &otilde;     LowerCase tilde o        
&#246;   ö   &ouml;       LowerCase umlaut o       
&#248;   ø   &oslash;     LowerCase slash o        

&#222;   Þ   &THORN;      UpperCase Thorn          
&#254;   þ   &thorn;      LowerCase thorn          

&#223;   ß   &szlig;      sz ligature (double S)   
&#352;   Š   &Scaron;     UpperCase caron hacek S  
&#353;   š   &scaron;     LowerCase caron hacek s  

&#217;   Ù   &Ugrave;     UpperCase grave accent U 
&#218;   Ú   &Uacute;     UpperCase acute accent U 
&#219;   Û   &Ucirc;      UpperCase circumflex U   
&#220;   Ü   &Uuml;       UpperCase umlaut U       
&#249;   ù   &ugrave;     LowerCase grave accent u 
&#250;   ú   &uacute;     LowerCase acute accent u 
&#251;   û   &ucirc;      LowerCase circumflex u   
&#252;   ü   &uuml;       LowerCase umlaut u       

&#221;   Ý   &Yacute;     UpperCase acute accent Y 
&#376;   Ÿ   &Yuml;       UpperCase diruesis Y     
&#253;   ý   &yacute;     LowerCase acute accent y 
&#255;   ÿ   &yuml;       LowerCase diuresis y     

&#381;   Ž   &Zcaron;     UpperCase caron hacek Z  
&#382;   ž   &zcaron;     LowerCase caron hacek z  

Greek alphabet

Your system may show that character #128 is the Euro sign. This sign will likely appear as an empty box or different character altogether in other OS. Please see currency symbols for the correct entity. The same goes for characters #142 and #158. You may see Z's with carons but other OS very likely will not. Please see accented letters for the correct entity.


other character entities

may not be rendered by older OS

With the extended characters on this chart, it may be a good idea to use images rather than entities unless you know categorically that your viewers' OS support these character entities.

more mathematical symbols

mathematical symbols | Greek alphabet | card suits | other extended characters

please note that these characters may not be displayed on older OS
&#8242;     &prime;      prime, minutes, feet          
&#8243;     &Prime;      double prime, seconds, inches 
&#8260;     &frasl;      fraction slash                
&#8706;     &part;       partial differential          
&#8719;     &prod;       n-ary product, product sign   
&#8721;     &sum;        n-ary summation               
&#8730;     &radic;      square root, radical sign     

&#8734;     &infin;      infinity                      
&#8745;     &cap;        intersection, cap             
&#8747;     &int;        integral                      
&#8804;     &le;         less-than or equal to         
&#8805;     &ge;         greater-than or equal to      
&#8722;     &minus;      minus                         
&#8776;     &asymp;      almost equal to, asymptotic to
&#8800;     &ne;         not equal to                  
&#8801;     &equiv;      identical to, equivalent      
&#9674;     &loz;        lozenge                       

other mathematical symbols | Greek alphabet | www.w3.org - more complete list

Greek alphabet

mathematical symbols | Greek alphabet | card suits | other extended characters

For use in technical documents employing the occasional Greek character - please note that these characters may not be displayed on older OS
&#913;   Α   &Alpha;      Uppercase Alpha          
&#914;   Β   &Beta;       Uppercase Beta           
&#915;   Γ   &Gamma;      Uppercase Gamma          
&#916;   Δ   &Delta;      Uppercase Delta          
&#917;   Ε   &Epsilon;    Uppercase Epsilon        
&#918;   Ζ   &Zeta;       Uppercase Zeta           
&#919;   Η   &Eta;        Uppercase Eta            
&#920;   Θ   &Theta;      Uppercase Theta          
&#921;   Ι   &Iota;       Uppercase Iota           
&#922;   Κ   &Kappa;      Uppercase Kappa          
&#923;   Λ   &Lambda;     Uppercase Lambda         
&#924;   Μ   &Mu;         Uppercase Mu             
&#925;   Ν   &Nu;         Uppercase Nu             
&#926;   Ξ   &Xi;         Uppercase Xi             
&#927;   Ο   &Omicron;    Uppercase Omicron        
&#928;   Π   &Pi;         Uppercase Pi             
&#929;   Ρ   &Rho;        Uppercase Rho            
&#931;   Σ   &Sigma;      Uppercase Sigma          
&#932;   Τ   &Tau;        Uppercase Tau            
&#933;   Υ   &Upsilon;    Uppercase Upsilon        
&#934;   Φ   &Phi;        Uppercase Phi            
&#935;   Χ   &Chi;        Uppercase Chi            
&#936;   Ψ   &Psi;        Uppercase Psi            
&#937;   Ω   &Omega;      Uppercase Omega          
          
&#945;   α   &alpha;      lowercase alpha          
&#946;   β   &beta;       lowercase beta           
&#947;   γ   &gamma;      lowercase gamma          
&#948;   δ   &delta;      lowercase delta          
&#949;   ε   &epsilon;    lowercase epsilon        
&#950;   ζ   &zeta;       lowercase zeta           
&#951;   η   &eta;        lowercase eta            
&#952;   θ   &theta;      lowercase theta          
&#953;   ι   &iota;       lowercase iota           
&#954;   κ   &kappa;      lowercase kappa          
&#955;   λ   &lambda;     lowercase lambda         
&#956;   μ   &mu;         lowercase mu             
&#957;   ν   &nu;         lowercase nu             
&#958;   ξ   &xi;         lowercase xi             
&#959;   ο   &omicron;    lowercase omicron        
&#960;   π   &pi;         lowercase pi             
&#961;   ρ   &rho;        lowercase rho            
&#962;   ς   &sigmaf;     lowercase final sigma    
&#963;   σ   &sigma;      lowercase sigma          
&#964;   τ   &tau;        lowercase tau            
&#965;   υ   &upsilon;    lowercase upsilon        
&#966;   φ   &phi;        lowercase phi            
&#967;   χ   &chi;        lowercase chi            
&#968;   ψ   &psi;        lowercase psi            
&#969;   ω   &omega;      lowercase omega          

other mathematical symbols

Card Suits

mathematical symbols | Greek alphabet | card suits | other extended characters

please note that these characters may not be displayed on older OS
&#9824;    &spades;     spade suit                
&#9827;    &clubs;      club suit, shamrock       
&#9829;    &hearts;     heart suit                
&#9830;    &diams;      diamond suit              
  
  

Encoding Examples

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples

The following example shows how to encode some characters on an HTML page - note the extra space between "address" and the e-mail address itself. (The characters displayed with character entities are in bold).

HTML code using numbered entities for
[e-mail address   <alias@ip.com>]

e-mail address &#160; &#060;<a href="mailto:alias&#064;ip&#046;com">alias&#064;ip&#064;com</a>&#062;

Here it is again using named entities. (The equivalent named entities for #046; and #064; are not supported by all OS.)

HTML code using named and numbered entities for
[e-mail address   <alias@ip.com>]

e-mail address &nbsp; &lt;<a href="mailto:alias&#064;ip.com">alias&#064;ip&#046;com</a>&gt;

Here is another example using named character entities.

HTML code for
[¾ English & ¼ français]:

&frac34; English &amp;&frac14; fran&ccedil;ais

Here is another chart with named entities. Also of interest is HTML Special Characters and Browser Compatibility.

It may also be a good idea to identify (with a meta tag) what character system you are using in the head section of your page. For this page, the following is placed just before </head>:

<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

Other character sets can be found at www.w3.org - Internationalization page.

recommended for encoding | standard characters | extended characters | cautionary tale about extended characters | encoding examples