Dig.do - Find top sites
Find the most accessed sites on the Internet
Top Internet Sites: list all sites | by country | by category


dig.do - Internet's most used character sets - Stats

Character set encoding stats used by the Internet's most accessed websites. A few years ago, the ISO-8859-1 encoding was the most common character set used by western websites. Now UNICODE (UTF-8) is the standard that is used by most websites. Other still common character codes are:

Below are the stats of the most used character codes (charsets) created by the analysis of Internet's most accessed websites.

Number of websites (sites visited to generate this stats): 1754139
Charset list stats generated in February/2013.

Charset encoding Number of sites % of websites
utf-8116732866.5 %
iso-8859-136304920.7 %
windows-1251426812.4 %
gb2312381762.2 %
336941.9 %
shift_jis219621.3 %
windows-1252203091.2 %
gbk135290.8 %
iso-8859-269820.4 %
windows-125661010.3 %
euc-jp60910.3 %
iso-8859-1552640.3 %
iso-8859-944620.3 %
euc-kr38590.2 %
windows-125033970.2 %
windows-125433680.2 %
big523790.1 %
us-ascii18930.1 %
windows-87414330.1 %
utf89700.1 %
iso-8859-76510 %
tis-6206490 %
windows-12555380 %
x-sjis5250 %
windows-12534840 %
iso8859-14420 %
shift-jis3660 %
koi8-r3400 %
cp12512950 %
windows-12572270 %
utf-162140 %
unicode1230 %
utf-71190 %
latin1950 %
windows-31j860 %
iso840 %
iso-8859-5790 %
"utf-8"760 %
ks_c_5601-1987730 %
win-1251700 %
cp-1251650 %
uft-8620 %
iso-2022-jp530 %
none470 %
gb18030440 %
iso-8895-1410 %
iso_8859-1350 %
340 %
latin5300 %
iso-8559-1290 %
iso-8859280 %
utf270 %
macintosh250 %
x-euc-jp250 %
iso-8859-1;250 %
iso-8859-6240 %
8859-1230 %
iso-8859-4230 %
iso8859-2220 %
sjis220 %
charset200 %
cp1252200 %
latin-1170 %
"utf-8";160 %
x-user-defined160 %
iso-8859-16160 %
iso-utf-8160 %
ansi150 %
iso8859-15140 %
windows-utf-8130 %
iso-8859-13130 %
iso-8859-3120 %
utf-8"/>120 %
ascii110 %
utf8_general_ci110 %
-110 %
gb_2312-80110 %
koi8-u110 %
iso-8859-8100 %
window-125190 %
x-mac-roman90 %
tis62070 %
euc70 %
cp125070 %
ansi_x370 %
is0-8859-170 %
gbk231270 %
gb70 %
"iso-8859-1"60 %
iso8859_160 %
iso-5589-160 %
utf-8560 %
windows60 %
windows-8859-160 %
utf-960 %
euckr60 %
big-550 %
window-87450 %
iso-104050 %
text50 %
cp125650 %
gb213250 %
iso-8859-8-i50 %
big5-hkscs50 %
ksc_560150 %
utf_850 %
urf-850 %
windows-cp125150 %
latin250 %

| dig.do - home | list world top sites | top sites by category | top sites by country | blog | stats | terms of use | contact |