The mainland China partial users are possibly unable to visit the Vicky hundred branches. If you can succeed the browsing, welcome toHereReport.
At present the numerous □□□□breakdown, □sends □multipurpose □, □□, the geographic name and so on □□□□, □each □□, □and so on awaits repairs □.

Unicode

Unicode□□□□one □) Is one kind inComputerOn uses character coding. It is each kind language eachCharacterHas only established the unification and only binary system code, by satisfies the cross language, the cross platform carries on the text to transform, the processing request. In 1990 started to research and develop, in 1994 announced officially. Since along with the computer working ability enhancement, Unicode also has obtained the popularization in the appearing on the market more than 10 years.

Newest edition Unicode is In 20053 month on 31st promotes Unicode 4.1.0.

XML and Unicode

XML useUTF-8AsStandard character collectionUses the specific numeric code, we may support the XML standard in each kind on the browser to demonstrate any local writing the homepage, so long as computer itself installs has the appropriate font files line.

In the past the computer code 8 standards, enabled each country all only to be possible but to compile and check respective coding system according to the national use character; But regarding partial character system quite complex language, like Vietnamese, also or the East Asian country large-scale character repertoire, all cannot demonstrate well under 8 environment. Speech also not necessarily may demonstrate well which including own language, the fear discussed demonstrates other national the writing. However, now in HTML and XML, we may use&#Nnn;Form demonstration specific character.NnnOn behalf of this character decade Unicode code. If wants to use the hexadecimal code, must before the code adds onXCharacter.

Only was only then has recently in the text to the hexadecimal system the support, then the old edition browser demonstrated perhaps these characters possibly had a question which the question - probably first could meet only were in regarding are bigger than 8 Unicode character the demonstration. Solved this question universal procedure still was transforms hexadecimal system code a decimal code (for example: WithReplaces).

In the Unicode standard, each generation of code-point all expressesU+hhhhHhhhIs the hexadecimal system numeral.

Also has some character repertoire standards to deposit some commonly used symbols outside the character encoding, then you possibly use the elephantSuch text symbolized expressed long delimits (□) situation, even if its character encoding is already used, these standards do not contain that character.

However many browsers only can demonstrate the UCS-2 complete character centralism a small subset. How did here list your browser to demonstrate various Unicode code:

CodeCharacter standard name (English)On browser demonstration
ACapital letter Latin alphabet "A"A
ßSmall letter Latin alphabet "Sharp S"
þSmall letter Latin alphabet "Thorn"
ΔCapital letterGreek letters"Delta"Delta
ЙCapital letterSlav letter"Short I"Й
ק Hebrew letter "Qof"
م Arabic numeral "Meem"
Thai languageNumeral 7
Ethiopia syllabic writing"Qha"
JapaneseHiragana "A"
JapaneseKatakana "A"
Simplified form Chinese character "Ye"Leaf
Traditional formChinese character "□"
South Korean syllableWriting "Yeob"

Some multi- languages support homepage browser, for instance Microsoft Windows System Internet Explorer 5.5, as well as cross platform browser Mozilla/Netscape 6, may according to dynamically need to use the corresponding character repertoire, has installed the appropriate language package in advance, may at the same time demonstrate on the page each kind of Unicode character. MSIE 5.5 also proposed the user may in need time the new typeface, namely installs namely uses. Other browser likeNetscape Navigator 4.77, then only can demonstrate with the page code corresponding character centralism writing. When you use the latter kind of browser, you not greatly possibly in advance install all typefaces, even if had the typeface, the browser not necessarily can apply completely these typefaces. The possible new situations is, this kind of browser only can demonstrate the partial writing, because they are defer to the standard to carry on the code, although theoretically in compatible system, so long as had the corresponding typeface, may correctly demonstrate. One kind is accommodating the means, are certain rare characters, passes "the name entity quotation" the way uses.

Unicode Code table
0000-0FFF8000-8FFF10000-10FFF20000-20FFF28000-28FFF
1000-1FFF9000-9FFF 21000-21FFF29000-29FFF
2000-2FFFA000-AFFF 22000-22FFF2A000-2AFFF
3000-3FFFB000-BFFF 23000-23FFF
4000-4FFFC000-CFFF1D000-1DFFF24000-24FFF2F000-2FFFF
5000-5FFFD000-DFFF 25000-25FFF
6000-6FFFE000-EFFF 26000-26FFF
7000-7FFFF000-FFFF 27000-27FFFE0000-E0FFF

Unicode and ISO 10,646 relations

Exterior link


Unicode □□item
ISO 10,646 general character repertoires | UTF-7 | UTF-8 | UTF-16/UCS-2 | UTF-32/UCS-4
Unicode code table | Basic multi- articles □plane | Auxiliary plane | second plane 字元 | A Chinese and Japanese □□ideograph | IICore

 

  > Chinese to English > zh.wikipedia.org (Machine translated into English)