是否有办法将统一代码字典的字体转换到其等效法8的实体?
e. 0xf067 (
as unicode html entities) to greek small letter gamma (U+03B3 aka γ
in standard utf8)
是否有办法将统一代码字典的字体转换到其等效法8的实体?
e. 0xf067 (
as unicode html entities) to greek small letter gamma (U+03B3 aka γ
in standard utf8)
最后,我设法利用文号在统一编码图上填写。
rel=“noreferer” http://www.stat.auckland.ac.nz/~paul/R/CM/AdobeSym.html。
然后从单编码转换为与iconv(作为单编码与设计;>utf8)相转换。
function symbol_sanitize_string($string) {
return preg_replace_callback( /([x{f020}-x{f0fe}]{1})/u ,
symbol_unicode_to_utf8_entity , $string);
}
function symbol_unicode_to_utf8_entity($wchar) {
static $_SYMBOL_FONT_MAP = array (
15696032 => 32,
15696033 => 33,
15696034 => 8704,
15696035 => 35,
15696036 => 8707,
15696037 => 37,
15696038 => 38,
15696039 => 8715,
15696040 => 40,
15696041 => 41,
15696042 => 8727,
15696043 => 43,
15696044 => 44,
15696045 => 8722,
15696046 => 46,
15696047 => 47,
15696048 => 48,
15696049 => 49,
15696050 => 50,
15696051 => 51,
15696052 => 52,
15696053 => 53,
15696054 => 54,
15696055 => 55,
15696056 => 56,
15696057 => 57,
15696058 => 58,
15696059 => 59,
15696060 => 60,
15696061 => 61,
15696062 => 62,
15696063 => 63,
15696256 => 8773,
15696257 => 913,
15696258 => 914,
15696259 => 935,
15696260 => 916,
15696261 => 917,
15696262 => 934,
15696263 => 915,
15696264 => 919,
15696265 => 921,
15696266 => 977,
15696267 => 922,
15696268 => 923,
15696269 => 924,
15696270 => 925,
15696271 => 927,
15696272 => 928,
15696273 => 920,
15696274 => 929,
15696275 => 931,
15696276 => 932,
15696277 => 933,
15696278 => 962,
15696279 => 937,
15696280 => 926,
15696281 => 936,
15696282 => 918,
15696283 => 91,
15696284 => 8756,
15696285 => 93,
15696286 => 8869,
15696287 => 95,
15696288 => 63717,
15696289 => 945,
15696290 => 946,
15696291 => 967,
15696292 => 948,
15696293 => 949,
15696294 => 966,
15696295 => 947,
15696296 => 951,
15696297 => 953,
15696298 => 981,
15696299 => 954,
15696300 => 955,
15696301 => 956,
15696302 => 957,
15696303 => 959,
15696304 => 960,
15696305 => 952,
15696306 => 961,
15696307 => 963,
15696308 => 964,
15696309 => 965,
15696310 => 982,
15696311 => 969,
15696312 => 958,
15696313 => 968,
15696314 => 950,
15696315 => 123,
15696316 => 124,
15696317 => 125,
15696318 => 8764,
15696544 => 8364,
15696545 => 978,
15696546 => 8242,
15696547 => 8804,
15696548 => 8260,
15696549 => 8734,
15696550 => 402,
15696551 => 9827,
15696552 => 9830,
15696553 => 9829,
15696554 => 9824,
15696555 => 8596,
15696556 => 8592,
15696557 => 8593,
15696558 => 8594,
15696559 => 8595,
15696560 => 176,
15696561 => 177,
15696562 => 8243,
15696563 => 8805,
15696564 => 215,
15696565 => 8733,
15696566 => 8706,
15696567 => 8226,
15696568 => 247,
15696569 => 8800,
15696570 => 8801,
15696571 => 8776,
15696572 => 8230,
15696573 => 63718,
15696574 => 63719,
15696575 => 8629,
15696768 => 8501,
15696769 => 8465,
15696770 => 8476,
15696771 => 8472,
15696772 => 8855,
15696773 => 8853,
15696774 => 8709,
15696775 => 8745,
15696776 => 8746,
15696777 => 8835,
15696778 => 8839,
15696779 => 8836,
15696780 => 8834,
15696781 => 8838,
15696782 => 8712,
15696783 => 8713,
15696784 => 8736,
15696785 => 8711,
15696786 => 63194,
15696787 => 63193,
15696788 => 63195,
15696789 => 8719,
15696790 => 8730,
15696791 => 8901,
15696792 => 172,
15696793 => 8743,
15696794 => 8744,
15696795 => 8660,
15696796 => 8656,
15696797 => 8657,
15696798 => 8658,
15696799 => 8659,
15696800 => 9674,
15696801 => 9001,
15696802 => 63720,
15696803 => 63721,
15696804 => 63722,
15696805 => 8721,
15696806 => 63723,
15696807 => 63724,
15696808 => 63725,
15696809 => 63726,
15696810 => 63727,
15696811 => 63728,
15696812 => 63729,
15696813 => 63730,
15696814 => 63731,
15696815 => 63732,
15696817 => 9002,
15696818 => 8747,
15696819 => 8992,
15696820 => 63733,
15696821 => 8993,
15696822 => 63734,
15696823 => 63735,
15696824 => 63736,
15696825 => 63737,
15696826 => 63738,
15696827 => 63739,
15696828 => 63740,
15696829 => 63741,
15696830 => 63742,
);
$conv = hexdec(bin2hex($wchar[1]));
$utf8 = $_SYMBOL_FONT_MAP[$conv];
return $utf8 ? &# . $utf8 . ; : $wchar[1];
}
The Adobe Symbol encoding to unicode mappings can be found here: ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/ADOBE/symbol.txt
I made a python3 class that lets you convert at the char level. Check it out on github: https://github.com/Oufattole/Adobe-Symbol-Encodings-to-Unicode
看来,你只能拿Adobe编码符号的最后两位数,而单编码制图的象征(从第一个链接)可以绘制成正确的单代码。
http://www.stat.auckland.ac.nz/%7Epaul/R/CM/AdobeSym.html” rel=“nofollow noretinger”AdobeSymb。
function symbol_unicode_to_utf8_entity($wchar) {
static $_SYMBOL_FONT_MAP = array (
15696032 => "#32",
15696033 => "#33",
15696034 => "#8704",
15696035 => "#35",
15696036 => "#8707",
15696037 => "#37",
15696038 => "#38",
15696039 => "#8715",
15696040 => "#40",
15696041 => "#41",
15696042 => "#8727",
15696043 => "#43",
15696044 => "#44",
15696045 => "#8722",
15696046 => "#46",
15696047 => "#47",
15696048 => "#48",
15696049 => "#49",
15696050 => "#50",
15696051 => "#51",
15696052 => "#52",
15696053 => "#53",
15696054 => "#54",
15696055 => "#55",
15696056 => "#56",
15696057 => "#57",
15696058 => "#58",
15696059 => "#59",
15696060 => "#60",
15696061 => "#61",
15696062 => "#62",
15696063 => "#63",
15696256 => "#8773",
15696257 => "#913",
15696258 => "#914",
15696259 => "#935",
15696260 => "#916",
15696261 => "#917",
15696262 => "#934",
15696263 => "#915",
15696264 => "#919",
15696265 => "#921",
15696266 => "#977",
15696267 => "#922",
15696268 => "#923",
15696269 => "#924",
15696270 => "#925",
15696271 => "#927",
15696272 => "#928",
15696273 => "#920",
15696274 => "#929",
15696275 => "#931",
15696276 => "#932",
15696277 => "#933",
15696278 => "#962",
15696279 => "#937",
15696280 => "#926",
15696281 => "#936",
15696282 => "#918",
15696283 => "#91",
15696284 => "#8756",
15696285 => "#93",
15696286 => "#8869",
15696287 => "#95",
15696288 => "#63717",
15696289 => "#945",
15696290 => "#946",
15696291 => "#967",
15696292 => "#948",
15696293 => "#949",
15696294 => "#966",
15696295 => "#947",
15696296 => "#951",
15696297 => "#953",
15696298 => "#981",
15696299 => "#954",
15696300 => "#955",
15696301 => "#956",
15696302 => "#957",
15696303 => "#959",
15696304 => "#960",
15696305 => "#952",
15696306 => "#961",
15696307 => "#963",
15696308 => "#964",
15696309 => "#965",
15696310 => "#982",
15696311 => "#969",
15696312 => "#958",
15696313 => "#968",
15696314 => "#950",
15696315 => "#123",
15696316 => "#124",
15696317 => "#125",
15696318 => "#8764",
15696544 => "#8364",
15696545 => "#978",
15696546 => "#8242",
15696547 => "#8804",
15696548 => "#8260",
15696549 => "#8734",
15696550 => "#402",
15696551 => "#9827",
15696552 => "#9830",
15696553 => "#9829",
15696554 => "#9824",
15696555 => "#8596",
15696556 => "#8592",
15696557 => "#8593",
15696558 => "#8594",
15696559 => "#8595",
15696560 => "#176",
15696561 => "#177",
15696562 => "#8243",
15696563 => "#8805",
15696564 => "#215",
15696565 => "#8733",
15696566 => "#8706",
15696567 => "#8226",
15696568 => "#247",
15696569 => "#8800",
15696570 => "#8801",
15696571 => "#8776",
15696572 => "#8230",
15696573 => "#63718",
15696574 => "#63719",
15696575 => "#8629",
15696768 => "#8501",
15696769 => "#8465",
15696770 => "#8476",
15696771 => "#8472",
15696772 => "#8855",
15696773 => "#8853",
15696774 => "#8709",
15696775 => "#8745",
15696776 => "#8746",
15696777 => "#8835",
15696778 => "#8839",
15696779 => "#8836",
15696780 => "#8834",
15696781 => "#8838",
15696782 => "#8712",
15696783 => "#8713",
15696784 => "#8736",
15696785 => "#8711",
15696786 => "#63194",
15696787 => "#63193",
15696788 => "#63195",
15696789 => "#8719",
15696790 => "#8730",
15696791 => "#8901",
15696792 => "#172",
15696793 => "#8743",
15696794 => "#8744",
15696795 => "#8660",
15696796 => "#8656",
15696797 => "#8657",
15696798 => "#8658",
15696799 => "#8659",
15696800 => "#9674",
15696801 => "#9001",
15696802 => "#63720",
15696803 => "#63721",
15696804 => "#63722",
15696805 => "#8721",
15696806 => "#63723",
15696807 => "#63724",
15696808 => "#63725",
15696809 => "#63726",
15696810 => "#63727",
15696811 => "#63728",
15696812 => "#63729",
15696813 => "#63730",
15696814 => "#63731",
15696815 => "#63732",
15696817 => "#9002",
15696818 => "#8747",
15696819 => "#8992",
15696820 => "#63733",
15696821 => "#8993",
15696822 => "#63734",
15696823 => "#63735",
15696824 => "#63736",
15696825 => "#63737",
15696826 => "#63738",
15696827 => "#63739",
15696828 => "#63740",
15696829 => "#63741",
15696830 => "#63742",
36 => "exist",
38 => "amp",
39 => "ni",
42 => "lowast",
45 => "minus",
60 => "lt",
62 => "gt",
64 => "cong",
65 => "Alpha",
66 => "Beta",
67 => "Chi",
68 => "Delta",
69 => "Epsilon",
70 => "Phi",
71 => "Gamma",
72 => "Eta",
73 => "Iota",
74 => "thetasym",
75 => "Kappa",
76 => "Lambda",
77 => "Mu",
78 => "Nu",
79 => "Omicron",
80 => "Pi",
81 => "Theta",
82 => "Rho",
83 => "Sigma",
84 => "Tau",
85 => "Upsilon",
86 => "sigmaf",
87 => "Omega",
88 => "Xi",
89 => "Psi",
90 => "Zeta",
92 => "there4",
94 => "perp",
97 => "alpha",
98 => "beta",
99 => "chi",
100 => "delta",
101 => "epsilon",
102 => "phi",
103 => "gamma",
104 => "eta",
105 => "iota",
106 => "ϕ",
107 => "kappa",
108 => "lambda",
109 => "mu",
110 => "nu",
111 => "omicron",
112 => "pi",
113 => "theta",
114 => "rho",
115 => "sigma",
116 => "tau",
117 => "upsilon",
118 => "piv",
119 => "omega",
120 => "xi",
121 => "psi",
122 => "zeta",
126 => "sim",
160 => "euro",
161 => "upsih",
162 => "prime",
163 => "le",
164 => "frasl",
165 => "infin",
166 => "fnof",
167 => "clubs",
168 => "diams",
169 => "hearts",
170 => "spades",
171 => "harr",
172 => "larr",
173 => "uarr",
174 => "rarr",
175 => "darr",
176 => "deg",
177 => "plusmn",
178 => "Prime",
179 => "ge",
180 => "times",
181 => "prop",
182 => "part",
183 => "bull",
184 => "divide",
185 => "ne",
186 => "equiv",
187 => "asymp",
188 => "hellip",
191 => "crarr",
192 => "alefsym",
193 => "image",
194 => "real",
195 => "weierp",
196 => "otimes",
197 => "oplus",
198 => "empty",
199 => "cap",
200 => "cup",
201 => "sup",
202 => "supe",
203 => "nsub",
204 => "sub",
205 => "sube",
206 => "isin",
207 => "notin",
208 => "ang",
209 => "nabla",
210 => "reg",
211 => "copy",
212 => "trade",
213 => "prod",
214 => "radic",
215 => "sdot",
216 => "not",
217 => "and",
218 => "or",
219 => "hArr",
220 => "lArr",
221 => "uArr",
222 => "rArr",
223 => "dArr",
224 => "loz",
225 => "lang",
229 => "sum",
241 => "rang",
242 => "int",
);
$conv = hexdec(bin2hex($wchar));
$utf8 = $_SYMBOL_FONT_MAP[$conv];
return $utf8 ? & . $utf8 . ; : $wchar;
}
// Usage.
// Assume letter from format like:
// <span style="font-family:Symbol;">m</span>
// Intermediate processing: "m" => "μ"
$intermediate = symbol_unicode_to_utf8_entity( m );
// Output UTF character "μ"
echo html_entity_decode($intermediate, ENT_COMPAT | ENT_HTML401, UTF-8 );
www.un.org/Depts/DGACM/index_spanish.htm 以前的文字并没有改变某些特性,如......。 因此,我撰写了一份新文字,将非单条代码符号特性的直接贬值与单条编码(摘自rel=“nofollow noreferer”>。
function symbol_sanitize_string($string) {
//important if you re use DOM to load html
$string = preg_replace( /charset=iso-8859-1/ , charset=utf-8 , $string );
//replace font symbols
$string = preg_replace_callback( /&#(61d+?);/i , symbol_alone2utf8 , $string);
return $string;
}
function symbol_alone2utf8( $match ){
return symbol2utf8( $match[1] );
}
function symbol2utf8( $decimal ) {
$_Symbol = array(
61472 => 020 ,
61473 => 021 ,
61474 => 022 ,
61475 => 023 ,
61476 => 024 ,
61477 => 025 ,
61478 => 026 ,
61479 => 027 ,
61480 => 028 ,
61481 => 029 ,
61482 => 02A ,
61483 => 02B ,
61484 => 02C ,
61485 => 02D ,
61486 => 02E ,
61487 => 02F ,
61488 => 030 ,
61489 => 031 ,
61490 => 032 ,
61491 => 033 ,
61492 => 034 ,
61493 => 035 ,
61494 => 036 ,
61495 => 037 ,
61496 => 038 ,
61497 => 039 ,
61498 => 03A ,
61499 => 03B ,
61500 => 03C ,
61501 => 03D ,
61502 => 03E ,
61503 => 03F ,
61504 => 040 ,
61505 => 041 ,
61506 => 042 ,
61507 => 043 ,
61508 => 044 ,
61509 => 045 ,
61510 => 046 ,
61511 => 047 ,
61512 => 048 ,
61513 => 049 ,
61514 => 04A ,
61515 => 04B ,
61516 => 04C ,
61517 => 04D ,
61518 => 04E ,
61519 => 04F ,
61520 => 050 ,
61521 => 051 ,
61522 => 052 ,
61523 => 053 ,
61524 => 054 ,
61525 => 055 ,
61526 => 056 ,
61527 => 057 ,
61528 => 058 ,
61529 => 059 ,
61530 => 05A ,
61531 => 05B ,
61532 => 05C ,
61533 => 05D ,
61534 => 05E ,
61535 => 05F ,
61536 => 060 ,
61537 => 061 ,
61538 => 062 ,
61539 => 063 ,
61540 => 064 ,
61541 => 065 ,
61542 => 066 ,
61543 => 067 ,
61544 => 068 ,
61545 => 069 ,
61546 => 06A ,
61547 => 06B ,
61548 => 06C ,
61549 => 06D ,
61550 => 06E ,
61551 => 06F ,
61552 => 070 ,
61553 => 071 ,
61554 => 072 ,
61555 => 073 ,
61556 => 074 ,
61557 => 075 ,
61558 => 076 ,
61559 => 077 ,
61560 => 078 ,
61561 => 079 ,
61562 => 07A ,
61563 => 07B ,
61564 => 07C ,
61565 => 07D ,
61566 => 07E ,
61601 => 0A1 ,
61602 => 0A2 ,
61603 => 0A3 ,
61604 => 0A4 ,
61605 => 0A5 ,
61606 => 0A6 ,
61607 => 0A7 ,
61608 => 0A8 ,
61609 => 0A9 ,
61610 => 0AA ,
61611 => 0AB ,
61612 => 0AC ,
61613 => 0AD ,
61614 => 0AE ,
61615 => 0AF ,
61616 => 0B0 ,
61617 => 0B1 ,
61618 => 0B2 ,
61619 => 0B3 ,
61620 => 0B4 ,
61621 => 0B5 ,
61622 => 0B6 ,
61623 => 0B7 ,
61624 => 0B8 ,
61625 => 0B9 ,
61626 => 0BA ,
61627 => 0BB ,
61628 => 0BC ,
61629 => 0BD ,
61630 => 0BE ,
61631 => 0BF ,
61632 => 0C0 ,
61633 => 0C1 ,
61634 => 0C2 ,
61635 => 0C3 ,
61636 => 0C4 ,
61637 => 0C5 ,
61638 => 0C6 ,
61639 => 0C7 ,
61640 => 0C8 ,
61641 => 0C9 ,
61642 => 0CA ,
61643 => 0CB ,
61644 => 0CC ,
61645 => 0CD ,
61646 => 0CE ,
61647 => 0CF ,
61648 => 0D0 ,
61649 => 0D1 ,
61650 => 0D2 ,
61651 => 0D3 ,
61652 => 0D4 ,
61653 => 0D5 ,
61654 => 0D6 ,
61655 => 0D7 ,
61656 => 0D8 ,
61657 => 0D9 ,
61658 => 0DA ,
61659 => 0DB ,
61660 => 0DC ,
61661 => 0DD ,
61662 => 0DE ,
61663 => 0DF ,
61664 => 0E0 ,
61665 => 0E1 ,
61666 => 0E2 ,
61667 => 0E3 ,
61668 => 0E4 ,
61669 => 0E5 ,
61670 => 0E6 ,
61671 => 0E7 ,
61672 => 0E8 ,
61673 => 0E9 ,
61674 => 0EA ,
61675 => 0EB ,
61676 => 0EC ,
61677 => 0ED ,
61678 => 0EE ,
61679 => 0EF ,
61681 => 0F1 ,
61682 => 0F2 ,
61683 => 0F3 ,
61684 => 0F4 ,
61685 => 0F5 ,
61686 => 0F6 ,
61687 => 0F7 ,
61688 => 0F8 ,
61689 => 0F9 ,
61690 => 0FA ,
61691 => 0FB ,
61692 => 0FC ,
61693 => 0FD ,
61694 => 0FE
);
$key = $decimal;
if( array_key_exists( $key, $_Symbol ) ){
if( $key <= 61487 )
$c = 0 ;
else
$c = f ;
$char = json_decode( "u . $c . $_Symbol[ $key ] . " );
return $char;
}else{
echo "char not found <br>
";
return "&#$decimal;";
}
}
这一文字经过测试,将大剂量文档转换为html,数学符号很高。
利用该法典从html档案中测试:
$string = file_get_contents( testfile.html );
$string = symbol_sanitize_string( $string );
echo $string;
//ensure that string is UTF-8 encoded
echo mb_detect_encoding($string) . "<br>";
I am trying to write a script to prevent brute-force login attempts in a website I m building. The logic goes something like this: User sends login information. Check if username and password is ...
<?php $con=mysql_connect("localhost","mts","mts"); if(!con) { die( unable to connect . mysql_error()); } mysql_select_db("mts",$con); /* date_default_timezone_set ("Asia/Calcutta"); $date = ...
I found this script online that creates a thumbnail out of a image but the thumbnail image is created with poor quality how can I improve the quality of the image. And is there a better way to create ...
如何确认来自正确来源的数字。
Most sites need some way to show the dates on the site in the users preferred timezone. Below are two lists that I found and then one method using the built in PHP DateTime class in PHP 5. I need ...
I wonder there is a way to post a message to a facebook business page with cURL? thanks
I want to create text as a watermark for an image. the water mark should have the following properties front: Impact color: white opacity: 31% Font style: regular, bold Bevel and Emboss size: 30 ...
How does php cast boolean variables? I was trying to save a boolean value to an array: $result["Users"]["is_login"] = true; but when I use debug the is_login value is blank. and when I do ...