<?php //Non-English Character Matching: //The "character set" block (square brackets) allow us to match characters, but since we can only range english chars, one trick is to use ASCII or UNICODE matching like this: //ASCII matching can be performed like this: preg_match('/[\x00-\x80]+/', $str); //Unicode matching can be performed like this: preg_match('/[^\u0000-\u0080]+/', $str); //To our case, to match only non-english chars use: preg_match('/[^\x00-\x80]+/', $str); //To match ALL chars (both english & non english & some non-chars as well, perhaps) use: preg_match('/[a-zA-Z\x00-\xFF]+/', $str); // Chinese characters range in unicode (verify?) preg_match('/^[u4e00-u9fa5],{0,}$/', $str); ?>
Tuesday, January 19, 2010
Only Non-English Character Matching
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment