regex - How to match a hex sequence of characters and replace it with white space in PHP -


i have text need clean characters. characters showed in pictures attached question. want replace them white space x20.

first hex sequence

second hex sequence

my attempt use preg_replace.

$result = preg_replace("/[\xef\x82\xac\x09|\xef\x81\xa1\x09]/", "\x20", $string); 

for particular case approach works, cases won't, because example had text comma , matched x82 , removed text.

how write regex search exact sequence ef 82 ac 09, or other 1 ef 81 a1 09, , not each pair separately ef 82 ac 09?

1.) match of 6 different hex bytes or pipe character in character class. wanted use group (?:...|...) matching different byte sequences.

2.) byte sequences not match image. seems messed 2 bytes. picture shows: ef 82 a1 09 , ef 81 ac 09 vs try: \xef\x82\xac\x09 | \xef\x81\xa1\x09

3.) when testing input sample

$str = "de la nouvelle;      fourniture $         option :";  foreach(preg_split("//u", $str) $v) {   var_dump($v, bin2hex($v)); echo "\n"; } 

it turned out, 09 much. characters removed ef81ac , ef82a1. right regex (?:\xef\x81\xac|\xef\x82\xa1)

$result = preg_replace("/(?:\xef\x81\xac|\xef\x82\xa1)/", "\x20", $string); 

see test @ eval.in


Comments

Popular posts from this blog

javascript - Karma not able to start PhantomJS on Windows - Error: spawn UNKNOWN -

Nuget pack csproj using nuspec -

c# - Display ASPX Popup control in RowDeleteing Event (ASPX Gridview) -