character B can have 
HEX code = 0x42 (real letter) or 0x412 (fake letter)
DEC code = 66 (real letter) or 1042 (fake letter)
HTML with named char ref = B (real letter) or В (fake letter)
Java string = B (real letter) or \u0412B (fake letter)
When I parse content from remote URL with CURL I see on macOS in both variants letter B. But really it could be not real letter B. I check it with this online tool is letter real or not.
This code helps me for one letter:
$content = str_replace("В", "B", $content);
But how can I make the same with PHP for all other illegal characters?