We have to search and replace high-bit characters sometimes. Here’s what we do:

 

Find/Replace in Files

Enable the Regular expression option

Use this code:

[^\x00-\x7F]

If that gets garbled in posting, it’s

left square bracket

caret

backslash

x00-

backslash

x7F

right square bracket

 

On 31.01.2021 00:30, Roger L Costello wrote:

> Hi Folks,

> 

> I have a large XHTML file. Apparently, somewhere in the file is an invalid Unicode character because when I drag-and-drop the file into Oxygen I get the error message:

> 

> Cannot open the specified file: Got a character encoding exception. File encoding (UTF-8) does not support all characters from the current file.

> 

> The problem is this: Where is the invalid UTF-8 character?

> 

> Is there a way to get Oxygen to tell me where -- what character position -- it detected the invalid Unicode character?

>