I'm sure this is an encoding issue, but I can't figure it out.
I exported a spreadsheet from Excel as a UTF-8 CSV. This produced a CSV in the UTF-8-BOM character encoding. Opening this file in Notepad++, most of the characters were rendered correctly - including non ANSI characters like ø. However, a hyphen ( ‐ ) is displayed as ☐.
I believe the character is U+2010 ‐ HYPHEN.
If I open the file in Notepad, the hyphen displays correctly. It also displays correctly if I use Vim to read the file or cat to print it out to the terminal.
Finally, the octal dump of the file reveals the hex bytes e2 80 90, which is the UTF-8 encoding of the U+2010 - HYPHEN Unicode character.
So why is Notepad++ displaying this character as ☐?