Forum: RetroDigital BBS

Encoding issue

From Nelgin@VERT/EOTLBBS to All on Wed Aug 30 12:42:36 2023

Hi all,

I have a problem with character encoding and wonder if someone who deals with this sort of thing more often than I do can help.

On my linux system I have two files that report as "Unicode text, UTF-8 text" when using file.

While one file correctly displays é (e with an acute accent) the other file displays an A with a tilde above followed by the copyright symbol.

Obviously, other characters are also displayed incorrectly.

I've tried various iterations of iconv to try and correct the output of the misprinting file but with no success.

A hexdump shows that the incorrectly displaying file has the following
c3 83 c2 a9

Whereas the correctly displayed file has
c3 a9

So, I'm open to suggestions on how to fix this using some native program rather than having to do search and replace.

Thanks,
---
■ Synchronet ■ End Of The Line BBS - endofthelinebbs.com

From Digital Man@VERT to Nelgin on Wed Aug 30 11:39:30 2023

Re: Encoding issue
By: Nelgin to All on Wed Aug 30 2023 12:42 pm

Hi all,

I have a problem with character encoding and wonder if someone who deals with this sort of thing more often than I do can help.

On my linux system I have two files that report as "Unicode text, UTF-8 text" when using file.

While one file correctly displays é (e with an acute accent) the other file displays an A with a tilde above followed by the copyright symbol.

Obviously, other characters are also displayed incorrectly.

I've tried various iterations of iconv to try and correct the output of the misprinting file but with no success.

A hexdump shows that the incorrectly displaying file has the following
c3 83 c2 a9

That sounds correct. See table here: https://www.utf8-chartable.de/unicode-utf8-table.pl

Whereas the correctly displayed file has
c3 a9

That's also correct.

So, I'm open to suggestions on how to fix this using some native program rather than having to do search and replace.

More background is needed with the problem here as it sounds like both files contain the correct UTF-8 sequence for the Unicode codepoints you're saying are being displayed.
--
digital man (rob)

Synchronet "Real Fact" #20:
Michael Swindell was directly responsible for Synchronet's commercial success Norco, CA WX: 93.0°F, 33.0% humidity, 1 mph ESE wind, 0.00 inches rain/24hrs ---
■ Synchronet ■ Vertrauen ■ Home of Synchronet ■ [vert/cvs/bbs].synchro.net

Who's Online
Recent Visitors
- Trevor Walkden
  Mon Aug 3 11:58:31 2026
  from Lethbridge, Ab via Telnet
- Trevor Walkden
  Sun Aug 2 13:49:30 2026
  from Lethbridge, Ab via Telnet
- Vintagegeek
  Sun Aug 2 09:52:30 2026
  from Swarthmore, Pa via Telnet
- Guest
  Sat Aug 1 15:02:04 2026
  from North Charleston,sc via Telnet

System Info

Sysop:	deepend
Location:	Calgary, Alberta
Users:	318
Nodes:	10 (1 / 9)
Uptime:	12:33:29
Calls:	2,605
Calls today:	1
Files:	6,302
D/L today:	25 files (11,110K bytes)
Messages:	483,500

Synchronet Oneliners
- morphBBS@rdbbs
  Mon Jul 20 23:32:53 2026
  can you see me?
- el3ctron@rdbbs
  Tue Jul 21 02:46:45 2026
  twitter is a oneliner copyy
- Vintagegeek@rdbbs
  Wed Jul 22 09:31:05 2026
  Left Right March
- morphBBS@rdbbs
  Fri Jul 24 01:06:24 2026
  Movies: The greatest nose job man in the universe & beverly hills
- Guest@rdbbs
  Thu Jul 30 19:09:46 2026
  life finds a way

Encoding issue

Who's Online

Recent Visitors

System Info

Synchronet Oneliners