[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Japanese '者' (U+8005) is replaced with \350\200
From: |
Karl Berry |
Subject: |
Re: Japanese '者' (U+8005) is replaced with \350\200 |
Date: |
Fri, 8 Jan 2021 19:35:19 -0700 |
UnicodeDecodeError: 'utf-8' codec can't decode bytes in position
78-79: invalid continuation byte$ ip
Sure, I don't doubt the output is invalid utf8 for you. I don't get that
output. Probably something in all these bytes is not being transmitted
properly in email. In any case, I can't take it into the wdiff source to
figure out what's actually going on, so I'm afraid I'm of no help.
In general, since the last release of wdiff was in 2014, it would not be
surprising if there were bugs in the utf-8 handling in the support code
or in wdiff itself. Denver or Martin, are you there? --best, karl.