[Scspamcop] Internationalize SC reports - a little bit...?
Patto
nobody at devnull.spamcop.net
Wed May 2 03:34:01 EDT 2007
I don't know if there is anybody still doing maintenance on the SC code.
If so, I would like to suggest a small effort to stop distorting
non-latin spam source text. For instance...
http://www.spamcop.net/sc?id=z1292523895z968197bd510d7f9a0b1390e94506f070z
If you go and look at the full message body, what do you see? This is
the first paragraph.
[quote]
?SEX?????????!!
????????????????????????????????
???????????????????????????
??????????????????????DHEA????????????????????????????????????(????)???????????????????????????
????????????????????????????????
?????????????????????????????
?????????????????????????
[/quote]
If I look at the original spam message, I see the following:
[quote]
◆SEXをすると健康になる!!
セックスによってテストステロンという男性ホルモンの分泌が増えると
骨や筋肉がじょうぶになり、善玉コレステロールも増える。
あるいはオルガスムスに達する直前に分泌されるDHEAというホルモンには、
刺激に対する認知能力を高め免疫システムを強化し、腫瘍(しゅよう)の成長を
押さえたり、骨の育成を助けたりする働きがある。
女性の場合は、エストロゲンが分泌され、骨と心臓の血管を強くする。
以上のように、セックスによって各種ホルモンの分泌が促され、
心臓を守り寿命を延ばす効果が確認されているのです。
[/quote]
The recipient(s) of the SC report will only get a bunch of question
marks, which is *not* what the original spam contained. The recipient(s)
may or may not correctly guess if the message is or is not spam. They
have no way of resurrecting the original data.
There would be such a tiny little effort to preserve the original
encoding, as found in this header line
"Content-Type: text/plain; charset=ISO-2022-JP"
If this is set to the same in the outgoing SC report, then the
recipients would see the *original* source data, not question marks.
Now I realize that in this particular example the headers themselves are
distorted by incorrectly inserted blank lines. But I doubt if it would
make any difference if these were omitted.
More information about the SCspamcop
mailing list