[PLUG] Fwd: Cron <michael at centos6> ~/spam_check.bash
Paul Heinlein
heinlein at madboa.com
Thu Jul 18 22:47:37 UTC 2019
On Thu, 18 Jul 2019, Michael C Robinson wrote:
> I wrote a simple script that greps an mbox for the subjects of every
> email in it. Problem is, a lot of these subjects are htmlized or
> something similar and are not plain text. Any suggestions on
> alternative approaches to extracting the subjects of every email in
> the spam box and emailing them to myself?
Python, among other tools, will decode the UTF-8 headers for you,
e.g.,
Python 2.7.10 (default, Oct 6 2017, 22:29:07)
[GCC 4.2.1 Compatible Apple LLVM 9.0.0 (clang-900.0.31)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import email
>>> from email.header import decode_header
>>> decode_header(u'=?UTF-8?B?U3RpY2sgdGhpcyB0byB5b3VyIHNraW4gYW5kIG1lbHQgMWxiL2RheS4uLg==?=')
[('Stick this to your skin and melt 1lb/day...', 'utf-8')]
>>>
--
Paul Heinlein
heinlein at madboa.com
45°38' N, 122°6' W
More information about the PLUG
mailing list