Hi,
I've been trying to make a regex to match the charset of mime multipart emails so as I can decode them correctly. However I've found that there are some differences in the format that I can't seem to work out a regex for, as I'm no expert.
currently I'm using (?<=charset=).*(?=;)
however the examples I've found by sending emails from different clients are:
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
charset=US-ASCII;
Content-Type: text/plain; charset=iso-8859-1
So my regex works on first two but not the last, however if I remove (?=;)
then I will also match the format=flowed
part, which I don't want.
any ideas?