I've written a Ruby script that is reading a file (File.read()
) that contains unicode characters, and it works fine from the command line.
However, when I try to put it into an Automator Workflow (Mac OS X), I get this error;
2009-12-23 17:55:15 -0500: /Users/jeffreyaylesworth/bin/symbols:19:in `split': invalid byte sequence in US-ASCII (ArgumentError)
(traceback)
So when running from Automator, split suddenly doesn't like non ASCII characters. As far as I can tell, both are running from the same version of Ruby (the version number is the same).
I'm not too concerned about why they are acting different (but if someone knows, that's great), but I would like a solution to make split accept non ASCII characters.
If it helps, I need to split text at a single character into two pieces, so if something that's similar to C's tokenizer would work, I can use that.