ansaurus

Question

Answer 1

+2 A:

The code you post is presumably due to wrong cut-and-paste operations because it's clearly wrong in both versions (f.read() fails because there's no f barename defined).

In Py3, ur = response.decode('utf8') works perfectly well for me, as does the following json.loads(ur). Maybe the wrong copys-and-pastes affected your 2-to-3 conversion attempts.

Alex Martelli 2010-06-28 00:06:03

Whoops, I will fix the code mistakes... I tried reformatting it for display but screwed it all up in the process. :PRegardless, I can't view the data after I parse it (using a simple "print(data)") because it gives me charmap errors.

Daniel Lew 2010-06-28 00:08:02

@Daniel, the problems _after_ you've gotten the data seem to be a separate question from this one about getting the data (which my answer, it appears, responded to -- though seemingly you don't agree, since you didn't even upvote it!). If by `data` you mean the `json.loads(response)`, I can `print` it without any problem (on my Mac Terminal.app, which supports UTF-8). What's your sys.stdout.encoding? Have you set properly the environment variable `PYTHONIOENCODING: Encoding[:errors] used for stdin/stdout/stderr` before starting Python 3? Etc, etc -- totally different issues, see.

Alex Martelli 2010-06-28 01:26:41

Sorry if I was unclear at first. The core problem is I can't *use* the data after parsing, for whatever reason (the print is just the beginning of it; if I can't print it, then somewhere down the line I'm going to run into trouble reading the data). I'll check out the encoding, suffice to say it doesn't work on my W7 machine.

Daniel Lew 2010-06-28 13:17:03

Alex Martelli 2010-06-28 13:58:42

If it were just the output capability of the Windows terminal, then why does the code work in Python 2?

Daniel Lew 2010-06-28 14:16:26

@Daniel, perhaps by a different setting of sys.stdout.encoding (e.g. via `PYTHONIOENCODING`, etc) -- I've already asked about that and I've heard nothing from you in response in this interminable thread of comments you insist on perpetuating. Why not just `print(repr(data))` in both cases and check if anything is different? If not, then you **know** it's all about output/terminal issues, as I suspect it may well be -- if specific differences, then of course let us know (editing your Q please, **not** in yet another cramped comment!-).

Alex Martelli 2010-06-28 14:32:30

I can't test the code at the moment anyways because reddit itself is down; once I can I'll edit the question with details. I do know that the sys.stdout.encoding is the same between my 2.6 and 3.1 instances (cp437, which I could try setting to something else).

Daniel Lew 2010-06-28 14:40:04

@Daniel, CP437 (like most CPs) just won't let you show every Unicode character (a tiny subset, in fact). Type into the Windows console "chcp 65001" (this sets the code page to UTF-8) and change the terminal font to a Unicode font: Right click title bar, Properties, Font, Lucida Console; then `SET PYTHONIOENCODING=utf8`.

Alex Martelli 2010-06-28 15:14:21

The PYTHONIOENCODING solved the problem, but I still want to know why it worked in P2 but not P3.

Daniel Lew 2010-06-29 15:10:31

Good luck finding the answer to your philosophical question as the 10th or later answer of this absurd comment thread. What I know for sure is never to even look at a Q of yours again, after this ridiculous series of events: I spot your coding mistakes, correctly spot that your claim "I cannot get the data into a usable state" is instead all about mis-set IO, show you how to set it correctly (all in comments, **incredibly** inconvenient), and _still_ no accept because you're apparently too stubborn to admit this really needs a new Q. What an utter, total waste of my time.

Alex Martelli 2010-06-29 15:28:32

ansaurus

tags:

views:

answers:

Python 2 vs. Python 3 - urllib formats

related questions