ansaurus

Question

C++ and Embedded Python - NUL Terminated Strings

Answer 1

+1 A:

A few points:

Don't use strings. You might even be able to make them work here with some contortions on *_StringAndSize() functions, but it won't be what you want. You should store your data in a custom data structure (or a buffer) that is just a sequence of bytes (do you really want clients performing string operations on this data in Python?). If your object really is a buffer object, you should use the Buffer API.
Your imported module has a refcount of 2 because it's being held in sys.modules (for efficiency for the next time you try to import it). Never decref references you don't own or you'll crash your program. The Importing Modules section of the documentation should really cover this, but it doesn't.
It's pretty expensive to initialize Python and tear it down every time you do these operations. You should try to reorganize your use case such that you can call Py_Initialize only once when your application starts (or the first time it needs Python), and then only call Py_Finalize when your application is definitely done with Python, or when it quits.
You're being very lazy with error checking - most of the Python C/API functions can return NULL to indicate that an exception has been thrown, and you're almost never checking this value. If something fails you're going to start crashing in very odd places. You can read about this in the Exception Handling section of the C/API manual.

Nick Bastin 2010-10-08 18:03:45

g.d.d.c 2010-10-08 18:53:58

Nick - Also on 1 - The library that's doing PDF Generation expects a file handle to put data into. I'm using a StringIO Instance in Python to stay off the disk, then when I've completed the PDF Generation I retrieve the buffer contents with `outfile.getvalue()`. I am so far unable to convert this to a bytearray - I get an encoding error.

g.d.d.c 2010-10-08 18:55:23

You could implement a very small object in C++ that would look like a file-like object to Python (just by implementing the proper protocol methods) and use that instead of a StringIO instance. That way you can keep the data in memory, but not have to worry about null bytes or API handling, as it'll already be a native C++ data structure when the client returns it to you.

Nick Bastin 2010-10-08 19:55:37

Also, for testing purposes I'd use a real file and make sure you can make that work (and make it robust), before complicating the situation by avoiding writing to disk.

Nick Bastin 2010-10-08 19:56:07

I have working code that uses real files and does not involve any embedding - the PDF Generation is a stand-alone executable compiled with Py2Exe. Using this approach we have bottlenecks related to Disk I/O. Hence the attempt to keep things in memory. Passing the data in as a String seems to work fine, it's only retrieving it that gives me fits. I've tried converting PyString `pOutput` into a PyByteArray as well. `PyByteArray_Size` returns the correct size, but I still can't get it back to a char * to hand it back to the C++ down stream - I only get about 2044 bytes of the 35000 present.

g.d.d.c 2010-10-08 23:03:41

This isn't a problem with Python (and is why you should generally avoid the string API for non-string data on the C side). The problem is that the default std::string constructor stops reading the `char` array at the first `null`, because that's what the spec says it should do. The entire data is *there*, you just need to use the proper constructor: `std::string(const Char *str, size_type length);` (I'm not 100% sure that the C++ spec promises that this conversion will work - it's entirely possible that there is no way to get this data into a C++ string without constructing an iterator)

Nick Bastin 2010-10-08 23:32:59

I was able to get this working using the PyString_AsStringAndSize function and then using assign to place the data into the the variable that gets moved downstream. Also, pulling the call to Py_Initialize() out into a global scope so it's not setup and torn down resolved the crash on files after the first. Appreciate the help.

g.d.d.c 2010-10-11 18:19:40

ansaurus

tags:

views:

answers:

C++ and Embedded Python - NUL Terminated Strings

related questions