ansaurus

Question

Machine dependent _write failures with EINVAL error code

Answer 1

A:

Two thoughts come to mind.. Either you are walking past the end of the buffer, and trying to write that data out, or the allocation of the buffer failed. Problems that, in debug mode, will not be as visible as they are in release mode.

It's probably a bad idea to allocate 250 meg of memory anyways. You'd do better to allocate a fixed size buffer, and do your writing in chunks.

Have you looked for things like Virus Scanners that might have a hold on the file in between your write operations, thus making the write fail?

I know of no limit to the amount of data you can pass to write in a single call, unless (like I said), you are writing data (as part of the buffer) that does not belong to you...

Since most of these functions wrap the Kernel call WriteFile(), (Or NtWriteFile()), there COULD be the condition that there isn't enough Kernel memory to handle the buffer to write. But, THAT I'm not certain of, since I don't know WHEN exactly the code makes the jump from UM to KM.

Don't know if any of this will help, but hope it does...

If you can provide any more details, please do. Sometimes just telling someone about the problem will trigger your brain to go "Wait a minute!", and you'll figure it out. heh..

LarryF 2009-02-25 01:11:40

As far as allocating the memory goes, we do a substantial amount of work to get the malloc off in the first place, and we do handle the cases where malloc fails (which is far prior to this point in the code). So the buffer should be allocated fine.

Matt Jordan 2009-02-25 01:19:27

What does your call to open the file look like? Just curious if that might lend any clues. Do you have a case where the write will always fail the first time, but work the second? Is it a case were IF you set the code up to loop until the write is successful, would it work? Not a fix, but a test?

LarryF 2009-02-25 18:41:40

Unfortunately - and I have to stress that this error is occurring in a 3rd party library that we have to use - they don't use a simple, standard FILE* object or the like. They've actually created their own - which yes, means all bets are off on whats happening.

Matt Jordan 2009-02-25 19:12:49

Maybe not, but they had to use SOMETHING that ended up in a call to CreateFile()... Or, as you describe, probably a call to _open, or open, or whatever, but even those wrap CreateFile... And EINVAL just means an invalid parameter was passed somewhere.

LarryF 2009-02-25 20:26:37

Answer 2

A:

You could be trashing your own stack by accidentally misusing a pointer somewhere else - if you can find a repro machine, try running your app under Application Verifier with all the memory checks turned on

Paul Betts 2009-02-25 01:47:48

Answer 3

+1 A:

Did you look at the implementation of _write() in the CRT (C runtime) source that was installed with Visual Studio (C:\Program Files\Microsoft Visual Studio 8\VC\crt\src\write.c)?

There are at least two conditions that cause _write() to set errno to EINVAL:

buffer is NULL, as you mentioned.
count parameter is odd when the file is opened in text mode in UTF-16 format (or UTF-8? the comments don't match the code). Is this a text or binary file? If it's text, does it have a byte order mark?
Perhaps another function that _write() calls also sets errno to EINVAL?

If you can reliably reproduce this problem, you should be able to narrow down the source of the error by putting breakpoints in the parts of the CRT source that set the error code. It also appears that the debug version of the CRT is capable of asserting when the error occurs, but it might require tweaking some options (I haven't tried it).

bk1e 2009-02-25 05:52:31

Its a binary file. I'll take a look at write.c and see if anything seems obvious.

Matt Jordan 2009-02-25 13:00:37

Alas - nothing jumped out. Since its binary, the UTF-16 error shouldn't apply. The only thing I can think of is that WriteFile itself is setting the EINVAL flag - which isn't something we'd be able to look at.

Matt Jordan 2009-02-25 19:12:04

Answer 4

A:

I've got a similiar problem, which would probably confirm that the buffer parameter as such is fine.

My problem is, that fflush returns an error and sets EINVAL in errno.

Now the likely cluprit is an fflush onto a Samba provided network drive, sometimes (but not reproducable), fflush (which most certainly does call _write internally), does an EINVAL.

Assuming that the stdio MS implementation has no grave bugs, this should not happen. (Any other errors, I would understand, but how can fflush set EINVAL? I do not give it any parameters, and all data is managed internally by libc, so any EINVAL would be a bug outside my code.)

Andreas

Andreas Kostyrka 2009-03-18 15:33:55

We never solved this, and more or less became OBE. However, we did run a lot of tests, and found that our problem only occurred on slower drives, e.g., our problem didn't occur on RAID 0, but did on RAID 1, on slow hard drives, etc. When we limited the amount of data to 32 MB, it also solved it.

Matt Jordan 2009-03-18 15:55:43

Potentially, you may want to try and limit the amount of data you send in a single shot to less than 64 MB (32 always worked, 64+ would usually work but fail in some circumstances) - or barring that, go for a HW solution that increases the write speed.

Matt Jordan 2009-03-18 15:56:40

Answer 5

+3 A:

Hi,

We had a very similar problem which we managed to reproduce quite easily. We first compiled the following program:

#include <stdlib.h>
#include <stdio.h>
#include <io.h>
#include <sys/stat.h>
#include <fcntl.h>

int main(int argc, char *argv[])
{ int len = 70000000;
  int handle= creat(argv[1], S_IWRITE | S_IREAD);
  setmode (handle, _O_BINARY);
  void *buf = malloc(len);
  int byteswritten = write(handle, buf, len);
  if (byteswritten == len)
    printf("Write successful.\n");
  else
    printf("Write failed.\n");
  close(handle);
  return 0;
}

Now, let's say you are working on the computer mycomputer and that C:\inbox maps to a shared folder \\mycomputer\inbox. Then the observe the following effect:

C:\>a.exe C:\inbox\x
Write successful.

C:\>a.exe \\mycomputer\inbox\x
Write failed.

Note that if len is changed to 60000000, there is no problem...

Based on this web page support.microsoft.com/kb/899149, we think it is a "limitation of the operating system" (the same effect has been observed with fwrite). Our work around is to try to cut the write in 63 MB pieces if it fails. This problem has apparently been corrected on Windows Vista.

I hope this helps! Simon

2009-05-29 13:01:09

Definitely did - or at least would have, if the program hadn't been OBE'd by other, economic-related factors. Definitely appears to have been the same problem. Thanks!

Matt Jordan 2009-06-07 21:17:03

ansaurus

tags:

views:

answers:

Machine dependent _write failures with EINVAL error code

related questions