ansaurus

Question

Answer 1

+6 A:

Use the std::string::substr library function.

std::string s = "12345678";
std::cout << s.substr (1) << '\n'; // => 2345678
s = "123456789";
std::cout << s.substr (1) << '\n'; // 23456789

Vijay Mathew 2010-03-12 06:10:11

Thanks for telling me about that function! It fixed the problem! Edit: nvm, I see others' answers :)

wrongusername 2010-03-12 06:14:15

Answer 2

+4 A:

The behavior you're describing would be expected if you copy the characters into the string but forget to tack a null character at the end to terminate the string. Try adding a null character to the end after the loop, and make sure you allocate enough space (one more character) for the null character. Or, better, use the string constructor overload which accepts not just a char * but also a length.

Or, even better std::string::substr -- it will be easier and probably more efficient.

string after(int after, string word) { 
  return word.substr (after);
}

BTW, you don't need an after method, since exactly what you want already exists on the string class.

Now, to answer your specific question about why this only showed up on the 8th and later characters, it's important to understand how "C" strings work. A "C" string is a sequence of bytes which is terminated by a null (0) character. Library functions (like the string constructor you use to copy temp into a string instance which takes a char *) will start reading from the first character (temp[0]) and will keep reading until the end, where "the end" is the first null character, not the size of the memory allocation. For example, if temp is 6 characters long but you fill up all 6 characters, then a library function reading that string to "the end" will read the first 6 characters and then keep going (past the end of the allocated memory!) until it finds a null character or the program crashes (e.g. due to trying to access an invalid memory location).

Sometimes you may get lucky: if temp was 6 characters long and the first byte in memory after the end of your allocation happened to be a zero, then everything would work fine. If however the byte after the end of your allocation happened to be non-zero, then you'd see garbage characters. Although it's not random (often the same bytes will be there every time since they're filled by operations like previous method calls which are consistent from run to run of your program), but if you're accessing uninitialized memory there's no way of knowing what you'll find there. In a bounds checking environment (e.g. Java or C# or C++'s string class), an attempt to read beyond the bounds of an allocation will throw an exception. But "C" strings don't know where their end is, leaving them vulnerable to problems like the one you saw, or more nefarious problems like buffer overflows.

Finally, a logical follow-up question you'd probably ask: why exactly 8 bytes? Since you're trying to access memory that you didn't allocate and didn't initialize, whats in that RAM is what the previous user of that RAM left there. On 32-bit and 64-bit machines, memory is generally allocated in 4- or 8-byte chunks. So it's likely that the previous user of that memory location stored 8 bytes of zeroes there (e.g. one 64-bit integer zero) zeros there. But the next location in memory had something different left there by the previous user. Hence your garbage characters.

Moral of the story: when using "C" strings, be very careful about your null terminators and buffer lengths!

Justin Grant 2010-03-12 06:10:47

Thank you! Could you please explain why it stopped working when it reached 8 chars in length?

wrongusername 2010-03-12 06:21:23

Answer 3

+2 A:

Your string temp is not NULL terminated. You requite temp[a] = '\0'; at the end of loop. Also you need to allocate word.size() - after + 1 chars so as to accomodate the NULL character.

Naveen 2010-03-12 06:14:05

Answer 4

A:

You're not null-terminating your char array. C-style strings (i.e., char arrays) need to have a null character (i.e., '\0') at the end so functions using them know when to stop.

I think this is basically your after() function, modulo some fudging of indexes:

string after(int after, string word) {
  return word.substring(after);
}

Jack 2010-03-12 06:15:08

@Jack the function name is substr, not substring.

Vijay Mathew 2010-03-12 07:00:47

ansaurus

tags:

views:

answers:

Weird problem with string function

related questions