views:

518

answers:

8

Spending my time on high level languages it suddenly occurred to me that I did not know the difference between a Char Array and a String. I think they are the same thing but not sure. Is there a difference? Is it just a Char Array with some abstraction?

+4  A: 

a character array is simply an array of characters

a string is data structure that uses an array of characters

some string representations use a null-terminator (like C), others use a length prefix

Steven A. Lowe
I think "data structure" is misleading. A classic C string has characters placed one after another in memory with the last byte containing a binary zero. Exactly the same as an array with the exception of that last byte.
Jay
Kind of a C-centric answer. There are languages out there where strings and arrays are very different things.
T.E.D.
@jay: is that not a structure? Do bad things happen when part of the structure is removed? A String in most languages *is* a structure even if that's nothing more than an array of chars and either a terminator or other mechanism for specifying the length.
SuperMagic
A string is an abstraction, not a data structure. It is *implemented* in terms of data structures such as arrays or lists, but that is a detail that differs from langauge to language, whereas the abstraction -- a data type that represents a chunk of text -- is largely the same everywhere.
Porculus
I have to say that I prefer Draemon's emphasis on a string as an abstraction. There have been many, many different implementation used over the years.
dmckee
@[Todd Moses]: I have to agree that Draemon's answer is more accurate; suggest it be the accepted answer instead of this one.
Steven A. Lowe
@Steven. Perhaps SO needs a "humility" or "gentleman" badge?
Draemon
+2  A: 

In C these are almost the same, though a string will have an additional null character at the end.

In other languages (Java, C# etc), a string is an object, whereas a character array is an array of ... chars (which are primitive data types).

Normally, strings are implemented with character arrays.

Oded
+10  A: 

A String is an abstraction, but of a sequence of characters. It says nothing of implementation. If you wanted to create a String implementation based on a linked list of characters there's nothing stopping you.

In a language such as C, there is very little difference - just that a c string is a pointer to a null-terminated series of characters at sequential addresses.

In an OOP language, a String will be an object of some String class. This will probably hold the data in a character array internally, but you don't need to know that. A character array can only be a simple array, but a String class can provide many operations (substrings, regex, etc) on strings if the implementer decides to.

Draemon
Haskell is an example of a language where the native string implementation is based on a linked list of characters.
Porculus
@Porculus thanks for the example
Draemon
My main problem with this answer, is that there are languages where that isn't true. For instance, in Fortran "string" is a proper type, totally separate from array types. It is no more of an "abstraction" there than "array" is.
T.E.D.
+1  A: 

String is a class in java. So it has attributes e.g. length. So when you ask for the size of string it simply returns that instead of computing the value each time. It also other methods e.g. indexOf, substring, etc to make life easy so you don't have to do that yourself.

fastcodejava
length() is a method not an attribute.
Draemon
@Draemon - +1 for mentioning it is a method, but internally it just returns an attribute.
fastcodejava
+1  A: 

The answer to some extent depends on what language your talking about. In the .Net/C# world strings are immutable objects, whereas a char array you can add/change values easily in the array. Strings can be treated as char arrays in a read-only fashion, as you can iterate over the characters in a string.

In the abstract I think the biggest difference is in how you want to work with them. Are you wanting to work with a chunk of text, say to show a message to an end user, or are you looking at a sequence of characters, doing some processing on the list? It's all rather subjective at a certain level.

ckramer
A: 

A C-style string is internally represented by array of character with the '\0' at end, which indiciates the end of string.

In C++, there's a string container class defined in string.h which provides some typical string operations to manipulate the string.

Dave18
The C++ class is in `string` , not `string.h` , and while it is a container, it is not a container for a C-style string (C++ strings are not null-terminated).
Porculus
+4  A: 

I used to teach programming, and this is how I used to explain this particular issue.

First, focus on what both things have in common: both a char array and a string consist of a sequence of characters. Being a sequence implies that the characters are ordered and that they can be enumerated, for example.

Now focus on what each of the two things add, in their particular different ways, to this common ground.

A char array adds what any array is known to add: indexing and random access to individual items.

A string, on the other hand, adds the fact that the sequence of chars is seen as a whole thing with its own properties. In some implementations, achieving this means altering the way the chars are stored (adding a terminating null in C strings, for example).

This approach (look at the commonalities, then at how things diverge from them) has proven useful in a variety of situations.

Hope this helps.

CesarGon
Exactly: properties of the abstraction versus properties of the implementation.
dmckee
A: 

It depends on the language. In C-ish languages, they are pretty much synonomous. You could claim the difference is that "strings" have an implicit terminating nul, but that would be splitting hairs.

Fortran is the other extreme. There character arrays and character strings are entirely different types, with different operations available for them.

T.E.D.