Spending my time on high level languages it suddenly occurred to me that I did not know the difference between a Char Array and a String. I think they are the same thing but not sure. Is there a difference? Is it just a Char Array with some abstraction?
a character array is simply an array of characters
a string is data structure that uses an array of characters
some string representations use a null-terminator (like C), others use a length prefix
In C these are almost the same, though a string will have an additional null character at the end.
In other languages (Java, C# etc), a string is an object, whereas a character array is an array of ... chars (which are primitive data types).
Normally, strings are implemented with character arrays.
A String is an abstraction, but of a sequence of characters. It says nothing of implementation. If you wanted to create a String implementation based on a linked list of characters there's nothing stopping you.
In a language such as C, there is very little difference - just that a c string is a pointer to a null-terminated series of characters at sequential addresses.
In an OOP language, a String will be an object of some String class. This will probably hold the data in a character array internally, but you don't need to know that. A character array can only be a simple array, but a String class can provide many operations (substrings, regex, etc) on strings if the implementer decides to.
String
is a class
in java. So it has attributes e.g. length
. So when you ask for the size of string it simply returns that instead of computing the value each time. It also other methods e.g. indexOf
, substring
, etc to make life easy so you don't have to do that yourself.
The answer to some extent depends on what language your talking about. In the .Net/C# world strings are immutable objects, whereas a char array you can add/change values easily in the array. Strings can be treated as char arrays in a read-only fashion, as you can iterate over the characters in a string.
In the abstract I think the biggest difference is in how you want to work with them. Are you wanting to work with a chunk of text, say to show a message to an end user, or are you looking at a sequence of characters, doing some processing on the list? It's all rather subjective at a certain level.
A C-style string is internally represented by array of character with the '\0' at end, which indiciates the end of string.
In C++, there's a string container class defined in string.h which provides some typical string operations to manipulate the string.
I used to teach programming, and this is how I used to explain this particular issue.
First, focus on what both things have in common: both a char array and a string consist of a sequence of characters. Being a sequence implies that the characters are ordered and that they can be enumerated, for example.
Now focus on what each of the two things add, in their particular different ways, to this common ground.
A char array adds what any array is known to add: indexing and random access to individual items.
A string, on the other hand, adds the fact that the sequence of chars is seen as a whole thing with its own properties. In some implementations, achieving this means altering the way the chars are stored (adding a terminating null in C strings, for example).
This approach (look at the commonalities, then at how things diverge from them) has proven useful in a variety of situations.
Hope this helps.
It depends on the language. In C-ish languages, they are pretty much synonomous. You could claim the difference is that "strings" have an implicit terminating nul, but that would be splitting hairs.
Fortran is the other extreme. There character arrays and character strings are entirely different types, with different operations available for them.