ansaurus

Question

Answer 1

+2 A:

you can use the is operator.

a = 'aaaaa'
b = 'bbbbb'

print a is b
a = b
print a is b

c = a[:]
print c is a

This works because a is b if and only if id(a) == id(b). In CPython at least, id(foo) is just the memory address at which foo is stored. Hence if foo is bar, then foo and bar are literally the same object. It's interesting to note that

a = 'aaaaa'
b = 'aaaaa'
a is b

is True. This is because python interns (at least most) strings so that it doesn't waste memory storing the same string twice and more importantly can compare strings by comparing fixed length pointers in the C implementation instead of comparing the strings byte by byte.

aaronasterling 2010-08-12 08:31:02

`print c is a` gives me `True` (was expecting `False`: `c` is a copy of `a`, stored elsewhere, so `c` and `a` should not be the same objects).

Vaibhav Bajpai 2010-08-12 08:39:32

@Vaibhav Bajpai python is interning `a[:]`. Because it is the same string there is no point in storing two copies of it.

aaronasterling 2010-08-12 08:41:14

@Vaibhav Bajpai `a = [1, 2, 3]; b = a[:]` creates a shallow copy of `a` because lists are mutable and so it is necessary to keep track of changes that might be made to `b`. strings are not mutable.

aaronasterling 2010-08-12 08:44:09

@aaronasterling this completely clears my doubt, thanks!

Vaibhav Bajpai 2010-08-12 08:46:38

Answer 2

+1 A:

It's still possible to have the same string stored in more than one place - see this example.

>>> a="aaaaa"
>>> b=a[:]
>>> b is a
True
>>> b=a[:-1]+a[-1]
>>> b is a
False
>>> b==a
True
>>>

gnibbler 2010-08-12 11:02:26

It's good to know that. I've often wondered about just always checking strings with `is`. Do you know why python does that in this case?

aaronasterling 2010-08-12 11:09:41

@aaronasterling, string literals should always be safe. Seems that constructed strings are not tested though. I don't know why it's done that way.

gnibbler 2010-08-12 12:00:50

It is definitely *not* safe to assume that identical string literals are stored as the same string. Python makes no guarantees at all in this case. The current CPython implementations happen to optimise string literals in the same compilation unit, and string literals that look like identifiers are interned, but these are both just details of the implementation.

Duncan 2010-08-12 13:04:30

@Duncan, indeed. The compiler in CPython matches up occurances of the same literal string and can use the same piece of memory for them (as they are immutable). There is no requirement by the language spec for this to happen though as far as I can tell.

gnibbler 2010-08-12 20:48:04

ansaurus

tags:

views:

answers:

b = a vs b = a[:] in strings|lists

related questions