It depends. First, let's lay a little groundwork on what GoogleShare is.
Consider your searches
"Tom Cruise" + "Scientology"
"John Travolta" + "Scientology"
What you're computing when you compute the GoogleShare here is the percentage of searches for "Scientology"
that also contain "Tom Cruise"
versus the percentage of searches for "Scientology"
that also contain "John Travolta"
. So the way to compute this is as follows:
Google search for "Scientology": 4,730,000 hits
Compare to:
Google search for "Tom Cruise" and "Scientology": 825,000 hits
Google search for "John Travolta" and "Scientology": 340,000 hits
Therefore, the "Tom Cruise"
GoogleShare of "Scientology"
is 17.44%. The "John Travolta"
GoogleShare of "Scientology"
is 7.18%. We say that relative to "Tom Cruise"
is more connected to "Scientology"
than "John Travolta"
is connected to "Scientology"
. Thus I note that your initial calculations of the GoogleShare of "Tom Cruise"
versus the GoogleShare of "John Travolta"
in "Scientology"
were incorrect. The key is figuring out what your base search is (here it is "Scientology"
) and what the terms are that you want to see what their share of this space is (here it is "Tom Cruise"
versus "John Travolta"
).
Now Consider a search
"Scientology" + "Tom Cruise" + keyword
and
"Scientology" + "John Travolta" + keyword.
There are two ways to view this. Are you trying to measure the share of "Tom Cruise"
and "John Travolta"
in the space of ("Scientology"
+ keyword
) or are you trying to measure the share of "Tom Cruise"
+ keyword
in the space of "Scientology"
? These are different.
Google search for "Scientology" + "StackOverflow": 34,300
Google search for "Tom Cruise" and "Scientology" and "StackOverflow": 1,360
Google search for "John Travotla" and "Scientology" and "StackOverflow": 1,660
If you want the share of "Tom Cruise"
and "John Travolta"
in the space of ("Scientology"
+ "StackOverflow"
) you'd compute:
"Tom Cruise": 1360 / 34300 = 3.97%
"John Travolta": 1660 / 34300 = 4.84%
If you want the share of "Tom Cruise" + "StackOverflow"
and "John Travolta" + "StackOverflow"
in the space of "Scientology"
you'd compute:
"Tom Cruise" + "StackOverflow": 1360 / 4730000 = .0029%
"John Travolta" + "StackOverflow" : 1660 / 4730000 = .0035%
You see, it all depends on what your base search is and what the terms are that you are trying find their share of this base term. In the first version our base search is "Scientology" + "StackOverflow"
and we are seeing what share "Tom Cruise"
and "John Travolta"
have of this space. In the second version our base search is "Scientology"
and we are seeing what share "Tom Cruise" + "StackOverflow"
and "John Travolta" + "StackOverflow"
have in this space.