|
#1
|
|||
|
|||
|
Does New Google Patent Validate Sandbox Theory?
Must read
Probably one of the best bits of information released by them in a patent. Large number of inventors listed on here, even Matt Cutts that guy who attends those SE conferences . Explains a bit about what is already known through experience as well as comments made by search engine representatives.Example: Quote:
Last edited by msgraph : 03-31-2005 at 12:02 PM. |
|
#2
|
|||
|
|||
|
Bloody interesting read. Not only does it look like the Sandbox Uncovered, it's got Matt Cutt's name on it!!
|
|
#3
|
|||
|
|||
|
Brilliant find, does explain a few things, but then again not everything. I wonder why we haven't found this sooner. Last sentence in the document is particularly interesting in how it explains the "boost" new sites normally receive after initially getting indexed. You do well for awhile, and then you crash later. There is for a "lag" on scoring for this it seems.
Quote:
__________________
Globalpromoter.com - Search Engine Optimization & Tools |
|
#4
|
|||
|
|||
|
Quote:
__________________
Globalpromoter.com - Search Engine Optimization & Tools |
|
#5
|
|||
|
|||
|
Quote:
I'm particularly interested in this part: Quote:
__________________
Bill Hartzer is an internet marketing consultant in Dallas and has been practicing organic SEO since 1996. |
|
#6
|
|||
|
|||
|
But what does this all mean. its a patent and that doesent mean that this is how the SE works. Correct me if I am wrong but...this dosent mean anything.
|
|
#7
|
|||
|
|||
|
If they've taken the time to patent this, it strongly suggest they are using some of what's described. You're correct that it doesn't explain exactly how the search engine works completely or that any or all of what's described is being used. But part of what are described fit in with things some people have been scratching their heads over -- the new info seems to neatly explain some of what's been seen.
|
|
#8
|
|||
|
|||
|
I see what you mean Danny. Great points. I am so glad I joined this forum filled with great knowledgeble members.
|
|
#9
|
|||
|
|||
|
For what it's worth, I wrote a long paper about network theory which covers some of this (Filthy Linking Rich).
I'm with my dear friend and supporter Dr. Edel Garcia. If we don't start and pay attention to information retrieval science, which Edel is an expert in, and even listen to my own musings on network theory - we're screwed! If we talk about anecdotal nonsense over and over again, just to point clients in the direction of some substance. And then fail miserably over and over again... It's just as I said in New York about "do we need to know how a search engine really works." And this supports the article Edel wrote about keyword density analysis yesterday (I can't find where it is in the forum - help moderator!). This industry is growing - we better learn to grow with it. |
|
#10
|
||||
|
||||
|
Quote:
Matt Cutts at SES NYC told me, we did not become a register to register domain names. He went on to explain exactly what is written above. Why didn't I say anything? Because we all knew it, but now its on paper. |
|
#11
|
|||
|
|||
|
The user behavior parts of the document also fits right in with the purchase of Urchin.
|
|
#12
|
|||
|
|||
|
Check out 114-116 - toolbar data usage for ranking purposes.
|
|
#13
|
|||
|
|||
|
Quote:
|
|
#14
|
|||
|
|||
|
From the DETAILED DESCRIPTION section
Quote:
![]() |
|
#15
|
|||
|
|||
|
So basically they are analyzing register information for how many years you are dedicating to the domain and because they want to see the interconnectivity of all your domains?
Then they are looking at fresh index types of data such as staleness of a page and how many links point to it. Plus if those links stay pointing to it or not? Last edited by Relevancy : 03-31-2005 at 10:26 PM. |
|
#16
|
|||
|
|||
|
Quote:
There is also mention on something I have studied more closely about how in the patent it tries to detemine if the domain has changed owners. It says "[0128] a significant change over time in the set of topics assoicated with a document may indicate that the document has changed owners and previous document indicators, such as score, anchor text, etc..are no longer reliable." I know a few people that have avoided the "sandbox" so to so, by "rehabing" older domains in order to set up new content or pages on the domain. For the most part this works if done without extending the domain to much as much as I have discovered. As it says "Similarly, a spike in the number of topics could indicate spam...that the document has been taken over as a "doorway" document". I don't think this was something before you would consider when rehabing a domain, but it sure will be know. At least for me.
__________________
Globalpromoter.com - Search Engine Optimization & Tools |
|
#17
|
|||
|
|||
|
A little deeper look and what do we find?
Google is also acknowledging the following while describing how it gathers "User Maintained/Generated Data" in its patent filings
Quote:
Last edited by whistleman : 04-01-2005 at 01:00 AM. Reason: corrected URL |
|
#18
|
|||
|
|||
|
I wouldn't worry too much about anything in this patent. Pragmatically, it conflicts it's own aims, and if I were a betting man, Google floated that purely to lay claim to any conceptual-based arguments to be made by competitors going forward.
It really sounds like a bunch of engineers got high on the Tahoe ski trip and whiteboarded their thoughts. |
|
#19
|
|||
|
|||
|
I agree, it does sound like a lot of clutter to mask a few key ideas. Most of it is "ifs" and "in some ways". The basic idea is they seem to be analyzing register information and then they go into how the value fresh vs stale pages.
Last edited by Marcia : 04-01-2005 at 05:12 AM. Reason: Unnecessary URL removed. |
|
#20
|
|||
|
|||
|
It gives me the chills too Whistleman but there are a couple of questions I have regarding this patent, the first of which is why is it not in Googles name or at least assigned to Google?
Secondly the idea of rankings based at least partially on how often a page is accessed from bookmarks reminds me of Alexa, just too easy to manipulate.
__________________
Mel Nelson Expert SEO Dont settle for average SEO Singapore Search Engine Optimization and web design |
|
#21
|
||||
|
||||
|
Quote:
|
|
#22
|
|||
|
|||
|
True Mikkel, but some are harder than others and clicking on a link in favorites is among the easier ones, while adding relevant anchor text links for example is much more difficult.
__________________
Mel Nelson Expert SEO Dont settle for average SEO Singapore Search Engine Optimization and web design |
|
#23
|
||||
|
||||
|
Mel, my point is just that it has been like this forever in SEO - new algoritmic changes happend all the time, we decode it and then manipulate it. Thats the nature of SEO and thats what makes it so much fun
![]() |
|
#24
|
|||
|
|||
|
About the sandbox
I originally posted this in the wrong place and everything, so posting it here instead:
It seems that at first glance people are thinking that the fact that historical data will be taken into consideration automatically means that new sites with no historical data will be included. "The method of claim 2, wherein the generating a score for the document includes scoring the document based, at least in part, on an elapsed time measured from the inception date corresponding to the document. 5. The method of claim 2, wherein the inception date corresponding to the document is based on at least one of a date when a search engine first discovers the document, a date when a search engine first discovers a link to the document, and a date when the document includes at least a predetermined number of pages." This shows that brand new sites are not going to suffer. Based on these conditions its pretty clear that sites are given some time to prove themselves. Also notice the careful wording " at least in part (on an elapsed time measured from the inception date corresponding to the document). " Not everything is based on historical data, the rest of the patent should settle that. Lets not have another "PageRank is everything" rumour! I am not allowed to submit large posts so the lowdown on the patent is at Search Science |
|
#25
|
|||
|
|||
|
I haven't read all of the patent, but from the chunks that I read it looks to me as though they decided to patent the idea of bringing aspects of time into the ranking of documents, and all they've done is sat down and come up with every possible measurement of time, with respect to a document, and included them all in the patent.
I think the best that can said for it is that it shows a way in which they are thinking (were thinking over a year ago), and that one or more aspects of time may be incorporated into the ranking procedures. The whole patent doesn't look like a set of intentions - it just covers all the bases in case they decide to use any of them - which they may already have done. |
|
#26
|
|||
|
|||
|
After reading the patent (but I am afraid not digesting it all completly) there are two things that stand out:
Why is a search engine patent by a group which includes several high level Google employees not registered or assigned to Google? I would characterize it more as a wish list of all the things G might like to do some day so lets patent all the ideas now before someone else does.
__________________
Mel Nelson Expert SEO Dont settle for average SEO Singapore Search Engine Optimization and web design |
|
#27
|
|||
|
|||
|
Quote:
Nothing explains the sandbox phenomenon completely - it's too unpredictable and there's so many features associated with it, but this is some good information, sandbox or no. I should have a comprehensive write-up complete by 12pm PST, 3pm EST. It takes forever to get through this thing. |
|
#28
|
|||
|
|||
|
April fools to all you SEOers!
.. jk but that would be evil |
|
#29
|
||||
|
||||
|
Quote:
|
|
#30
|
|||
|
|||
|
I've said this before but I'll say it again....
What do you know about a domain *before* you even visit it? |
![]() |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
|
|