<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss'><id>tag:blogger.com,1999:blog-3474847450266445164</id><updated>2009-10-17T17:27:42.131-07:00</updated><title type='text'>Hacking with Google data and services</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://googlehack.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3474847450266445164/posts/default'/><link rel='alternate' type='text/html' href='http://googlehack.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>GAL</name><uri>http://www.blogger.com/profile/13875312689720417550</uri><email>noreply@blogger.com</email></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>2</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3474847450266445164.post-2467384080805613481</id><published>2007-01-01T04:12:00.000-08:00</published><updated>2007-01-01T04:18:26.809-08:00</updated><title type='text'>02. Clean a Snapshot of Google in Time</title><content type='html'>&lt;span class="docEmphBold"&gt;Google Zeitgeist provides a weekly,  monthly, and yearly ove&lt;/span&gt;&lt;span class="docEmphBold"&gt;rview of what the Web was interested in.&lt;/span&gt;&lt;p class="docText"&gt; &lt;/p&gt; &lt;p class="docText"&gt;Turning to Google itself for a definition of &lt;span class="docEmphasis"&gt;zeitgeist&lt;/span&gt; (&lt;tt&gt;define:zeitgeist&lt;/tt&gt;), there's  consensus that it refers to "the spirit of the times." And &lt;a name="ID-ID-d771e0-993777"&gt;&lt;/a&gt;Google Zeitgeist (&lt;a class="docLink" href="http://www.google.com/press/zeitgeist.html" target="_blank"&gt;http://www.google.com/press/zeitgeist.html&lt;/a&gt;) is just that: a  mirror that the Web (according to Google) holds up to us, providing a snapshot  of the week, month, or year that was.&lt;a name="ID-ID-d774e0-993779"&gt;&lt;/a&gt; &lt;/p&gt; &lt;p class="docText"&gt;A typical weekly Google Zeitgeist, shown in &lt;span class="docLink"&gt;Figure 1-8&lt;/span&gt;, lists the top 15 gaining  queries.&lt;/p&gt;&lt;a name="ID-47156:FigureLabel:Figure_1-8"&gt;&lt;/a&gt; &lt;p&gt; &lt;/p&gt;&lt;center&gt; &lt;h5 class="docFigureTitle"&gt;Figure 1-8. The week's top 15 gaining queries&lt;/h5&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_oi6mainhcZA/RZj7F-5EjCI/AAAAAAAAAAk/5Fz3PDlsS9g/s1600-h/test.JPG"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://2.bp.blogspot.com/_oi6mainhcZA/RZj7F-5EjCI/AAAAAAAAAAk/5Fz3PDlsS9g/s320/test.JPG" alt="" id="BLOGGER_PHOTO_ID_5015034265277336610" border="0" /&gt;&lt;/a&gt; &lt;/center&gt;&lt;br /&gt;&lt;p class="docText"&gt;It takes only a few moments of visiting Google Zeitgeist before  you're itching to go back a little further in time: the week your second child  was born, the month during which the Olympics were held, the year you graduated  from high school. Click the Archive link to choose any year from the Google  Zeitgeist Archive and display links such as those shown in &lt;span class="docLink"&gt;Figure 1-9&lt;/span&gt; for every week, month,  and year since January 2001.&lt;/p&gt; &lt;p&gt; &lt;table align="center" bg border="0" cellpadding="1" cellspacing="0" width="90%" style="color:black;"&gt; &lt;tbody&gt; &lt;tr&gt; &lt;td&gt; &lt;table bg border="0" cellpadding="6" cellspacing="0" width="100%" style="color:white;"&gt; &lt;tbody&gt; &lt;tr&gt; &lt;td valign="top" width="60"&gt;&lt;span style="font-weight: bold; color: rgb(255, 0, 0);font-size:180%;" &gt;!&lt;/span&gt;&lt;br /&gt;&lt;/td&gt; &lt;td valign="top"&gt; &lt;p class="docText"&gt;&lt;a name="ID-ID-d777e0-1002538"&gt;&lt;/a&gt;Weekly Zeitgeist updates  actually started in June 2001, at the same time the monthlies switched from PDF  to HTML format. In August 2005, Google stopped listing declining queries and  started listing 5 more of the top gaining queries, bringing the total to  15.&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/p&gt;&lt;br /&gt;&lt;a name="ID-92486:FigureLabel:Figure_1-9"&gt;&lt;/a&gt; &lt;p&gt; &lt;/p&gt;&lt;center&gt; &lt;h5 class="docFigureTitle"&gt;Figure 1-9. The Zeitgest Archive pages, displaying  weekly, monthly, and year-end reports dating back to 2001&lt;/h5&gt;&lt;h5 class="docFigureTitle"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_oi6mainhcZA/RZj72-5EjEI/AAAAAAAAAA0/WKX9-qmXjmk/s1600-h/test.JPG"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://2.bp.blogspot.com/_oi6mainhcZA/RZj72-5EjEI/AAAAAAAAAA0/WKX9-qmXjmk/s320/test.JPG" alt="" id="BLOGGER_PHOTO_ID_5015035107090926658" border="0" /&gt;&lt;/a&gt;&lt;/h5&gt; &lt;/center&gt;&lt;br /&gt;&lt;p class="docText"&gt;Monthly reports provide some information about Google News  queries and Google Image Search queries, and you can find monthly reports for  countries around the world by clicking the Zeitgeist Around the World link on  the front page. Year-end reports provide even more detail with trend graphs.&lt;/p&gt; &lt;p class="docText"&gt;While Google Zeitgeist's statistics aren't earth-shattering  (e.g., searches for &lt;tt&gt;iraq&lt;/tt&gt; more than doubled on March 19, 2003, the date  that Operation Iraqi Freedom beganimagine that!), it does provide a snapshot of  what the world in aggregate found interesting enough to look up.&lt;/p&gt;&lt;a name="ID-95486:HeadB:See_Also"&gt;&lt;/a&gt; &lt;h4 class="docSection2Title" id="title-IDAXBUJ"&gt;See Also&lt;/h4&gt; &lt;ul&gt;&lt;li&gt; &lt;p class="docList"&gt;If Google Zeitgeist piques your interest, you might also try  the &lt;a name="ID-ID-d780e0-993805"&gt;&lt;/a&gt;Yahoo! Buzz Index (&lt;a class="docLink" href="http://buzz.yahoo.com/" target="_blank"&gt;http://buzz.yahoo.com&lt;/a&gt;), a similar  collection of statistics around popular Yahoo! Searches: the day's top movers  (overall and by various Yahoo! categories), most viewed and emailed Yahoo! news  items, and a market trendlike chart (click the View Complete Chart... link  associated with any of the buzz listings on the front page) of leaders and  movers, according to &lt;span class="docEmphasis"&gt;&lt;a name="ID-ID-d783e0-1002539"&gt;&lt;/a&gt;buzz score&lt;/span&gt; (&lt;a class="docLink" href="http://help.yahoo.com/help/us/buzz/#buzz-04" target="_blank"&gt;http://help.yahoo.com/help/us/buzz/#buzz-04&lt;/a&gt;).&lt;/p&gt; &lt;/li&gt;&lt;li&gt; &lt;p class="docList"&gt;Google Trends (&lt;a class="docLink" href="http://www.google.com/trends" target="_blank"&gt;http://www.google.com/trends&lt;/a&gt;) is a new product from the Google  Labs that graphs the mentions of words of phrases over time. Type in two words  separated by commas to get a quick visual sense of the popularity. For example,  "Google, Yahoo" shows you which search engine is mentioned more across time,  regions, news stories, and languages.&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3474847450266445164-2467384080805613481?l=googlehack.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://googlehack.blogspot.com/feeds/2467384080805613481/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3474847450266445164&amp;postID=2467384080805613481' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3474847450266445164/posts/default/2467384080805613481'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3474847450266445164/posts/default/2467384080805613481'/><link rel='alternate' type='text/html' href='http://googlehack.blogspot.com/2007/01/02-clean-snapshot-of-google-in-time.html' title='02. Clean a Snapshot of Google in Time'/><author><name>GAL</name><uri>http://www.blogger.com/profile/13875312689720417550</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='13057539991190661575'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_oi6mainhcZA/RZj7F-5EjCI/AAAAAAAAAAk/5Fz3PDlsS9g/s72-c/test.JPG' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3474847450266445164.post-4206459203225184321</id><published>2007-01-01T02:35:00.000-08:00</published><updated>2007-01-01T07:01:22.610-08:00</updated><title type='text'>01. Browse the Google Directory</title><content type='html'>&lt;p class="docText"&gt;Google's&lt;a name="ID-ID-d747e0-993713"&gt;&lt;/a&gt; Web Search&lt;a name="ID-ID-d750e0-993715"&gt;&lt;/a&gt; indexes billions of pages, which means it isn't  suitable for all searches. When you have a search that you can't narrow downfor  example, if you're looking for information on a person about whom you know  nothingbillions of pages will get very frustrating very quickly.&lt;/p&gt; &lt;p class="docText"&gt;But you don't have to limit your searches to the Web. Google  also has a searchable subject index, the Google Directory, at &lt;a class="docLink" href="http://directory.google.com/" target="_blank"&gt;http://directory.google.com&lt;/a&gt;. Instead of indexing the entirety  of billions of pages, the directory describes sites instead, indexing about five  million URLs. This makes it a much better search for general topics.&lt;/p&gt; &lt;p class="docText"&gt;Does Google spend time building a searchable subject index in  addition to a full-text index? No, Google bases its directory on the &lt;a name="ID-ID-d753e0-993721"&gt;&lt;/a&gt;Open Directory Project data at &lt;a class="docLink" href="http://dmoz.org/" target="_blank"&gt;http://dmoz.org/&lt;/a&gt;. Unlike the results  at the standard Google Web Search, the collection of URLs at the Open Directory  Project is gathered and maintained by a group of human volunteers rather than  automatic algorithms, but Google does add some of its own Googlish magic to  it.&lt;/p&gt; &lt;p class="docText"&gt;As you can see in &lt;span class="docLink"&gt;Figure 1-6&lt;/span&gt;, the front of the site is  organized into several topics. To find what you're looking for, you can either  do a keyword search or &lt;span class="docEmphasis"&gt;drill down&lt;/span&gt; through the  hierarchies of subjects.&lt;/p&gt;&lt;a name="ID-54570:FigureLabel:Figure_1-6"&gt;&lt;/a&gt; &lt;p&gt; &lt;/p&gt;&lt;center&gt; &lt;h5 class="docFigureTitle"&gt;Figure 1-6. The Google Directory&lt;/h5&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_oi6mainhcZA/RZjkvu5EjAI/AAAAAAAAAAM/wxQDSF3oydg/s1600-h/test.JPG"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://1.bp.blogspot.com/_oi6mainhcZA/RZjkvu5EjAI/AAAAAAAAAAM/wxQDSF3oydg/s320/test.JPG" alt="" id="BLOGGER_PHOTO_ID_5015009693769436162" border="0" /&gt;&lt;/a&gt; &lt;/center&gt;&lt;br /&gt;&lt;p class="docText"&gt;Beside most listings, as shown in &lt;span class="docLink"&gt;Figure 1-7&lt;/span&gt;, you'll see a green bar.  The green bar is an approximate indicator of the site's PageRank in the Google  search engine. (Not every listing in the Google Directory has a corresponding  PageRank in the Google web index.) Web sites are listed in the default order of  Google &lt;a name="ID-ID-d756e0-1002535"&gt;&lt;/a&gt;PageRank, but you also have the option  to list them in alphabetical order.&lt;/p&gt;&lt;a name="ID-44646:FigureLabel:Figure_1-7"&gt;&lt;/a&gt; &lt;p&gt; &lt;/p&gt;&lt;center&gt; &lt;h5 class="docFigureTitle"&gt;Figure 1-7. Individual listings under Science Physics  Quantum Mechanics People Feynman, Richard&lt;/h5&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_oi6mainhcZA/RZjlAO5EjBI/AAAAAAAAAAU/0TxWC-evF7E/s1600-h/test.JPG"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://3.bp.blogspot.com/_oi6mainhcZA/RZjlAO5EjBI/AAAAAAAAAAU/0TxWC-evF7E/s320/test.JPG" alt="" id="BLOGGER_PHOTO_ID_5015009977237277714" border="0" /&gt;&lt;/a&gt; &lt;/center&gt;&lt;br /&gt;&lt;p class="docText"&gt;One thing you'll notice about the Google Directory is how the  annotations and other information vary between categories. This is because the  information in the directory is maintained by a small army of thousands of  volunteers who are each responsible for one or more categories. For the most  part, annotation is pretty good.&lt;/p&gt;&lt;a name="ID-82486:HeadB:Searching_Versus_Browsing"&gt;&lt;/a&gt; &lt;h4 class="docSection2Title" id="title-IDARRWRG"&gt;Searching Versus Browsing&lt;/h4&gt; &lt;p class="docText"&gt;There are two different kinds of shoppers, and they illustrate  the difference between &lt;span class="docEmphasis"&gt;&lt;a name="ID-ID-d759e0-1002536"&gt;&lt;/a&gt;searching&lt;/span&gt; and &lt;span class="docEmphasis"&gt;browsing&lt;/span&gt;. Some shoppers know exactly what they're  after, and they want to find a store with the item, locate the item, and  purchase it as quickly as possible. As with a web search, it helps to know a bit  about what you're looking for if this is your style. &lt;/p&gt; &lt;p class="docText"&gt;Other shoppers want to explore a particular store, see what the  store offers, and choose an item if the right one comes along. This style of  browsing is suited for people who want to get a larger survey of items in a  particular category before they necessarily know what they're looking for. &lt;/p&gt; &lt;p class="docText"&gt;If you were interested in looking at sites about child  psychology, you might try a search at &lt;a class="docLink" href="http://search.google.com/" target="_blank"&gt;http://search.google.com&lt;/a&gt; with  the query &lt;tt&gt;child&lt;/tt&gt; &lt;tt&gt;psychology&lt;/tt&gt;. You would find thousands of sites  in the search results, along with news articles about child psychology, college  papers about the topic, and even pages that mention the terms &lt;span class="docEmphasis"&gt;child&lt;/span&gt; and &lt;span class="docEmphasis"&gt;psychology  &lt;/span&gt;without relating to the topic. But browsing the Child Psychology category  in the Google Directory (&lt;a class="docLink" href="http://directory.google.com/Top/Science/Social_Sciences/Psychology/Child_Psychology/" target="_blank"&gt;http://directory.google.com/Top/Science/Social_Sciences/Psychology/Child_Psychology/&lt;/a&gt;)  gives you hundreds of links selected by Open Directory volunteers as being  relevant to the topic. &lt;/p&gt; &lt;p class="docText"&gt;There are still times when you need to search the directory,  and Google has provided a couple ways to accomplish this.&lt;/p&gt;&lt;a name="ID-30006:HeadB:Searching_the_Google_Directory"&gt;&lt;/a&gt; &lt;h4 class="docSection2Title" id="title-IDA2SWRG"&gt;Searching the Google Directory&lt;/h4&gt; &lt;p class="docText"&gt;Because the &lt;a name="ID-ID-d762e0-1002537"&gt;&lt;/a&gt;Google Directory  is a far smaller collection of URLs, ideal for more general searching, it does  not have the various complicated special syntaxes for searching that the Web  Search does. However, there are a couple of special syntaxes that you should  know about:&lt;/p&gt; &lt;dl class="docList"&gt;&lt;dt&gt;&lt;br /&gt;&lt;p&gt;&lt;span class="docPubcolor"&gt;&lt;span class="docPubcolor"&gt;&lt;span class="docMonofont"&gt;intitle:&lt;/span&gt;&lt;/span&gt; &lt;/span&gt;&lt;/p&gt; &lt;/dt&gt;&lt;dd&gt; &lt;p class="docList"&gt;Just like the Google web special syntax, &lt;tt&gt;intitle:&lt;/tt&gt;  restricts the query word search to the title of a page.&lt;/p&gt; &lt;/dd&gt;&lt;dt&gt;&lt;br /&gt;&lt;p&gt;&lt;span class="docPubcolor"&gt;&lt;span class="docPubcolor"&gt;&lt;span class="docMonofont"&gt;inurl:&lt;/span&gt;&lt;/span&gt; &lt;/span&gt;&lt;/p&gt; &lt;/dt&gt;&lt;dd&gt; &lt;p class="docList"&gt;&lt;tt&gt;inurl:&lt;/tt&gt; restricts the query word search to the URL of a  page.&lt;/p&gt;&lt;/dd&gt;&lt;/dl&gt; &lt;p class="docText"&gt;When you're searching on Google's web index, your overwhelming  concern is probably how to reduce your list of search results to something  manageable. With that in mind, you might start with the narrowest possible  search.&lt;/p&gt; &lt;p class="docText"&gt;That's a reasonable strategy for the web index, but because you  have a narrower pool of sites in the Google Directory, you want that search to  be more general.&lt;/p&gt; &lt;p class="docText"&gt;For example, say you were looking for information on author P.  G. Wodehouse. A simple search on &lt;tt&gt;P.&lt;/tt&gt; &lt;tt&gt;G.&lt;/tt&gt; &lt;tt&gt;Wodehouse&lt;/tt&gt; in  Google's web index gets you over 1,100,000 results, possibly compelling you to  immediately narrow your search. But doing the same search in the Google  Directory returns only 176 results. You might consider that a manageable number  of results, or you might want to carefully narrow your results further.&lt;/p&gt; &lt;p class="docText"&gt;The Directory is also good for searching for events. A Google  web search for &lt;tt&gt;Korean&lt;/tt&gt; &lt;tt&gt;War&lt;/tt&gt; will find over 24 million results,  while searching the Google Directory will find just over 138,000. This is a case  where you will probably need to narrow your search. Use general words indicating  what kind of information you want&lt;tt&gt;timeline&lt;/tt&gt;, for example, or  &lt;tt&gt;archives&lt;/tt&gt;, or &lt;tt&gt;lesson&lt;/tt&gt; &lt;tt&gt;plans&lt;/tt&gt;. Don't narrow your search  with names or locations; that's not the best way to use the &lt;a name="ID-ID-d765e0-1002615"&gt;&lt;/a&gt;Google Directory. &lt;a name="ID-ID-d768e0-1002614"&gt;&lt;/a&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3474847450266445164-4206459203225184321?l=googlehack.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://googlehack.blogspot.com/feeds/4206459203225184321/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3474847450266445164&amp;postID=4206459203225184321' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3474847450266445164/posts/default/4206459203225184321'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3474847450266445164/posts/default/4206459203225184321'/><link rel='alternate' type='text/html' href='http://googlehack.blogspot.com/2007/01/01-browse-google-directory.html' title='01. Browse the Google Directory'/><author><name>GAL</name><uri>http://www.blogger.com/profile/13875312689720417550</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='13057539991190661575'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_oi6mainhcZA/RZjkvu5EjAI/AAAAAAAAAAM/wxQDSF3oydg/s72-c/test.JPG' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry></feed>