snowclone_google.pl - gathers snowclone statistics from google
snowclone_google.pl [options] ``snowclone text''
Options:
-h, --help prints this help message
-v, --verbose increases verbosity level
-l, --langs "en" set language list
-p, --per 100 set results per query
-r, --results 100 set total number of results
-k, --key "" google API key
-c, --caps keep capitalization
-u, --punctuation keep punctuation
-s, --safe turn on SafeSearch
-f, --filter turn on filtering of similar search results
--proxy HTTP proxy
This program will search using Google for the specified snowclone string, given using Google's globbing format. It will then extract the strings that satisfy the snowclone from the snippets provided by Google and compile statistics on them.
Capitalization and punctuation are, by default, discarded.
The Google API key will be read from the file ~/.googlekey. It is necessary to supply this key either in this file or via the command-line options, as it is required to access the Google API.
The HTTP_PROXY environment variable is honored, but note that the Google API docs warn that many proxies have difficulty with the SOAP API which underlies the Google API.
snowclone_google.pl -r 500 ``In space, no one can hear you *''
As of 2004-02-15, Google only allows the first 1000 results to be retrieved. Also, the number of results per query is limited to 10.
Make it possible to have all of the URLs and full snippets listed, seperated by which variant of the snowclone they are. This should probably be HTML format for ease of use.
Actual error handling for rejected keys, overlimit, etc.
Reports * as a hit sometimes.
Probably a bunch of others. Let me know if you find any.
Copyright 2004 Michael Leuchtenburg.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
Michael Leuchtenburg <michael@slashhome.org>