NAME

snowclone_google.pl - gathers snowclone statistics from google


SYNOPSIS

snowclone_google.pl [options] ``snowclone text''

 Options:
   -h, --help                 prints this help message
   -v, --verbose              increases verbosity level
   -l, --langs      "en"      set language list
   -p, --per        100       set results per query
   -r, --results    100       set total number of results
   -k, --key        ""        google API key
   -c, --caps                 keep capitalization
   -u, --punctuation          keep punctuation
   -s, --safe                 turn on SafeSearch
   -f, --filter               turn on filtering of similar search results
       --proxy                HTTP proxy


OPTIONS

-l, --langs
Set languages. Space seperated. Example: ``en ja'' to request English and Japanese results.

-k, --key
Google API key. You can request one from Google: http://www.google.com/apis/


DESCRIPTION

This program will search using Google for the specified snowclone string, given using Google's globbing format. It will then extract the strings that satisfy the snowclone from the snippets provided by Google and compile statistics on them.

Capitalization and punctuation are, by default, discarded.

The Google API key will be read from the file ~/.googlekey. It is necessary to supply this key either in this file or via the command-line options, as it is required to access the Google API.

The HTTP_PROXY environment variable is honored, but note that the Google API docs warn that many proxies have difficulty with the SOAP API which underlies the Google API.


EXAMPLE

snowclone_google.pl -r 500 ``In space, no one can hear you *''


NOTE

As of 2004-02-15, Google only allows the first 1000 results to be retrieved. Also, the number of results per query is limited to 10.


TODO

Make it possible to have all of the URLs and full snippets listed, seperated by which variant of the snowclone they are. This should probably be HTML format for ease of use.

Actual error handling for rejected keys, overlimit, etc.


BUGS

Reports * as a hit sometimes.

Probably a bunch of others. Let me know if you find any.


COPYRIGHT

Copyright 2004 Michael Leuchtenburg.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.


AUTHOR

Michael Leuchtenburg <michael@slashhome.org>