[PLUG-TALK] Google Search Grumbles

Dwight Hubbard dwight_hubbard at yahoo.com
Mon Jan 30 10:15:20 PST 2012


Have you looked at yql (http://developer.yahoo.com/yql/) ??  It creates a sql like interface for web scraping.  


So a you can do a yql query like:
select * from search.web where query="linux cifs setup"


And the result is a parsable xml document.

(Note I work for Yahoo so I may be a bit biased but I still think YQL is cool :-) )


>________________________________
> From: Rich Shepard <rshepard at appl-ecosys.com>
>To: plug-talk at pdxlinux.org 
>Sent: Saturday, January 28, 2012 12:18 PM
>Subject: [PLUG-TALK] Google Search Grumbles
> 
>   Others have vented here about the poor quality of search results using
>Google. My frustration with it is requesting linux-specific results and
>having most hits for Windows, even when I specify '- Windows' in the search
>term.
>
>   I'm trying now to find a tool that will export .dbf table data in ASCII
>format so I can modify the data prior to import into postgees and R. Even
>some bold-text headlines that say 'linux' have only Windows content on the
>referenced page.
>
>   Off to find another search engine.
>
>Rich
>_______________________________________________
>PLUG-talk mailing list
>PLUG-talk at lists.pdxlinux.org
>http://lists.pdxlinux.org/mailman/listinfo/plug-talk
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.pdxlinux.org/pipermail/plug-talk/attachments/20120130/05aa3c74/attachment.html>


More information about the PLUG-talk mailing list