[PLUG] Has google outsmarted the linux text browsers?
Tom
tgrom.automail at nuegia.net
Mon Apr 1 01:11:40 UTC 2019
On Sun, 24 Mar 2019 15:19:51 -0700
Tomas Kuchta <tomas.kuchta.lists at gmail.com> wrote:
> If you check the page source, it explains it all. The link doesn't
> point to the original page, it points to Google, with a reference for
> the actual page. Google then redirects you.
>
> So, you never request the page directly, you are always redirected by
> Google. That way they are in full control of your browsing as well as
> being able to get all the info about your visit through the embedded
> JavaScript on the destination page, when they include it.
>
> I imagine that, if the destination wants the traffic from Google,
> they need to embed their code. Otherwise it would old not show up in
> G searches at the first place.
>
> Tomas
>
> On Sat, Mar 23, 2019, 3:11 PM logical american
> <website.reader3 at gmail.com> wrote:
>
> > Today I was trying 3 different linux text browsers, lynx, w3m and
> > links, but found that all 3 could not fully capture the links in
> > the google search engine page after doing a search query to
> > www.google.com
> >
> > For example Google would give as the header line
> >
> > We have found our results
> >
> > then the URL line:
> >
> > http://www.foundourresults.com/our/data/.../is/here
> >
> > and some explanation below that...
> >
> > But notice the ellipsis ... in the url line. All 3 linux browsers
> > would faithfully record the url line with the ellipsis, and thus
> > the search results were unresolvable when used for a brand new
> > lookup. However google carefully embeds the correct url in the
> > header line which is where the browser actually goes next, after
> > manually mouse clicking on that find.
> >
> > I noticed that Startpage doesn't use ellipsis in the url line, so
> > it is simple to capture the url's using the -dump option to a file
> > and later parse these for further internet lookup and they all work.
> >
> > How would one go about recovering the full urls from the Google
> > search results so that a text browser successfully captures the
> > fully specified URL reference?
> >
> > Randall
> >
> >
> > --
> > *CONFIDENTIAL:*/This email message and/or any attachments is for the
> > sole use of the intended recipient(s) and may contain confidential
> > information. _Any unauthorized review, use, copying, dissemination,
> > disclosure, retention or distribution is strictly prohibited._ If
> > you are not the intended recipient, please contact the sender by
> > reply email and destroy all copies of the original message along
> > with any attachments. This communication (including attachments) is
> > covered by the Electronic Communication Privacy Act, U.S. Code
> > Title 18 §2510-2521./
> > _______________________________________________ PLUG mailing list
> > PLUG at pdxlinux.org
> > http://lists.pdxlinux.org/mailman/listinfo/plug
> >
> _______________________________________________
> PLUG mailing list
> PLUG at pdxlinux.org
> http://lists.pdxlinux.org/mailman/listinfo/plug
just use https://duckduckgo.com/lite or wiby.me if Google is doing
shady stuff the their results
--
________________________________________
/ A halted retreat Is nerve-wracking and \
| dangerous. To retain people as men -- |
\ and maidservants Brings good fortune. /
----------------------------------------
\
\
/\ /\
//\\_//\\ ____
\_ _/ / /
/ * * \ /^^^]
\_\O/_/ [ ]
/ \_ [ /
\ \_ / /
[ [ / \/ _/
_[ [ \ /_/
More information about the PLUG
mailing list