[prev in list] [next in list] [prev in thread] [next in thread]
List: wget
Subject: Re: Not detected hyperlink in recursive downloading (wget 1.9.1)
From: nemeth () mokk ! bme ! hu
Date: 2005-04-27 21:41:08
Message-ID: 1114638068.427006f422816 () kelt ! mokk ! bme ! hu
[Download RAW message or body]
Quoting Hrvoje Niksic <hniksic@xemacs.org>:
Dear Hrvoje,
You are right, now I also can't reproduce the bug. I just realize
that in my second dowloading have missed another file.
When I downloaded the pages, the webserver was a little slow, made
long (10 s<) pauses too. Perhaps it reached the wget time limit,
I don't know.
Sorry for disturb you!
Thank you for your quick response,
Best regards,
László Németh
> nemeth@mokk.bme.hu writes:
>
> > I tried to download the European Constitution in English from
> >
> > http://europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/12004V.html
> >
> > with the following wget command:
> >
> > wget -r -l 2
> > http://europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/12004V.html
> >
> > In this file the "20. Protocol on the position of Denmark" link is
> > not detected, and the file on
> >
>
http://europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/C2004310EN.01035601.htm
> > hadn't downloaded. This link works in Firefox and Links browsers.
>
> That link seems to work for me. For example:
>
> $ wget -rl1 -A C2004310EN.01035601.htm
> http://europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/12004V.html
> --11:33:48--
> http://europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/12004V.html
> --11:33:48-- http://europa.eu.int/robots.txt
> Removing europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/12004V.html
> since it should be rejected.
> --11:33:49--
>
http://europa.eu.int/eur-lex/lex/en/treaties/dat/12004V/htm/C2004310EN.01035601.htm
> FINISHED --11:33:49--
> Downloaded: 20,998 bytes in 3 files
>
> I used -A in the example to download only that one link, but the link
> seems to be detected and should be downloaded in any case.
>
> Can you send a debug log of your download? Do you have something in
> .wgetrc that might be affecting Wget's behavior?
>
----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic