[prev in list] [next in list] [prev in thread] [next in thread] 

List:       wget
Subject:    Re: Sym link following
From:       Micah Cowan <micah () cowan ! name>
Date:       2008-01-25 9:05:28
Message-ID: 4799A658.3080903 () cowan ! name
[Download RAW message or body]

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Wayne Connolly wrote:
> Hi all,
> 
> I love wget - so sweet.
> 
> Anyway, i am testing this on http://staging.measanctum.com as the wget
> dump target with this command:
> 
> wget -m -nH -p -k --retr-symlinks http://measanctum.com
> /srv/www/htdocs/staging.measanctum.com
> 
> My problem is that http://staging.measanctum.com
> <http://staging.measanctum.com/> is missing images and some styles that
> actually reside inside symlinked directories.
> 
> wget doesnt seem to follow into the sym dirs and retrieve the files.

HTTP has no way to specify a symlink versus a regular path. Therefore,
it can't effect how Wget works, and --retr-symlinks has no effect on
HTTP URLs.

Unfortunately, the source of the problem is that, while Wget knows how
to parse HTML to find links, it doesn't know how to parse CSS. In
particular, it won't find the stylesheets that are only linked via CSS
"@import" directives, or images specified only in CSS "url(...)"
expressions.

Unfortunately, there's not really much you can do about that. :\
However, CSS support in Wget is a top priority, and we hope to include
it for the 1.12 release. This may be a 6-months-to-a-year wait. :(

Wish I had better news...

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer...
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHmaZY7M8hyUobTrERAlGJAJ4sGRFuHKuKJjcSWaLgqFClReJCfgCdGS8c
+rZvc+4iZB58372zSP3gh4o=
=pVqK
-----END PGP SIGNATURE-----
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic