[prev in list] [next in list] [prev in thread] [next in thread] 

List:       fedora-list
Subject:    Re: downloading a complete web page without using a browser...
From:       D&R <dwoody5654 () gmail ! com>
Date:       2021-07-06 5:24:25
Message-ID: 20210706002425.419d5ab3 () star1 ! home ! com
[Download RAW message or body]

On Sat, 3 Jul 2021 20:25:04 -0700
users@lists.fedoraproject.org wrote:

> On 7/3/21 8:02 PM, dwoody5654@gmail.com wrote:
> > I have been using a shell script called save-page-as.sh to download a
> > complete web page. This has been working as expected.
> > The relevant line the the script is:
> > "${browser}" "-new-window" "${url}" &>/dev/null
> > 
> > I now need the ability to run this program or another program via email to
> > my computer from other locations. I do not have the option to login
> > remotely.
> > 
> > The save-page-as.sh program runs firefox. I have not been able to get this
> > to work using email. env shows DISPLAY=:0.0. I have added each of the
> > Display commands as below:
> > 
> > export DISPLAY:0
> > export DISPLAY:0.0
> > export DISPLAY:0.1
> > 
> > None of those have worked.
> > 
> > the url I am trying to download does not have an extension ie. no '.htm'
> > such as:
> > https://my.acbl.org/club-results/details/338288
> > 
> > wget does not download the correct web page.
> > 
> > Appreciate any pointers to get the save-page-as.sh working using a browser
> > or a different command line program.
> > 
> > David
> 
> Hi David,
> 
> Try this
> 
> 
> $ curl https://my.acbl.org/club-results/details/338288 --output> 
> 
>    % Total    % Received % Xferd  Average Speed   Time    Time     Time 
>   Current
>                                   Dload  Upload   Total   Spent    Left 
>   Speed
> 100  463k    0  463k    0     0   193k      0 --:--:--  0:00:02 --:--:-- 
>   193k
> 
> 
> I opened eraseme.html and the 338288 web page right
> next to each other in Firefox and they look exactly
> the same to me.
> 
There are spacing and alignment differences and apparently other differences.
Also if you then run:

html2txt eraseme.html

or

html2text eraseme.html

it does not display any of the text(content).

> I use curl almost exclusively for download web site.
> wget has its issues.
> 
> HTH,
> -T
> _______________________________________________
> users mailing list -- users@lists.fedoraproject.org
> To unsubscribe send an email to users-leave@lists.fedoraproject.org
> Fedora Code of Conduct:
> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List
> Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List
> Archives:
> https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org
> Do not reply to spam on the list, report it:
> https://pagure.io/fedora-infrastructure
_______________________________________________
users mailing list -- users@lists.fedoraproject.org
To unsubscribe send an email to users-leave@lists.fedoraproject.org
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org
Do not reply to spam on the list, report it: https://pagure.io/fedora-infrastructure

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic