Static HTML for older conference sites

As part of the admin team sprint this week (to consolidate/simplify/move servers), I flipped the switch this morning so that 2017.ploneconf.org and 2018.ploneconf.org are now being served as static HTML. I used httrack to convert the Plone sites to HTML; that is a lot nicer than wget! Those two sites join the other two static HTML 2015.ploneconf.org and 2016.ploneconf.org.

1 Like

...which reminds me: I'd started looking at archive.org to see how to extract the 2014 site and clean up the HTML. TBD!

Speaking of a lot nicer than wget... I had good results with https://github.com/website-scraper/demo

1 Like