Nothing is ever truely lost (C4AG Archive)

Anything about cars, as long as it's clean...
No offensive language, no profanity, no nudity, nothing that your mom will slap you for...
Be nice to others.
User avatar
daveskatesallday
Club4AG Pro
Posts: 501
Joined: Thu Jan 10, 2013 10:35 pm

Re: Nothing is ever truely lost (C4AG Archive)

Postby daveskatesallday » Mon Feb 18, 2013 6:43 am

SidekickChuck wrote:Looks like there was a wayback archive created on December 7th, 2012. Links work and content preserved:

http://web.archive.org/web/201204181304 ... ub4ag.com/


Great tool for reference and content scraping. Just thought I would pass it on.



guess im slightly late on catching this but how TF ?!?! anyway glad we're able to go back and get all the good info that was on there!
Voici mon secret. Il est très simple: on ne voit bien qu'avec le cœur. L'essentiel est invisible pour les yeux.

User avatar
Red
Club4AG Expert
Posts: 475
Joined: Thu Jan 10, 2013 3:28 pm

Re: Nothing is ever truely lost (C4AG Archive)

Postby Red » Mon Feb 18, 2013 7:00 am

Dave, "htf" is really easy. There are programs that are called "web snatchers" or "web walkers" which start by going to each IP address, or each top level registered domain, and looking at the top (index, home) page. Then they look for any link on that page and visit the linked page, and repeat the process on each page.

You can find freeware that will do this on your computer. Adobe Acrobat used to offer the same option, they took that out about five yeas ago. The proble today is that web pages are so bloated, with so much code and so many advertising links, that "snatching" a web site can generate a tremendous amount of data now. The copy of the old forum pages that is on the wayback machine runs well over 10GB in size and can take a broadband customer DAYS to download. Possibly a week, running 24x7. ALong the way you'll pull down every page from eery idiot advertiser who linked their own web sites. "OOpsie".

But the theory is really easy, and the besster software will let you adjust how many levels down it goes, etc. That's gotten harder to find.

--Red
-- Original owner, 1985 GT-S

User avatar
daveskatesallday
Club4AG Pro
Posts: 501
Joined: Thu Jan 10, 2013 10:35 pm

Re: Nothing is ever truely lost (C4AG Archive)

Postby daveskatesallday » Mon Feb 18, 2013 8:36 am

Red wrote:Dave, "htf" is really easy. There are programs that are called "web snatchers" or "web walkers" which start by going to each IP address, or each top level registered domain, and looking at the top (index, home) page. Then they look for any link on that page and visit the linked page, and repeat the process on each page.

You can find freeware that will do this on your computer. Adobe Acrobat used to offer the same option, they took that out about five yeas ago. The proble today is that web pages are so bloated, with so much code and so many advertising links, that "snatching" a web site can generate a tremendous amount of data now. The copy of the old forum pages that is on the wayback machine runs well over 10GB in size and can take a broadband customer DAYS to download. Possibly a week, running 24x7. ALong the way you'll pull down every page from eery idiot advertiser who linked their own web sites. "OOpsie".

But the theory is really easy, and the besster software will let you adjust how many levels down it goes, etc. That's gotten harder to find.

--Red


i gotcha didn't know about that, cool stuff!
i will say though i tried clicking on a few threads and they wouldn't open but they were there so idk
Voici mon secret. Il est très simple: on ne voit bien qu'avec le cœur. L'essentiel est invisible pour les yeux.

User avatar
Red
Club4AG Expert
Posts: 475
Joined: Thu Jan 10, 2013 3:28 pm

Re: Nothing is ever truely lost (C4AG Archive)

Postby Red » Mon Feb 18, 2013 8:43 am

Sometimes it can be terribly slow to open, or load.
-- Original owner, 1985 GT-S