don't need a CD, just google "teleport pro" and create a browsable copy on your hard disk.
Wow - using a spider crawler to pull down a mirror of the site, that will bog things way down, if say, 2 folks try it at one.
Well, it's slow, I'm at 2.12GB, 48,101 files, 36 folders, after about 24 hours.
I'm at a point, where it's startting to skip what it thinks are binary files:
listerengine.com/smf/index.php?action=printpage;topic=1157.0
11:28:22 Info: engine: transfer-status: link added: listerengine.com/smf/index.php?topic=3231.msg40869 -> t:/My Web Sites/Lister Engine Forum/listerengine.com/smf/index32e0.html
11:28:22 Warning: File not parsed, looks like binary: listerengine.com/smf/index.php?topic=1157.msg15132
11:28:22 Info: engine: transfer-status: link added: listerengine.com/smf/index.php?topic=3231.msg40698 -> t:/My Web Sites/Lister Engine Forum/listerengine.com/smf/indexbf6d-5.html
11:28:23 Warning: File not parsed, looks like binary: listerengine.com/smf/index.php?topic=1157.msg18545
But yet, it looks like a valid page, when pasted into a browser. And it's still crunching.
UPDATE
Finished the crawl. 2.48GB 66,872 files, 45 folders. Should be all text, and compressible by at least 50%, if not more.
HTTrack Website Copier/3.33 mirror complete in 1 days, 5 hours 51 minutes 10 seconds : 67833 links scanned, 66788 files written (1977475170 bytes overall), 66649 files updated [589582870 bytes received at 5486 bytes/sec], 1975030644 bytes transfered using HTTP compression in 65832 files, ratio 28%, 15.3 requests per connection
(142 errors, 33076 warnings, 66654 messages)