inamerrata

I Want

Posted on 2003-11-13, 00:03, by aj, under meta.

One feature I’d really like for my blog is a micro web.archive.org that just caches the pages I link to (along with any graphics, frames, embedded junk, stylesheets and whatever else they might contain that affects how they display), so that I can have blosxom automagically redirect the link to my cached copy if and when the link goes stale.

Only problem is there doesn’t seem to be any software around that can just spider a single page and all the gumph that’s on it, but not anything it links to. Worse, that’s a Hard Problem, requiring a real HTML parser. Oh well.

UPDATE 2003/11/14:

So Clinton pointed me at wget’s -p option, which does what I want. How cool! A modicum of futzing around is required but this is actually doable. Sweet.

Comment (RSS) | Trackback

Recent Comments
- Crown on Putting the B in BTC
- oz on Putting the B in BTC
- MrItly on Liquid and Taproot Activation
- Yancy on Bitcoin in 2021
- aj on Bitcoin in 2021
- 0xB10C on Bitcoin in 2021
- 0xB10C on Bitcoin in 2021
November 2003

S M T W T F S

1

2 3 4 5 6 7 8

9 10 11 12 13 14 15

16 17 18 19 20 21 22

23 24 25 26 27 28 29

30

« Oct Dec »
Blogroll
Planets
- Debian
- HUMBUG
- Kernel.org
- Linux Australia
- Ubuntu
- WordPress
Pages
- About
Categories
- btc (13)
- copyright (40)
- debian (87)
- ecash (21)
- links (7)
- linux-aus (12)
- mac (4)
- marketsw (2)
- meta (40)
- neo-con (34)
- osdc2007 (1)
- philosophy (9)
- poli-mics (63)
- products (5)
- random (45)
- redhat (2)
- rockets (2)
- startup (1)
- tech (50)
- travelblog (8)
- Uncategorized (8)
Meta

I Want

Leave a Reply

Recent Comments

Blogroll

Planets

Pages

Categories

Meta