WebGet (web page and content grabber)
Did you ever find a website with a bunch of cool files, but they were
scattered on tons of pages and each page popped up ads?
I hate that!
WebGet lets you list a page and find its links and its content.
It sorts the content and then lets you list off THOSE pages, too.
Once you come across a decent site in IE (or whatever you browse the web with),
go to the address and hit CTRL-C to copy the URL to the clipboard.
After clicking the LIST button (the find folder button), you'll see
that page's content listed off at the bottom and it's links listed at the
top with "tochek" status and the listed page with "done" status.

Click the LIST button again to read each of those pages in and get their
links and content, too.
This can take a while during which time WebGet's screen just freezes.
It's downloading stuff, but you can't see status on the screen.
You -can- see status with DebugView
debugview here
(hoaky? Yes. But for now just DEAL with it.
till i get paid enough to get motivated enough to change this situation ;)


After LISTing for a few iterations, all the "tochek" status links will be
turned into "done" status and you'll have a list of all the content
Click the DOWNLOAD button (green down arrow) to download all the content
to a particular folder on your harddrive.
Other buttons:
The green check button toggles a tochek into a done and visa versa.
The green check with a 20 does that for 20 URLs.
Use this if you don't want to bother loading these pages.
(Cuz you know the content they have is worthless, etc.)
The red X button deletes a URL so it isn't downloaded to your hard drive.
Same deal with the 20 one. (Derrr)
Might not want to use this till you're DONE listing pages, cuz they might
pop right back in.
Might want to delete the crappy html pages that you really don't care about
(except for their content pics, WAVs, etc).
The + button adds a web root URL to the filter list.
If the site uses another web root to store stuff, etc,
Highlight the extern url and click the + to load that web root, too.
Save and Load save intermediate webget lists (to come back to it later.)
The blue E will boot IE and load the highlighted URL.
(handy:)
So to recap:
copy a web site URL to the clipboard.
That file is loaded and each src= and href= thingy is listed off.
(Basically, a file system is built.)
Keep:
1) deleting junky links with the delete button.
(Usually from javascript code, etc.)
There's also an OH so handy delete*20 button.
2) changing "tocheck" entries you don't care about to "done"
with the check button
3) listing again
until all the "tocheck"s turn into "done"s.
Then download stuff to a temp dir with the green down button.
You can also save and load intermediate WebGet states with save/load.
|