Artificial intelligent assistant

wget: Follow custom URL attributes wget is a great tool to make a quick snapshot of a small site. As far as I know (I really hope I just couldn't find it in `wget --help`), wget can follow well-known HTML URL attributes only, like `<a href=...`, `<img src=...` and so on. However, sometimes a particular site might use non-standard attributes that represent real URLs that do not look like URLs for wget. Let' say, if a site has a "static" gallery with zoomed images, a particular image page might have something like this: <div zoomed_img="/gallery/image.jpg"> <img src="/gallery/image_small.jpg"/> </div> Thus, wget ignores the `zoomed_img` attribute with `/gallery/image.jpg`. My wget command is: wget --recursive \ --domains domain \ --no-parent \ --page-requisites \ --no-clobber \ --html-extension \ --convert-links \ Is it possible to make wget follow custom URL HTML attributes?

wget (at least 1.16.3) does not allow a user to specify custom attributes. There has been a suggestion to extend the the `--follow-tags` option with a syntax like `--follow-tags=a/href`, but nobody followed up on that.

If you do not mind applying a quick hack to wget to support your scenario, see this commit for the work needed to add custom tags or attributes.

Alternatively, you can write a post-processor yourself.

xcX3v84RxoQ-4GxG32940ukFUIEgYdPy 69f12fab37418125a216d16fe7b17820