usage: get_list <base_url> <base_file> <num_list>
This routine grabs pages from sequential sets of URLs and saves them to disk. <base_url> is the url to grab from. <base_file> is the filename to store that which is pointed at by <base_url>. <num_list> is a list of numbers and number sequences to be plugged into <base_url> and <base_file>. <base_url> and <base_file> should each include one printf style escape code ('%' followed by options and a type). Since only integers will be passed in, probably one of the integer types like %d or with modifiers like %03d. get_list.pl will iterate over each number represented by <num_list> and use that to format <base_url> and <base_file>. <num_list> is a comma separated list of integers and integer sequences with no spaces included. An integer sequence is two integers separated by a hyphen. Examples: Let's say we want to get: http://abc.de/blah/do/img02.jpg http://abc.de/blah/do/img05.jpg http://abc.de/blah/do/img06.jpg http://abc.de/blah/do/img07.jpg http://abc.de/blah/do/img12.jpg http://abc.de/blah/do/img13.jpg http://abc.de/blah/do/img14.jpg http://abc.de/blah/do/img15.jpg http://abc.de/blah/do/img16.jpg http://abc.de/blah/do/img20.jpg and save them as do_img02.jpg, do_img05.jpg, do_img06.jpg, do_img07.jpg, do_img12.jpg, do_img13.jpg, do_img14.jpg, do_img15.jpg, do_img16.jpg, and do_img20.jpg. We could use the command: get_list 'http://abc.de/blah/do/img%02d.jpg' 'do_img%02d.jpg' 2,5-7,12-16,20