A PDF Harvester in 25 Lines of Python
The goal of this article is to develop a utility that handles the following: Retrieve HTML from a webpage. Parse the HTML and extract all references to embedded PDF links. For each PDF link, download the document and save it locally. Plenty ... [...Read more...]