I maintain a website showing my university group's publications.
I have written a quick and dirty Ruby script to parse a CSV file containing this data (which I grab manually from ISI Web of Science website), and present it in a nice format in HTML.
There is no direct link to a PDF file in the CSV file. Instead, there is information which I can use to go to doi.org, which forwards on to the real page (hosted by the journal), with a link for the PDF.
I want to be able to, for each publication in the CSV file, go to that web page, and grab the PDF.
I've never done this before. Using wget in a terminal, this works fine, except that the HTML link on the journal website is simply "/link info", without the TLD info.
Can anyone recommend a simple way of going about this, please?