The gist: the small company I work for advertises its products through Google Merchant. We upload the products in an XML file as per Google's requirements.
The problem: manually formatting thousands of products into XML is an arduous task. What I want is a rapid-fire way to convert the relevant information on each product page into formatted XML. I'm looking for a (semi-)automatic way to go from bigHTMLSourceCode --> formattedXML.
If I'm not being clear, imagine wanting to format an Amazon product page into XML. You want the cost, description, weight, etc., arrayed in a certain way, with the appropriate XML tags, etc., and doing so for thousands of products isn't tenable.
I've Googled extensively, but haven't had any luck finding programs that can help with this.