I am trying to automate scientific PDF to XML using Python. I would like to know: is there any Python library that can generate XML from text or from PDF documents?
            Asked
            
        
        
            Active
            
        
            Viewed 1,022 times
        
    1 Answers
2
            
            
        You can use textract to read a pdf file, for details: Extracting text from a PDF file using Python once you have your text in a variable use ElementTrees to write properly formatted XML.
 
    
    
        Hood Khizer
        
- 115
- 7
