Questions tagged [html-xml-utils]
11 questions
                    
                    4
                    
            votes
                
                1 answer
            
        Parse local HTML python (lxml)
I'm trying to parse a local HTML with lxml, but I'm getting an error, but I don't know why (sorry in advance for the bad code, I'm new to this).
from lxml import etree, html
from StringIO import StringIO
parser = etree.HTMLParser()
doc =…
         
    
    
        Lara M.
        
- 855
- 2
- 10
- 23
                    2
                    
            votes
                
                1 answer
            
        xml2csv package error '_IterParseIterator' object has no attribute 'next'
I am using xmlutils package to convert xml file to csv. My code is following :
from xmlutils.xml2csv import xml2csv as x
input_path='/media/ishan/Local Disk/doc.xml'
output_path='media/ishan/Local…
         
    
    
        Ishan
        
- 996
- 3
- 13
- 34
                    1
                    
            vote
                
                2 answers
            
        How can I use hxselect to generate array-ish result?
I'm using hxselect to process a HTML file in bash.
In this file there are multiple divs defined with the '.row' class.
In bash I want to extract these 'rows' into an array. (The divs are multilined so simply reading it line-by-line is not…
         
    
    
        Daniel
        
- 2,318
- 2
- 22
- 53
                    1
                    
            vote
                
                1 answer
            
        Middle Selectors Ignored in hxselect
I'm attempting to extract some text from a webpage using hxselect from html-xml-utils 7.4.  According to the man, hxselect will accept a comma delimited list of CSS selectors.  I have three selectors:
/usr/local/bin/hxnormalize -x -i 0 -l 5000…
         
    
    
        A Bowler Cat
        
- 305
- 1
- 2
- 9
                    1
                    
            vote
                
                1 answer
            
        Groovy: keeping of CR/LF of attributes while serializing XMLSlurper
I'm new with groovy (a few weeks of experience). Currently I'm trying to process some visual studio .vcproj files using groovy: replacing some paths, that will be found by a regexp patterns. This works fine for me.
To write the changes to the file,…
         
    
    
        Alfred Meier
        
- 47
- 4
                    0
                    
            votes
                
                1 answer
            
        Vbscript/UFT how to get the Tag elements name to validate Xpath of tag elements
Attached XML sample, In the attached XML I want to verify the tag Elements presence
for ex: PayloadList/IFXResp/IFX/GeneralStatus/StatusCode
it would be great if anyone can help me to get the above xpath code. Looking for code in vbscript/UFT to…
         
    
    
        Anil
        
- 3
- 2
                    0
                    
            votes
                
                1 answer
            
        Parsing XML in Python
I have a large XML file and I need to format it to get some needed data from particular elements in it and print out only data needed into another file. In the XML file I have a number of text tags belonging to different conversations with id's and…
         
    
    
        T. A
        
- 1
- 3
                    0
                    
            votes
                
                1 answer
            
        html-xml-utils Get a specific  value
        I am using Xubuntu 16.04.2 with html-xml-utils
I tried this command
hxnormalize -x site/sciz.html | hxselect 'table.vitals-table:first-child'
but it doesn't work, it outputs nothing. There are about 3-5 of these tables in the html.
Doing this…
        
            
            
                
                    
    
    
         knuxyl
    
    
        knuxyl
        
            - 129
- 3
- 12
0
                    votes
                
            
            
                2 answersmake[1]: exec(f77) failed (No such file or directory) on DragonFly BSD
        I get this error when trying to compile html-xml-utils-6.9 on DragonFly BSD. Funny thing is, when doing grep -r f77 in the source directory, the result is empty. So no such option is in the constructed Makefile after ./configure.
This is my…
        
            
            
                
                    
    
    
         Olle Härstedt
    
    
        Olle Härstedt
        
            - 3,799
- 1
- 24
- 57
0
                    votes
                
            
            
                1 answerExtract HTML table content based on "thead"
        Here is a basic HTML table :
  
    bar 
  
  
    rows 
    …
  
Suppose there are several such tables in the source file.  Is there an option of hxextract, or a CSS3…
        
            
            
                
                    
    
    
         Skippy le Grand Gourou
    
    
        Skippy le Grand Gourou
        
            - 6,976
- 4
- 60
- 76
-1
                    votes
                
            
            
                1 answerReplace anything inside a HTML tag ID
        Have searched all over Stackoverflow and Google, and the closest answer:
sed -i -e 's|\([0-9]\{0,\}\) |'"$(date -d @${contents})"' |g' /var/www/html/index.html
works only when the…
        
            
            
                
                    
    
    
         dani 'SO learn value newbies'
    
    
        dani 'SO learn value newbies'
        
            - 151
- 1
- 15
 
I am using Xubuntu 16.04.2 with html-xml-utils
I tried this command
hxnormalize -x site/sciz.html | hxselect 'table.vitals-table:first-child'
but it doesn't work, it outputs nothing. There are about 3-5 of these tables in the html.
Doing this…
         
    
    
        knuxyl
        
- 129
- 3
- 12
                    0
                    
            votes
                
                2 answers
            
        make[1]: exec(f77) failed (No such file or directory) on DragonFly BSD
I get this error when trying to compile html-xml-utils-6.9 on DragonFly BSD. Funny thing is, when doing grep -r f77 in the source directory, the result is empty. So no such option is in the constructed Makefile after ./configure.
This is my…
         
    
    
        Olle Härstedt
        
- 3,799
- 1
- 24
- 57
                    0
                    
            votes
                
                1 answer
            
        Extract HTML table content based on "thead"
Here is a basic HTML table :
  
    
Suppose there are several such tables in the source file.  Is there an option of hxextract, or a CSS3…
        | bar | rows… | 
 
    
    
        Skippy le Grand Gourou
        
- 6,976
- 4
- 60
- 76
                    -1
                    
            votes
                
                1 answer
            
        Replace anything inside a HTML tag ID
Have searched all over Stackoverflow and Google, and the closest answer:
sed -i -e 's|\([0-9]\{0,\}\) |'"$(date -d @${contents})"' |g' /var/www/html/index.html
works only when the…
         
    
    
        dani 'SO learn value newbies'
        
- 151
- 1
- 15