I read a HTML file into R and remove all HTML files based on previous answers. However, there is always \r\n inside the text. How can I replace them with " "? Your inputs will be appreciated. Thanks.
            Asked
            
        
        
            Active
            
        
            Viewed 3,553 times
        
    2 Answers
3
            
            
        Simple:
> x <- 'abc\r\n123\r\n'
> gsub('[\n\r]',' ',x)
[1] "abc 123 "
 
    
    
        Thomas
        
- 43,637
- 12
- 109
- 140
- 
                    Maybe `gsub('[\n\r]',' ',x)` might be better? Or `gsub('\r\n', ' ', x)` – Rich Scriven May 07 '14 at 19:53
- 
                    @RichardScriven Yes, definitely. I wrote this answer without testing it. – Thomas May 07 '14 at 19:56
- 
                    
- 
                    4@lucyh Then please read about how to supply [a reproducible example](http://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example). – Thomas May 07 '14 at 19:59
- 
                    None of these seems to work: Any suggestions? `> x <- "\\nDécor is fun" > gsub('[\\n]',' ',x) [1] " Décor is fu " > gsub('[\n\r]',' ',x) [1] "\\nDécor is fun"` – torenunez Apr 02 '20 at 23:44
- 
                    Ended up using a hacky solution in this post: https://stackoverflow.com/questions/51384784/how-to-replace-r-n-characters-in-a-text-string-specifically-in-r `> x <- "\\nDécor is fun" > gsub('__n', '', gsub('[\\\\]', '__', x)) [1] "Décor is fun"` – torenunez Apr 02 '20 at 23:56
0
            
            
        None of these worked for me; I had to use:
> x <- 'abc\r\n123\r\n'
> gsub('[\r\n][\r\n]',' ',x,perl=TRUE)
[1] "abc 123 "
 
    
    
        R_User123456789
        
- 650
- 7
- 9
