283

I have a text file and want to keep lines started with <Path>, and delete all the other lines.

How can I do it?

wonea
  • 1,877

11 Answers11

418

There is an easy way to achieve this. You need to perform 2 steps.

  1. Go to Search menu > Find... > Select "Mark" Tab. Activate regular expressions. Search for ^<Path> (^ is for line start). Don't forget to check "Bookmark lines" and Press "Mark All"

    ==> All Rows you want to keep got a Bookmark

  2. Go to Menu "Search - Bookmark - Remove unmarked lines"

    ==> All lines that aren't Bookmarked are deleted.

stema
  • 4,614
70

This can actually be done in two steps as of 6.3. I think it can be done earlier than that as I had 5.9 when I first tried it.

Using stema's post as the basis of this answer. There's one less step now. Mark lines and remove unmarked lines. Done. Detailed instructions follow.

  1. Search menu "Find". In the Find dialog, click the "Mark" tab. Activate regular expressions. Search for ^<Path> (^ is for line start). Don't forget to check "Bookmark lines" and Press "Mark All"

    ==> All Rows you want to keep now have a Bookmark

  2. Search Menu -> Bookmark -> Remove Unmarked Lines.

    ==> All NON Bookmarked lines are deleted.

37

Clean regex only solution

Two step variant

  1. regex replace

    (?!^.*test.*$)^.+
    

    replace test with your requested text

  2. replace

    [\r\n]{2,}
    

    with \r\n

Single step variant

Use ^(?!<Path>).*\r\n to replace matches with empty string. Generalized version would be ^(?!.*?test).*\r\n. This won't remove empty line at the end of the file. All other lines are removed, including multiple consecutive empty lines.

Explanation:

  1. (?!) is a negative look up. ^.*test.*$ selects the whole line that contains the requested text.

  2. [\r\n]{2,} matches any \r\n that occurs more then once this is Windows New line. if you have Linux or another operating system you might need to mess with this. the second is to replace it with one return line.

sonar0m
  • 471
7

It seems to me that the easiest way is to just use the "Find All in Current Document" feature and then either copy the results into a new file or select all and replace in the current one.

This would find all lines containing your text and list them at the bottom. Just right click on the search result and copy / paste.

Gubbins
  • 81
  • 1
  • 1
5

Better solution with regex replace:

(?!^.*SOMETEXT.*$)^.+\r?\n

And replace with nothing

LoneDev
  • 51
4

Go to menu Search -> Find... -> Activate regular expressions. Search for "^Path" (^ is for line start).

Click on the "Find all in Current Document" button.

The "Find result" window will appear with all the lines the the pattern. Select copy/paste them to a new tab in Notepad++.

In this new tab, got to: menu Search -> Replace... -> Activate regular expressions.

In the "Find what:" field, use the pattern: "Line \d+: ". Leave the "Replace with:" field blank.

Click on the "Replace all" button.

Luis
  • 41
3

It is clumsy, but copy it all to Excel, and then use =IF(LEFT(A1,6)="<Path>",A1,"") and copy that formula all the way down. Then copy that back to Notepad++. It's not ideal, but it's pretty easy (if you have Excel). Warning: It will not work well with indented lines (Excel will shift the columns, etc.).

soandos
  • 24,600
  • 29
  • 105
  • 136
3

Providing that you actually want to match <Path> and not a file system path, you can try this from a command line using Perl:

perl -pe " if ($_ !~ /<Path>/) { s/$_// } " < in.txt > out.txt

It worked with Strawberry Perl on Windows, so adjust accordingly if the results are not what you expect.

Joe Internet
  • 5,355
2

There is no easy way to do what you want with Notepad++. You'll need to either download a program to your computer or script something in VB (I assume you're on Windows).

You can do what you want one of two ways with sed. The sed utility is a favorite on *nix and can be found for Windows from the great people at GnuWin (http://gnuwin32.sourceforge.net/packages/sed.htm). You would download this program, and then run your command from the command prompt.

Delete all lines not containing :

sed -i '/^<PATH>/!d' file

Print all lines containing to a new file:

sed -n '/^<PATH>/p' file > newfile

I suggest you use print the lines you want to a new file. The reason for this is that you probably won't get the regex statement for <PATH> the first time around. The sed utility uses Regular Expression Basic Syntax (view the reference at http://www.regular-expressions.info/reference.html). If <PATH> is something like a *nix path (/var/www) then you'll need to escape the / character for your regex to work.

Example: sed -n '/^\/var\/www/p' file > newfile
This will print out all lines that start with /var/www. If I filed to escape the / character, then the command would have thrown an error. You can escape a special character (such as /) with the backslash character \.

phuclv
  • 30,396
  • 15
  • 136
  • 260
Chris Ting
  • 1,579
1

2025-06-19 update: the LineFilter2 plugin is now called LineFilter3:

enter image description here

enter image description here

outputs:

enter image description here


As Karsten suggested in a comment, one can use the LineFilter2 plugin, which is straightforward to install and use.

enter image description here

To install:

enter image description here

To launch:

enter image description here

Franck Dernoncourt
  • 24,246
  • 64
  • 231
  • 400
0

Use Search->Replace and enter a regular expression like ^[^ ].* and replace all with an empty string using Regular expression. Next step is to find empty lines searching for \n\n replacing with \n using Extended multiple times until 0 occurrences were found. (use \r\n\r\n and \r\n depending on file format). If you have very many empty lines in a row, it is quickier to use \n\n\n\n\n\n\n or even more \n:s in the search string.