241

One of my favorite BASH commands is:

find . -name '*.*' -exec grep 'SearchString' {} /dev/null \;

which searches the contents of all of the files at and below the current directory for the specified SearchString. As a developer, this has come in handy at times.

Due to my current project, and the structure of my codebase, however, I'd like to make this BASH command even more advanced by not searching any files that are in or below a directory that contains ".svn", or any files that end with ".html"

The MAN page for find kind of confused me though. I tried using -prune, and it gave me strange behavior. In an attempt to skip only the .html pages (to start), I tried :

find . -wholename './*.html' -prune -exec grep 'SearchString' {} /dev/null \;

and did not get the behavior I was hoping for. I think I might be missing the point of -prune. Could you guys help me out?

Thanks

Oliver Salzburg
  • 89,072
  • 65
  • 269
  • 311
Cody S
  • 2,684

5 Answers5

328

You can use the negate (!) feature of find to not match files with specific names:

find . ! -name '*.html' ! -path '*.svn*' -exec grep 'SearchString' {} /dev/null \;

So if the name ends in .html or contains .svn anywhere in the path, it will not match, and so the exec will not be executed.

Stephen S
  • 628
Paul
  • 61,193
13

I've had the same issue for a long time, and there are several solutions which can be applicable in different situations:

  • ack-grep is a sort of "developer's grep" which by default skips version control directories and temporary files. The man page explains how to search only specific file types and how to define your own.
  • grep's own --exclude and --exclude-dir options can be used very easily to skip file globs and single directories (no globbing for directories, unfortunately).
  • find . \( -type d -name '.svn' -o -type f -name '*.html' \) -prune -o -print0 | xargs -0 grep ... should work, but the above options are probably less of a hassle in the long run.
l0b0
  • 7,453
12

The following find command does prune directories whose names contain .svn, Although it does not descend into the directory, the pruned path name is printed ...(-name '*.svn' is the cause!) ..

You can filter out the directory names via: grep -d skip which silently skips such input "directory names".

With GNU grep, you can use -H instead of /dev/null. As a slight side issue: \+ can be much faster than \;, eg. for 1 million one-line files, using \; it took 4m20s, using \+ it took only 1.2s.

The following method uses xargs instead of -exec, and assumes there are no newlines \n in any of your file names. As used here, xargs is much the same as find's \+.

xargs can pass file-names which contain consecutive spaces by changing the input delimiter to '\n' with the -d option.

This excludes directories whose names contain .svn and greps only files which don't end with .html.

find . \( -name '*.svn*' -prune  -o ! -name '*.html' \) |
   xargs -d '\n' grep -Hd skip 'SearchString'
Peter.O
  • 3,093
8

By using "-not" and "-and":

find . -type f \( \
    -not -name "${1}" \
    -and -not -name "${2}" \
\)

In one line:

find . -type f \( -not -name "${1}" -and -not -name "${2}" \)
3

This example excludes files that have the "test" in their names from the search. The search itself looks for "ProductReplacement" for XML files only.

find . ! -name '*test*.*' -name '*.xml' -exec grep -i 'ProductReplacement' {} \; -print

You can specify more exclusion patetrns with additional

! -name 'file_pattern' entries.