How to ignore certain filenames using "find"?

Question

One of my favorite BASH commands is:

find . -name '*.*' -exec grep 'SearchString' {} /dev/null \;

which searches the contents of all of the files at and below the current directory for the specified SearchString. As a developer, this has come in handy at times.

Due to my current project, and the structure of my codebase, however, I'd like to make this BASH command even more advanced by not searching any files that are in or below a directory that contains ".svn", or any files that end with ".html"

The MAN page for find kind of confused me though. I tried using -prune, and it gave me strange behavior. In an attempt to skip only the .html pages (to start), I tried :

find . -wholename './*.html' -prune -exec grep 'SearchString' {} /dev/null \;

and did not get the behavior I was hoping for. I think I might be missing the point of -prune. Could you guys help me out?

Thanks

score 328 · Accepted Answer · edited Aug 28 '16 at 02:03

328

You can use the negate (!) feature of find to not match files with specific names:

find . ! -name '*.html' ! -path '*.svn*' -exec grep 'SearchString' {} /dev/null \;

So if the name ends in .html or contains .svn anywhere in the path, it will not match, and so the exec will not be executed.

edited Aug 28 '16 at 02:03

Stephen S

628

answered Mar 06 '12 at 00:40

Paul

61,193

l0b0 · Answer 2 · 2012-03-06T14:15:50.713

I've had the same issue for a long time, and there are several solutions which can be applicable in different situations:

ack-grep is a sort of "developer's grep" which by default skips version control directories and temporary files. The man page explains how to search only specific file types and how to define your own.
grep's own --exclude and --exclude-dir options can be used very easily to skip file globs and single directories (no globbing for directories, unfortunately).
find . \( -type d -name '.svn' -o -type f -name '*.html' \) -prune -o -print0 | xargs -0 grep ... should work, but the above options are probably less of a hassle in the long run.

Peter.O · Answer 3 · 2012-03-06T13:32:28.487

The following find command does prune directories whose names contain .svn, Although it does not descend into the directory, the pruned path name is printed ...(-name '*.svn' is the cause!) ..

You can filter out the directory names via: grep -d skip which silently skips such input "directory names".

With GNU grep, you can use -H instead of /dev/null. As a slight side issue: \+ can be much faster than \;, eg. for 1 million one-line files, using \; it took 4m20s, using \+ it took only 1.2s.

The following method uses xargs instead of -exec, and assumes there are no newlines \n in any of your file names. As used here, xargs is much the same as find's \+.

xargs can pass file-names which contain consecutive spaces by changing the input delimiter to '\n' with the -d option.

This excludes directories whose names contain .svn and greps only files which don't end with .html.

find . \( -name '*.svn*' -prune  -o ! -name '*.html' \) |
   xargs -d '\n' grep -Hd skip 'SearchString'

score 8 · Answer 4 · edited Jan 23 '24 at 08:08

8

By using "-not" and "-and":

find . -type f \( \
    -not -name "${1}" \
    -and -not -name "${2}" \
\)

In one line:

find . -type f \( -not -name "${1}" -and -not -name "${2}" \)

edited Jan 23 '24 at 08:08

Vitor Wdson

3

answered May 27 '21 at 16:58

Alberto Salvia Novella

360

score 3 · Answer 5 · answered Jun 03 '20 at 23:50

This example excludes files that have the "test" in their names from the search. The search itself looks for "ProductReplacement" for XML files only.

find . ! -name '*test*.*' -name '*.xml' -exec grep -i 'ProductReplacement' {} \; -print

You can specify more exclusion patetrns with additional

! -name 'file_pattern' entries.

How to ignore certain filenames using "find"?

5 Answers5

Linked