99

What command do I use to find the size of all the files (recursively) in a Linux or Mac OS X directory?

Journeyman Geek
  • 133,878

5 Answers5

105

The BSD version of du used in OS X reports size with 512-byte blocks -- the sizes are essentially rounded up to the next 512-byte value. This tells you the space on disk, which is larger than the amount of data. If you have a lot of small files, the difference can be large.

Here's an example.

This is the value with regular du. It's in 512-byte blocks:

$ du -s
248   .

The -h flag results in a more readable number, in kilobytes. As expected, it's half the number of 512-byte blocks:

$ du -hs
124K  .

Finally, you can use find and awk to give you the sum of actual bytes in the files. This is kind of slow, but it works:

$ find . -type f -exec ls -l {} \; | awk '{sum += $5} END {print sum}'
60527

This value matches exactly the number reported by Finder's Get Info window. (There are no weird forks or xattrs in this set of files.) It's significantly smaller than the value reported by du.

Here's how it works: it gets a list of all the files, and passes them to ls -l; then awk is used to count up the bytes. The -type f flag is there so that only files (and not directories) get sent to ls. Without that flag, it'll also send directory names to ls, and each file will be listed twice : once as an individual file, and once as an item in the directory.

The GNU version of du can give values in actual bytes instead of blocks. It's unfortunate that the BSD version of du is not as flexible.

wch
  • 1,166
82

Show the size of a single file

du -h path_to_a_file

Show the size of the contents of a directory, each sub-directory, and each individual file:

du -h path_to_a_directory

Show the size of the contents of a directory:

du -sh path_to_a_directory

14

You can use du -ah . which displays sizes of all files and directories recursively.

This can be combined with sort, so you'll see the top-20 biggest directories in the current folder:

du -ah . | sort -rh | head -20

Note: Option -h for sort is not available on OSX/BSD, so you've to install sort from coreutils (e.g. via brew) and apply the bin path to PATH, e.g.

export PATH="/usr/local/opt/coreutils/libexec/gnubin:$PATH" # Add a "gnubin" for coreutils.

Otherwise use:

du -a . | sort -rn | head -20
kenorb
  • 26,615
11

du - tells the disk use not the file size.

find . -type f -print0 | xargs -0 stat -f%z | awk '{b+=$1} END {print b}'

above terminal code (im on osx 10.6) offers for me the best result and is waaay faster than "find ... -exec"

a quick benchmark

time find . -type f -print0 | xargs -0 stat -f'%z' | awk '{b+=$1} END {print b}'
4744010970

real    0m0.086s
user    0m0.029s
sys 0m0.073s

time find . -type f -exec ls -l {} \; | awk '{sum += $5} END {print sum}'
4744010970

real    0m18.515s
user    0m2.929s
sys 0m9.339s
Acid
  • 111
2

I combined all your approuches and combined it with a human readable output the result is:

#!/bin/sh
find $1 -type f -print0 | xargs -0 stat -f'%z' | awk '{b+=$1} END {print b}' | awk '{ sum=$1 ; hum[1024**3]="Gb";hum[1024**2]="Mb";hum[1024]="Kb"; for (x=1024**3; x>=1024; x/=1024){ if (sum>=x) { printf "%.2f %s\n",sum/x,hum[x];break } }}'

Link to the gist: https://gist.github.com/mlegenhausen/9365461

malte
  • 121