86

I have a scanned course and it has two pages, consecutive are showing as one page. How can I automatically split all the pages in one pass? Usually this is done by cropping odd and even pages and then merge them back together but this could take very much time.

How can I split pages on scanned PDF in a single pass?

Traveler
  • 370

6 Answers6

61

You could use MuPDF's mutool:

mutool poster -x 2 in.pdf out.pdf
DG'
  • 859
59

There's an excellent, free and open source tool called Briss. It is very simple, user friendly and effective. It works on multiple operating systems through Java.

Load your PDF into the app. The app will group similar pages together and lay them on top of each other. Draw rectangles on top of your pages so that they cover what you want included. It will look like this:

enter image description here

Even if your PDF has multiple categories of layout within a single document, Briss will handle it. For example, let's say some parts are in portrait and others in landscape. Briss will group them into different categories and let you draw different rectangles on them, and then process it all in a single pass, into a single document. Briss is very good at deciding which pages should be grouped together. It typically takes me less then a minute of manual work to get Briss started. Thus, a document of hundreds or even thousands of pages can be done in a couple of minutes thanks to this brilliant program.

When it looks good, select Action, then Crop PDF.

Truly a very neat tool.

Note: I realize this answer reads like I'm a Briss developer or something, but I'm really not. I just love the tool.

Fiksdal
  • 865
36

After looking on some internet answers (it is a question often asked ) I discovered that this can be done easily using the Poster option from the Print menu.

Steps (for Adobe Acrobat XI):

  1. Choose Print from File Menu or Ctrl+P
  2. Select Printer as Adobe PDF
  3. Select Poster tab.
  4. Change Overlap to 0 inch
  5. Adjust the Tile scale to your needs, 100%(99%) if the result printed PDF page size is same as the current PDF page size, 75% if the printed pdf page size is half the current pdf. Tinker with "Tile scale" percents if necessary to obtain your desired result. To check the the result printed PDF page size go right of "Adobe PDF" combo-box select Properties and change Adobe PDF Page Sizecombo-box if necessary.
  6. You can hit Print button when the page looks split like you desired, check the dotted line in the guiding preview:

enter image description here

Here is a print screen for the described settings:

enter image description here

30

Sejda.com can split scanned PDF documents in half, down the middle. Works on all desktop platforms.

Here's a short how to:

How to split scanned PDF documents in half with Sejda.com

If it's a booklet scan and the pages are not in their natural order anymore it can reorder them for you too.

I'm a developer on the project. Open source.

Edi
  • 794
  • 5
  • 9
7

There are two problem with automating splitting scanned books in a single pass:

  • Automation is not always accuracy
  • Making a scanned book comfortably read is more than just splitting pages

For everything related with scanned books, I highly recommend using ScanTailor Advanced. It has features such as:

  • Turn skewed pages vertically,
  • Select content to reduce the page size,
  • Increase/decrease margin (for notetaking, maybe),
  • Whiten the result for better reading experience.

You must export the PDF into images to use this, and recombine the output images back. The processed images may be very small in file size (up to only 6% of the origin), but excellent in quality.

From its original GitHub repo:

Scan Tailor is Free Software (which is more than just freeware). It’s written in C++ with Qt and released under the General Public License version 3. We develop both Windows and GNU/Linux versions.

Other tips

To complete the task satisfactorily, I recommend you to use PDF-Xchange Viewer for extracting images and adding OCR, i2pdf for merging the outputs. In my experience, you can set the JPG quality to the lowest and it doesn't seem much different, but you have a trade-off between the final output's size and image quality. All programs are free. The whole process takes around 1 hour in background, with occasional checks.

I also have a complete guide to process scanned books, you may want to check it out: The ultimate guide to process scanned books.


FYI: How to create hierarchical bookmarks on scanned PDF files?

Ooker
  • 2,199
1

The free (as in freedom) pdfarranger can do that.

Just select the pages you want to split, right click and select "Split pages":

enter image description here

enter image description here