Questions tagged [xpdf]

Xpdf is a viewer for Portable Document Format (PDF) files. (These are also sometimes also called 'Acrobat' files, from the name of Adobe's PDF software.) Xpdf runs under the X Window System on UNIX, VMS, and OS/2.

Source: http://linux.die.net/man/1/xpdf

13 questions
10
votes
6 answers

Extracting background images from a PDF file?

I have a PDF file containing maps of the building I work in, here: http://www.libsys.und.edu/dev/FloorPlans_All.pdf The original source files have been lost, and I've been asked to extract the map images, preferably without the text and icons that…
4
votes
2 answers

"No such file or directory" when the file is there

I'm trying to run XPDF on a linux (probably red hat) OVH shared server. I've managed to have ftp ssh access and put the 64 bits binaries onto a folder. The problem is : even though the files are there with the right permissions, if I try running it…
4
votes
1 answer

pdftotext - Error: Illegal entry in bfchar block in ToUnicode CMap

I am running pdftotext on a bunch of pdfs, and some of them throw this error: Error: Illegal entry in bfchar block in ToUnicode CMap I took a look at the outfiles, and they seem to look ok, so I'm not sure if it's a significant error, but I am…
Tony Stark
  • 2,470
1
vote
1 answer

xpdf copy & paste and unicode

Since updating to debian jessie I have a problem with copy and paste from PDF files using xpdf: only letters/symbols contained in ASCII are copied correctly. All other symbols/letters are in some way garbeld. For example I have a PDF file containg…
1
vote
1 answer

How do I use a for loop and output to a different directory

usage: pdftotext [options] [] I'm using pdftotext (xpft, usage above) to covert all pdfs in a directory (and subdirectories) to textfiles. It doesn't matter if structure is preserved or not I just want to write the files to…
Some_Guy
  • 794
1
vote
2 answers

how to extract text from pdf with embedded subset fonts

Pdftotext of xpdf is working fine for normal embedded fonts file , but fails where embedded subsets fonts are there . Is there any workaround for this issue ?
1
vote
2 answers

How to navigate using h, j, k, l (Vi-like) in Xpdf?

man xpdf says the following: Previous versions of xpdf included a "viKeys" X resource. It is no longer available, but the following bindings are equivalent: bind h any scrollLeft(16) bind l any scrollRight(16) bind k any scrollUp(16) bind j any…
1
vote
0 answers

Text from pdftotext and alignment

I used texttopdf on linux to extract all the text from a multipage pdf, everything is fine, except that each page get a different alignment, although in the original pdf this is not the case, here is a sample from the first 3 pages: …
1
vote
0 answers

Cygwin's xpdfReader's File open dialogue box takes several dozen seconds to appear

I use XpdfReader 4.0.4 on 2 corporate-imaged Windows 11 machines at work. Approximately 2025-05-15 to 16, XpdfReader's file open dialogue box on one of them started to take several dozen seconds to appear after pressing Ctrl+O. This is working off…
user2153235
  • 1,543
1
vote
2 answers

How to install Xpdf on Centos 5.5

I'd like to install xPDF on a CentOS 5.5 server. No GUI, I really just need the console apps. Can anyone give me a brief guide?
Jason
  • 1,909
  • 7
  • 31
  • 40
1
vote
1 answer

PowerShell Unexpected Token Error

Question: I am running the following script in Windows 10 via the PowerShell ISE cd C:\Users\xyz\Desktop\pdfs $FILES= ls *.pdf foreach ($f in $FILES) { "C:\Program Files\xpdf-tools-win-4.01.01\bin32\pdftotext.exe" -enc UTF-8 "$f" } I am…
Anthony
  • 153
  • 1
  • 7
1
vote
2 answers

How to start xpdf(1) without the outline pane?

I am trying to configure xpdf(1) so that it doesn't show the outline pane by default. I've managed to set up a keybinding in my ~/.xpdfrc to toggle the visibility of the outline pane with this command: bind t any toggleOutline I wasn't able to…
0
votes
1 answer

pdftotext giving scrambled output

if anyone could help I'd appreciate I'm trying to output text via pdftotext from number of pdf files. Unfortunately my output keeps ending up like this: "* * * $ * # 2 …
looser
  • 1