|
Post by alcyone on Sept 22, 2012 15:05:55 GMT -6
It looks like anything that is bold or a header in the Delving Deeper PDFs is not in the plain text of the PDF document, and thus is not searchable. I've also extracted the text using pdftotext and I get the same problem; only the normal text in the document exists in the searchable part of the document, not the bold headings.
|
|
|
Post by Otto Harkaman on Sept 22, 2012 16:05:40 GMT -6
I am guessing the heading is a graphic? If you OCR the pdf it should be searchable and exportable, but if the pdf is locked you can't do this.
|
|
|
Post by talysman on Sept 22, 2012 16:41:25 GMT -6
Hmm, yeah, I just looked at the Monster/Treasure PDF, and the text describing each monster can be selected and copied, but not the name of each monster. Although the headers "Monsters" and "Explanation of Monsters" do seem to be copyable, so this doesn't seem to be an issue with headers being images. A very weird problem.
|
|
|
Post by waysoftheearth on Sept 22, 2012 17:47:29 GMT -6
I've noticed this problem too.
The bold headings are part of the main text, just with a bold style applied.
It is strange that they are treated differently to the rest of the text by the PDF reader, but there it is.
I will try to fix this problem in the production of the "free to public" PDFs (more on those shortly).
|
|
|
Post by talysman on Oct 2, 2012 19:48:04 GMT -6
This may be an issue with the PDF creation app, because I found another odd behavior: on my Android tablet (Nook Tablet,) I can't select just a single word or sentence on most pages. Tapping and holding on a word causes the entire page or most of the page to be selected. In contrast, the Swords & Wizardry Whitebox PDF doesn't behave this way on the Nook.
There's two differences between the PDFs: application and library used to create the PDFs, and Acrobat version (1.4 for Delving Deeper, 1.3 for Whitebox.) I'll look around to see if I have some other 1.4 or 1.5 PDFs to test if it's just a problem with the Nook's PDF reader. Another possibility is that it's a side effect of the unselectable headings issue.
I did notice that there are a total of four font types in the PDF (Time New Roman, Futura, Arial, and Anonymous. One of the Futura fonts embedded isn't a TrueType font. I don't know if that has something to do with the problem.
|
|
|
Post by waysoftheearth on Oct 2, 2012 20:51:01 GMT -6
I've been working on redoing the layout of Vol 3 in my (cough) "spare" time.
I've redone almost all of the monster part now, so I will give some attention to how the PDF gets generated by the layout software and see if I can figure it out. I am not a PDF guru by any measure, but I should be able to spot any "obvious" issues.
I'll give an update shortly...
|
|
|
Post by waysoftheearth on Oct 11, 2012 16:00:52 GMT -6
For those that may be following this thread...
I've now completely redone the layout of Vol 3 with everything in alphabetical order, and have started looking into why the headings are not selectable in the PDFs.
My initial impression is that it is all to do with the font license. Apparently, there is no bold style for the font we have used, so it becomes "read only" in the PDF. One possible solution seems to be to use a different font (which does have a bold style) for the headings.
I'm hopeful that I will be able to spend some time on it this weekend.
|
|
|
Post by waysoftheearth on Oct 12, 2012 16:37:38 GMT -6
Further to this, the "bold" issue is now fixed in three volumes of the Reference Rules PDFs.
As there was no bold variation of the font used for the main body text, PagePlus was using a "synthetic" font for the bold style (not sure exactly what that actually is) and it was treated differently by PDF readers.
I've replaced all bold text and headings with a heavier weight font from the same font family and it works perfectly well.
|
|