Print to PDF not allowing selectable text on some sites
Moderator: trava90
Forum rules
This board is for technical/general usage questions and troubleshooting for the Pale Moon browser only.
Technical issues and questions not related to the Pale Moon browser should be posted in other boards!
Please keep off-topic and general discussion out of this board, thank you!
This board is for technical/general usage questions and troubleshooting for the Pale Moon browser only.
Technical issues and questions not related to the Pale Moon browser should be posted in other boards!
Please keep off-topic and general discussion out of this board, thank you!
Print to PDF not allowing selectable text on some sites
PM 29.0.1 x64, Win10 20H2.
If I try to print to PDF using PDFfactory or MS Print to PDF, the resulting PDF has vector outlines of characters that zoom well, but no selectable text.
This is reproducible in a clean install of PM on a clean Windows machine on different hardware.
Some discussion of this is here-
https://community.adobe.com/t5/acrobat/ ... p/10998138
and this page also works as a testing page.
FF, Chrome, and Chromium Edge all have the same problem, but get around it by having a built-in "Save to PDF" feature in the browser's print window.
Here was a similar thread describing this behavior-
viewtopic.php?f=3&t=18303&p=134776&hili ... ts#p134762
The workaround was to disable custom web fonts on the page before printing, but this is not always desirable and also requires quite a few clicks.
I have tried using Ublock Origin's remote font blocker, but that's not solving it.
My main use for Print to PDF is for web receipts. So I need to be able to copy text out of them frequently or search for keywords within a folder full of them.
Any ideas on how to improve this print functionality?
If I try to print to PDF using PDFfactory or MS Print to PDF, the resulting PDF has vector outlines of characters that zoom well, but no selectable text.
This is reproducible in a clean install of PM on a clean Windows machine on different hardware.
Some discussion of this is here-
https://community.adobe.com/t5/acrobat/ ... p/10998138
and this page also works as a testing page.
FF, Chrome, and Chromium Edge all have the same problem, but get around it by having a built-in "Save to PDF" feature in the browser's print window.
Here was a similar thread describing this behavior-
viewtopic.php?f=3&t=18303&p=134776&hili ... ts#p134762
The workaround was to disable custom web fonts on the page before printing, but this is not always desirable and also requires quite a few clicks.
I have tried using Ublock Origin's remote font blocker, but that's not solving it.
My main use for Print to PDF is for web receipts. So I need to be able to copy text out of them frequently or search for keywords within a folder full of them.
Any ideas on how to improve this print functionality?
Re: Print to PDF not allowing selectable text on some sites
Solution: use a different PDF printer.
e.g. if I print this page to PDF using my PDF printer (PrimoPDF) i get PDFs with selectable text.
e.g. if I print this page to PDF using my PDF printer (PrimoPDF) i get PDFs with selectable text.
- Attachments
-
- Print to PDF not allowing selectable text on some sites - Pale Moon forum.pdf
- (150.24 KiB) Downloaded 28 times
"Sometimes, the best way to get what you want is to be a good person." -- Louis Rossmann
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
Re: Print to PDF not allowing selectable text on some sites
No problem with Primo.
- Attachments
-
- Solved_ Acrobat DC_ Unable to select text when print from ...pdf
- (263.5 KiB) Downloaded 21 times
"Sometimes, the best way to get what you want is to be a good person." -- Louis Rossmann
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
Re: Print to PDF not allowing selectable text on some sites
I can't reproduce your results on a clean test VM. I installed latest PrimoPDF and tried setting it to both PDF 1.3 (default) and PDF 1.5. The resulting PDF only has Times New Roman embedded and only the header and footer selectable as text. Your PDF has numerous fonts embedded if I look at its Properties.
I tried CutePDF Writer and BullZip PDF Printer as well. Same thing... just vector outlines, but no embedded fonts.
I also have the problem when printing to Acrobat XI's virtual printer on my main machine.
Any reason why PM wouldn't pass the fonts to the printer?
I tried CutePDF Writer and BullZip PDF Printer as well. Same thing... just vector outlines, but no embedded fonts.
I also have the problem when printing to Acrobat XI's virtual printer on my main machine.
Any reason why PM wouldn't pass the fonts to the printer?
Re: Print to PDF not allowing selectable text on some sites
I have no idea why it doesn't work for you.
I'm using a pretty old version of PrimoPDF, and it defaults to PDF 1.4 -- there don't seem to be any settings to control this font embedding behaviour. Of note when I print I tend to pick "eBook" format to retain as much information as possible while keeping the size small. I'm using the standard Print function from within the browser.
Clearly this isn't something the browser controls because then it would be the same result in my case.
I'm using a pretty old version of PrimoPDF, and it defaults to PDF 1.4 -- there don't seem to be any settings to control this font embedding behaviour. Of note when I print I tend to pick "eBook" format to retain as much information as possible while keeping the size small. I'm using the standard Print function from within the browser.
Clearly this isn't something the browser controls because then it would be the same result in my case.
"Sometimes, the best way to get what you want is to be a good person." -- Louis Rossmann
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
- billmcct
- Keeps coming back
- Posts: 959
- Joined: 2012-09-04, 15:19
- Location: Costa Rica & Union City Georgia USA
Re: Print to PDF not allowing selectable text on some sites
Here's one from "Print pages to PDF 0.5.0.2.1" that does have selectable text and clickable links. It used to save scrollable code boxes but alas no longer does.
--------------------------------------------------------------------------------------------------------------
The difference between the Impossible and the Possible lies in a man's Determination.
Tommy Lasorda
The difference between the Impossible and the Possible lies in a man's Determination.
Tommy Lasorda
Re: Print to PDF not allowing selectable text on some sites
Can you also try a test using the Adobe forum link further up in this thread?
I don't think that extension is viable in Win10-64?
I've noticed if I disable gfx.downloadable_fonts.enabled, I can make a PDF with embedded fonts.
Also, I did a test in IE and that embeds the fonts.
PM v28 64-bit and 32-bit both have the problem under Win 10 20H2 64-bit. As does FF v52 ESR 32-bit.
Any ideas?
Re: Print to PDF not allowing selectable text on some sites
Confirmed here, using a different browser and different PDF printer. It's gotta be Adobe purposely preventing their pages from being saved to a useful PDF format.
Re: Print to PDF not allowing selectable text on some sites
It isn't specfic to the Adobe website. I can reproduce on any page with custom fonts-
https://fonts.google.com/
Re: Print to PDF not allowing selectable text on some sites
For the record I have also tested on Windows 10 20H2 x64, Pale Moon 29.0.1 x64. The result is what I attached previously. Yes, I do enable downloadable fonts by default because they are part of webdesign. It is not an inherent problem with the OS or the browser, as far as I can tell.
Please include troubleshooting information so we know your environment this occurs in for you. See sticky in this board.
Please include troubleshooting information so we know your environment this occurs in for you. See sticky in this board.
"Sometimes, the best way to get what you want is to be a good person." -- Louis Rossmann
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
Re: Print to PDF not allowing selectable text on some sites
I fired up a Win7-32 VM and installed PM v28 and v29 32-bit. Both versions embedded the fonts correctly. (Specifically, on the Google Fonts web page I used for testing, according to PDF Document Properties.)
So maybe this is something with how browsers are sending web fonts for printing in Win x64? It isn't sending fonts, but outlines only.
And as I mentioned before, Chrome, Chromium Edge, and FF are getting around this by including a "Save to PDF" option in their custom print dialog. I'm not sure where to look for any relevant discussion on how these browsers are (not) handling this. Maybe that would yield some clues if something was changed in Win x64 printing.
So maybe this is something with how browsers are sending web fonts for printing in Win x64? It isn't sending fonts, but outlines only.
And as I mentioned before, Chrome, Chromium Edge, and FF are getting around this by including a "Save to PDF" option in their custom print dialog. I'm not sure where to look for any relevant discussion on how these browsers are (not) handling this. Maybe that would yield some clues if something was changed in Win x64 printing.
Re: Print to PDF not allowing selectable text on some sites
Here, Windows 10 Pro 64-bit 2004 built-in Microsoft Print to PDF worked on the Adobe page. The text is selectable. Using the Ghostscript PDF printer: not selectable.
Update:The Ghostscript PDF printer works. I did not have it at its defaults earlier.
PrimoPDF. Found an old manual. Per that their Print preset does not embed. Unclear if that is true today. They have other settings that embed (but the resulting file size could be larger than what you want).
Update: I thought I had closed out my browser after changing the preference, and had reloaded the page, but clearly not. In a clean profile, neither worked. My workaround has always been OCR the PDF.
Update:
PrimoPDF. Found an old manual. Per that their Print preset does not embed. Unclear if that is true today. They have other settings that embed (but the resulting file size could be larger than what you want).
Update: I thought I had closed out my browser after changing the preference, and had reloaded the page, but clearly not. In a clean profile, neither worked. My workaround has always been OCR the PDF.
Re: Print to PDF not allowing selectable text on some sites
I did the following:
Brand new profile.
Went to the Adobe site.
After page loads, this is under Console - CSS:
I searched on this: "downloadable font: hhea: bad caret offset: -59".
Found this:
Firefox throws error "downloadable font: hhea: bad caret offset:" [...] in console
"So in each case, OTS issues a warning message to advise that the font resource is in violation of the OpenType spec requirements (although in these cases it is still usable)."
https://bugzilla.mozilla.org/show_bug.cgi?id=1603367
I deleted this:
I printed the page using the MS Print to PDF and Ghostcript PDF. Selectable text in both.
Brand new profile.
Went to the Adobe site.
After page loads, this is under Console - CSS:
Found this:
Firefox throws error "downloadable font: hhea: bad caret offset:" [...] in console
"So in each case, OTS issues a warning message to advise that the font resource is in violation of the OpenType spec requirements (although in these cases it is still usable)."
https://bugzilla.mozilla.org/show_bug.cgi?id=1603367
I deleted this:
Code: Select all
<link rel="stylesheet" href="https://use.typekit.net/csa7wef.css">
Re: Print to PDF not allowing selectable text on some sites
So iow your pdf printers rejected a bad font and converted them to shapes, and mine did not.
I still don't see how that is our problem. I think typekit.net needs to fix their fonts and not rely on browsers to sanitize them for them.
I still don't see how that is our problem. I think typekit.net needs to fix their fonts and not rely on browsers to sanitize them for them.
"Sometimes, the best way to get what you want is to be a good person." -- Louis Rossmann
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
Re: Print to PDF not allowing selectable text on some sites
I don't think it is just Adobe. Can you reproduce the problem at this Google page? (Test print just pages 2-3.)
https://fonts.google.com/
Re: Print to PDF not allowing selectable text on some sites
Just having brief snippets of text won't have the fonts embedded because it would be wasteful to do so. I don't think you can blame PDF printers for employing best effort to keep file sizes small and be efficient in that case. Keep in mind that PDF is primarily a visual output format. I've had to educate people many times in my translation profession of that too "Please translate this PDF" isn't that simple, but it seems most people don't understand why. It's closer to a postscript format than a source document.
"Sometimes, the best way to get what you want is to be a good person." -- Louis Rossmann
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
"Seek wisdom, not knowledge. Knowledge is of the past; wisdom is of the future." -- Native American proverb
"Linux makes everything difficult." -- Lyceus Anubite
Re: Print to PDF not allowing selectable text on some sites
I am not the person to argue for what a PDF is or isn't supposed to be... I am just trying to save receipts from the internet with selectable/searchable text.
If this can't be fixed, can you implement a "Save to PDF" feature as FF and Chrome do to work around this issue?
Most PDF writers will embed a subset a font if the character set is less than 100%. This is how they save space. But that's not the case here. The PDF printer doesn't seem to be receiving the fonts from the browser, yes?
Re: Print to PDF not allowing selectable text on some sites
Keep in mind, this is working correctly in Win7-32 according to my testing. So something has changed in Win10-64.
I can offer a bounty to get this fixed, if that helps.
I can offer a bounty to get this fixed, if that helps.