Text in PDF recognized as gibberish in any PDFium viewer due to invalid bfrange definitions in ToUnicodeMap #1498

orzFly · 2024-02-24T14:03:12Z

Bug Report

Description of the problem

Lines 269 to 271 in 485b7e6

    
           1 beginbfrange 
        
           <0000> <${toHex(entries.length - 1)}> [${entries.join(' ')}] 
        
           endbfrange

Currently, our code generates all ToUnicodeMap entries on a single line. This yields invalid text mapping on any PDFium base viewers (and maybe others).

https://source.chromium.org/chromium/_/pdfium/pdfium.git/+/master:core/fpdfapi/font/cpdf_tounicodemap.cpp;l=171-172;drc=61bda438f9071586c92f8f626c29021524a8d0b0

    uint32_t lowcode = lowcode_opt.value();
    uint32_t highcode = (lowcode & 0xffffff00) | (highcode_opt.value() & 0xff);

Related Chromium bug: https://bugs.chromium.org/p/pdfium/issues/detail?id=1339#c1

The PDF spec doesn't give too much detail about beginbfrange. I looked around and found the doc below. Based on section 1.4.1 in that doc, the <19ff><1a00><63cf> beginbfrange entry is illegal. The first byte values should be the same for the two source range values in the entry.
https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/5411.ToUnicode.pdf

The link is moved or removed at this moment. I found another copy at http://www.audentia-gestion.fr/ADOBE/5411.ToUnicode.pdf

Screenshots

Google Chrome 122.0.6261.69 Linux x86_64
Chromium 122.0.6261.69 (Official Build) Arch Linux (64-bit)
WPS Office for Linux 11.1.0.11698
Firefox (pdf.js) - CORRECT
Adobe Acrobat Reader 2023.008.20533 64-bit on Windows 11 - CORRECT

Code sample

https://replit.com/@orzFly/pdfkit-tounicode?v=1
test.pdf

I used 258 glyphs in the document, so only the first two (258 % 256 = 2) glyphs is correct - yields "AB" correctly. All the rest are incorrect.

Your environment

pdfkit version: 0.12.3, or master
Node version: 12.22.9
Browser version:
- Google Chrome 122.0.6261.69 Linux x86_64
- WPS Office for Linux 11.1.0.11698
- Chromium 122.0.6261.69 (Official Build) Arch Linux (64-bit)
Operating System: Linux x86_64

orzFly · 2024-02-24T14:05:41Z

I have a possible fix - will send a pull request later. However, I am not sure how to add unit test about this particular behavior.

This resolves foliojs#1498.

orzFly added a commit to orz-forks/pdfkit that referenced this issue Feb 24, 2024

Generate ToUnicodeMap bfrange in multiple ranges (foliojs#1498)

f4dd1a8

orzFly mentioned this issue Feb 24, 2024

Generate ToUnicodeMap bfrange in multiple ranges (#1498) #1499

Merged

4 tasks

orzFly added a commit to orz-forks/pdfkit that referenced this issue Feb 26, 2024

Generate ToUnicodeMap bfrange in multiple ranges (foliojs#1498)

ba09658

This resolves foliojs#1498.

blikblum closed this as completed in 946f9cf Feb 26, 2024

FaroukAmr mentioned this issue May 25, 2024

[Snyk] Upgrade pdfkit from 0.13.0 to 0.15.0 FaroukAmr/Web-Application#1014

Open

PRAVEENKUMAR6130 mentioned this issue May 26, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 PRAVEENKUMAR6130/juice-shop#6

Open

ezelf86 mentioned this issue May 26, 2024

[Snyk] Upgrade pdfkit from 0.12.3 to 0.15.0 ezelf86/dvws-node#5

Open

denk247 mentioned this issue May 31, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 denk247/juice-shop#4

Open

yangpulse-snyk mentioned this issue Jun 2, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 yangpulse-snyk/juice-shop#4

Open

ZolotarenkoDan mentioned this issue Jun 7, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 ZolotarenkoDan/juice-shop#4

Open

kostatuno mentioned this issue Jun 8, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 kostatuno/juice-shop#4

Closed

cicdpipeline1 mentioned this issue Jun 9, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 cicdpipeline1/juice-shop-master#5

Open

valentina-creator mentioned this issue Jun 9, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 valentina-creator/juice-shop#5

Open

arunkp mentioned this issue Jun 15, 2024

[Snyk] Upgrade pdfkit from 0.12.3 to 0.15.0 arunkp/juice-shop#4

Open

ShishiraKashyap mentioned this issue Jun 15, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 ShishiraKashyap/juice-shop#4

Open

bkiranraj mentioned this issue Jun 15, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 bkiranraj/juice-shop#4

Open

Suas1901 mentioned this issue Jun 15, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 Suas1901/juice-shop-lnw#4

Open

AmoghGowda mentioned this issue Jun 16, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 AmoghGowda/juice-shop-amogh#4

Open

KrissWorks mentioned this issue Jun 16, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 KrissWorks/juice-shoptesting#4

Open

vinaychandranac mentioned this issue Jun 16, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 vinaychandranac/juice-shop#4

Open

grimmgit mentioned this issue Jun 21, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 grimmgit/SAST-Snyk-juice-shop#4

Open

NadavSha1 mentioned this issue Jun 21, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 NadavSha1/juice-shop#4

Open

frath76 mentioned this issue Jun 25, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 frath76/juice-shop#4

Open

ferreiralam mentioned this issue Jun 26, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 ferreiralam/juice-shop#4

Open

Ciriusz1 mentioned this issue Jun 26, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 Ciriusz1/juice-shop#4

Open

limaleonardo mentioned this issue Jun 28, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 limaleonardo/juice-shop#4

Open

Peteti-Nagendra mentioned this issue Jun 28, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 Peteti-Nagendra/juice-shop#4

Open

Matheus156 mentioned this issue Jun 28, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 Matheus156/juice-shop#4

Open

srithreepo mentioned this issue Aug 24, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 srithreepo/juice-shop-scan#4

Open

benjma mentioned this issue Aug 24, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 benjma/juice-shop#9

Open

PowrSlave mentioned this issue Aug 24, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 PowrSlave/total-recall#4

Open

felipemaragao mentioned this issue Aug 25, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 felipemaragao/juiceshop#4

Open

itsarraj mentioned this issue Aug 26, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 itsarraj/juice-shop#4

Open

boosef-snyk mentioned this issue Aug 28, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 boosef-snyk/juice-shop#6

Open

kiranvodnala mentioned this issue Aug 28, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 kiranvodnala/DevSec-juice-shop#4

Open

elanonc mentioned this issue Aug 28, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 elanonc/juice-shop#4

Open

cybersec111 mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 cybersec111/juice-shop#5

Open

ukmaker mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 ukmaker/juice-shop#4

Open

aleksander-acc mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 aleksander-acc/juice-shop#5

Open

naweedjaulim mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 naweedjaulim/juice-shop#5

Open

chuanl-accenture mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 chuanl-accenture/juice-shop#5

Open

mateuszsalata-acc mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 mateuszsalata-acc/juice-shop#6

Open

czhu314 mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 czhu314/juice-shop#8

Open

prestonbateman mentioned this issue Aug 30, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 prestonbateman/juice-shop#4

Open

SindhuThammineni mentioned this issue Aug 31, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 SindhuThammineni/juice-shop#4

Open

anvitaakr mentioned this issue Aug 31, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 anvitaakr/juice-shop#4

Open

IT-MASTERMINDS mentioned this issue Sep 4, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 IT-MASTERMINDS/juice-shop#4

Open

M6xbom1 mentioned this issue Sep 6, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 M6xbom1/juice-shop#4

Open

aravind-optiv mentioned this issue Sep 7, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 aravind-optiv/juice-shop#5

Open

Haliq0841 mentioned this issue Sep 8, 2024

[Snyk] Upgrade: jimp, pdfkit Haliq0841/nAo#6

Open

FaroukAmr mentioned this issue Sep 8, 2024

[Snyk] Upgrade: , dotenv, express, express-rate-limit, express-session, helmet, moment, mongoose, nodemailer, pdfkit, ws FaroukAmr/Web-Application#1141

Open

snykfrog mentioned this issue Sep 9, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 snykfrog/juice-shop-gh-fix#10

Open

kapilrk-04 mentioned this issue Sep 9, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 kapilrk-04/juice-shop#4

Open

boduk000 mentioned this issue Sep 12, 2024

[Snyk] Upgrade pdfkit from 0.11.0 to 0.15.0 boduk000/juice-shop#26

Open

caishenwong mentioned this issue Sep 13, 2024

[Snyk] Upgrade: express-robots-txt, feature-policy, file-stream-rotator, i18n, jsonwebtoken, node-pre-gyp, pdfkit, sanitize-html, unzipper caishenwong/juice-shop#3

Open

FaroukAmr mentioned this issue Sep 14, 2024

[Snyk] Upgrade: , dotenv, express, express-rate-limit, express-session, helmet, moment, mongoose, nodemailer, pdfkit, ws FaroukAmr/Web-Application#1146

Open

IT21221064 mentioned this issue Sep 14, 2024

[Snyk] Upgrade: concurrently, config, express-jwt, fs, mongoose, nodemon, pdfkit, validator IT21221064/SSD-Vulnerabilities#4

Open

FaroukAmr mentioned this issue Sep 19, 2024

[Snyk] Upgrade: , dotenv, express, express-rate-limit, express-session, helmet, moment, mongoose, nodemailer, pdfkit, ws FaroukAmr/Web-Application#1149

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text in PDF recognized as gibberish in any PDFium viewer due to invalid bfrange definitions in ToUnicodeMap #1498

Text in PDF recognized as gibberish in any PDFium viewer due to invalid bfrange definitions in ToUnicodeMap #1498

orzFly commented Feb 24, 2024 •

edited

Loading

orzFly commented Feb 24, 2024

Text in PDF recognized as gibberish in any PDFium viewer due to invalid bfrange definitions in ToUnicodeMap #1498

Text in PDF recognized as gibberish in any PDFium viewer due to invalid bfrange definitions in ToUnicodeMap #1498

Comments

orzFly commented Feb 24, 2024 • edited Loading

Bug Report

Description of the problem

Screenshots

Code sample

Your environment

orzFly commented Feb 24, 2024

orzFly commented Feb 24, 2024 •

edited

Loading