Found an issue with certain PDFs that use embedded custom fonts. The parser returns garbled text for some fields — looks like character encoding is getting confused.\n\nThe PDFs display fine in Adobe Reader and Chrome’s PDF viewer. The issue seems specific to PDFs generated by SAP (our ERP system exports in a format that uses embedded fonts heavily).\n\nHappy to share a sample document (with sensitive data redacted) if the team wants to investigate.\n\nAffects about 10% of our purchase order PDFs.
2 Likes
Thanks for reporting! We identified the issue with CIDFont encoding. Fix deployed — could you re-test?
6 Likes
Just tested a few SAP POs and they parse correctly now. That was fast, thanks team!
3 Likes