How to find hidden or invisible characters when reading pdf by using PdfBox?


I am new to pdfBox and i am working on a problem trying to read pdf through line by line but i am finding hidden or invisible characters while reading the pdf. This is creating problems in my desired output as the characters visible in the pdf and the characters read as not exactly same with many invisible characters added .I tried the isEmbedded() method but it did not work as those characters were not embedded. If there is a way to find this hidden characters and eliminate them please let me know. (This is my first question on stack overflow )






By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.

Popular posts from this blog

The Dalles, Oregon

眉山市

清晰法令