Text comparison is not as expected

WebViewer Version:10.9.0

Do you have an issue with a specific file(s)? : No
Can you reproduce using one of our samples or online demos? : Yes
Are you using the WebViewer server? : No
Does the issue only happen on certain browsers? : No
Is your issue related to a front-end framework? : No
Is your issue related to annotations? : No

Please give a brief summary of your issue:
When comparing the PDF files, unexpected comparison results are displayed.

Please describe your issue and provide steps to reproduce it:
We implemented based on the following document:

When comparing a specific PDF files, unexpected comparison results are displayed.

  1. Changes between blocks of text on both page 1.
  2. Changes between the block of text on the old page 1 and a portion of the text on the new page 2.
  3. Changes between a portion of the text on the old page 2 and the block of text on the new page 1.
  4. Changes in a portion of the text on both page 2.

While 1 and 4 are as expected, 2 and 3 are supposed to be unnecessary as comparison targets.

・What could be the reason for mistakenly including them as a comparison target?
・Are there any precautions to avoid this situation when creating PDFs?

Please provide a link to a minimal sample where the issue is reproducible:

比較用(元原稿).pdf (243.1 KB)
比較用(変更後原稿).pdf (276.7 KB)

Hi Atsuko,

I am not able to see any issues using this text compare sample: JS Semantic PDF Files Comparison Demo | Apryse WebViewer

This is using the latest version, 10.11. Could you clarify any issues with the image below?

Best Regards,
Darian

Hi, Darian

To explain this issue, I have added red borders and red text to the results in the showcase.
By comparing the two files, four changes are presented.

1 is the comparison between A and B, which is as expected.
4 is the comparison between C and D, also as expected.
I do not understand why 2 represents the comparison between A and D, and 3 represents the comparison between B and C as changes.
I would like to be informed about the circumstances in which results 2 and 3 are displayed as differences, as I do not understand the reason for their display in this file.

Best Regards,
Atsuko

1 Like

Hello Atsuko,

Thank you for the clarification. This seems to be related to the metadata in the files. I have submitted a report to the product team about this issue. Thank you for reporting this.

2 Likes

Hello Atsuko,

The product team has informed me that unfortunately we will not be working on this issue at this time as we consider it a minor issue in text comparison.

Best Regards,
Darian

1 Like