Product: PDF SDK
Product Version: Python 3.10
Please give a brief summary of your issue:
The value returned by Element.GetBBox() doesn’t match the TextExtraction output for the Word.GetBBox()
Please describe your issue and provide steps to reproduce it:
I am trying to match the text extraction output to the text elements. It works well for text that is not within a form object, however matching with BBox intersection doesn’t work for elements that are within a form object.
I’m reading from the form as follows:
reader.Begin(obj, page.GetResourceDict())
// iterate through elements, grab text elements, look at element.GetBBox()
reader.End()
The bounding boxes aren’t far off form one another, so I wonder if there’s a missing transform somewhere?