PDF spec states quadrilaterals map the glyph, including ascenders and descenders. I realize quads represent:
I take it TextExtractor.Word.BBox is the most minimal level square including all 4 points above.
I am looking for the baseline. My text is typically not slanted as shown above.
The only thing I can see to do is to open an output page, render a full character set, Subtract Ascent from the top of the BBox of the sample, or add Descent (this is negative) to the bottom of the BBox of the sample. Perhaps there is an easier way in the context of line and word extraction using TextExtractor?