How Do I Extract the ActualText From a PDF

Question:

How do I extract the real content, exactly as it appears on Acrobat’s clipboard, rather than the raw glyphs?

Answer:

To extract the real content, exactly as it appears on Acrobat’s clipboard, use ActualText and disregard e_text in between Begin ActualText … End ActualText. The attached sample uses ElementReader to loop through all elements and list the actual text.
extract_actual_text_from_pdf.py (1.6 KB)