Skip to content

Can SchemeFlow read my PDF?

How SchemeFlow reads different types of PDF content.

SchemeFlow can read most PDF files, but the quality of extraction depends on how the content is stored inside the PDF.

Text-based PDFs

When content is embedded as selectable text (you can click and highlight the text when viewing the PDF), SchemeFlow will extract it with high accuracy and reliability. This is the ideal input for any SchemeFlow agent.

Image-based PDFs

When content is not embedded as text, for example a scanned page, handwritten notes, or images of text, SchemeFlow will still attempt to extract the content using optical character recognition (OCR). However, this process is less reliable and may produce errors or miss information.

Our recommendation

Use high-quality PDFs with embedded text wherever possible. The better the source file, the better your SchemeFlow output will be. With lower-quality PDFs, SchemeFlow will do its best, but you should review the results carefully before using them in your report.

Not sure what type of PDF you have?

Open the file and try selecting some text. If you can highlight individual words, it is text-based. If nothing selects, it is likely image-based.