Can SchemeFlow read my PDF?
How SchemeFlow reads different types of PDF content.
SchemeFlow can read most PDF files, but the quality of extraction depends on how the content is stored inside the PDF.
Text-based PDFs
When content is embedded as selectable text (you can click and highlight the text when viewing the PDF), SchemeFlow will extract it with high accuracy and reliability. This is the ideal input for any SchemeFlow agent.
Image-based PDFs
When content is not embedded as text, for example a scanned page, handwritten notes, or images of text, SchemeFlow will still attempt to extract the content using optical character recognition (OCR). However, this process is less reliable and may produce errors or miss information.
Our recommendation
Use high-quality PDFs with embedded text wherever possible. The better the source file, the better your SchemeFlow output will be. With lower-quality PDFs, SchemeFlow will do its best, but you should review the results carefully before using them in your report.
Not sure what type of PDF you have?
Open the file and try selecting some text. If you can highlight individual words, it is text-based. If nothing selects, it is likely image-based.
