OCR Node

Extract text from images and documents.

Overview

The OCR (Optical Character Recognition) node extracts text content from images, PDFs, and scanned documents. It converts visual text into machine-readable text that can be processed by other nodes.

Configuration

Field	Description	Required
`Files Path`	Path or URL to the file(s) to process	Yes
`Output Variable`	Variable name to store the extracted text	Yes

Supported File Types

Images: PNG, JPG, JPEG, GIF, BMP, TIFF
Documents: PDF (scanned or digital)

Using Variables

Pass file paths from previous nodes:

{{uploaded_file}}
{{scraped_image_url}}
{{document_path}}

Example Use Cases

Process Uploaded Documents

Files Path: {{user_upload.url}}
Output Variable: document_text

Extract Text from Screenshots

Files Path: {{screenshot_url}}
Output Variable: screen_text

Batch Process Images

Use with a For Loop node to process multiple files:

Files Path: {{current_image}}
Output Variable: extracted_text

Output

The extracted text is stored in your output variable:

{{document_text}}

Best Practices

Use high-quality images for better accuracy
Ensure good contrast between text and background
For multi-page PDFs, consider processing pages individually
Combine with LLM nodes to analyze extracted text