Documentation is being updated. Some sections may not reflect the latest features.
Looking for step-by-step guides? Explore Tutorials →

OCR Node

Extract text from images and documents.

Overview

The OCR (Optical Character Recognition) node extracts text content from images, PDFs, and scanned documents. It converts visual text into machine-readable text that can be processed by other nodes.

Configuration

Field Description Required
Files Path Path or URL to the file(s) to process Yes
Output Variable Variable name to store the extracted text Yes

Supported File Types

  • Images: PNG, JPG, JPEG, GIF, BMP, TIFF
  • Documents: PDF (scanned or digital)

Using Variables

Pass file paths from previous nodes:

{{uploaded_file}}
{{scraped_image_url}}
{{document_path}}

Example Use Cases

Process Uploaded Documents

Files Path: {{user_upload.url}}
Output Variable: document_text

Extract Text from Screenshots

Files Path: {{screenshot_url}}
Output Variable: screen_text

Batch Process Images

Use with a For Loop node to process multiple files:

Files Path: {{current_image}}
Output Variable: extracted_text

Output

The extracted text is stored in your output variable:

{{document_text}}

Best Practices

  • Use high-quality images for better accuracy
  • Ensure good contrast between text and background
  • For multi-page PDFs, consider processing pages individually
  • Combine with LLM nodes to analyze extracted text
AI AssistantPowered by Ubex
Beta
Ask me anything about Ubex workflows, nodes, or the API.