Unstructured Logo

Unstructured Demo

Parsing Strategies

Saved Results

No saved results yet.

Fast

Ideal for: digital-character-based documents

Ideal document types: HTML, DocX, PPTX, XLSX, TXT

Cost: $

Example:
Input:
<html>...</html>
Output:
{ "type": "Title", "text": "Example" }

Hi Res

Ideal for: easy PDFs

Ideal document types: PDF (easy)

Cost: $$

Example:
Input:
Simple PDF content
Output:
{ "type": "Paragraph", "text": "Simple PDF content" }

Gen AI

Ideal for: difficult PDFs & images

Ideal document types: PDF (complex or scanned), Images

Cost: $$$

Example:
Input:
Complex PDF or Image
Output:
<html><body><p>Extracted content from complex PDF or image</p></body></html>