Welcome to Graphor

Unstructured Data into Intelligent Insights

Graphor empowers organizations to unlock the full potential of their unstructured data through state-of-the-art document parsing, intelligent data extraction, and document chat. Our platform bridges the gap between raw information and actionable knowledge, enabling you to build powerful AI applications with precision and ease.

Data Ingestion

Graphor applies state-of-the-art parsing to your documents, combining advanced OCR capabilities with intelligent text classification to transform unstructured content into structured, queryable formats.

Supported File Formats

Category	Formats
Documents	PDF, DOC, DOCX, PPT, PPTX, HTML, TXT
Tabular	CSV, TSV, XLS, XLSX
Images	PNG, JPG, JPEG, TIFF, BMP, HEIC
Video	MP4, MOV, AVI, MKV, WebM
Audio	MP3, WAV, M4A, OGG, FLAC
Markdown	MD

Multi-Source Ingestion

Beyond local file uploads, Graphor supports ingestion from multiple sources:

Web Scraping - Ingest content directly from URLs with optional crawling to capture linked pages
GitHub Repositories - Automatically process code and documentation from GitHub repos
YouTube Videos - Extract and process transcripts from YouTube videos

Data Extraction

Transform your documents into structured data with LLM-powered extraction. Define custom output schemas and provide natural language instructions to extract exactly the information you need.

Custom Schema Definition - Define output fields with types (string, number, boolean, array) and descriptions
Natural Language Instructions - Guide the extraction process with custom prompts
Page-Level Provenance - Each extracted item includes references to source pages for full traceability

Document Chat

Chat with your documents using natural language. Ask questions and get answers grounded in your ingested content.

Conversational Memory - Maintain context across multiple questions in the same conversation
Document Scoping - Focus your questions on a specific document or search across all sources
Contextual Answers - Responses are grounded in your actual document content

Core Platform Capabilities

Our platform integrates:

Advanced Data Ingestion - Process unstructured data from multiple sources with state-of-the-art OCR and classification
LLM-Powered Data Extraction - Extract structured data from documents using customizable schemas
Document Chat - Ask questions and get answers grounded in your document content
Support for Mainstream LLMs - Compatible with OpenAI, Anthropic, and open-source alternatives
Export Flexibility - Expose your systems via REST API or MCP Server integration
User-Friendly Interface - Intuitive low-code workflow for managing data, models, and deployments

Why Graphor?

Graphor stands apart with its comprehensive approach to unstructured data processing, offering:

State-of-the-art parsing - Advanced document understanding across 25+ file formats
End-to-end solution - From raw data to deployment-ready APIs
Structured extraction - Transform documents into actionable structured data
Evaluation-driven improvement - Continuous refinement through robust metrics
Flexibility and integration - Works with your existing data and AI infrastructure

Discover how Graphor can transform your approach to unstructured data and large language models.

Getting Started

Ready to harness the power of Graphor? Follow these steps to begin your journey:

Quickstart

Upload documents, extract data, and chat with your content in minutes

Guides

Dive deeper into Graphor’s core concepts and capabilities:

Data Ingestion

Import documents from multiple sources with state-of-the-art parsing and OCR

Data Extraction

Extract structured data using custom schemas and natural language instructions

Document Chat

Ask questions and get answers grounded in your document content

Get Started

Developers

Data Options

Unstructured Data into Intelligent Insights

Data Ingestion

Supported File Formats

Multi-Source Ingestion

Data Extraction

Document Chat

Core Platform Capabilities

Why Graphor?

Getting Started

Quickstart

Guides

Data Ingestion

Data Extraction

Document Chat

​Unstructured Data into Intelligent Insights

​Data Ingestion

​Supported File Formats

​Multi-Source Ingestion

​Data Extraction

​Document Chat

​Core Platform Capabilities

​Why Graphor?

​Getting Started

Quickstart

​Guides

Data Ingestion

Data Extraction

Document Chat

Unstructured Data into Intelligent Insights

Data Ingestion

Supported File Formats

Multi-Source Ingestion

Data Extraction

Document Chat

Core Platform Capabilities

Why Graphor?

Getting Started

Guides