Docs-Insights Subnet
The Docs-Insights is a decentralized system built for advanced document
processing tasks. It combines multiple AI models—including vision models,
language models, vision-language models (VLMs), and OCR engines—to accurately
understand and extract information from documents. This subnet aims to offer a
powerful, open-source alternative to proprietary tools, making document
comprehension more accessible and efficient. By delivering key insights with a
single click, it significantly reduces the time and effort required for
document review.
Key Capabilities:
-
Checkbox and Associated Text Detection - Currently live and
operational on SN-84, outperforming industry standards like GPT-4 Vision and
Azure Form Recognizer.
-
Highlighted and Encircled Text Detection - Detects and
extracts highlighted or circled text segments accurately (Under
Development).
-
Document Classification - Automatically classifies
documents by type (e.g., receipts, forms, letters). This feature is live on
SN-84 and powered by the Donut model, a cutting-edge, OCR-free architecture.
-
Document Parsing - Leverages powerful LLMs to extract key
entities like names, addresses, phone numbers, and monetary values.
Documents are intelligently segmented into logical sections for improved
clarity. Live on SN-84.
-
JSON Data Structuring - Compiles and formats extracted data
into a concise, readable JSON file, significantly reducing document review
time.