Tianshu - Enterprise AI Data Preprocessing Platform
Free644 GitHub stars
Platform & FrameworkAgnosticFile System
Overview
Tianshu is an enterprise-level AI data preprocessing platform that converts PDF and Office documents to Markdown and integrates with MCP protocol AI assistants. It is designed for developers and data scientists looking for a comprehensive solution for document parsing and multimodal information extraction.