Document (PDF, Word, PPTX ...) extraction and parse API
A fast, helpful, and open-source document parser
JavaScript parser and stringifier for YAML
Python module for parsing semi-structured text into python tables
Parse text and tables from PDF files.
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML
A fast, powerful, CommonMark compliant, extensible Markdown processor
RAG-Anything: All-in-One RAG Framework
A JavaScript library for parsing and formatting chords and chord sheet
Parse files for optimal RAG
A machine learning software for extracting information
Markdown parser, done right. 100% CommonMark support, extensions
An incremental parsing system for programming tools
Convert notion pages, block and list of blocks to markdown
A post-modern modal text editor
Zero-copy PDF text extraction library written in Zig
Java library for parsing and rendering CommonMark (Markdown)
Tree-sitter bindings for Emacs Lisp
Parser generator to read, process, or translate structured text
A Python tool to help extracting information from structured PDFs
A python library that makes AMR parsing, generation and visualization
Fast and efficient unstructured data extraction
Semantic search and document parsing tools for the command line
A markdown editor based on Vue
Multilingual Document Layout Parsing in a Single Vision-Language Model