OCRmyPDF adds an OCR text layer to scanned PDF files
Open Security Controls Assessment Language (OSCAL)
Tom Preston-Werner's obvious, minimal language
Video-based AI memory library. Store millions of text chunks in MP4
Always know what to expect from your data
The lxml XML toolkit for Python
Easily serialize Data Classes to and from JSON
CLI tool to filter JSON and JSON Lines data with Python syntax
CLI tool to extract (meta)data from PDF and manipulate PDF files
TikZ figures for concepts in physics/chemistry/ML
Situational Awareness Server compatible with TAK clients
A Python tool to help extracting information from structured PDFs
The social web translator
Edit PDF files with Nano Banana
Cortex Analyzers Repository
Diff JSON and JSON-like structures in Python
Yet another serialization library on top of dataclasses
tmux session manager. built on libtmux
An implementation of the JSON Schema specification for Python
pytablewriter is a Python library to write a table in various formats
The data structure for multimodal data
A fast serialization and validation library, with builtin
A simple tool for reading in poorly redacted documents
LaTeX CV generator from a YAML/JSON input file
Open-Source Python3 tool for recognizing layouts, tables, and math