Search Results for "convert word documents to html"

Showing 123 open source projects for "convert word documents to html"

View related business solutions
  • The training management software of choice for commercial training providers who want to save time and crush sales targets. Icon
    The training management software of choice for commercial training providers who want to save time and crush sales targets.

    On average, Arlo clients reduce administration by 43% and grow registrations by 76%.

    Arlo is training management software for training providers who want to save time and crush sales targets. It is a complete training management solution to promote, sell and deliver instructor-led, online and blended learning. Arlo takes care of your public training schedule, private in-house courses and ongoing training contracts, so you to manage your whole business in one system. It saves you time by automating manual processes and helps you grow with slick ecommerce and marketing tools. See for yourself by starting a free trial.
    Try for Free
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Learn More
  • 1
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services. It is built for developers who need high-quality parsing from scans, photos, PDFs, office files, and other document sources while preserving privacy and control over the processing flow. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    rmarkdown

    rmarkdown

    Dynamic Documents for R

    R Markdown is an R package for creating dynamic, reproducible documents that combine code (R, Python, SQL, etc.), results (figures, tables), and narrative text. Built on Knitr and Pandoc, it supports generating HTML, PDF, Word, slideshows, dashboards, and more. It’s widely used in data science and reproducible reporting workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    PasteMD

    PasteMD

    Paste Markdown and AI responses into Word Excel instantly fast

    PasteMD is a lightweight desktop utility designed to streamline the process of transferring formatted content from the clipboard into office applications such as Word, WPS, and Excel. It primarily targets users who frequently copy content from AI chat tools or web pages and encounter formatting issues, especially with Markdown, tables, and LaTeX formulas. PasteMD operates from the system tray and monitors clipboard content, automatically converting Markdown or HTML into properly formatted documents using Pandoc. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    ...It renders text with metrics and spacing accurate to within fractions of a pixel for the highest fidelity in reproducing the look of a printed page on the screen. The viewer is small, fast, yet complete. It supports many document formats, such as PDF, XPS, OpenXPS, CBZ, EPUB, and FictionBook 2. You can annotate PDF documents and fill out forms with the mobile viewers (this feature is coming soon to the desktop viewer as well). The command line tools allow you to annotate, edit, and convert documents to other formats such as HTML, SVG, PDF, and CBZ. You can also write scripts to manipulate documents using Javascript. The library is written modularly in portable C, so features can be added and removed by integrators if they so desire.
    Downloads: 19 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 5
    WeChatMsg

    WeChatMsg

    Project aimed at extracting, exporting, and analyzing chat records

    WeChatMsg repository hosts an open-source project aimed at extracting, exporting, and analyzing chat records from the WeChat messaging platform. It provides tools that read local WeChat database files and allow users to convert chat data into readable formats such as HTML, Word, and CSV, making it possible to inspect conversations outside the mobile app environment. Beyond simple export, the project includes mechanisms for analyzing chat histories and generating annual reports or visual summaries about messaging trends, interaction patterns, and more. ...
    Downloads: 273 This Week
    Last Update:
    See Project
  • 6
    Pandoc

    Pandoc

    The universal markup converter

    Pandoc is a universal document converter able to convert files from a multitude of markup formats into another. With Pandoc, you have a swiss-army knife of a converter, able to convert practically any markup format into any other. Pandoc contains a Haskell library for conversions as well as a command-line tool that uses this library. It can convert to and from just about anything-- lightweight markup formats, HTML formats, documentation formats, ebooks, TeX formats, word processor formats and so much more. ...
    Downloads: 260 This Week
    Last Update:
    See Project
  • 7
    bookdown

    bookdown

    Authoring Books and Technical Documents with R Markdown

    A open-source (GPL-3) R package to facilitate writing books and long-form articles/reports with R Markdown. Generate printer-ready books and ebooks from R Markdown documents. A markup language easier to learn than LaTeX, and to write elements such as section headers, lists, quotes, figures, tables, and citations. Multiple choices of output formats: PDF, LaTeX, HTML, EPUB, and Word. Possibility of including dynamic graphics and interactive applications (HTML widgets and Shiny apps) Support for languages other than R, including C/C++, Python, and SQL, etc. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    iLovePDF Api

    iLovePDF Api

    iLovePDF Rest Api - PHP Library

    ...We offer a simple and concise API Reference and Guide as well as API Libraries with their own docs too. Our infrastructure uses the best PDF technology for processing PDF files. Merge and split documents with a variety of custom options. Remove, extract or organize PDF pages as you need. Reduce the size of your PDF while maintaining its original quality and formatting. Easily convert Images, MS Word, PowerPoint and Excel files into non-editable PDF documents. Convert PDF documents to JPG images or to PDF/A format.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    A simple, free, unlimited, secure web-based tool that converts Microsoft Word documents (.docx) into Markdown (.md/.txt) and HTML files. Perfect for developers, writers, and anyone who needs to transform .docx MS Office Word documents into web-friendly or AI context friendly formats. Unlike those other jerks on the web that charge many dollars per month for this, I made it free, unlimited and open source.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Hightouch is a data and AI platform for marketing and personalization. Icon
    Hightouch is a data and AI platform for marketing and personalization.

    Marketing needs data and AI. Give them Hightouch.

    Find insights, run real-time campaigns, and build AI agents with all your data.
    Learn More
  • 10
    Unstructured.IO

    Unstructured.IO

    Open source libraries and APIs to build custom preprocessing pipelines

    The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data into structured outputs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    PDF to Word Converter
    Coolmuster PDF to Word Converter turns any PDF into a fully editable Microsoft Word document — with perfect preservation of: Text styles & fonts Images & graphics Tables & columns Hyperlinks Overall layout & formatting No more re-formatting after conversion.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    Extractous is a Rust-based unstructured data extraction library focused on fast local parsing of documents and other content-heavy files. Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Huxtable

    Huxtable

    An R package to create styled tables in multiple output formats

    Huxtable is an R package to create LaTeX and HTML tables, with a friendly, modern interface. Features include control over text styling, number format, background color, borders, padding, and alignment. Cells can span multiple rows and/or columns. Tables can be manipulated with standard R subsetting or dplyr functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    LandPPT

    LandPPT

    An LLM-based presentation generation platform

    LandPPT is an open-source AI platform that automatically generates professional presentation slides using large language models. The system allows users to create complete PowerPoint presentations simply by entering a topic or uploading source documents such as PDFs, Word files, or Markdown notes. Using natural language processing and structured content generation, the platform produces presentation outlines and converts them into fully formatted slide decks. The application integrates...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Gotenberg

    Gotenberg

    A Docker-powered stateless API for PDF files

    Gotenberg provides a developer-friendly API to interact with powerful tools like Chromium and LibreOffice for converting numerous document formats (HTML, Markdown, Word, Excel, etc.) into PDF files, and more! Thanks to Docker, you don't have to install each tool in your environments; drop the Docker image in your stack, and you're good to go! The webhook feature allows you to upload the output file to the destination of your choice. There are many options to fit your requirements, from the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    OpenAPI Generator

    OpenAPI Generator

    OpenAPI Generator allows generation of API client libraries

    With 50+ client generators, you can easily generate code to interact with any server which exposes an OpenAPI document. Maintainers of APIs may also automatically generate and distribute clients as part of official SDKs. Each client supports different options and features, but all templates can be replaced with your own Mustache-based templates. Getting started with server development can be tough, especially if you're evaluating technologies. We can reduce the burden when you bring your own...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 18
    PandaWiki

    PandaWiki

    AI-powered open source platform for building intelligent wiki bases

    PandaWiki is an open source knowledge base system designed to help users build intelligent documentation platforms powered by large language models. It combines traditional wiki functionality with modern AI capabilities, allowing teams and individuals to create and manage product documentation, technical manuals, FAQs, and blog-style knowledge resources. PandaWiki provides tools for managing knowledge bases through an administrative interface while also generating public-facing wiki sites...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    ...It can be used to create spreadsheets and documents for LibreOffice, OpenOffice, Microsoft Office Excel and Word. It is open source GPL v3 and crossplatform, it can run on most operating systems that can run Python (Python is required). More details, example files, and online manual at http://csv2odf.sf.net.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 20
    MSWord2CHM is a VB.NET application for converting MS Office Word documents (doc) into the compiled help (chm) document format, using the Primary Interop Assemblies (PIA) and the 'HTML Tidy' and 'TidyForNet' libraries
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    poi-tl

    poi-tl

    Generate awesome word(docx) with template

    A better way to generate word(docx) with the template, based on Apache POI. FreeMarker or Velocity generates new html pages or configuration files based on text templates and data. poi-tl is a Word template engine that generates new documents based on Word templates and data. The Word template has rich styles. Poi-tl will perfectly retain the styles in the template in the generated documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    LinWriter

    LinWriter

    small efficient text editor for Windows and Linux

    LinWriter is a small robust utility that can write quick documents and notes for your every day use. Providing a friendly and intuitive interface, it offers a distraction free environment to enhance productivity for your documents and notes. Key features include a minimalistic interface and support for common formats such as TXT, Markdown, HTML and its custom format 'rtdoc', which allows rich text editing. Including are also basic formatting options, inserting graphics, bullets, numbered...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Govel

    Govel

    Lightweight and lightning fast novel writing and organizing tool

    Govel is a lightweight novel writing and organizing software designed with the intention to leave your hands where they belong: on the keyboard. There are no fancy menus, submenus, dialogues, buttons, tabs, you name it. You don't even need to use the mouse, which makes your overall workflow faster than in any other program. You can still customize the visual appearance while beeing maximum productive. And the best of all: Govel is completly free.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    Bridgex is an open‑source graphical interface for converting files to Markdown, built in Python and based on Pyside6 (Qt for Python). Its objective is to simplify access to the Markitdown library through a straightforward, modular visual experience. Features ✨ - Cross‑platform graphical interface. - Efficient file‑to‑Markdown conversion. - Modularity: easy to adapt and extend. - Support for multiple input formats. - Lightweight editing prior to saving. Supported Formats...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB