Enterprise DNA
M MCP Servers Developer low

microsoft/markitdown

by Various

Python tool for converting files and office documents to Markdown.

M

MCP

microsoft/markitdown

Added 1 June 2026

#autogen #autogen-extension #langchain #markdown #microsoft-office #openai #pdf

Overview

Python tool that converts files and Office documents into Markdown format. Handles multiple input types including PDFs, Word docs, Excel sheets, and images, outputting clean Markdown suitable for further processing or storage.

Best for

Best for
Developers building document pipeline tools or migrating content to Markdown-based systems

Use cases

  • Converting legacy Word documents to Markdown for documentation systems
  • Batch processing spreadsheets into structured Markdown tables
  • Extracting text from PDFs while preserving basic formatting

Notes

Python tool that converts files and Office documents into Markdown format. Handles multiple input types including PDFs, Word docs, Excel sheets, and images, outputting clean Markdown suitable for further processing or storage.

138,078 stars on GitHub. Last updated 2026-05-26. Licensed MIT.

Use cases

  • Converting legacy Word documents to Markdown for documentation systems
  • Batch processing spreadsheets into structured Markdown tables
  • Extracting text from PDFs while preserving basic formatting

Pros

  • Supports diverse file formats including Office suite documents
  • High community adoption with 138k+ GitHub stars
  • Handles images and extracts text from visual content

Cons

  • Python-only, requires runtime environment setup
  • Conversion quality varies by source format complexity
  • Limited control over output formatting rules

Indexed from awesome-mcp-servers-punkpeye and enriched against its public facts.

Pros

  • Supports diverse file formats including Office suite documents
  • High community adoption with 138k+ GitHub stars
  • Handles images and extracts text from visual content

Cons

  • Python-only, requires runtime environment setup
  • Conversion quality varies by source format complexity
  • Limited control over output formatting rules

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with35entries
M MCP Dev low

0xMassi/webclaw

Various

Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.

★ 1,269 updated 3d ago
M MCP Dev low

agenticdecks/deckrun-mcp

Various

MCP server for Deckrun — generate presentation PDFs, videos, and audio from Markdown

★ 1 updated 1mo ago
M MCP Dev low

ailenshen/apple-notes-mcp

Various

Read and write Apple Notes, with Apple Notes native formatting support

★ 7 updated 1mo ago
M MCP Dev low

AIMLPM/markcrawl

Various

Fast Python web crawler for RAG and AI ingestion. Extracts clean Markdown from any site for LLMs and vector stores.

★ 2 updated 17d ago
M MCP Dev low

aparajithn/agent-scraper-mcp

Various

Web scraping MCP server for AI agents — screenshots, content extraction, structured scraping

★ 4 updated 2mo ago
M MCP Dev low

arthurpanhku/DocSentinel

Various

MCP server for AI agent for cybersecurity: automate assessment of documents, questionnaires & reports. Multi-format parsing, RAG knowledge base,Risks, compliance gaps, remediations

★ 90 updated 6d ago
M MCP Dev low

AryanBV/pdf-toolkit-mcp

Various

Write-capable PDF toolkit for any MCP client: 22 tools to read, create, render, encrypt, and transform PDFs. Vision rendering for scans, form-preserving merge and split, AES-256, z

★ 6 updated 4d ago
M MCP Dev low

bch1212/agentfetch-mcp

Various

MCP server for fetching web URLs with token estimation, caching, and intelligent routing. Built for AI agents.

★ 0 updated 24d ago
M MCP Dev low

calclavia/mcp-obsidian

Various

📇 🏠 - This is a connector to allow Claude Desktop (or any MCP client) to read and search any directory containing Markdown notes (such as an Obsidian vault).

M MCP Dev low

caol64/wenyan-mcp

Various

文颜 MCP Server 可以让 AI 自动将 Markdown 文章排版后发布至微信公众号。

★ 1,235 updated 1mo ago
M MCP Dev low

danielkennedy1/pdf-tools-mcp

Various

🐍 - PDF download, view & manipulation utilities.

★ 31 updated 1y ago
M MCP Dev low

dodopayments/contextmcp

Various

Self-hosted MCP server for your documentation

★ 48 updated 2d ago
M MCP Dev low

drolosoft/go-docs-mcp

Various

📄🐹⚡ Go MCP server for multi-format document access — PDF, TXT, MD, DOCX, CSV, images. Install and Go.

★ 3 updated 15d ago
M MCP Dev low

epicsagas/alcove

Various

Alcove is an MCP server that gives AI coding agents on-demand access to your private project docs — BM25 + vector hybrid search for precision retrieval, tree-sitter code indexing s

★ 9 updated 2d ago
M MCP Dev low

Erodenn/fetch-guard

Various

Fetch URLs and return clean, LLM-ready markdown with metadata and layered prompt injection defense. Configurable timeouts, word limits, JS rendering, and link extraction. All-in-on

★ 0 updated 2mo ago
M MCP Dev low

exa-labs/exa-mcp-server

Various

Exa MCP for web search and web crawling!

★ 4,513 updated 12d ago
M MCP Dev low

exoticknight/mcp-file-merger

Various

MCP server for merging multiple files into one

★ 26 updated 9mo ago
M MCP Dev low

FacundoLucci/plsreadme

Various

FacundoLucci/plsreadme — indexed from awesome-mcp-servers-punkpeye

★ 2 updated 1mo ago
M MCP Dev low

Harry-027/JotDown

Various

An MCP Server in Rust for creating Notion pages & mdBooks with LLMs 🦀

★ 21 updated 3mo ago
M MCP Dev low

isaacphi/mcp-gdrive

Various

Model Context Protocol (MCP) Server for reading from Google Drive and editing Google Sheets

★ 281 updated 1y ago
M MCP Dev low

jinzcdev/markmap-mcp-server

Various

An MCP server for converting Markdown to interactive mind maps with export support (PNG/JPG/SVG).

★ 205 updated 2mo ago
M MCP Dev low

johannesbrandenburger/typst-mcp

Various

Typst MCP Server is an MCP (Model Context Protocol) implementation that helps AI models interact with Typst, a markup-based typesetting system. The server provides tools for conver

★ 157 updated 1mo ago
M MCP Dev low

kc23go/anybrowse

Various

Web scraping MCP server for AI agents. Real Chrome, 84% success rate. 10 free calls/day, no signup.

★ 3 updated 8d ago
M MCP Dev low

kehvinbehvin/json-mcp-filter

Various

JSON MCP server to filter only relevant data for your LLM

★ 24 updated 2mo ago
M MCP Dev low

madhan-g-p/DevDocs-MCP

Various

Documentation Authority for AI Agents based upon Devdocs

★ 11 updated 24d ago
M MCP Dev low

MarceauSolutions/md-to-pdf-mcp

Various

Convert Markdown to professional PDFs with customizable themes - MCP server for Claude Desktop

★ 3 updated 4mo ago
M MCP Dev low

mark3labs/mcp-filesystem-server

Various

Go server implementing Model Context Protocol (MCP) for filesystem operations.

★ 646 updated 6mo ago
M MCP Dev low

MobileReality/mdma

Various

Interactive documents from Markdown. Extends MD with forms, approvals, webhooks, and more — built for next gen apps

★ 13 updated 6d ago
M MCP Dev low

pskill9/website-downloader

Various

MCP server to download entire websites

★ 151 updated 1y ago
M MCP Dev low

Retio-ai/pagemap

Various

🐍 🏠 - Compresses ~100K-token HTML into 2-5K-token structured maps while preserving every actionable element. AI agents can read and interact with any web page at 97% fewer tokens

★ 32 updated 13d ago
M MCP Dev low

SecurityRonin/docx-mcp

Various

MCP server for reading and editing Word (.docx) documents with track changes, comments, footnotes, and structural validation

★ 19 updated 6d ago
M MCP Dev low

sifter-ai/sifter

Various

Sifter is an open-source, developer-first document extraction engine that turns unstructured documents — invoices, contracts, receipts, reports — into a structured, queryable datab

★ 44 updated 5d ago
M MCP Dev low

UnMarkdown/mcp-server

Various

MCP server for the Unmarkdown API: Convert markdown, manage documents, publish pages

★ 0 updated 3mo ago
M MCP Dev low

vezlo/src-to-kb

Various

Convert source code to LLM ready knowledge base

★ 33 updated 5mo ago
M MCP Dev low

Zacccck/Claude-MCP-Read-Email-Attachments

Various

Local MCP server for Claude — read and parse Outlook email attachments via Microsoft Graph API

★ 8 updated 1mo ago
Alternatives10entries
M MCP Dev low

adeu

ai.adeu

docx ↔ LLM translator. Projects .docx to Markdown for editing. Projects edits back to OOXML as tracked changes. Python and Node.js implementations.

★ 86 updated 2d ago
M MCP Dev low

dodopayments/contextmcp

Various

Self-hosted MCP server for your documentation

★ 48 updated 2d ago
M MCP Dev low

just-every/mcp-read-website-fast

Various

Quickly reads webpages and converts to markdown for fast, token efficient web scraping

★ 150 updated 1mo ago
M MCP Dev low

kimwwk/repocrunch

Various

Analyze GitHub repos into structured JSON. No AI, fully deterministic.

★ 8 updated 2mo ago
M MCP Dev low

lfnovo/content-core

Various

Extract what matters from any media source

★ 152 updated 22d ago
M MCP Dev low

linxule/mineru-mcp

Various

📇 ☁️ - MCP server for MinerU document parsing API. Parse PDFs, images, DOCX, and PPTX with OCR (109 languages), batch processing (200 docs), page ranges, and local file upload. 73

★ 5 updated 27d ago
M MCP Dev low

NameetP/pdfmux

Various

PDF extraction that checks its own work. #2 reading order accuracy — zero AI, zero GPU, zero cost.

★ 66 updated 12d ago
M MCP Dev low

opendatalab/MinerU-Ecosystem

Various

opendatalab/MinerU-Ecosystem — indexed from awesome-mcp-servers-punkpeye

★ 105 updated 23d ago
M MCP Dev low

talonicdev/talonic-mcp

Various

Official Talonic MCP server. Lets AI agents extract structured data from any document via the Model Context Protocol.

★ 4 updated 2d ago
M MCP Dev low

zcaceres/markdownify-mcp

Various

A Model Context Protocol server for converting almost anything to Markdown

★ 2,714 updated 7d ago