# Web Scraper to Markdown API MCP server

Extract clean markdown from any URL. Removes boilerplate. For RAG pipelines. x402.

## Links
- Registry page: https://www.getdrio.com/mcp/io-github-br0ski777-web-scraper
- Repository: https://github.com/Br0ski777/web-scraper-x402
- Website: https://github.com/Br0ski777/web-scraper-x402

## Install
- Endpoint: https://web-scraper.api.klymax402.com/mcp
- Auth: Not captured

## Setup notes
- Remote endpoint: https://web-scraper.api.klymax402.com/mcp

## Tools
- web_scrape_to_markdown - Scrape and extract content from a URL with full JS rendering, returned as clean markdown. Alternative to Firecrawl scrape at 2.5x lower cost. Strips navigation, ads, scripts, and boilerplate — ideal for RAG pipelines and AI research agents.

1. title (string) -- page title from <title> tag
2. description (string) -- meta description
3. author (string) -- author from meta tags or schema
4. content (string) -- clean markdown body text, headings preserved
5. wordCount (number) -- total words in extracted content
6. charCount (number) -- total characters
7. url (string) -- final URL after redirects

Example output: {"title":"How to Scale APIs","description":"A guide to...","content":"# How to Scale APIs\n\nScaling requires...","wordCount":1250,"charCount":7800,"url":"https://blog.example.com/scale-apis"}

Use this BEFORE summarizing articles, building RAG corpora, researching topics from web sources, or extracting data from documentation pages. Essential for any workflow that needs to scrape and extract content from web pages as LLM input. Drop-in replacement for Firecrawl scrape.

Do NOT use for screenshots -- use capture_screenshot instead. Do NOT use for SEO audit -- use seo_audit_page instead. Do NOT use for tech stack detection -- use website_detect_tech_stack instead. Do NOT use for web search -- use web_search_query instead. Endpoint: https://web-scraper.api.klymax402.com/mcp
- web_scrape_batch - Use this when you need to extract clean content from multiple web pages at once (up to 10 URLs). Returns the same structured markdown output as web_scrape_to_markdown for each URL.

1. results (array) -- each entry has title, description, author, content, wordCount, charCount, url
2. summary -- total pages scraped, total word count, failed URLs if any

Example output: {"results":[{"url":"https://a.com","title":"Page A","wordCount":800},{"url":"https://b.com","title":"Page B","wordCount":1200}],"summary":{"total":2,"totalWords":2000,"failed":0}}

Use this FOR building research corpora, comparing content across competitor pages, or bulk documentation extraction. Essential when you have 3+ URLs to process in one workflow.

Do NOT use for single URLs -- use web_scrape_to_markdown instead. Do NOT use for SEO comparison -- use seo_audit_batch instead. Endpoint: https://web-scraper.api.klymax402.com/mcp

## Resources
Not captured

## Prompts
Not captured

## Metadata
- Owner: io.github.Br0ski777
- Version: 1.2.0
- Runtime: Sse
- Transports: HTTP
- License: Not captured
- Language: Not captured
- Stars: Not captured
- Updated: May 16, 2026
- Source: https://registry.modelcontextprotocol.io